Commit Graph

  • 20f4064dfa Merge branch 'distributed' of github.com:nicklashansen/tdmpc2 into distributed Nicklas Hansen 2024-01-08 10:51:00 -08:00
  • c6d1bd85bf support distributed training Nicklas Hansen 2024-01-07 11:52:53 -08:00
  • 0f3bc77011 amend slicesampler indexing Nicklas Hansen 2024-01-08 10:48:04 -08:00
  • fabf01a5ec solves episodic variant of cartpole-balance-sparse Nicklas Hansen 2024-01-07 19:28:41 -08:00
  • 26c72119cd init Nicklas Hansen 2024-01-07 18:16:33 -08:00
  • 31249a8961 separate episodes with nans Nicklas Hansen 2024-01-07 14:21:38 -08:00
  • 33876d124f add instructions for distributed training Nicklas Hansen 2024-01-07 11:55:07 -08:00
  • 33555b5982 support distributed training Nicklas Hansen 2024-01-07 11:52:53 -08:00
  • e5c9029c86 disable uncertainty estimation when coef=0 Nicklas Hansen 2024-01-04 19:39:44 -08:00
  • a7ff00b0cd add option to disable planning with mpc=false Nicklas Hansen 2024-01-04 19:17:47 -08:00
  • 194c92331c add uncertainty regularization Nicklas Hansen 2024-01-03 18:11:32 -08:00
  • 13cac07759 fix minor logging issue in offline trainer Nicklas Hansen 2024-01-03 10:12:28 -08:00
  • 1d224cec3a update documentation Nicklas Hansen 2023-12-31 14:38:22 -08:00
  • e3c876670a add launcher Nicklas Hansen 2023-12-29 16:37:26 -08:00
  • 1f6c7771b9 Merge pull request #10 from nicklashansen/experimental Nicklas Hansen 2023-12-28 16:37:27 +01:00
  • 6cb779aa3a allow missing env dependencies + update readme Nicklas Hansen 2023-12-28 07:33:03 -08:00
  • 54145a4d8c integrate slicesampler as default Nicklas Hansen 2023-12-27 08:49:04 -08:00
  • 2f86a1e4d8 fix sampler https://github.com/pytorch/rl/pull/1762 Nicklas Hansen 2023-12-25 10:11:42 -08:00
  • ca4dfa1db3 further reduce buffer differences Nicklas Hansen 2023-12-23 09:39:13 -08:00
  • eef1d1b407 unit test buffer implementations Nicklas Hansen 2023-12-23 07:43:28 -08:00
  • 70fe242adc does not reproduce results w/ previous buffer Nicklas Hansen 2023-12-22 14:26:48 -08:00
  • 2929cfdb44 fix computation of mem requirements Nicklas Hansen 2023-12-22 13:57:01 -08:00
  • 34ea3662cd compare new/old buffers Nicklas Hansen 2023-12-22 13:34:12 -08:00
  • fea0936e69 set logging defaults Nicklas Hansen 2023-12-22 07:44:34 -08:00
  • 95bf14e343 Merge branch 'main' of github.com:nicklashansen/tdmpc2 into experimental Nicklas Hansen 2023-12-22 07:43:44 -08:00
  • bfb1971898 naive support for pixels Nicklas Hansen 2023-12-22 07:34:40 -08:00
  • 3ded0ebc83 faster replay buffer implementation Nicklas Hansen 2023-12-22 05:55:43 -08:00
  • 445af9d81d easier customization of architecture: all args can now be set freely when model_size is not specified Nicklas Hansen 2023-12-06 08:15:54 -08:00
  • f3139291e2 update dependencies Nicklas Hansen 2023-11-25 19:20:52 -08:00
  • 58a95e431b Merge pull request #1 from asmith26/patch-1 Nicklas Hansen 2023-10-28 10:04:29 -07:00
  • d36529dea4 Fix small typo asmith26 2023-10-28 12:26:53 +01:00
  • b67b21c5c6 first commit Nicklas Hansen 2023-10-25 18:26:00 -07:00