91 Commits

Author SHA1 Message Date
Nicklas Hansen
f0af4b6b27 update dockerfile + pin all versions 2024-07-02 10:12:06 -07:00
Nicklas Hansen
a2d9d0c8ff minor fix in print 2024-07-02 10:12:06 -07:00
Nicklas Hansen
ab43880945 migrate to slicebuffer from torchrl-nightly 2024-07-02 10:12:06 -07:00
Nicklas Hansen
5f6fadec0f update pinned torchrl version 2024-03-27 12:51:40 -07:00
Nicklas Hansen
57158282b4 Merge branch 'main' of github.com:nicklashansen/tdmpc2 into main 2024-02-02 15:56:56 -08:00
Nicklas Hansen
718966c28d reduce # wandb calls 2024-02-02 15:56:54 -08:00
Nicklas Hansen
01cdf0f799 Update README.md 2024-01-24 21:43:20 -08:00
Nicklas Hansen
02b18a48b1 update dockerfile 2024-01-22 17:37:31 -08:00
Nicklas Hansen
e8f1ed6785 update dockerfile + pin all versions 2024-01-21 21:21:44 -08:00
Nicklas Hansen
8b6fe61bed minor fix in print 2024-01-11 18:19:08 -08:00
Nicklas Hansen
aa9c6f33f5 migrate to slicebuffer from torchrl-nightly 2024-01-10 19:53:30 -08:00
Nicklas Hansen
ff02f41e73 fix 2024-01-08 17:18:22 -08:00
Nicklas Hansen
e86c343a67 Merge branch 'episodic-rl' of github.com:nicklashansen/tdmpc2 into episodic-rl 2024-01-08 10:55:10 -08:00
Nicklas Hansen
cc62c4c9ce init 2024-01-08 10:51:27 -08:00
Nicklas Hansen
0f3bc77011 amend slicesampler indexing 2024-01-08 10:48:04 -08:00
Nicklas Hansen
fabf01a5ec solves episodic variant of cartpole-balance-sparse 2024-01-07 19:28:41 -08:00
Nicklas Hansen
26c72119cd init 2024-01-07 18:16:33 -08:00
Nicklas Hansen
31249a8961 separate episodes with nans 2024-01-07 14:21:38 -08:00
Nicklas Hansen
33876d124f add instructions for distributed training 2024-01-07 11:55:07 -08:00
Nicklas Hansen
a7ff00b0cd add option to disable planning with mpc=false 2024-01-04 19:17:47 -08:00
Nicklas Hansen
13cac07759 fix minor logging issue in offline trainer 2024-01-03 10:12:28 -08:00
Nicklas Hansen
1d224cec3a update documentation 2023-12-31 14:38:22 -08:00
Nicklas Hansen
e3c876670a add launcher 2023-12-29 16:37:26 -08:00
Nicklas Hansen
1f6c7771b9 Merge pull request #10 from nicklashansen/experimental
[Feature] Faster replay buffer + support pixel observations
2023-12-28 16:37:27 +01:00
Nicklas Hansen
6cb779aa3a allow missing env dependencies + update readme 2023-12-28 07:33:03 -08:00
Nicklas Hansen
54145a4d8c integrate slicesampler as default 2023-12-27 08:49:04 -08:00
Nicklas Hansen
2f86a1e4d8 fix sampler https://github.com/pytorch/rl/pull/1762 2023-12-25 10:11:42 -08:00
Nicklas Hansen
ca4dfa1db3 further reduce buffer differences 2023-12-23 09:39:13 -08:00
Nicklas Hansen
eef1d1b407 unit test buffer implementations 2023-12-23 07:43:28 -08:00
Nicklas Hansen
70fe242adc does not reproduce results w/ previous buffer 2023-12-22 14:26:48 -08:00
Nicklas Hansen
2929cfdb44 fix computation of mem requirements 2023-12-22 13:57:01 -08:00
Nicklas Hansen
34ea3662cd compare new/old buffers 2023-12-22 13:34:12 -08:00
Nicklas Hansen
fea0936e69 set logging defaults 2023-12-22 07:44:34 -08:00
Nicklas Hansen
95bf14e343 Merge branch 'main' of github.com:nicklashansen/tdmpc2 into experimental 2023-12-22 07:43:44 -08:00
Nicklas Hansen
bfb1971898 naive support for pixels 2023-12-22 07:34:40 -08:00
Nicklas Hansen
3ded0ebc83 faster replay buffer implementation 2023-12-22 05:55:43 -08:00
Nicklas Hansen
445af9d81d easier customization of architecture:
all args can now be set freely when model_size is not specified
2023-12-06 08:15:54 -08:00
Nicklas Hansen
f3139291e2 update dependencies 2023-11-25 19:20:52 -08:00
Nicklas Hansen
58a95e431b Merge pull request #1 from asmith26/patch-1
Fix small typo
2023-10-28 10:04:29 -07:00
asmith26
d36529dea4 Fix small typo 2023-10-28 12:26:53 +01:00
Nicklas Hansen
b67b21c5c6 first commit 2023-10-25 18:26:00 -07:00