91 Commits

Author SHA1 Message Date
Sue Hyun Park
8bbc14ebab Fix: handle _action_masks buffer in single-task scenarios (#67) 2025-05-19 20:12:12 -07:00
Nicklas Hansen
7992fa193e update readme + dockerfile 2025-05-13 14:51:16 -07:00
Nicklas Hansen
7ec6bc83a8 only instantiate termination pred head if episodic=true 2025-05-02 16:51:24 -07:00
Nicklas Hansen
38b31a5d72 only instantiate termination pred head if episodic=true 2025-05-02 16:24:02 -07:00
Nicklas Hansen
7942e9082b update readme + clean up 2025-04-15 16:32:15 -07:00
Nicklas Hansen
eece80123d full support for episodic rl 2025-04-15 15:55:05 -07:00
Nicklas Hansen
38f853efc4 clean up 2025-04-15 10:16:02 -07:00
Nicklas Hansen
62be41ab58 experimental changes to termination prediction 2025-04-10 00:32:13 -07:00
Nicklas Hansen
c95b755655 add walker2d 2025-04-09 15:55:57 -07:00
Nicklas Hansen
81eb17068e QoL improvements to termination signal debugging 2025-04-08 19:15:31 -07:00
Nicklas Hansen
add30b5a74 merge main into branch + fix termination in td-targets 2025-04-08 12:40:10 -07:00
Nicklas Hansen
0a914570dc fix multitask model api conversion 2025-02-27 16:25:21 -08:00
Nicklas Hansen
55bde9745f update torchrl version 2025-02-05 16:46:34 -08:00
Nicklas Hansen
5ced6dfeb4 auto-convert old checkpoints to new format 2025-02-05 16:26:19 -08:00
Nicklas Hansen
dddc226d25 partial fix to loading checkpoints 2025-01-21 00:10:53 -08:00
Vincent Moens
ae4238946f Conversion tools for state-dicts (#55)
* init

* init

* amend
2025-01-20 15:49:36 -08:00
Nicklas Hansen
a19f91c0b5 enable torch.compile for offline rl + rgb inputs 2024-12-25 12:22:39 -08:00
Nicklas Hansen
e452ca7539 factor pi outputs 2024-12-25 12:08:07 -08:00
Nicklas Hansen
db1865334e refactor pi outputs 2024-12-25 12:02:33 -08:00
Nicklas Hansen
804f9b3949 refactor pi outputs 2024-12-24 03:05:00 -08:00
Nicklas Hansen
66f8c21f58 cache buffer values in offline training 2024-12-19 09:40:04 -08:00
Nicklas Hansen
9cac7c5775 faster offline data loading 2024-12-19 06:52:31 -08:00
Nicklas Hansen
df8a465c8e update offline trainer to use new torch.load api 2024-12-10 16:30:05 -08:00
Nicklas Hansen
2e27fbb6f4 partial fix for loading old checkpoints 2024-12-10 16:04:27 -08:00
Nicklas Hansen
6117bc427d simplify dmcontrol wrappers + upgrade to gymnasium==0.29.1 2024-12-10 15:16:34 -08:00
Nicklas Hansen
32fc2bdf93 refactor policy 2024-12-03 12:22:02 -08:00
Nicklas Hansen
0a79c8bd38 update Dockerfile + environment.yaml 2024-11-28 11:35:15 -08:00
Nicklas Hansen
1bfbcb7794 Merge pull request #49 from nicklashansen/speedups
Speedups
2024-11-10 12:33:32 -08:00
Nicklas Hansen
0c3fcc4619 update readme 2024-11-10 12:32:57 -08:00
Nicklas Hansen
fb07cdac3f update compile print 2024-11-10 12:27:24 -08:00
Nicklas Hansen
c694d286f0 add assertion for compile=true compatibility 2024-11-10 12:25:43 -08:00
Nicklas Hansen
1a77207646 support newest version of myosuite 2024-11-04 15:15:40 -08:00
Nicklas Hansen
b7725e74a5 move cfg conversion to parser.py 2024-10-31 14:52:59 -07:00
Nicklas Hansen
d477619f8d Merge branch 'speedups' of github.com:nicklashansen/tdmpc2 into speedups 2024-10-27 14:24:39 -07:00
Nicklas Hansen
c1dd0c0338 minor QoL improvements in offline pipeline 2024-10-27 14:24:19 -07:00
Nicklas Hansen
c0d3faac77 Merge pull request #48 from vmoens/patch-1
Use torch.compiler.cudagraph_mark_step_begin() in eval
2024-10-27 14:22:51 -07:00
Vincent Moens
3b5f67592c Update offline_trainer.py 2024-10-26 00:33:27 +01:00
Vincent Moens
fad0d1be03 Use torch.compiler.cudagraph_mark_step_begin() in eval 2024-10-26 00:32:16 +01:00
Nicklas Hansen
836547d76f fix eval index + clean up 2024-10-21 14:49:21 -07:00
Nicklas Hansen
970792e2b6 clean up prints 2024-10-18 15:31:25 -07:00
Nicklas Hansen
c3a912e10d Merge pull request #46 from vmoens/cudagraphs
[WIP,Feature] Add cudagraphs and compile option
2024-10-18 09:35:33 -07:00
Nicklas Hansen
38eb46df04 Merge branch 'cudagraphs' of https://github.com/vmoens/tdmpc2 into cudagraphs 2024-10-17 14:57:32 -07:00
vmoens
8b731819a6 merged commits 2024-10-08 10:24:02 -07:00
Nicklas Hansen
a7890b6985 add more results 2024-10-06 21:11:38 -07:00
vmoens
603b67ce66 merged commits 2024-10-03 16:09:48 +01:00
Nicklas Hansen
88095e7899 add more results 2024-08-30 11:28:02 -07:00
Nicklas Hansen
3789fcd5b8 update pinned torchrl version 2024-07-02 10:12:06 -07:00
Nicklas Hansen
d51feb0e9f Update README.md 2024-07-02 10:12:06 -07:00
Nicklas Hansen
2dc668ecaf reduce # wandb calls 2024-07-02 10:12:06 -07:00
Nicklas Hansen
39be86fc52 update dockerfile 2024-07-02 10:12:06 -07:00