Commit Graph

101 Commits

Author SHA1 Message Date
Nicklas Hansen
6116eb3fa5 fix merge error 2025-05-20 13:59:12 -07:00
Nicklas Hansen
491d367fc6 Merge branch 'vectorized_env' of github.com:nicklashansen/tdmpc2 into vectorized_env 2025-05-20 13:44:41 -07:00
Nicklas Hansen
10f368f20d init 2025-05-20 13:42:28 -07:00
Nicklas Hansen
829e329b3b init 2025-05-20 13:40:02 -07:00
Sue Hyun Park
8bbc14ebab Fix: handle _action_masks buffer in single-task scenarios (#67) 2025-05-19 20:12:12 -07:00
Nicklas Hansen
7992fa193e update readme + dockerfile 2025-05-13 14:51:16 -07:00
Nicklas Hansen
7ec6bc83a8 only instantiate termination pred head if episodic=true 2025-05-02 16:51:24 -07:00
Nicklas Hansen
38b31a5d72 only instantiate termination pred head if episodic=true 2025-05-02 16:24:02 -07:00
Nicklas Hansen
7942e9082b update readme + clean up 2025-04-15 16:32:15 -07:00
Nicklas Hansen
eece80123d full support for episodic rl 2025-04-15 15:55:05 -07:00
Nicklas Hansen
38f853efc4 clean up 2025-04-15 10:16:02 -07:00
Nicklas Hansen
62be41ab58 experimental changes to termination prediction 2025-04-10 00:32:13 -07:00
Nicklas Hansen
c95b755655 add walker2d 2025-04-09 15:55:57 -07:00
Nicklas Hansen
81eb17068e QoL improvements to termination signal debugging 2025-04-08 19:15:31 -07:00
Nicklas Hansen
add30b5a74 merge main into branch + fix termination in td-targets 2025-04-08 12:40:10 -07:00
Nicklas Hansen
0a914570dc fix multitask model api conversion 2025-02-27 16:25:21 -08:00
Nicklas Hansen
55bde9745f update torchrl version 2025-02-05 16:46:34 -08:00
Nicklas Hansen
5ced6dfeb4 auto-convert old checkpoints to new format 2025-02-05 16:26:19 -08:00
Nicklas Hansen
dddc226d25 partial fix to loading checkpoints 2025-01-21 00:10:53 -08:00
Vincent Moens
ae4238946f Conversion tools for state-dicts (#55)
* init

* init

* amend
2025-01-20 15:49:36 -08:00
Nicklas Hansen
a19f91c0b5 enable torch.compile for offline rl + rgb inputs 2024-12-25 12:22:39 -08:00
Nicklas Hansen
e452ca7539 factor pi outputs 2024-12-25 12:08:07 -08:00
Nicklas Hansen
db1865334e refactor pi outputs 2024-12-25 12:02:33 -08:00
Nicklas Hansen
804f9b3949 refactor pi outputs 2024-12-24 03:05:00 -08:00
Nicklas Hansen
66f8c21f58 cache buffer values in offline training 2024-12-19 09:40:04 -08:00
Nicklas Hansen
9cac7c5775 faster offline data loading 2024-12-19 06:52:31 -08:00
Nicklas Hansen
df8a465c8e update offline trainer to use new torch.load api 2024-12-10 16:30:05 -08:00
Nicklas Hansen
2e27fbb6f4 partial fix for loading old checkpoints 2024-12-10 16:04:27 -08:00
Nicklas Hansen
6117bc427d simplify dmcontrol wrappers + upgrade to gymnasium==0.29.1 2024-12-10 15:16:34 -08:00
Nicklas Hansen
32fc2bdf93 refactor policy 2024-12-03 12:22:02 -08:00
Nicklas Hansen
0a79c8bd38 update Dockerfile + environment.yaml 2024-11-28 11:35:15 -08:00
Nicklas Hansen
10a0be2724 fix indexing 2024-11-10 23:16:32 -08:00
Nicklas Hansen
ad2342e258 Merge branch 'vectorized_env' of github.com:nicklashansen/tdmpc2 into vectorized_env 2024-11-10 13:04:54 -08:00
Nicklas Hansen
fa41a3e450 init 2024-11-10 13:00:56 -08:00
Nicklas Hansen
1bfbcb7794 Merge pull request #49 from nicklashansen/speedups
Speedups
2024-11-10 12:33:32 -08:00
Nicklas Hansen
0c3fcc4619 update readme 2024-11-10 12:32:57 -08:00
Nicklas Hansen
fb07cdac3f update compile print 2024-11-10 12:27:24 -08:00
Nicklas Hansen
c694d286f0 add assertion for compile=true compatibility 2024-11-10 12:25:43 -08:00
Nicklas Hansen
1a77207646 support newest version of myosuite 2024-11-04 15:15:40 -08:00
Nicklas Hansen
b7725e74a5 move cfg conversion to parser.py 2024-10-31 14:52:59 -07:00
Nicklas Hansen
d477619f8d Merge branch 'speedups' of github.com:nicklashansen/tdmpc2 into speedups 2024-10-27 14:24:39 -07:00
Nicklas Hansen
c1dd0c0338 minor QoL improvements in offline pipeline 2024-10-27 14:24:19 -07:00
Nicklas Hansen
c0d3faac77 Merge pull request #48 from vmoens/patch-1
Use torch.compiler.cudagraph_mark_step_begin() in eval
2024-10-27 14:22:51 -07:00
Vincent Moens
3b5f67592c Update offline_trainer.py 2024-10-26 00:33:27 +01:00
Vincent Moens
fad0d1be03 Use torch.compiler.cudagraph_mark_step_begin() in eval 2024-10-26 00:32:16 +01:00
Nicklas Hansen
836547d76f fix eval index + clean up 2024-10-21 14:49:21 -07:00
Nicklas Hansen
970792e2b6 clean up prints 2024-10-18 15:31:25 -07:00
Nicklas Hansen
c3a912e10d Merge pull request #46 from vmoens/cudagraphs
[WIP,Feature] Add cudagraphs and compile option
2024-10-18 09:35:33 -07:00
Nicklas Hansen
38eb46df04 Merge branch 'cudagraphs' of https://github.com/vmoens/tdmpc2 into cudagraphs 2024-10-17 14:57:32 -07:00
vmoens
8b731819a6 merged commits 2024-10-08 10:24:02 -07:00