Sue Hyun Park
|
8bbc14ebab
|
Fix: handle _action_masks buffer in single-task scenarios (#67)
|
2025-05-19 20:12:12 -07:00 |
|
Nicklas Hansen
|
7992fa193e
|
update readme + dockerfile
|
2025-05-13 14:51:16 -07:00 |
|
Nicklas Hansen
|
7ec6bc83a8
|
only instantiate termination pred head if episodic=true
|
2025-05-02 16:51:24 -07:00 |
|
Nicklas Hansen
|
38b31a5d72
|
only instantiate termination pred head if episodic=true
|
2025-05-02 16:24:02 -07:00 |
|
Nicklas Hansen
|
7942e9082b
|
update readme + clean up
|
2025-04-15 16:32:15 -07:00 |
|
Nicklas Hansen
|
eece80123d
|
full support for episodic rl
|
2025-04-15 15:55:05 -07:00 |
|
Nicklas Hansen
|
38f853efc4
|
clean up
|
2025-04-15 10:16:02 -07:00 |
|
Nicklas Hansen
|
62be41ab58
|
experimental changes to termination prediction
|
2025-04-10 00:32:13 -07:00 |
|
Nicklas Hansen
|
c95b755655
|
add walker2d
|
2025-04-09 15:55:57 -07:00 |
|
Nicklas Hansen
|
81eb17068e
|
QoL improvements to termination signal debugging
|
2025-04-08 19:15:31 -07:00 |
|
Nicklas Hansen
|
add30b5a74
|
merge main into branch + fix termination in td-targets
|
2025-04-08 12:40:10 -07:00 |
|
Nicklas Hansen
|
0a914570dc
|
fix multitask model api conversion
|
2025-02-27 16:25:21 -08:00 |
|
Nicklas Hansen
|
55bde9745f
|
update torchrl version
|
2025-02-05 16:46:34 -08:00 |
|
Nicklas Hansen
|
5ced6dfeb4
|
auto-convert old checkpoints to new format
|
2025-02-05 16:26:19 -08:00 |
|
Nicklas Hansen
|
dddc226d25
|
partial fix to loading checkpoints
|
2025-01-21 00:10:53 -08:00 |
|
Vincent Moens
|
ae4238946f
|
Conversion tools for state-dicts (#55)
* init
* init
* amend
|
2025-01-20 15:49:36 -08:00 |
|
Nicklas Hansen
|
a19f91c0b5
|
enable torch.compile for offline rl + rgb inputs
|
2024-12-25 12:22:39 -08:00 |
|
Nicklas Hansen
|
e452ca7539
|
factor pi outputs
|
2024-12-25 12:08:07 -08:00 |
|
Nicklas Hansen
|
db1865334e
|
refactor pi outputs
|
2024-12-25 12:02:33 -08:00 |
|
Nicklas Hansen
|
804f9b3949
|
refactor pi outputs
|
2024-12-24 03:05:00 -08:00 |
|
Nicklas Hansen
|
66f8c21f58
|
cache buffer values in offline training
|
2024-12-19 09:40:04 -08:00 |
|
Nicklas Hansen
|
9cac7c5775
|
faster offline data loading
|
2024-12-19 06:52:31 -08:00 |
|
Nicklas Hansen
|
df8a465c8e
|
update offline trainer to use new torch.load api
|
2024-12-10 16:30:05 -08:00 |
|
Nicklas Hansen
|
2e27fbb6f4
|
partial fix for loading old checkpoints
|
2024-12-10 16:04:27 -08:00 |
|
Nicklas Hansen
|
6117bc427d
|
simplify dmcontrol wrappers + upgrade to gymnasium==0.29.1
|
2024-12-10 15:16:34 -08:00 |
|
Nicklas Hansen
|
32fc2bdf93
|
refactor policy
|
2024-12-03 12:22:02 -08:00 |
|
Nicklas Hansen
|
0a79c8bd38
|
update Dockerfile + environment.yaml
|
2024-11-28 11:35:15 -08:00 |
|
Nicklas Hansen
|
1bfbcb7794
|
Merge pull request #49 from nicklashansen/speedups
Speedups
|
2024-11-10 12:33:32 -08:00 |
|
Nicklas Hansen
|
0c3fcc4619
|
update readme
|
2024-11-10 12:32:57 -08:00 |
|
Nicklas Hansen
|
fb07cdac3f
|
update compile print
|
2024-11-10 12:27:24 -08:00 |
|
Nicklas Hansen
|
c694d286f0
|
add assertion for compile=true compatibility
|
2024-11-10 12:25:43 -08:00 |
|
Nicklas Hansen
|
1a77207646
|
support newest version of myosuite
|
2024-11-04 15:15:40 -08:00 |
|
Nicklas Hansen
|
b7725e74a5
|
move cfg conversion to parser.py
|
2024-10-31 14:52:59 -07:00 |
|
Nicklas Hansen
|
d477619f8d
|
Merge branch 'speedups' of github.com:nicklashansen/tdmpc2 into speedups
|
2024-10-27 14:24:39 -07:00 |
|
Nicklas Hansen
|
c1dd0c0338
|
minor QoL improvements in offline pipeline
|
2024-10-27 14:24:19 -07:00 |
|
Nicklas Hansen
|
c0d3faac77
|
Merge pull request #48 from vmoens/patch-1
Use torch.compiler.cudagraph_mark_step_begin() in eval
|
2024-10-27 14:22:51 -07:00 |
|
Vincent Moens
|
3b5f67592c
|
Update offline_trainer.py
|
2024-10-26 00:33:27 +01:00 |
|
Vincent Moens
|
fad0d1be03
|
Use torch.compiler.cudagraph_mark_step_begin() in eval
|
2024-10-26 00:32:16 +01:00 |
|
Nicklas Hansen
|
836547d76f
|
fix eval index + clean up
|
2024-10-21 14:49:21 -07:00 |
|
Nicklas Hansen
|
970792e2b6
|
clean up prints
|
2024-10-18 15:31:25 -07:00 |
|
Nicklas Hansen
|
c3a912e10d
|
Merge pull request #46 from vmoens/cudagraphs
[WIP,Feature] Add cudagraphs and compile option
|
2024-10-18 09:35:33 -07:00 |
|
Nicklas Hansen
|
38eb46df04
|
Merge branch 'cudagraphs' of https://github.com/vmoens/tdmpc2 into cudagraphs
|
2024-10-17 14:57:32 -07:00 |
|
vmoens
|
8b731819a6
|
merged commits
|
2024-10-08 10:24:02 -07:00 |
|
Nicklas Hansen
|
a7890b6985
|
add more results
|
2024-10-06 21:11:38 -07:00 |
|
vmoens
|
603b67ce66
|
merged commits
|
2024-10-03 16:09:48 +01:00 |
|
Nicklas Hansen
|
88095e7899
|
add more results
|
2024-08-30 11:28:02 -07:00 |
|
Nicklas Hansen
|
3789fcd5b8
|
update pinned torchrl version
|
2024-07-02 10:12:06 -07:00 |
|
Nicklas Hansen
|
d51feb0e9f
|
Update README.md
|
2024-07-02 10:12:06 -07:00 |
|
Nicklas Hansen
|
2dc668ecaf
|
reduce # wandb calls
|
2024-07-02 10:12:06 -07:00 |
|
Nicklas Hansen
|
39be86fc52
|
update dockerfile
|
2024-07-02 10:12:06 -07:00 |
|