Commit 42d21bd
authored
PPO + JAX + EnvPool + Atari (#227)
* PPO + jax + envpool + atari
* fix bug: only report metric when lifes are used up
* pre-commit
* quick fix
* Quick refactor
* push changes
* pre-commit and use EnvPool's new API
* update envpool
* update docs
* update ppo benchmark script
* update docs
* use the latest envpool interface
* update envpool to the latest version
* update pyproject.toml
* update lock files
* Quick clarification
* Update docs
* remove non benchmarked script
* update docs
* revert poetry changes
* docs fix
* remove uncessary code, add docs
* add a note one envpool
* update test cases
* explain `get_action_and_value`
* fix indent
* Fix weird error with `np.mean`. See below:
We got this message. See #227 (comment)
```
NotImplementedError: Got <class 'jaxlib.xla_extension.DeviceArray'>, but numpy array, torch tensor, or caffe2 blob name are expected.
```
* update docs
* pre-commit
* add note on `charts/avg_episodic_return`
* update reproducibility script
* add note on value function clipping1 parent c20c799 commit 42d21bd
File tree
22 files changed
+157414
-32
lines changed- .github/workflows
- benchmark
- cleanrl
- docs/rl-algorithms
- ppo/ppo_atari_envpool_xla_jax
- tests
22 files changed
+157414
-32
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
193 | 193 | | |
194 | 194 | | |
195 | 195 | | |
196 | | - | |
| 196 | + | |
197 | 197 | | |
198 | 198 | | |
199 | 199 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
71 | 71 | | |
72 | 72 | | |
73 | 73 | | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
84 | 84 | | |
85 | 85 | | |
86 | 86 | | |
87 | | - | |
88 | | - | |
89 | | - | |
90 | | - | |
91 | | - | |
92 | | - | |
93 | | - | |
94 | 87 | | |
95 | 88 | | |
96 | 89 | | |
| |||
107 | 100 | | |
108 | 101 | | |
109 | 102 | | |
110 | | - | |
111 | | - | |
112 | | - | |
113 | | - | |
114 | | - | |
115 | | - | |
116 | | - | |
| 103 | + | |
| 104 | + | |
117 | 105 | | |
118 | 106 | | |
119 | 107 | | |
| |||
0 commit comments