Conversation
|
The latest updates on your projects. Learn more about Vercel for Git ↗︎
|
Owner
Author
Owner
Author
|
Preliminary docs are available at https://cleanrl-git-jax-ppo-envpool-atari-vwxyzjn.vercel.app/rl-algorithms/ppo/#ppo_atari_envpool_xla_jaxpy. |
We got this message. See #227 (comment) ``` NotImplementedError: Got <class 'jaxlib.xla_extension.DeviceArray'>, but numpy array, torch tensor, or caffe2 blob name are expected. ```
Owner
Author
|
Hi @yooceii @kinalmehta, I have addressed most of your concerns. Please let me know if additional tweaks are needed. |
kinalmehta
approved these changes
Oct 6, 2022
Collaborator
kinalmehta
left a comment
There was a problem hiding this comment.
All the concerns seem to be addressed.
LGTM
Great Work!!
ludgerpaehler
pushed a commit
to ludgerpaehler/koopman-rl
that referenced
this pull request
Jan 13, 2026
* PPO + jax + envpool + atari * fix bug: only report metric when lifes are used up * pre-commit * quick fix * Quick refactor * push changes * pre-commit and use EnvPool's new API * update envpool * update docs * update ppo benchmark script * update docs * use the latest envpool interface * update envpool to the latest version * update pyproject.toml * update lock files * Quick clarification * Update docs * remove non benchmarked script * update docs * revert poetry changes * docs fix * remove uncessary code, add docs * add a note one envpool * update test cases * explain `get_action_and_value` * fix indent * Fix weird error with `np.mean`. See below: We got this message. See vwxyzjn#227 (comment) ``` NotImplementedError: Got <class 'jaxlib.xla_extension.DeviceArray'>, but numpy array, torch tensor, or caffe2 blob name are expected. ``` * update docs * pre-commit * add note on `charts/avg_episodic_return` * update reproducibility script * add note on value function clipping
softwarecore1995
added a commit
to softwarecore1995/clean-rl
that referenced
this pull request
Feb 22, 2026
* PPO + jax + envpool + atari * fix bug: only report metric when lifes are used up * pre-commit * quick fix * Quick refactor * push changes * pre-commit and use EnvPool's new API * update envpool * update docs * update ppo benchmark script * update docs * use the latest envpool interface * update envpool to the latest version * update pyproject.toml * update lock files * Quick clarification * Update docs * remove non benchmarked script * update docs * revert poetry changes * docs fix * remove uncessary code, add docs * add a note one envpool * update test cases * explain `get_action_and_value` * fix indent * Fix weird error with `np.mean`. See below: We got this message. See vwxyzjn/cleanrl#227 (comment) ``` NotImplementedError: Got <class 'jaxlib.xla_extension.DeviceArray'>, but numpy array, torch tensor, or caffe2 blob name are expected. ``` * update docs * pre-commit * add note on `charts/avg_episodic_return` * update reproducibility script * add note on value function clipping
softwarecore1995
added a commit
to softwarecore1995/clean-rl
that referenced
this pull request
Feb 22, 2026
We got this message. See vwxyzjn/cleanrl#227 (comment) ``` NotImplementedError: Got <class 'jaxlib.xla_extension.DeviceArray'>, but numpy array, torch tensor, or caffe2 blob name are expected. ```
arjunmahesh1
pushed a commit
to arjunmahesh1/cleanrl
that referenced
this pull request
Mar 6, 2026
* PPO + jax + envpool + atari * fix bug: only report metric when lifes are used up * pre-commit * quick fix * Quick refactor * push changes * pre-commit and use EnvPool's new API * update envpool * update docs * update ppo benchmark script * update docs * use the latest envpool interface * update envpool to the latest version * update pyproject.toml * update lock files * Quick clarification * Update docs * remove non benchmarked script * update docs * revert poetry changes * docs fix * remove uncessary code, add docs * add a note one envpool * update test cases * explain `get_action_and_value` * fix indent * Fix weird error with `np.mean`. See below: We got this message. See vwxyzjn/cleanrl#227 (comment) ``` NotImplementedError: Got <class 'jaxlib.xla_extension.DeviceArray'>, but numpy array, torch tensor, or caffe2 blob name are expected. ``` * update docs * pre-commit * add note on `charts/avg_episodic_return` * update reproducibility script * add note on value function clipping
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.

Description
Types of changes
Checklist:
pre-commit run --all-filespasses (required).mkdocs serve.If you are adding new algorithms or your change could result in performance difference, you may need to (re-)run tracked experiments. See #137 as an example PR.
--capture-videoflag toggled on (required).mkdocs serve.width=500andheight=300).