|
1 | 1 | # CleanRL (Clean Implementation of RL Algorithms) |
2 | 2 |
|
3 | | -<img src=" |
4 | | -https://img.shields.io/github/license/vwxyzjn/cleanrl"> |
| 3 | + |
| 4 | +[<img src="https://img.shields.io/badge/license-MIT-blue">](https://github.com/vwxyzjn/cleanrl) |
5 | 5 | [](https://github.com/vwxyzjn/cleanrl/actions/workflows/tests.yaml) |
6 | 6 | [](https://github.com/vwxyzjn/cleanrl/actions/workflows/docs.yaml) |
7 | 7 | [<img src="https://img.shields.io/discord/767863440248143916?label=discord">](https://discord.gg/D6RCjA6sVT) |
8 | | -[<img src="https://badge.fury.io/py/cleanrl.svg">]( |
9 | | -https://pypi.org/project/cleanrl/) |
10 | 8 | [<img src="https://img.shields.io/youtube/channel/views/UCDdC6BIFRI0jvcwuhi3aI6w?style=social">](https://www.youtube.com/channel/UCDdC6BIFRI0jvcwuhi3aI6w/videos) |
11 | 9 | [](https://github.com/psf/black) |
12 | 10 | [](https://pycqa.github.io/isort/) |
@@ -78,7 +76,6 @@ poetry install -E atari |
78 | 76 | python cleanrl/dqn_atari.py --env-id BreakoutNoFrameskip-v4 |
79 | 77 | python cleanrl/c51_atari.py --env-id BreakoutNoFrameskip-v4 |
80 | 78 | python cleanrl/ppo_atari.py --env-id BreakoutNoFrameskip-v4 |
81 | | -python cleanrl/apex_dqn_atari.py --env-id BreakoutNoFrameskip-v4 |
82 | 79 |
|
83 | 80 | # NEW: 3-4x side-effects free speed up with envpool's atari (only available to linux) |
84 | 81 | poetry install -E envpool |
@@ -120,7 +117,6 @@ You may also use a prebuilt development environment hosted in Gitpod: |
120 | 117 | | | [`dqn_atari.py`](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/dqn_atari.py) | |
121 | 118 | | ✅ [Categorical DQN (C51)](https://arxiv.org/pdf/1707.06887.pdf) | [`c51.py`](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/c51.py) | |
122 | 119 | | | [`c51_atari.py`](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/c51_atari.py) | |
123 | | -| ✅ [Apex Deep Q-Learning (Apex-DQN)](https://arxiv.org/pdf/1803.00933.pdf) | [`apex_dqn_atari.py`](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/apex_dqn_atari.py) | |
124 | 120 | | ✅ [Soft Actor-Critic (SAC)](https://arxiv.org/pdf/1812.05905.pdf) | [`sac_continuous_action.py`](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/sac_continuous_action.py) | |
125 | 121 | | ✅ [Deep Deterministic Policy Gradient (DDPG)](https://arxiv.org/pdf/1509.02971.pdf) | [`ddpg_continuous_action.py`](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/ddpg_continuous_action.py) | |
126 | 122 | | ✅ [Twin Delayed Deep Deterministic Policy Gradient (TD3)](https://arxiv.org/pdf/1802.09477.pdf) | [`td3_continuous_action.py`](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/td3_continuous_action.py) | |
|
0 commit comments