Skip to content
Merged
Show file tree
Hide file tree
Changes from 8 commits
Commits
Show all changes
20 commits
Select commit Hold shift + click to select a range
fec5537
examples/atari -> examples/atari/reproduction
keisuke-nakata Jun 19, 2019
0958798
examples/ale -> examples/atari
keisuke-nakata Jun 19, 2019
e50dc43
examples/mujoco -> examples/mujoco/reproduction
keisuke-nakata Jun 19, 2019
6cd7fb8
examples_tests/atari -> examples_tests/atari/reproduction
keisuke-nakata Jun 19, 2019
b0dc93e
examples_tests/ale -> examples_tests/atari
keisuke-nakata Jun 19, 2019
cd7180b
examples_test/mujoco -> exampes_tests/mujoco/reproduction
keisuke-nakata Jun 19, 2019
542fda6
fix directory names in examples/README.md
keisuke-nakata Jun 19, 2019
a9485a7
fix directory names in examples_tests scripts
keisuke-nakata Jun 19, 2019
4c84abe
Merge commit 'd8dd47dd2a077df9af6b5a320a6edb24307a696e' into rename-e…
keisuke-nakata Jun 25, 2019
019d6ab
Merge commit '861181a4a417469a55ee3bf9e3c4a1937d50e947' into rename-e…
keisuke-nakata Jun 28, 2019
935ed86
mv mujoco tasks formerly in `gym` to `mujoco`
keisuke-nakata Jun 28, 2019
626b33e
update readme
keisuke-nakata Jun 28, 2019
0fe53e4
mv test scripts
keisuke-nakata Jun 28, 2019
c880c2f
change directory for tests
keisuke-nakata Jun 28, 2019
156990e
Update examples_tests/mujoco/test_ppo_batch.sh
keisuke-nakata Jul 1, 2019
de52f96
Update examples_tests/mujoco/test_ddpg_batch.sh
keisuke-nakata Jul 1, 2019
b28ea54
Update examples/mujoco/README.md
keisuke-nakata Jul 1, 2019
64cfd08
Update examples/mujoco/README.md
keisuke-nakata Jul 1, 2019
0671f2a
Update examples/mujoco/README.md
keisuke-nakata Jul 1, 2019
16caced
Update examples_tests/atari/test_dqn_batch.sh
keisuke-nakata Jul 1, 2019
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 3 additions & 3 deletions examples/README.md
Original file line number Diff line number Diff line change
@@ -1,8 +1,8 @@
# Examples

- `ale`: examples for general Atari games
- `atari`: examples with benchmark scores for reproducing published results on Atari
- `atari`: examples for general Atari games
- `atari/reproduction`: examples with benchmark scores for reproducing published results on Atari
- `gym`: examples for OpenAI Gym environments
- `grasping`: examples for a Bullet-based robotic grasping environment
- `mujoco`: examples with benchmark scores for reproducing published results on MuJoCo tasks
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should mujoco/reproduction be in gym/reproduction? Or should we move gym tasks that use mujoco into mujoco @muupan ?

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That is, should we have a mujoco and mujoco/reproduction directory, where mujoco tasks formerly in gym move to mujoco?

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @prabhatnagarajan @muupan , I'll fix this.

- `mujoco/reproduction`: examples with benchmark scores for reproducing published results on MuJoCo tasks
- `quickstart`: a quickstart guide of ChainerRL
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
12 changes: 0 additions & 12 deletions examples_tests/ale/test_a2c.sh

This file was deleted.

14 changes: 0 additions & 14 deletions examples_tests/ale/test_a3c.sh

This file was deleted.

14 changes: 0 additions & 14 deletions examples_tests/ale/test_acer.sh

This file was deleted.

12 changes: 0 additions & 12 deletions examples_tests/ale/test_categorical_dqn.sh

This file was deleted.

12 changes: 0 additions & 12 deletions examples_tests/ale/test_dqn.sh

This file was deleted.

12 changes: 0 additions & 12 deletions examples_tests/ale/test_dqn_batch.sh

This file was deleted.

14 changes: 0 additions & 14 deletions examples_tests/ale/test_nsq.sh

This file was deleted.

12 changes: 0 additions & 12 deletions examples_tests/ale/test_ppo.sh

This file was deleted.

12 changes: 12 additions & 0 deletions examples_tests/atari/reproduction/test_dqn.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
#!/bin/bash

set -Ceu

outdir=$(mktemp -d)

gpu="$1"

# atari/reproduction/dqn
python examples/atari/reproduction/dqn/train_dqn.py --env PongNoFrameskip-v4 --steps 100 --replay-start-size 50 --outdir $outdir/atari/reproduction/dqn --eval-n-steps 200 --eval-interval 50 --n-best-episodes 1 --gpu $gpu
model=$(find $outdir/atari/reproduction/dqn -name "*_finish")
python examples/atari/reproduction/dqn/train_dqn.py --env PongNoFrameskip-v4 --demo --load $model --outdir $outdir/temp --eval-n-steps 200 --gpu $gpu
12 changes: 12 additions & 0 deletions examples_tests/atari/reproduction/test_iqn.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
#!/bin/bash

set -Ceu

outdir=$(mktemp -d)

gpu="$1"

# atari/reproduction/iqn
python examples/atari/reproduction/iqn/train_iqn.py --env PongNoFrameskip-v4 --steps 100 --replay-start-size 50 --outdir $outdir/atari/reproduction/iqn --eval-n-steps 200 --eval-interval 50 --n-best-episodes 1 --gpu $gpu
model=$(find $outdir/atari/reproduction/iqn -name "*_finish")
python examples/atari/reproduction/iqn/train_iqn.py --env PongNoFrameskip-v4 --demo --load $model --outdir $outdir/temp --eval-n-steps 200 --gpu $gpu
13 changes: 13 additions & 0 deletions examples_tests/atari/reproduction/test_rainbow.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
#!/bin/bash

set -Ceu

outdir=$(mktemp -d)
echo "outdir: $outdir"

gpu="$1"

# atari/reproduction/rainbow
python examples/atari/reproduction/rainbow/train_rainbow.py --env PongNoFrameskip-v4 --steps 100 --replay-start-size 50 --outdir $outdir/atari/reproduction/rainbow --eval-n-steps 200 --eval-interval 50 --n-best-episodes 1 --gpu $gpu
model=$(find $outdir/atari/reproduction/rainbow -name "*_finish")
python examples/atari/reproduction/rainbow/train_rainbow.py --env PongNoFrameskip-v4 --demo --load $model --outdir $outdir/temp --eval-n-steps 200 --gpu $gpu
12 changes: 12 additions & 0 deletions examples_tests/atari/test_a2c.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
#!/bin/bash

set -Ceu

outdir=$(mktemp -d)

gpu="$1"

# atari/a2c
python examples/atari/train_a2c_ale.py --env PongNoFrameskip-v4 --steps 100 --update-steps 50 --outdir $outdir/atari/a2c
model=$(find $outdir/atari/a2c -name "*_finish")
python examples/atari/train_a2c_ale.py --env PongNoFrameskip-v4 --demo --load $model --eval-n-runs 1
14 changes: 14 additions & 0 deletions examples_tests/atari/test_a3c.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
#!/bin/bash

set -Ceu

outdir=$(mktemp -d)

gpu="$1"

# atari/a3c (only for cpu)
if [[ $gpu -lt 0 ]]; then
python examples/atari/train_a3c_ale.py 4 --env PongNoFrameskip-v4 --steps 100 --outdir $outdir/atari/a3c
model=$(find $outdir/atari/a3c -name "*_finish")
python examples/atari/train_a3c_ale.py 4 --env PongNoFrameskip-v4 --demo --load $model --eval-n-runs 1 --outdir $outdir/temp
fi
14 changes: 14 additions & 0 deletions examples_tests/atari/test_acer.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
#!/bin/bash

set -Ceu

outdir=$(mktemp -d)

gpu="$1"

# atari/acer (only for cpu)
if [[ $gpu -lt 0 ]]; then
python examples/atari/train_acer_ale.py 4 --env PongNoFrameskip-v4 --steps 100 --outdir $outdir/atari/acer
model=$(find $outdir/atari/acer -name "*_finish")
python examples/atari/train_acer_ale.py 4 --env PongNoFrameskip-v4 --demo --load $model --eval-n-runs 1 --outdir $outdir/temp
fi
12 changes: 12 additions & 0 deletions examples_tests/atari/test_categorical_dqn.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
#!/bin/bash

set -Ceu

outdir=$(mktemp -d)

gpu="$1"

# atari/categorical_dqn
python examples/atari/train_categorical_dqn_ale.py --env PongNoFrameskip-v4 --steps 100 --replay-start-size 50 --outdir $outdir/atari/categorical_dqn --gpu $gpu
model=$(find $outdir/atari/categorical_dqn -name "*_finish")
python examples/atari/train_categorical_dqn_ale.py --env PongNoFrameskip-v4 --demo --load $model --eval-n-runs 1 --outdir $outdir/temp --gpu $gpu
4 changes: 2 additions & 2 deletions examples_tests/atari/test_dqn.sh
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,6 @@ outdir=$(mktemp -d)
gpu="$1"

# atari/dqn
python examples/atari/dqn/train_dqn.py --env PongNoFrameskip-v4 --steps 100 --replay-start-size 50 --outdir $outdir/atari/dqn --eval-n-steps 200 --eval-interval 50 --n-best-episodes 1 --gpu $gpu
python examples/atari/train_dqn_ale.py --env PongNoFrameskip-v4 --steps 100 --replay-start-size 50 --outdir $outdir/atari/dqn --gpu $gpu
model=$(find $outdir/atari/dqn -name "*_finish")
python examples/atari/dqn/train_dqn.py --env PongNoFrameskip-v4 --demo --load $model --outdir $outdir/temp --eval-n-steps 200 --gpu $gpu
python examples/atari/train_dqn_ale.py --env PongNoFrameskip-v4 --demo --load $model --eval-n-runs 1 --outdir $outdir/temp --gpu $gpu
12 changes: 12 additions & 0 deletions examples_tests/atari/test_dqn_batch.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
#!/bin/bash

set -Ceu

outdir=$(mktemp -d)

gpu="$1"

# atari/dqn batch
python examples/atari/train_dqn_batch_ale.py --env PongNoFrameskip-v4 --steps 100 --replay-start-size 50 --outdir $outdir/atari/dqn_batch --gpu $gpu
model=$(find $outdir/atari/dqn_batch -name "*_finish")
python examples/atari/train_dqn_batch_ale.py --env PongNoFrameskip-v4 --demo --load $model --eval-n-runs 1 --outdir $outdir/temp --gpu $gpu
12 changes: 0 additions & 12 deletions examples_tests/atari/test_iqn.sh

This file was deleted.

14 changes: 14 additions & 0 deletions examples_tests/atari/test_nsq.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
#!/bin/bash

set -Ceu

outdir=$(mktemp -d)

gpu="$1"

# atari/nsq (only for cpu)
if [[ $gpu -lt 0 ]]; then
python examples/atari/train_nsq_ale.py 4 --env PongNoFrameskip-v4 --steps 100 --outdir $outdir/atari/nsq
model=$(find $outdir/atari/nsq -name "*_finish")
python examples/atari/train_nsq_ale.py 4 --env PongNoFrameskip-v4 --demo --load $model --eval-n-runs 1 --outdir $outdir/temp
fi
12 changes: 12 additions & 0 deletions examples_tests/atari/test_ppo.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
#!/bin/bash

set -Ceu

outdir=$(mktemp -d)

gpu="$1"

# atari/ppo
python examples/atari/train_ppo_ale.py --env PongNoFrameskip-v4 --steps 100 --update-interval 50 --batchsize 16 --epochs 2 --outdir $outdir/atari/ppo --gpu $gpu
model=$(find $outdir/atari/ppo -name "*_finish")
python examples/atari/train_ppo_ale.py --env PongNoFrameskip-v4 --demo --load $model --eval-n-runs 1 --outdir $outdir/temp --gpu $gpu
13 changes: 0 additions & 13 deletions examples_tests/atari/test_rainbow.sh

This file was deleted.

12 changes: 12 additions & 0 deletions examples_tests/mujoco/reproduction/test_ddpg.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
#!/bin/bash

set -Ceu

outdir=$(mktemp -d)

gpu="$1"

# mujoco/reproduction/ddpg (specify non-mujoco env to test without mujoco)
python examples/mujoco/reproduction/ddpg/train_ddpg.py --env Pendulum-v0 --gpu $gpu --steps 10 --replay-start-size 5 --batch-size 5 --outdir $outdir/mujoco/reproduction/ddpg
model=$(find $outdir/mujoco/reproduction/ddpg -name "*_finish")
python examples/mujoco/reproduction/ddpg/train_ddpg.py --env Pendulum-v0 --demo --load $model --eval-n-runs 1 --outdir $outdir/temp --gpu $gpu
12 changes: 12 additions & 0 deletions examples_tests/mujoco/reproduction/test_ppo.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
#!/bin/bash

set -Ceu

outdir=$(mktemp -d)

gpu="$1"

# mujoco/reproduction/ppo (specify non-mujoco env to test without mujoco)
python examples/mujoco/reproduction/ppo/train_ppo.py --steps 10 --update-interval 5 --batch-size 5 --epochs 2 --outdir $outdir/mujoco/reproduction/ppo --env Pendulum-v0 --gpu $gpu
model=$(find $outdir/mujoco/reproduction/ppo -name "*_finish")
python examples/mujoco/reproduction/ppo/train_ppo.py --demo --load $model --eval-n-runs 1 --env Pendulum-v0 --outdir $outdir/temp --gpu $gpu
12 changes: 12 additions & 0 deletions examples_tests/mujoco/reproduction/test_td3.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
#!/bin/bash

set -Ceu

outdir=$(mktemp -d)

gpu="$1"

# mujoco/reproduction/td3 (specify non-mujoco env to test without mujoco)
python examples/mujoco/reproduction/td3/train_td3.py --env Pendulum-v0 --gpu $gpu --steps 10 --replay-start-size 5 --batch-size 5 --outdir $outdir/mujoco/reproduction/td3
model=$(find $outdir/mujoco/reproduction/td3 -name "*_finish")
python examples/mujoco/reproduction/td3/train_td3.py --env Pendulum-v0 --demo --load $model --eval-n-runs 1 --outdir $outdir/temp --gpu $gpu
12 changes: 0 additions & 12 deletions examples_tests/mujoco/test_ddpg.sh

This file was deleted.

12 changes: 0 additions & 12 deletions examples_tests/mujoco/test_ppo.sh

This file was deleted.

12 changes: 0 additions & 12 deletions examples_tests/mujoco/test_td3.sh

This file was deleted.