Skip to content

Commit a599e11

Browse files
authored
Fix 3DBall PPO hard regression (#4133)
1 parent 78bb8c7 commit a599e11

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

config/ppo/3DBallHard.yaml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@ behaviors:
22
3DBallHard:
33
trainer_type: ppo
44
hyperparameters:
5-
batch_size: 1200
5+
batch_size: 120
66
buffer_size: 12000
77
learning_rate: 0.0003
88
beta: 0.001
@@ -17,10 +17,10 @@ behaviors:
1717
vis_encode_type: simple
1818
reward_signals:
1919
extrinsic:
20-
gamma: 0.995
20+
gamma: 0.99
2121
strength: 1.0
2222
keep_checkpoints: 5
23-
max_steps: 5000000
23+
max_steps: 500000
2424
time_horizon: 1000
2525
summary_freq: 12000
2626
threaded: true

0 commit comments

Comments
 (0)