Improve StateActionQFunctions by muupan · Pull Request #172 · chainer/chainerrl

muupan · 2017-11-15T03:08:17Z

Please merge this after merging #171

Add nonlinearity and last_wscale aruguments to StateActionQFunctions
Improve docstrings
Add tests for StateActionQFunctions. They are actually not checking if each configuration is applied or not, but it should be better than having no test.

toslunar

Thanks. I left a few minor comments.

toslunar · 2017-11-15T05:29:07Z

chainerrl/q_functions/state_action_q_functions.py

+            Nonlinearities with learnable parameters such as PReLU are not
+            supported.
+        last_wscale (float): Scale of weight initialization of the last layer.
+


This empty line seems unnecessary.

toslunar · 2017-11-15T06:16:07Z

chainerrl/q_functions/state_action_q_functions.py

+        self.nonlinearity = nonlinearity

        super().__init__()
        with self.init_scope():


Could you add a comment that nonlinearity does not need to be passed to MLPBN because hidden_sizes is empty?

toslunar · 2017-11-15T06:20:25Z

tests/q_functions_tests/test_state_action_q_function.py

+    *testing.product({
+        'n_dim_obs': [1, 5],
+        'n_dim_action': [1, 3],
+        'n_hidden_layers': [0, 1, 2],


'n_hidden_layers': [1, 2],

n_hidden_layers=0 works except for LateAction architectures. I'll remove 0 for them.

muupan · 2017-11-16T10:16:03Z

Thank you for your review! I fixed the points you mentioned.

muupan added 4 commits November 15, 2017 12:01

Merge branch 'improve-mlpbn' into improve-saqf

399eef2

Add nonlinearity and last_wscale to SAQFunctions

9bb43a9

Add tests for SAQFunctions

af0f824

Avoid duplicated code

38e4d1a

muupan mentioned this pull request Nov 15, 2017

Improve deterministic policies #173

Merged

Fix import grouping

e09d439

muupan changed the title ~~Improve StateActionQFunctions~~ [WIP] Improve StateActionQFunctions Nov 15, 2017

muupan added 2 commits November 15, 2017 14:16

Merge branch 'master' into improve-saqf

1a03b91

Improve docstring on nonlinearity

8105c4b

muupan changed the title ~~[WIP] Improve StateActionQFunctions~~ Improve StateActionQFunctions Nov 15, 2017

toslunar requested changes Nov 15, 2017

View reviewed changes

muupan added 4 commits November 16, 2017 17:55

Clarify nonlinearity is not used when n_hidden_layers=0

b506e8b

Require n_hidden_layers >= 1 for LateAction arch

406b549

Remove test cases with n_hidden_layers=0 for LateAction arch

bcc85c2

Add comments explaining why nonlinearity is not passed

671860d

toslunar approved these changes Nov 16, 2017

View reviewed changes

toslunar merged commit 5aa8d6e into chainer:master Nov 16, 2017

muupan added the enhancement label Nov 30, 2017

muupan added this to the v0.3 milestone Nov 30, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve StateActionQFunctions#172

Improve StateActionQFunctions#172
toslunar merged 11 commits intochainer:masterfrom
muupan:improve-saqf

muupan commented Nov 15, 2017

Uh oh!

toslunar left a comment

Uh oh!

toslunar Nov 15, 2017

Uh oh!

toslunar Nov 15, 2017

Uh oh!

toslunar Nov 15, 2017

Uh oh!

muupan Nov 16, 2017

Uh oh!

muupan commented Nov 16, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

muupan commented Nov 15, 2017

Uh oh!

toslunar left a comment

Choose a reason for hiding this comment

Uh oh!

toslunar Nov 15, 2017

Choose a reason for hiding this comment

Uh oh!

toslunar Nov 15, 2017

Choose a reason for hiding this comment

Uh oh!

toslunar Nov 15, 2017

Choose a reason for hiding this comment

Uh oh!

muupan Nov 16, 2017

Choose a reason for hiding this comment

Uh oh!

muupan commented Nov 16, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants