[bug-fix] Fix save/restore critic, add test #5062

ervteng · 2021-03-09T16:19:47Z

Proposed change(s)

Add critic to the list of modules for optimizer, so that it is saved/restored properly. This was introduced in #4939 and doesn't affect older releases of ML-Agents.

Types of change(s)

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

dongruoping

Is this able to run on GPU? In _compare_two_policies we move the actors explicitly and I just saw that we also added self.actor.to(default_device()) to TorchPolicy. I didn't see that in optimizers and I wonder if we need to do the same thing as well.

dongruoping · 2021-03-09T18:26:27Z

ml-agents/mlagents/trainers/tests/torch/saver/test_saver.py

+    model_saver.save_checkpoint("MockBrain", 2000)
+
+    # create a new optimizer and policy
+    optimizer2 = OptimizerClass(policy, trainer_settings)


is it intentional that optimizer2 is using policy not policy2?

No this be policy2 - good catch

ervteng · 2021-03-09T20:52:25Z

Is this able to run on GPU? In _compare_two_policies we move the actors explicitly and I just saw that we also added self.actor.to(default_device()) to TorchPolicy. I didn't see that in optimizers and I wonder if we need to do the same thing as well.

I think I need to add this too, but the different optimizers have different components. I'll try iterating through the get_modules() and moving each of those to the default device.

edit Never mind, this should work b/c the optimizer moves itself to the default device on initialization. So if it fails, it means the optimizer itself is broken.

Fix save/restore critic, add test

f9ab64b

ervteng requested review from dongruoping and andrewcoh March 9, 2021 16:19

Rename module for PPO

e7820b3

vincentpierre approved these changes Mar 9, 2021

View reviewed changes

dongruoping reviewed Mar 9, 2021

View reviewed changes

Use correct policy in test

4574c15

dongruoping approved these changes Mar 9, 2021

View reviewed changes

ervteng merged commit 78cb833 into main Mar 10, 2021

delete-merged-branch bot deleted the develop-fix-resume branch March 10, 2021 15:09

github-actions bot locked as resolved and limited conversation to collaborators Mar 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[bug-fix] Fix save/restore critic, add test #5062

[bug-fix] Fix save/restore critic, add test #5062

Uh oh!

ervteng commented Mar 9, 2021

Uh oh!

dongruoping left a comment

Uh oh!

dongruoping Mar 9, 2021

Uh oh!

ervteng Mar 9, 2021

Uh oh!

ervteng commented Mar 9, 2021 •

edited

Loading

Uh oh!

Uh oh!

[bug-fix] Fix save/restore critic, add test #5062

[bug-fix] Fix save/restore critic, add test #5062

Uh oh!

Conversation

ervteng commented Mar 9, 2021

Proposed change(s)

Types of change(s)

Checklist

Other comments

Uh oh!

dongruoping left a comment

Choose a reason for hiding this comment

Uh oh!

dongruoping Mar 9, 2021

Choose a reason for hiding this comment

Uh oh!

ervteng Mar 9, 2021

Choose a reason for hiding this comment

Uh oh!

ervteng commented Mar 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ervteng commented Mar 9, 2021 •

edited

Loading