support amp training for detection models #4933

xiaohu2015 · 2021-11-14T14:43:53Z

The pr is about #4509.
Since amp is supported on classification training, I also modify some files to support amp training on detetction models.

cc @datumbox

facebook-github-bot · 2021-11-14T14:43:59Z

💊 CI failures summary and remediations

As of commit bfc9225 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

datumbox

Thanks for the PR @xiaohu2015!

There are some related linter issues, but overall it looks good.

@prabhat00155 could you have also a look since you did the classification one?

references/detection/engine.py

references/detection/train.py

Co-authored-by: Vasilis Vryniotis <[email protected]>

xiaohu2015 · 2021-11-15T11:18:02Z

@datumbox Thanks. I have test the amp code, it works well.

datumbox · 2021-11-15T11:47:19Z

@xiaohu2015 There are a couple of more linter issues (spaces). Have a look a the CI job, at the end it will show you the errors. For your convenience here are the things you need to change to keep it happy:

diff --git a/references/detection/train.py b/references/detection/train.py
index 5c50dcfa..ae13a32b 100644
--- a/references/detection/train.py
+++ b/references/detection/train.py
@@ -143,7 +143,7 @@ def get_args_parser(add_help=True):
 
     # Prototype models only
     parser.add_argument("--weights", default=None, type=str, help="the weights enum name to load")
-    
+
     # Mixed precision training parameters
     parser.add_argument("--amp", action="store_true", help="Use torch.cuda.amp for mixed precision training")
 
@@ -211,9 +211,9 @@ def main(args):
 
     params = [p for p in model.parameters() if p.requires_grad]
     optimizer = torch.optim.SGD(params, lr=args.lr, momentum=args.momentum, weight_decay=args.weight_decay)
-    
+
     scaler = torch.cuda.amp.GradScaler() if args.amp else None
-    
+
     args.lr_scheduler = args.lr_scheduler.lower()
     if args.lr_scheduler == "multisteplr":
         lr_scheduler = torch.optim.lr_scheduler.MultiStepLR(optimizer, milestones=args.lr_steps, gamma=args.lr_gamma)

Other than that, the changes look good to me. :)

I'll leave it to @prabhat00155 to do the final checks on our side and merge when ready.

prabhat00155

Thanks @xiaohu2015!

github-actions · 2021-11-15T22:04:43Z

Hey @prabhat00155!

You merged this PR, but no labels were added. The list of valid labels is available at https://github.com/pytorch/vision/blob/main/.github/process_commit.py

* support amp training * support amp training * support amp training * Update references/detection/train.py Co-authored-by: Vasilis Vryniotis <[email protected]> * Update references/detection/engine.py Co-authored-by: Vasilis Vryniotis <[email protected]> * fix lint issues Co-authored-by: Vasilis Vryniotis <[email protected]>

Summary: * support amp training * support amp training * support amp training * Update references/detection/train.py * Update references/detection/engine.py * fix lint issues Reviewed By: datumbox Differential Revision: D32470476 fbshipit-source-id: d0ef0c561b4eed2d0cf654741bd2d108ce65411e Co-authored-by: Vasilis Vryniotis <[email protected]> Co-authored-by: Vasilis Vryniotis <[email protected]> Co-authored-by: Vasilis Vryniotis <[email protected]>

xiaohu2015 added 2 commits November 14, 2021 22:35

support amp training

906a7d4

support amp training

87859b3

pytorch-probot bot added the ciflow/default label Nov 14, 2021

facebook-github-bot added the cla signed label Nov 14, 2021

support amp training

610f0d7

datumbox reviewed Nov 15, 2021

View reviewed changes

references/detection/engine.py Outdated Show resolved Hide resolved

references/detection/train.py Outdated Show resolved Hide resolved

datumbox requested a review from prabhat00155 November 15, 2021 11:09

xiaohu2015 and others added 2 commits November 15, 2021 19:12

Update references/detection/train.py

3fbf9d4

Co-authored-by: Vasilis Vryniotis <[email protected]>

Update references/detection/engine.py

1d6b607

Co-authored-by: Vasilis Vryniotis <[email protected]>

Merge branch 'main' into main

4a9545a

fix lint issues

bfc9225

prabhat00155 approved these changes Nov 15, 2021

View reviewed changes

prabhat00155 merged commit 59ec1df into pytorch:main Nov 15, 2021

prabhat00155 added module: reference scripts enhancement labels Nov 15, 2021

datumbox mentioned this pull request Dec 1, 2021

Replace uses of apex.amp with PyTorch's amp in references #4509

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

support amp training for detection models #4933

support amp training for detection models #4933

Uh oh!

xiaohu2015 commented Nov 14, 2021 •

edited by pytorch-probot bot

Loading

Uh oh!

facebook-github-bot commented Nov 14, 2021 •

edited

Loading

Uh oh!

datumbox left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

xiaohu2015 commented Nov 15, 2021

Uh oh!

datumbox commented Nov 15, 2021

Uh oh!

prabhat00155 left a comment

Uh oh!

github-actions bot commented Nov 15, 2021

Uh oh!

Uh oh!

support amp training for detection models #4933

support amp training for detection models #4933

Uh oh!

Conversation

xiaohu2015 commented Nov 14, 2021 • edited by pytorch-probot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Nov 14, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

datumbox left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

xiaohu2015 commented Nov 15, 2021

Uh oh!

datumbox commented Nov 15, 2021

Uh oh!

prabhat00155 left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 15, 2021

Uh oh!

Uh oh!

xiaohu2015 commented Nov 14, 2021 •

edited by pytorch-probot bot

Loading

facebook-github-bot commented Nov 14, 2021 •

edited

Loading

datumbox left a comment •

edited

Loading