Merge different gpu backends with accelerator='gpu' #13642

justusschock · 2022-07-13T17:06:01Z

What does this PR do?

Setting accelerator='gpu' does no longer select the GPUAccelerator (see #13636 ), but will dynamically choose an available GPU backend (currently cuda or mps)

Fixes #13102

Does your PR introduce any breaking changes? If yes, please list them.

Yes, together with #13636 this changes accelerator='gpu' to dynamically select gpu backends

Before submitting

Was this discussed/approved via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you list all the breaking changes introduced by this pull request?
Did you update the CHANGELOG? (not for typos, docs, test updates, or minor internal changes/refactors)

PR review

Anyone in the community is welcome to review the PR.
Before you start reviewing, make sure you have read the review guidelines. In short, see the following bullet-list:

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

Did you have fun?

Make sure you had fun coding 🙃

cc @Borda @justusschock @kaushikb11 @awaelchli @akihironitta @rohitgr7

for more information, see https://pre-commit.ci

src/pytorch_lightning/lite/lite.py

src/pytorch_lightning/trainer/connectors/accelerator_connector.py

src/pytorch_lightning/utilities/enums.py

src/pytorch_lightning/trainer/connectors/accelerator_connector.py

codecov · 2022-07-22T13:59:14Z

Codecov Report

Merging #13642 (a0c76b9) into master (8d14554) will increase coverage by 27%.
The diff coverage is 100%.

❗ Current head a0c76b9 differs from pull request most recent head 92de866. Consider uploading reports for the commit 92de866 to get more accurate results

@@            Coverage Diff            @@
##           master   #13642     +/-   ##
=========================================
+ Coverage      49%      76%    +27%     
=========================================
  Files         328      327      -1     
  Lines       25552    25679    +127     
=========================================
+ Hits        12470    19538   +7068     
+ Misses      13082     6141   -6941

src/pytorch_lightning/trainer/connectors/accelerator_connector.py

carmocca · 2022-07-25T12:48:28Z

@justusschock CI failures look real

awaelchli · 2022-07-25T14:14:44Z

Fixed it. After the changes we did for ddp fork #13405, we should mock the device parser functions instead of torch.cuda.device_count directly.

justusschock changed the base branch from master to reroute_gpu_accelerator July 19, 2022 09:15

rohitgr7 force-pushed the reroute_gpu_accelerator branch from 8124087 to 9f19717 Compare July 19, 2022 10:20

rohitgr7 added 2 commits July 19, 2022 13:31

Rename GPUAccelerator to CUDAAccelerator

7e12ea2

Add back GPUAccelerator and deprecate it

d630a2c

justusschock force-pushed the reroute_gpu_accelerator branch from 9f19717 to d630a2c Compare July 19, 2022 11:34

rohitgr7 force-pushed the reroute_gpu_accelerator branch from d630a2c to 3247b1f Compare July 19, 2022 11:45

justusschock force-pushed the merge_different_gpus branch 2 times, most recently from d5d1b7a to d630a2c Compare July 19, 2022 12:51

Remove temporary registration

94b68ec

Base automatically changed from reroute_gpu_accelerator to master July 19, 2022 17:06

Merge branch 'master' into merge_different_gpus

53b6b08

github-actions bot added the pl Generic label for PyTorch Lightning package label Jul 20, 2022

carmocca added this to the pl:1.7 milestone Jul 20, 2022

carmocca assigned justusschock Jul 20, 2022

justusschock and others added 12 commits July 20, 2022 14:41

accelerator connector reroute

c145755

accelerator_connector tests

953d551

update enums

7d443cf

lite support + tests

729a8bc

[pre-commit.ci] auto fixes from pre-commit.com hooks

a170ae5

for more information, see https://pre-commit.ci

typo

7ddc024

[pre-commit.ci] auto fixes from pre-commit.com hooks

2575c01

for more information, see https://pre-commit.ci

move "gpu" support up before actual accelerator flag checks

708b4b4

Stupid arguments

315fd05

fix tests

d7365ff

change exception type

50bcbde

fix registry test

b0f18f2

justusschock marked this pull request as ready for review July 21, 2022 17:34

justusschock requested review from Borda, tchaton and SeanNaren as code owners July 21, 2022 17:34

github-actions bot removed the pl Generic label for PyTorch Lightning package label Jul 21, 2022

Merge branch 'master' into merge_different_gpus

3565ce2

github-actions bot added the pl Generic label for PyTorch Lightning package label Jul 21, 2022

justusschock added 2 commits July 21, 2022 20:07

changelog

9f257e5

changelog

323271c

carmocca approved these changes Jul 21, 2022

View reviewed changes

rohitgr7 approved these changes Jul 21, 2022

View reviewed changes

mergify bot added the ready PRs ready to be merged label Jul 21, 2022

otaj approved these changes Jul 22, 2022

View reviewed changes

fix order

90c996b

rohitgr7 reviewed Jul 22, 2022

View reviewed changes

src/pytorch_lightning/trainer/connectors/accelerator_connector.py Outdated Show resolved Hide resolved

awaelchli reviewed Jul 22, 2022

View reviewed changes

src/pytorch_lightning/trainer/connectors/accelerator_connector.py Outdated Show resolved Hide resolved

move up again

a0c76b9

mergify bot added has conflicts and removed ready PRs ready to be merged labels Jul 22, 2022

awaelchli approved these changes Jul 22, 2022

View reviewed changes

src/pytorch_lightning/trainer/connectors/accelerator_connector.py Show resolved Hide resolved

akihironitta approved these changes Jul 23, 2022

View reviewed changes

justusschock added 2 commits July 25, 2022 11:17

add missing test

c9dc306

Merge branch 'master' into merge_different_gpus

92de866

mergify bot added ready PRs ready to be merged and removed has conflicts ready PRs ready to be merged labels Jul 25, 2022

justusschock enabled auto-merge (squash) July 25, 2022 09:33

fix pickling issue

fe66ab3

justusschock merged commit 2278719 into master Jul 25, 2022

justusschock deleted the merge_different_gpus branch July 25, 2022 14:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Merge different gpu backends with accelerator='gpu' #13642

Merge different gpu backends with accelerator='gpu' #13642

Uh oh!

justusschock commented Jul 13, 2022 •

edited by github-actions bot

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Jul 22, 2022 •

edited

Loading

Uh oh!

Uh oh!

carmocca commented Jul 25, 2022

Uh oh!

awaelchli commented Jul 25, 2022

Uh oh!

Uh oh!

Merge different gpu backends with accelerator='gpu' #13642

Merge different gpu backends with accelerator='gpu' #13642

Uh oh!

Conversation

justusschock commented Jul 13, 2022 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Does your PR introduce any breaking changes? If yes, please list them.

Before submitting

PR review

Did you have fun?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Jul 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

carmocca commented Jul 25, 2022

Uh oh!

awaelchli commented Jul 25, 2022

Uh oh!

Uh oh!

justusschock commented Jul 13, 2022 •

edited by github-actions bot

Loading

codecov bot commented Jul 22, 2022 •

edited

Loading