Skip to content

DDP spawn no longer works in Jupyter environment #7550

@Borda

Description

@Borda

🐛 Bug

Running multi-GPU training does not update training progress bar, used in Grid Jupyterlab

Please reproduce using the BoringModel

https://github.com/Borda/kaggle_plant-pathology/blob/main/notebooks/Plant-Pathology-with-Lightning.ipynb

To Reproduce

just change

    gpus=2
    accelerator="ddp_spawn"
    precision=16,
    accumulate_grad_batches=8,
    val_check_interval=0.25,
    progress_bar_refresh_rate=1,
    weights_summary='top',

when I check GPU utilization it is running correctly

Expected behaviour

a properly updated progress bar

Environment

Note: Bugs with code are solved faster ! Colab Notebook should be made public !

cc @tchaton @rohitgr7 @akihironitta @awaelchli

Metadata

Metadata

Assignees

Labels

3rd partyRelated to a 3rd-partybugSomething isn't workingdistributedGeneric distributed-related topic

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions