Can't train in DDP mode after recent update

## 🐛 Bug

When I pull the latest code, I found that DDP training would get stuck in the first few epochs.
I ran some tests to see which commit caused this bug and I found commit `a3ecf0fd640465f9a7c009e81bcc5ecabf381004` on Mar 3 worked well. 
But when I `git checkout` commit `e931b9da33f45551928059b8d61bddd50e401e48` on Mar 4, the bug appeared. 
And the bug still exists in the latest commit.


## To Reproduce (REQUIRED)
`python3 -m torch.distributed.launch --nproc_per_node 4 train.py`

The training process would get stuck forever unless you terminate it manually.
And it still occupied the GPU memory unless killing the process by `kill -9 xxxxx`

![stuck](https://user-images.githubusercontent.com/5948604/110415966-daa79980-80cd-11eb-8e8d-f7f56c2c9cd5.png)


## Expected behavior
Roll back to the older code, and get the expected behavior.
```bash
$ git checkout a3ecf0fd640465f9a7c009e81bcc5ecabf381004
$ python3 -m torch.distributed.launch --nproc_per_node 4 train.py
```
![worked well](https://user-images.githubusercontent.com/5948604/110415987-e004e400-80cd-11eb-8fea-b00a5305459e.png)


## Environment
If applicable, add screenshots to help explain your problem.

 - OS: Ubuntu 20.04
 - GPU: 1080 Ti * 4
 - Python: 3.8
 - pytorch: 1.7.1
 - CUDA: 11.1
 - Driver:  455.32

## Additional
It seems like the latest commit working fine on 2 * 3090, I'm not sure yet, I will do some further tests on 3090 or other GPU.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Can't train in DDP mode after recent update #2405

🐛 Bug

To Reproduce (REQUIRED)

Expected behavior

Environment

Additional

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Can't train in DDP mode after recent update #2405

Description

🐛 Bug

To Reproduce (REQUIRED)

Expected behavior

Environment

Additional

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions