DDP and Gradient checkpointing

I don’t understand what the issue is. Why did your code hang - that is essential information to put in here. Did you try any of the following:

if none of them worked can you provide more details? In particular Your original post does not describe enough to know what the problem is. Things can hang for many reasons - especially in complicated multip processing code.