How to combine torch.compile with cudnn.benchmark, torch.utils.checkpoint and nn.DataParallel?

Does torch.compile already include these tricks (like cudnn.benchmark and torch.utils.checkpoint) ?

Should I first wrap the model with torch.compile or nn.DataParallel?

I can’t find it in the document.

I would be very grateful if someone could answer these questions or tell me where to find the answers.