Optimising a huge number of parameters

There’s a tutorial for using torch.utils.checkpointing here. Try and follow that, and see if it’s applicable to your use case!