About torch.cuda.empty_cache()

cc @colesbury that might have a better idea how to do this?