From the documentation for the two versions of svd(), torch.svd() and torch.linalg.svd(), torch.svd() computes, by default, the “reduced” SVD:
If some is True (default), the method returns the reduced singular value decomposition.
while torch.linalg.svd() computes, by default, the full SVD:
The parameter full_matrices chooses between the full (default) and reduced SVD.
Given that your X is very large an exceedingly non-square, torch.svd()'s
default reduced SVD could well require much less memory. You might try U,S,V = torch.linalg.svd (X, full_matrices = False) and see if
that works for you.
As an aside, torch.svd() is deprecated in favor of torch.linalg.svd(),
so if you can get torch.linalg.svd() working, you should probably use it.