I’ve tested running singular value decomposition on a large random matrix on both mps and cpu. I used the frobenius norm of the original matrix subtracted from the matrix assembled from the SVD components and discovered that when the device is mps the norm is much much larger than the norm for cpu
CPU
MPS