Torch.sum does not benefit from parallelism?

@ptrblck Can you share some links etc that would explain the CUDA parallelization that you mentioned