MSELoss documentation (link) seems to have a mistake at:
The **sum** operation still operates over all the elements, and divides by n .
The division by n can be avoided if one sets `reduction = 'sum'` .
I guess instead of “sum operation” it would be “mean operation”.