Batchnormalization problem (multi-gpu)

Becasue for 1 * 1 perchannel tensor , the BN mean will always be zero