Batchnorm mean and variance calcuate for each dimension?

Sunnydreamrain · November 9, 2020, 3:51am

Hi,

For batchnorm, it says in the doc "The mean and standard-deviation are calculated per-dimension over the mini-batches ". But for batchnorm1d, when input is of size (N，C，L), it seems N and L is merged together and the mean/var are calculated together for C. I checked the dimension of the running mean/var, it is of size C.
I was wondering is there any built-in way to implement mean/var for each C and L, but the weight/bias is only for C (sharing over L).

Thanks.

ptrblck · November 10, 2020, 6:38am

You could use nn.LayerNorm and specify the normalized_shape which should be used to calculate the mean and standard deviation. However, I think you would need to set elementwise_affine=False and could apply a linear layer instead on the output using your desired shape.

Sunnydreamrain · November 11, 2020, 9:09am

Thanks. But LayerNorm computes the mean/var over the neruons and they are computed for both training and test. I still want to compute the mean/var for each neuron over the batches only in training and fixed for test.

ptrblck · November 11, 2020, 9:12am

In that case, your best bet might be to implement this type of normalization layer manually.

Sunnydreamrain · November 12, 2020, 9:26am

Okay. Thanks anyway.