How does batch norm behave in pytorch depending if one is in training mode or eval/inference mode?