When i do not use the batch norm, the training time is slower than when i use. (2~3 times slower)
I wrote the code to print the log every 10 iterations.
In tranining phase, the speed of printing the log is much faster when using the batch norm.
Is this normal?