How to efficiently normalize a batch of tensor to [0, 1]

The performance drop in is in min() function
I read that it calls item(), and it is slow.