Is (ReLU + Softmax) in caffe same with CrossEntropy in Pytorch?

I am reproducing a network that implemented in caffe. The last layer of the nework is

(Caffe) block (n) --> BatchNorm --> ReLU --> SoftmaxWithLoss

I want to reproduce it in pytorch using CrossEntropy Loss. So, Is it right to remove ReLU layer before Softmax Loss because Cross Entropy aleady has it as

(Pytorch) block (n) --> BatchNorm --> CrossEntropyLoss

nn.CrossEntropyLoss is LogSoftMax + NLLLoss. So you should not remove the ReLU
See the first line here:

Thanks. But I removed it and it still work. I am wondering which one is correct remove or not remove?