NCCL backend hangs when training over infiniband

It might be a bios issue. I have a similar issue with 4Xv100 GPU that runs too slow. It turns out to be a bios setup problem. There is a bios adjustments which needs to be arranged as performance mode.
[V100 is too slow for training](http://v100 topic)