About the order of gradients computation

Rita · January 20, 2021, 9:51am

Hi,

I am wondering is it deterministic for the order how the gradients get computed if the model is fixed?
And if we started a distributed training job, are the orders on the computations nodes are all the same?

Thanks!