I noticed that Pytorch documentation suggests that the input dimensions of 3D images are (depth, height, width). In contrast, in TF documentation these dimensions are referred as (dim1, dim2, dim3). Therefore, maybe not but I was wondering if there is any difference (e.g., results, efficiency) in using (depth, height, width) vs. (height, width, depth), since the latter is more natural (at least for me).
Thanks.