(search for random() in that page above and try executing it).
I tried many ways to replace with equivalent but can not find substitiute such that code runs error-free. It is poorly written such that does not tell whcih library it should come from.
that might have worked but errors continue to occur with this sample code, fixed number of it and now seeing:
init_process: rank, size: 0 , 2
init_process: rank, size: 1 , 2
run: rank/size: 1 / 2
run: rank/size: 0 / 2
GG: size/bsz: 2 / 64
GG: size/bsz: 2 / 64
Process Process-1:
Traceback (most recent call last):
File “/home/miniconda3/envs/root-test/lib/python3.9/multiprocessing/process.py”, line 315, in _bootstrap
self.run()
File “/home/miniconda3/envs/root-test/lib/python3.9/multiprocessing/process.py”, line 108, in run
self._target(*self._args, **self._kwargs)
File “/root/extdir/gg/git/codelab/gpu/ml/pytorch/distributed/tutorials/2-distributed-data-parallelism/dist-app-training.py”, line 160, in init_process
fn(rank, size)
File “/root/extdir/gg/git/codelab/gpu/ml/pytorch/distributed/tutorials/2-distributed-data-parallelism/dist-app-training.py”, line 139, in run
num_batches = ceil(len(train_set.dataset) / float(bsz))
NameError: name ‘ceil’ is not defined
Process Process-2:
Traceback (most recent call last):
File “/home/miniconda3/envs/root-test/lib/python3.9/multiprocessing/process.py”, line 315, in _bootstrap
self.run()
File “/home/miniconda3/envs/root-test/lib/python3.9/multiprocessing/process.py”, line 108, in run
self._target(*self._args, **self._kwargs)
File “/root/extdir/gg/git/codelab/gpu/ml/pytorch/distributed/tutorials/2-distributed-data-parallelism/dist-app-training.py”, line 160, in init_process
fn(rank, size)
File “/root/extdir/gg/git/codelab/gpu/ml/pytorch/distributed/tutorials/2-distributed-data-parallelism/dist-app-training.py”, line 139, in run
num_batches = ceil(len(train_set.dataset) / float(bsz))
NameError: name ‘ceil’ is not defined