The setup was working well before using multiple GPUs, and GPUs are able to communicate between devices.
if I do not give the environment variable CUDA_VISIBLE_DIVICES then it throws below error:
ERROR:root:Caught TypeError in replica 1 on device 1.
Original Traceback (most recent call last):
File "/opt/conda/lib/python3.8/site-packages/torch/nn/parallel/parallel_apply.py", line 64, in _worker
output = module(*input, **kwargs)
File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl
return forward_call(*input, **kwargs)
TypeError: forward() missing 2 required positional arguments: 'input' and 'text'
Traceback (most recent call last):
File "/opt/conda/lib/python3.8/site-packages/LPR/__init__.py", line 138, in feedforward
batch_plates = self.license_plate.fetch_plate_number(batch, self.long_size)
File "/opt/conda/lib/python3.8/site-packages/LPR/__init__.py", line 102, in fetch_plate_number
detected_text_dict = self.ocr.fetch_text(img_association[item])
File "/opt/conda/lib/python3.8/site-packages/LPR/OCR/__init__.py", line 172, in fetch_text
preds = self.model(image, text_for_pred, is_train=False) #torch.unsqueeze(image,0)
File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl
return forward_call(*input, **kwargs)
File "/opt/conda/lib/python3.8/site-packages/torch/nn/parallel/data_parallel.py", line 171, in forward
outputs = self.parallel_apply(replicas, inputs, kwargs)
File "/opt/conda/lib/python3.8/site-packages/torch/nn/parallel/data_parallel.py", line 181, in parallel_apply
return parallel_apply(replicas, inputs, kwargs, self.device_ids[:len(replicas)])
File "/opt/conda/lib/python3.8/site-packages/torch/nn/parallel/parallel_apply.py", line 89, in parallel_apply
output.reraise()
File "/opt/conda/lib/python3.8/site-packages/torch/_utils.py", line 543, in reraise
raise exception
TypeError: Caught TypeError in replica 1 on device 1.
Original Traceback (most recent call last):
File "/opt/conda/lib/python3.8/site-packages/torch/nn/parallel/parallel_apply.py", line 64, in _worker
output = module(*input, **kwargs)
File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl
return forward_call(*input, **kwargs)
TypeError: forward() missing 2 required positional arguments: 'input' and 'text'
DEBUG:root:Entity ended here with no valid Licence plate detection and id is: my_id