I am trying to use gloo for distributed training. but the script is stuck at initializing the process group.
dist.init_process_group(backend=“gloo”, init_method=“tcp://10.139.64.218:7077”,rank=1, world_size=3)
I am running the scripts on a 3 node cpu cluster on Azure Databricks
please help.