Torch assign independent tasks to different gpus

nikolaos_karaliolios · February 17, 2021, 10:05am

i have a script in which train two models in an independent way, on the same dataset and using the same initialization

pseudo code would read something like

initialize warm-up model
train and store warm-up model
load warm-up model and train using 1st strategy
load warm-up model and train using 2nd strategy
compare performances

is there a way to assign in the script the training to different GPUs so that steps 3&4 happen in parallel without calling a different .py file for each step?

janhenr · February 17, 2021, 10:23am

Hi,
I am not aware of any way to do this in PyTorch. However, it seems like your usecase is very easily fixed writing a bash script. You can use the same .py script using different arguments and call it in a bash script as such:

python train.py --warmup --model_out "weights.ckpt" # warmup
python train.py --strategy_1 --model_in "weights.ckpt" --model_out "strategy_1.ckpt"
python train.py --strategy_2 --model_in "weights.ckpt" --model_out "strategy_2.ckpt"
python train.py --compare --models_to_compare  "strategy_1.ckpt" "strategy_2.ckpt"

Then you just run the above script with bash run.sh.
Hope this helps!

nikolaos_karaliolios · February 17, 2021, 10:28am

that seems to be the only way to do it, but the problem is that i want to call the script many times and with different values of hyperparameters, so it gets way too complicated…

thanks!