Parallelize multiple gpu model.forward() calls

If I wanted to make multiple forward() calls of a model on the same GPU to obtain a speedup for training how could I do so?

hi, I meet the same issue. have you solved it ? can you tell me something about it.