ONNX export error | tensors from F.gelu() and F.layer_norm stay in CUDA

Hi!

I already opened an issue on PyTorch’s github but thought that someone might have a solution in the meanwhile. The issue is explained in this link.

onnx.export | onnx | cuda | cpu | device