I am trying to run the code on UniLM on https://github.com/microsoft/unilm for fine-tuning, under Abstractive Summarization - [Gigaword] (10K)
but as I was running the code, i encountered this error
RuntimeError: CUDA out of memory. Tried to allocate 98.00 MiB (GPU 0; 10.92 GiB total capacity; 10.01 GiB already allocated; 67.56 MiB free; 10.29 GiB reserved in total by PyTorch)
Can anyone assist me in explaining what is going on and even better, solve it with me.