So I got this error message
If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
Reserved is 37 GB and allocate was 600 MB.
So imho this fits the >>
case?
I tried setting to 512 and 256 and 64 but I still get the same error.
Does anyone have experience on how to set this correctly?
Many thanks in advance!