I am running a model on UCF101, and I am encountering this error, after running the model for 8 iterations (even thought it changes in which iteration stops):
/opt/conda/conda-bld/pytorch_1503970438496/work/torch/lib/THCUNN/ClassNLLCriterion.cu:57: void cunn_ClassNLLCriterion_updateOutput_kernel(Dtype *, Dtype *, Dtype *, long *, Dtype *, int, int, int, int, long) [with Dtype = float, Acctype = float]: block: [0,0,0], thread: [7,0,0] Assertion t >= 0 && t < n_classes
failed.
THCudaCheck FAIL file=/opt/conda/conda-bld/pytorch_1503970438496/work/torch/lib/THCUNN/generic/ClassNLLCriterion.cu line=87 error=59 : device-side assert triggered
Traceback (most recent call last):
File βmain.pyβ, line 321, in
main()
File βmain.pyβ, line 158, in main
train(train_loader, model, optimizer, epoch, criterion)
File βmain.pyβ, line 201, in train
loss = criterion(output, target_var)
File β/home/josueortc/anaconda3/lib/python3.6/site-packages/torch/nn/modules/module.pyβ, line 224, in call
result = self.forward(*input, **kwargs)
File β/home/josueortc/anaconda3/lib/python3.6/site-packages/torch/nn/modules/loss.pyβ, line 482, in forward
self.ignore_index)
File β/home/josueortc/anaconda3/lib/python3.6/site-packages/torch/nn/functional.pyβ, line 746, in cross_entropy
return nll_loss(log_softmax(input), target, weight, size_average, ignore_index)
File β/home/josueortc/anaconda3/lib/python3.6/site-packages/torch/nn/functional.pyβ, line 672, in nll_loss
return _functions.thnn.NLLLoss.apply(input, target, weight, size_average, ignore_index)
File β/home/josueortc/anaconda3/lib/python3.6/site-packages/torch/nn/_functions/thnn/auto.pyβ, line 47, in forward
output, *ctx.additional_args)
RuntimeError: cuda runtime error (59) : device-side assert triggered at /opt/conda/conda-bld/pytorch_1503970438496/work/torch/lib/THCUNN/generic/Clas
I am not sure, what it means but based on nvidia-smi, the model is only occupying 2.5 GB of ram.