foxet
(Fan LIN)
September 29, 2017, 12:23pm
#1
wide resnet(last two layer changed)
orange curve: train set
blue cure: val set
only 650 images
axis x:epoch
axis y:accuracy
batch size:15 for both trainset/val set
image size:320
compared to train set. …why the variance of dev set behave so different from the train set, seems no convergence.
what can be the problem?
chenyuntc
(Yun Chen)
September 29, 2017, 2:36pm
#2
It’s hard to say. Do you forgot model.eval()
?
And how many imgs are used for validation?
foxet
(Fan LIN)
September 29, 2017, 2:44pm
#3
yes, i used model.eval()
and i use 100 images for test
ytoon
September 29, 2017, 2:49pm
#4
Hi, is your model trained from the scratch or finetune from the pre-trained model?
foxet
(Fan LIN)
September 29, 2017, 2:54pm
#5
from scratch…but i added two layers(average pool and maxpoll) in the end of the model , but i don’t think they matter
foxet
(Fan LIN)
September 29, 2017, 5:06pm
#6
loss curve
orange curve: train set
blue cure: val set
chenyuntc
(Yun Chen)
September 30, 2017, 1:01am
#7
The test Images is far from enough, if you don’t have enough images for training, I would suggest you using pretrained model from torchvision and finetune.
http://pytorch.org/docs/master/torchvision/models.html
foxet
(Fan LIN)
September 30, 2017, 1:05am
#8
yes,
the test images is small,but does it lead to bumpy curve?
chenyuntc
(Yun Chen)
September 30, 2017, 1:08am
#9
the bumpy test curve is caused by too few test images.
the gentle training accuracy curve is the result of overfitting.
foxet
(Fan LIN)
September 30, 2017, 1:10am
#10
ok ,thankyou , i will try to make the test set bigger