Image captioning with LSTM

lanka · May 5, 2019, 8:12am

hi,
can anyone explain me to LSTM image captioning training, suppose as an example single image has 5 image captions(all sentence are equal length). how do we train LSTM? do we need to train 5 times or only ones with a random sentence?
Thank you

vdw · May 6, 2019, 3:36am

I don’t think there’s any real difference. Once you do multiple epochs, the network basically will see every image-caption pair anyway.