Image captioning

raesol (reda) October 21, 2022, 9:48am 1

I was reading an image captioning paper and It got me thinking:

Do we need to do image detection, image classification and then captioning
We don’t need to do them separately?
Thank you for the insight in advance