Something about Image Captioning

Hello, everyone. I am freshman to image captioning, but I find it very interesting.But there is a problem puzzled me.Is there some relationships between image captionging and object detection.In other words, I wanna ask if the object detected can be applied to image captioning?

@Michael_Hsu: Hello, I am not sure if this is the right forum to ask this question since it is mainly for Pytorch related questions. But I would be glad to suggest some references that you may find interesting related to captioning and detection:

I hope this helps!

Do help me a lot! Thanks for your generious contribution!