Hi team,
I can’t seem to find the paper behind the Keypoint R-CNN implementation in torchvision
.
I’d like to better understand and study the architecture and I was looking for some support in the literature, please.
Is it, actually, just the Mask R-CNN paper ?
They mention Keypoints Detection in their work.
Thanks