As far as I know, COCO only provides annotations for 80 classes. My question is how was Faster R-CNN ResNet-50 FPN form torchvision trained with 91 classes?
I am guessing this is due to the 91 stuff categories the COCO dataset has.
Screenshot from the COCO website
Not 100% sure tho…