OBJECT DETECTION TUTORIAL - XML annotations

Hi

I am new to computer vision and would like to use Torchvision object detection fine tutorial to process my dataset which has 4 categories. Also the annotations are is xml format. I will use the pretrained model so I was wondering if you can advise what to do to get the label correctly.

Thanks