Keypoints detection program

Sylvain_Ard · May 24, 2022, 7:03pm

Hello,
I am looking for two keypoints detection programs :
1 : one with one point per class
2 : the other with any number of points per class
Both programs must accept big images (1500x 1000 pixels) without resizing before to enter in the neural network.
Thank you
Best regards

Sylvain_Ard · May 28, 2022, 4:20pm

noon has an answer ?

ATAboukhadra · May 29, 2022, 1:11pm

Hi,

You can use the Keypoint RCNN from torchvision: Keypoint RCNN

You can specify the number of keypoints that you want and in case you have multiple values for the number of keypoints you can choose the maximum possible number and when preparing the data you can append zeros in case if you have a lower number of keypoints than the maximum .

You can also refer to this tutorial to understand more about how to deal with Mask RCNN architecture and how to prepare a dataset for it including bounding boxes as they are necessary to train this model.

Sylvain_Ard · May 29, 2022, 2:08pm

Thank you very much ! I will try

Sylvain_Ard · June 3, 2022, 12:29pm

I saw your tutorial but it can only be one point per class and per polygon, so it answers half of the question, for example I want to detect the teeth of a leaf, so there are a variable number of points of the same class per polygon(=leaf)
thank you

Sylvain_Ard · June 5, 2022, 12:57pm

you don’t know a program to do this ?

ATAboukhadra · June 9, 2022, 8:44am

Hi,

Sorry for not replying earlier as I didn’t receive a notification.

one point per class and per polygon

What do you mean one point per class? You can specify whatever number of keypoints you want to estimate within each bounding box -in your case ‘a leaf’- when you call the RCNN constructor. Of course it’s the case as long as you have training data that has the same nature as the testing samples.

there are a variable number of points of the same class per polygon(=leaf)

As I have suggested, you can assume that every leaf has the maximum possible number of teeth and whenever it’s not the case i.e. the leaf has lower number than the maximum, you can just assume that they are there but not visible and duplicate the keypoints until you reach the maximum. At least, this approach is working fine for me.

Sylvain_Ard · June 9, 2022, 9:26am

no in COCO keypoints format there is only one keypoint per keypoint name (= class) and per polygon, if I imagine keypoints names “teeth1”, “teeth2”, etc it wouldn’t work as teeth1 and teeth2 for two leaves have no correlation

ATAboukhadra · June 9, 2022, 10:29am

Ok it’s a bit difficult for me to understand your point. Could you please describe the dataset that you have for training?
In my understanding, you have a set of RGB images that contain leaves and each leaf has a number of keypoints -teeth- that varies for each leaf e.g. 20, 100, 150, 500 and so on, and each keypoint has an x,y values that correspond to the pixel location on the image.
If you don’t have such dataset, then you can’t train the Keypoint RCNN model to do this task afaik.

Sylvain_Ard · June 9, 2022, 10:55am

yes I have a such dataset but what do you think I should put in keypoints_names?

Sylvain_Ard · June 9, 2022, 12:21pm

What I want to say is that I have only one keypoint_name : teeth (one class) and several keypoints and if I put teeth1 teeth2 and so on in keypoints names it can’t works

ATAboukhadra · June 9, 2022, 1:38pm

Why do you need names?
Keypoint RCNN doesn’t care about names of keypoints as long as you formulate the data correctly. Please refer to this tutorial for more information. https://debuggercafe.com/human-pose-detection-using-pytorch-keypoint-rcnn/

Sylvain_Ard · June 9, 2022, 2:12pm

it does ! for example :
This model has been pre-trained on the COCO Keypoint dataset. It outputs the keypoints for 17 human parts and body joints. They are: ‘nose’, ‘left_eye’, ‘right_eye’, ‘left_ear’, ‘right_ear’, ‘left_shoulder’, ‘right_shoulder’, ‘left_elbow’, ‘right_elbow’, ‘left_wrist’, ‘right_wrist’, ‘left_hip’, ‘right_hip’, ‘left_knee’, ‘right_knee’, ‘left_ankle’, ‘right_ankle’.

for example left_eye is a keypoint_name

ATAboukhadra · June 10, 2022, 8:48am

This is just to identify what every keypoint represent for visualization purpose. It also helps to create edges between them. I train this model to identify 778 keypoint for the hand not knowing which is which. No need for names!

Sylvain_Ard · June 10, 2022, 9:00am

but in this case, how it recognizes which point is of what name if it does not differency them ?

Sylvain_Ard · June 10, 2022, 11:04am

I will test, how many iterations is good for training please ?

ATAboukhadra · June 13, 2022, 8:26am

but in this case, how it recognizes which point is of what name if it does not differency them ?

Depending on the order of points while training

how many iterations is good for training please ?

According to my experience, very few number of epochs should be sufficient. 1 epoch can be enough, more than 5 could result in overfitting.

maog77 · December 20, 2023, 12:43pm

Hi, did you succeed in your aim? how did you create the json annotation file?

Sylvain_Ard · December 20, 2023, 12:57pm

Hi,
I succeeded with one keypoint per class with mmpose but not with several keypoints per class.
I used for labeling my own program : https://www.sylvain-ard.fr/Programmes/LabelingProgram%20Setup.exe

maog77 · December 20, 2023, 1:10pm

Thank you,
Do you mean one keypoint per class in the train dataset?
When you make the prediction do you find only one or all of them?