I have used deep learning to predict the 2D keypoints of a deformable object in two different views of the object video feeds.
How can I retrieve the 3D keypoints if I have 2D keypoints for each corresponding frame? Is there a framework that already does that? Looking for some expert advice here.