Hi PyTorch community,
I’d like to announce a working PyTorch implementation of the paper Unsupervised learning of Object Landmarks through Conditional Image Generation
This allows the generation of keypoints on 2D images, using unsupervised learning with no labels! Works for both still images and video. (Possibly easier/better on video).
If you are at all interested in unsupervised learning, it’s a great paper. One of the best papers on unsupervised learning IMHO, due to the simplicity of the technique.
Check it out at …
I also implemented transporter network from Unsupervised Learning of Object Keypoints for Perception and Control and can confirm that it also works very well on Atari images.
Here are a few of the results.