I have the data πββπΓπ with π=300,000 the number of data and π=2048 the number of variables. Of the π given I only have about 10,000 labels π¦β{0.1}} so for about 290,000 individuals, I donβt know the labels.
Is there a way to create a good classifier?
I think this problem is widely known as weakly or semi-supervised learning. I just canβt find adequate code on GitHub in pytorch , preferably quite simple and understandable.
Thank you in advance