Knowledge distillation on bdd100k dataset using Resnet-50 as teacher and my own architecture as student


This is Aman Goyal. I am currently pursuing research at MSU on knowledge distillation. I had explored various methods but was unable to find any method which distills knowledge on bdd100k detection dataset. I want to use teacher as Resnet-50 and student as my own architecture.

It would be really great if anyone could guide me on the same.

Thanks a lot