C++ Embedding layer slow grad_zero() and backward() execution for large vocabulary

Amanuel_Negash · January 26, 2019, 2:45pm

how can I create sparse embedding layer with pytorch C++ front end?