Expert Parallelism and Expert Parallelism + Tensor Parallelism need

Guo_Danding · September 12, 2024, 1:19pm

Hi, I have no idea how to implement ep and ep+tp with current torch api. Is there any examples or tutorials? Is there anyone has tried it?

fegin · September 13, 2024, 4:43pm

If you mean MoE, yes it is possible to implement MoE with TP via DTensor. We are also planning to explore this parallelism combination but it is not ready yet.

lovanto · September 13, 2024, 4:44pm

Hi! If I’m not mistaken expert parallelism is implemented in GitHub - databricks/megablocks

Guo_Danding · September 14, 2024, 7:17am

Thanks. I will check it.

Guo_Danding · September 14, 2024, 7:21am

Yes, it is what i mean. Pytorch has great api to support TP. But i cannot find any tutorials helpful to implement EP or EP+TP. So i am confused.