Intuition behind adding MultiheadAttention block under activation.py

100deep1001 · December 8, 2022, 4:39am

@ptrblck Why is the implementation of MultiheadAttention a part of the pytorch/torch/nn/modules/activation.py ?

ptrblck · December 17, 2022, 9:13pm

I don’t know, but @zhangguanheng66 might know the reason as he has implemented it in this PR.