PyTorch Forums
Given attention weights and pair of key/value, how to calculate query?
nlp
hadaev8
(Had)
August 18, 2020, 10:19pm
1
I mean pytorch default multi head attention layer.