nn.MultiHeadAttention attn_output_weights become the same during evaluation only

liyunhan1993 (云汉李) April 25, 2022, 4:34am 2

I have the same problem as yours but not the same, did you solve it?