nn.MultiHeadAttention attn_output_weights become the same during evaluation only

I have the same problem as yours but not the same, did you solve it?