How can I use attn_output_weights for next attention blocks?

jBloodless · August 25, 2022, 7:01pm

nn.MultiheadAttention able to return mysterious thing called attn_output_weights. My problem is to transfer results of MultiHeadAttention block to next iterations to keep attention results through them (frame by frame evaluation of audio, let’s say). How can I use those weights as input for nn.MultiheadAttention? Or I should use something different for this task?