nn.MultiheadAttention
able to return mysterious thing called attn_output_weights
. My problem is to transfer results of MultiHeadAttention block to next iterations to keep attention results through them (frame by frame evaluation of audio, let’s say). How can I use those weights as input for nn.MultiheadAttention
? Or I should use something different for this task?