nn.MultiheadAttention able to return mysterious thing called
attn_output_weights. My problem is to transfer results of MultiHeadAttention block to next iterations to keep attention results through them (frame by frame evaluation of audio, let’s say). How can I use those weights as input for
nn.MultiheadAttention? Or I should use something different for this task?