PyTorch Forums
Why F.scaled_dot_product_attention output in this case differs with normal attention
Juuso_Korhonen
(Juuso Korhonen)
May 22, 2024, 11:41am
4
What is that function reshape_batch_dim_to_heads?
show post in topic