I read that pytorch added memory-optimized algorithms like FlashAttention and Memory Efficient Attention
Does anyone know if pytorch will support Flash Attention or other memory-optimized algorithms in PyTorch Mobile later? maybe there will also be mobile GPU backend support compatibility?