Gaussian-weighted self-attention implementation

Superklez · April 14, 2021, 1:59am

How do I implement Gaussian-weighted self-attention in PyTorch? I would like to follow the proposed attention mechanism in T-GSA.