Hi everyone,
I’ve been trying to find an implementation of the stochastic “hard” attention described in the seminal work of Xu et al (Show, Attend, and Tell) but so far I have only come across a Tensorflow implementation (https://github.com/uzaymacar/attention-mechanisms). I find it strange that no one has open-sourced this widely known attention mechanism. Please let me know if I have missed something Thank you!