Hello,
I am trying to sample k elements from a categorical distribution in a differential way, and i notice that F.gumbel_softmax(logit, tau=1, hard=True) can return a one-hot tensor, but how can i sample t times using the gumbel sofmax, like topk function in pytorch.
Thanks!