Visualize attention map from vision transformer

Hi all. I want to visualize attention map from vision transformer and understand important parts of the image that transformer model attended. Do you know any resource for visualize attention map from Swin transformer or some transformer architecture that have an image as output not for classification task.
Thanks for your participate.