Is attention useful for classification tasks?

Are attention modules like SCSE useful for classification?
I’ve seen them being used in decoder blocks of segmentation models but not in classification models? I have a classification problem (not in the image domain) and no matter what model i use, i can’t get passed ~65% accuracy. I was wondering if self-attention would be useful but noticed that it’s nowhere used in classification tasks. Any idea why?