(Up to date) Transformer + Classifier example?

By the way, this is an extension of someone’s suggestion here to try Transformers for a classification problem: Help improving sports prediction model - #5 by J_Johnson