Asking for advice on how to process very long input text(3000-5000 words per item)

Anyone know any solutions to process a large input src? I’m working on text summarization with transformer, the src input is 3000-5000 length and the output is around 1000. Model always OOM.

Thanks for your advice!