Signal data and transformers

Any suggestions or papers to implement transformer for signal data of length 5000 and 12 channels.
Thanks in advance.