Any suggestions or papers to implement transformer for signal data of length 5000 and 12 channels. Thanks in advance.