Empty batch recieved

finetuning gptj on 20newsgroup with dataparallel.