Using DataLoader for .txt files?

Lady_Hangaku · March 28, 2018, 11:33am

Hello!

I’m trying to get into using pytorch for text classification. I’d like to use DataLoader for having a simpler way of using my training and test data.
Now it looks like this:

Training-Data
   -> NO (Label of all the text files in this folder)
      -> a.txt (Whole text)
      -> b.txt
          ...
   -> NW
   -> SO
   -> SW

Test-Data
   (Same as Training-Data)

I want to classify one whole text for its right label. So the thing is that every text has a different length, so I also need to padd the ones which are shorter.
Does anyone know how to do this? Or should I just convert it to .csv files? I’m clueless…

Thank you for your help!