I am trying to use the SNLI Classifier (https://github.com/pytorch/examples/tree/master/snli) on a different QA dataset, TrecQA, as a baseline model. I am having trouble importing the dataset.
The task is more or less the same (premise -> question, hypothesis -> answer) and there are 2 labels instead of 3.
This dataset has 4 files for each of train/dev/test set:
ids.txt, questions.txt, answers.txt, labels.txt.
How do I import the dataset in train, dev, set splits and build the vocabulary like they do in the SNLI example: https://github.com/pytorch/examples/blob/master/snli/train.py
Some help will be much appreciated. Thank you!