Train,test validation

SHASHANK_KUMAR_MISHR · October 19, 2019, 7:40am

I have a data frame in which two columns are there text_data and distractor. How to divide in train,test and validation in pytorch.

ptrblck · October 21, 2019, 11:40am

If you’ve already created a Dataset instance for your data frame, you could use torch.utils.data.random_split to create non-overlapping Datasets.
Alternatively, you could also split the data before creating the Datasets e.g. via sklearn.model_selection.train_test_split or just by manually indexing the data frame.