Seeding everything to get the same masked words


I am using a few different libraries (torch, hf transformers, lightning) so please feel free to let me know if this is outside the main torch forum’s wheelhouse.

I have two datasets, for which I want to create two dataloaders that and want to mask the same tokens in each.

I tried L.seed_everything() and was unable to get what I want.

Does anyone have any suggestions? My code is here: Google Colab