How can i check if my transformer blocks are working in my classifier? (with Colab Link)

Michael_Ringer · March 7, 2021, 9:10pm

Hey,

im currently working on a classification transformer and using the imdb sentiment dataset.
And I am pretty unsure that my model is working because when i remove the transformer blocks the loss is only 8% higher as with them.

I am using the pretrained 100 dim glove word embeeding vectors. Thats why my hidden dim is also only 100.

model specs:

INPUT_DIM = 20000
HID_DIM = 100
OUT_DIM = 1
N_LAYERS = 2
N_HEADS = 4
FF_DIM = 32
MAXLEN = 100

optimizer = torch.optim.Adam(model.parameters())
criterion = nn.BCEWithLogitsLoss()

Google Colab Link:

Can someone please tell me how to check if these attention algorithms are working?

Unity05 · March 8, 2021, 10:24pm

Hi,

as what I’ve seen, your MultiHeadAttention and EncoderLayer implementation should work. Could you show a continuous representation of both losses, please?

Regards,
Unity05

Michael_Ringer · March 9, 2021, 7:06am

For that i need to train the model again. I can send you the losses later.
But i forgot to mention that the model with transformer blocks seems to overfitt cause the validation_acc was in every 5 epochs about 80%. I think thats because i used the pretty small dim of the pretrained, glove word vectors (100) also as hid_dim.

Unity05 · March 9, 2021, 1:30pm

I’ve just seen that you only stack two of the encoder layers what might be too few. Talking about your hidden dimension, shouldn’t it be hid_dim = embed_dim as using different dimensions would complicate it. You could also try 300d for instance.

Michael_Ringer · March 9, 2021, 4:16pm

I removed the emb_dim and changed to the 300d pretrained word embeddings.

INPUT_DIM = 20002
HID_DIM = 300
OUT_DIM = 1
N_LAYERS = 4
N_HEADS = 6
FF_DIM = HID_DIM * 2
MAXLEN = 100

training results from old config (with transformerblocks)

INPUT_DIM = 20002
HID_DIM = 300
OUT_DIM = 1
N_LAYERS = 2
N_HEADS = 2
FF_DIM = 128
MAXLEN = 100

training results from old config (without transformerblocks)

Hammes · March 17, 2021, 7:12am

I am so happy to see that the issue have been solved. i am looking for the same info.

mymilestonecard