Add additional data such as word features to word embedding

keysmash97 · May 31, 2019, 8:40am

Hi everyone, I’m using pretrain word embedding for nmt task. I have idea about using word features such as named entity to improve nmt quality. Is it possible to concate one hot vector (named entity) to my pretrain word embedding and use it for my nmt model ?
For example:
Given sentence: My name is James .
NE annotated sentence: My|O name|O is|O James|PERSON
With James|PERSON, i will concate one-hot vector of PERSON tag (e.g [1,0,0,0]) to word embedding vector of “James” (e.g [4,5,6]). So result is [4,5,6,1,0,0,0]

vdw · May 31, 2019, 10:20am

Sure, you can concatenate vectors. For example, say you have

embed with embed.shape = (batch_size, seq_len, embed_dim)
custom with custom.shape = (batch_size, seq_len, custom_dim)

You can do:

X = torch.cat([embed, custom], 2)

Then X.shape = (batch_size, seq_len, embed_dim+custom_dim)

Shandilya21 · June 9, 2019, 7:17am

yes you can do that,
just add extras embedding of suitable dimension,
embedding = nn.Embedding(vocab_size, dim),
embedding.shape = (batch_size, seq_len, dims)
extended_dim = (batch_size, seq_len, extended_dims)

final = torch.cat([embedding, extended],2)]
the final is the vector with embeddings of name_entity as well.

Dimitrisl · September 24, 2020, 1:14pm

Hello, does this guarantee that your custom embeddings will train ?

mbednarski · September 24, 2020, 7:29pm

If you will use standard pytorch blocks (contained in nn module) - they will. If you create tensors by yourself that should be updated by autograd you need to wrap them inside nn.Parameter. In any case you can check if parameter is actually updated by checking value of your_custom_parameter.grad after doing loss.backward(). If it will be non-zero that means it will be updated by the optimizer.