Softmax VS. Negative Sampling

tridelt · August 23, 2019, 9:00am

Dear Community,

eventually Negative Sampling was introduced as substitute for the expensive Softmax Function as part of the word2vec model (https://papers.nips.cc/paper/5021-distributed-representations-of-words-and-phrases-and-their-compositionality.pdf).

One questions though, if we had unlimited processing power would be still use Negative Sampling?
Is it better to only update a few weights (NS) compared to all the weights (Softmax)?

Would love to hear your thoughts.

Yours,
Luca