Hi everyone. I work in ad-tech, and we’ve started investigating user embeddings using a Sequence-of-Events (SoE) approach - where embeddings are built not on aggregated features, but directly from raw user events.
We’ve already found a couple of promising papers, some of them are even with an open source PyTorch implementation (e.g. CoLES). But it’s still hard for us to determine whether this approach will scale well to our use case (we handle hundreds of millions of users daily).
With this message, I would like to kindly ask anyone familiar with this topic to share suggestions - links to papers, web pages, approaches, relevant topics, GitHub repositories, anything. We want to build this the right way.
Thanks in advance.