Industry prefer fast models for real time inference. Academics favor larger/complex models.
Industry paper architectures are highly bespoke to their software application. Academics are limited to MovieLens, Amazon Reviews, or very low sample survey data
Transformers (BERT4Rec) still dominates session based topics, but it’s results does not apply across different datasets.
No one knows how to evaluate embeddings. Dimensionality reduction does not work.

Applied Scientists/Data Scientists to Software Engineers ratio ranges from 1:1 at the lowest to around 8:1 for most companies.
RecSys/Search & Discovery/Data Science teams are usually built on top and supported by data engineering and platform engineering teams.
Even split between data scientists embedded in product teams lead by PM vs RecSys/Search & Discovery/Data Science teams lead by tech lead.

Big skew between infrastructure maturity between smaller and bigger companies. Few builds in-house tools. Most buys and stitch together in-house platform.
Low adoption of NVIdia’s in-house frameworks.
It is still common to run batch jobs over Spark/Ray on a hourly/daily/weekly basis for candidate generation and persist results in Redis.
Lot’s of interest for streaming/real time recommendation, but the challenge is support in applications and data engineering. AdTech companies don’t face such issues.

Tutorials:

Video: :movie_camera:](https://vimeo.com/749421236) Code: :floppy_disk: Event: DLI Event – NVIDIA. Event code: DLI_XLAB_SR22