Neil Shah
Director of Research, Senior Principal Scientist at Snap.
Bellevue, WA
neil at nshah dot net
nshah at snap dot com
I currently lead a team of scientists, engineers, interns, and collaborators on fundamental and applied research around modeling users, content, and their interactions at scale. I am broadly interested in advancing the state-of-the-art across machine learning technologies underpinning this, including graph and sequential representation learning, generative recommendation, and large language and foundation models for content and user understanding. At Snap, my team’s work has led to multiple step-function research platform capabilities and 85+ launches with topline business impact across our Growth, Content, Ads, Lenses, and Safety ML surfaces.
Prior to Snap, I got my PhD in the Computer Science Department at Carnegie Mellon University, where I worked on modeling and discovery of various abuse vectors in large online platforms. I was fortunate to have been advised by Christos Faloutsos. Earier, I received my B.S. in Computer Science from the Department of Computer Science at North Carolina State University. There, I worked with Nagiza Samatova on reduction, indexing, and storage systems for large-scale scientific data.
news
| Apr 28, 2026 | Three papers accepted at ACL 2026 in San Diego on training-free LLM embeddings, sparse attention, and collaborative memory for agentic recommendation. |
|---|---|
| Apr 27, 2026 | Two papers accepted at SIGIR 2026 in Melbourne! New work on multimodal generative retrieval with vision-language semantic IDs, and an industry paper on deploying semantic IDs for recommendation at Snapchat. |
| Feb 28, 2026 | Sharing a new preprint on the use of plain transformers as scalable and powerful link predictors on graphs. |
| Dec 28, 2025 | Sharing a new preprint on hierarchical token prepending, a new training-free method for getting strong LLM embeddings for retrieval. |
| Nov 28, 2025 | Sharing a new preprint on model-scaling behavior in generative recommendation methods, which shows scaling limitations in existing semantic ID-based methods. |
| Oct 28, 2025 | Excited to share two new works at CIKM 2025 on generative recommendation, covering the newest open-source reproducibility tooling (GRID) and meta-item embeddings for cold-start learning. |
| Oct 27, 2025 | Excited to share our new work at LoG 2025 on GNN distillation to MLPs, which shows that stronger models aren’t always stronger teachers. |
| May 28, 2025 | We have several works at KDD 2025 on graph neural networks (GiGL, our library to scale GNNs at Snap, and a corresponding tutorial), and recommendation systems (improved self-attention for cross-domain recommendation, and optimization in collaborative filtering)! |
selected publications
A curated cross-section of my work across graph machine learning, recommendation systems, and trust & safety. See publications for the full list, or Google Scholar for citations.
- SIGIR
Semantic IDs for Recommender Systems at Snapchat: Use Cases, Technical Challenges, and Design ChoicesIn ACM SIGIR Conference on Research and Development in Information Retrieval, 2026 - WSDM
Sequential Data Augmentation for Generative RecommendationIn ACM International Conference on Web Search and Data Mining, 2026 - CIKM
Generative Recommendation with Semantic IDs: A Practitioner’s HandbookIn ACM International Conference on Information and Knowledge Management, 2025 - KDD
GiGL: Large-Scale Graph Neural Networks at SnapchatIn ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2025 - SIRIP
Learning Universal User Representations Leveraging Cross-domain User Intent at SnapchatIn ACM SIGIR Conference on Research and Development in Information Retrieval, 2025 - SIRIP
Embedding-based Retrieval in Friend RecommendationIn ACM SIGIR Conference on Research and Development in Information Retrieval, 2023 - ICLR
MLPInit: Embarrassingly Simple GNN Training Acceleration with MLP InitializationIn International Conference on Learning Representations, 2023 - ICLR
Graph-less Neural Networks: Teaching Old MLPs new Tricks via DistillationIn International Conference on Learning Representations, 2022 - WWW
Graph Neural Networks for Friend Ranking in Large-scale Social PlatformsIn The Web Conference, 2021 - DSAA
SliceNDice: Mining Suspicious Multi-attribute Entity Groups with Multi-view GraphsIn IEEE International Conference on Data Science and Advanced Analytics, 2019 - KDD
Modeling Dwell Time Engagement on Visual MultimediaIn ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2019 - WWW
FLOCK: Combating Astroturfing on Livestreaming PlatformsIn ACM World Wide Web Conference, 2017 - KDD
FRAUDAR: Bounding Graph Fraud in the Face of CamouflageIn ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2016