Neil Shah

Director of Research, Senior Principal Scientist at Snap.

prof_pic.jpg

Bellevue, WA

neil at nshah dot net

nshah at snap dot com

I currently lead a team of scientists, engineers, interns, and collaborators on fundamental and applied research around modeling users, content, and their interactions at scale. I am broadly interested in advancing the state-of-the-art across machine learning technologies underpinning this, including graph and sequential representation learning, generative recommendation, and large language and foundation models for content and user understanding. At Snap, my team’s work has led to multiple step-function research platform capabilities and 85+ launches with topline business impact across our Growth, Content, Ads, Lenses, and Safety ML surfaces.

Prior to Snap, I got my PhD in the Computer Science Department at Carnegie Mellon University, where I worked on modeling and discovery of various abuse vectors in large online platforms. I was fortunate to have been advised by Christos Faloutsos. Earier, I received my B.S. in Computer Science from the Department of Computer Science at North Carolina State University. There, I worked with Nagiza Samatova on reduction, indexing, and storage systems for large-scale scientific data.

news

Apr 28, 2026 Three papers accepted at ACL 2026 in San Diego on training-free LLM embeddings, sparse attention, and collaborative memory for agentic recommendation.
Apr 27, 2026 Two papers accepted at SIGIR 2026 in Melbourne! New work on multimodal generative retrieval with vision-language semantic IDs, and an industry paper on deploying semantic IDs for recommendation at Snapchat.
Feb 28, 2026 Sharing a new preprint on the use of plain transformers as scalable and powerful link predictors on graphs.
Dec 28, 2025 Sharing a new preprint on hierarchical token prepending, a new training-free method for getting strong LLM embeddings for retrieval.
Nov 28, 2025 Sharing a new preprint on model-scaling behavior in generative recommendation methods, which shows scaling limitations in existing semantic ID-based methods.
Oct 28, 2025 Excited to share two new works at CIKM 2025 on generative recommendation, covering the newest open-source reproducibility tooling (GRID) and meta-item embeddings for cold-start learning.
Oct 27, 2025 Excited to share our new work at LoG 2025 on GNN distillation to MLPs, which shows that stronger models aren’t always stronger teachers.
May 28, 2025 We have several works at KDD 2025 on graph neural networks (GiGL, our library to scale GNNs at Snap, and a corresponding tutorial), and recommendation systems (improved self-attention for cross-domain recommendation, and optimization in collaborative filtering)!

selected publications

A curated cross-section of my work across graph machine learning, recommendation systems, and trust & safety. See publications for the full list, or Google Scholar for citations.

  1. ACL
    ding2026hierarchical.png
    Hierarchical Token Prepending: Enhancing Information Flow in Decoder-based LLM Embeddings
    Xueying Ding, Xingyue Huang, Clark Ju, Liam Collins, Yozen Liu, Leman Akoglu, Neil Shah, and Tong Zhao
    In Annual Meeting of the Association for Computational Linguistics, 2026
  2. SIGIR
    ju2026semanticidsf.png
    Semantic IDs for Recommender Systems at Snapchat: Use Cases, Technical Challenges, and Design Choices
    Clark Mingxuan Ju, Tong Zhao, Leonardo Neves, Liam Collins, Bhuvesh Kumar, Jiwen Ren, Lili Zhang, Wenfeng Zhuo, and 10 more authors
    In ACM SIGIR Conference on Research and Development in Information Retrieval, 2026
  3. WSDM
    lee2026sequentialda.png
    Sequential Data Augmentation for Generative Recommendation
    Geon Lee, Bhuvesh Kumar, Clark Ju, Tong Zhao, Kijung Shin, Neil Shah, and Liam Collins
    In ACM International Conference on Web Search and Data Mining, 2026
  4. preprint
    liu2025understandin.png
    Understanding Generative Recommendation with Semantic IDs from a Model Scaling View
    Jingzhe Liu, Liam Collins, Jiliang Tang, Tong Zhao, Neil Shah, and Clark Ju
    arXiv preprint, 2025
  5. CIKM
    ju2025generativere.png
    Generative Recommendation with Semantic IDs: A Practitioner’s Handbook
    Clark Ju, Liam Collins, Leonardo Neves, Bhuvesh Kumar, Louis Wang, Tong Zhao, and Neil Shah
    In ACM International Conference on Information and Knowledge Management, 2025
  6. KDD
    zhao2025gigllargesca.png
    GiGL: Large-Scale Graph Neural Networks at Snapchat
    Tong Zhao, Yozen Liu, Matthew Kolodner, Kyle Montemayor, Elham Ghazizadeh, Ankit Batra, Zihao Fan, Xiaobin Gao, and 7 more authors
    In ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2025
  7. SIRIP
    ju2025learninguniv.png
    Learning Universal User Representations Leveraging Cross-domain User Intent at Snapchat
    Mingxuan Ju, Leonardo Neves, Bhuvesh Kumar, Liam Collins, Tong Zhao, Yuwei Qiu, Ching Dou, Yang Zhou, and 6 more authors
    In ACM SIGIR Conference on Research and Development in Information Retrieval, 2025
  8. preprint
    han2025retrievalaug.png
    Retrieval-Augmented Generation with Graphs (GraphRAG)
    Haoyu Han, Yu Wang, Harry Shomer, Kai Guo, Jiayuan Ding, Yongjia Lei, Mahantesh Halappanavar, Ryan A. Rossi, and 10 more authors
    arXiv preprint, 2025
  9. ICML
    llaga.png
    LLaGA: Large Language and Graph Assistant
    Runjin Chen, Tong Zhao, Ajay Kumar Jaiswal, Neil Shah, and Zhangyang Wang
    In International Conference on Machine Learning, 2024
  10. NeurIPS
    heart-gnn.png
    Evaluating Graph Neural Networks for Link Prediction: Current Pitfalls and New Benchmarking
    Juanhui Li, Harry Shomer, Haitao Mao, Shenglai Zeng, Yao Ma, Neil Shah, Jiliang Tang, and Dawei Yin
    In Conference on Neural Information Processing Systems, 2023
  11. SIRIP
    friend-rec-ebr-arch.png
    Embedding-based Retrieval in Friend Recommendation
    Jiahui Shi, Vivek Chaurasiya, Yozen Liu, Shubham Vij, Yan Wu, Satya Kanduri, Neil Shah, Peicheng Yu, and 4 more authors
    In ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
  12. ICLR
    mlpinit.png
    MLPInit: Embarrassingly Simple GNN Training Acceleration with MLP Initialization
    Xiaotian Han, Tong Zhao, Yozen Liu, Xia Hu, and Neil Shah
    In International Conference on Learning Representations, 2023
  13. ICLR
    graphless-nn.png
    Graph-less Neural Networks: Teaching Old MLPs new Tricks via Distillation
    Shichang Zhang, Yozen Liu, Yizhou Sun, and Neil Shah
    In International Conference on Learning Representations, 2022
  14. ICLR
    gcond.png
    Graph Condensation for Graph Neural Networks
    Wei Jin, Lingxiao Zhao, Shichang Zhang, Yozen Liu, Jiliang Tang, and Neil Shah
    In International Conference on Learning Representations, 2022
  15. ICLR
    from-stars-to-subgraphs-gnn.png
    From Stars to Subgraphs: Uplifting any GNN with Local Structure Awareness
    Lingxiao Zhao, Wei Jin, Leman Akoglu, and Neil Shah
    In International Conference on Learning Representations, 2022
  16. ICLR
    is-homophily-a-necessity.png
    Is Homophily a Necessity for Graph Neural Networks?
    Yao Ma, Xiaorui Liu, Neil Shah, and Jiliang Tang
    In International Conference on Learning Representations, 2022
  17. CIKM
    graph-signal-denoising.png
    A Unified View on Graph Neural Networks as Graph Signal Denoising
    Yao Ma, Xiaorui Liu, Tong Zhao, Yozen Liu, Jiliang Tang, and Neil Shah
    In ACM International Conference on Information and Knowledge Management, 2021
  18. WWW
    grafrank.png
    Graph Neural Networks for Friend Ranking in Large-scale Social Platforms
    Aravind Sankar, Yozen Liu, Jun Yu, and Neil Shah
    In The Web Conference, 2021
  19. AAAI
    gaug.png
    Data Augmentation for Graph Neural Networks
    Tong Zhao, Yozen Liu, Leonardo Neves, Oliver Woodford, Meng Jiang, and Neil Shah
    In AAAI Conference on Artificial Intelligence, 2021
  20. DSAA
    slicendice.png
    SliceNDice: Mining Suspicious Multi-attribute Entity Groups with Multi-view Graphs
    Hamed Nilforoshan and Neil Shah
    In IEEE International Conference on Data Science and Advanced Analytics, 2019
  21. KDD
    modeling-dwell-time.png
    Modeling Dwell Time Engagement on Visual Multimedia
    Hemank Lamba and Neil Shah
    In ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2019
  22. false-information-survey.png
    False Information on Web and Social Media: A Survey
    Srijan Kumar and Neil Shah
    In Social Media Analytics: Advances and Applications, CRC Press 2018, 2018
  23. WWW
    broadcast-divergence.png
    FLOCK: Combating Astroturfing on Livestreaming Platforms
    Neil Shah
    In ACM World Wide Web Conference, 2017
  24. KDD
    fraud-in-camouflage.png
    FRAUDAR: Bounding Graph Fraud in the Face of Camouflage
    Bryan Hooi, Hyun Ah Song, Alex Beutel, Neil Shah, Kijung Shin, and Christos Faloutsos
    In ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2016