Generative Recommendation: A Survey of Models, Systems, and Industrial Advances

Siqi Liang
Yudi Zhang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The rapid advancement of large language models (LLMs) has sparked a paradigm shift in recommender system design, transforming traditional discriminative ranking architectures into unified generative frameworks. Conventional multi-stage cascading architectures, comprising retrieval, ranking, and auction, have achieved remarkable industrial success but remain limited by semantic gaps, stage inconsistencies, and feature fragmentation. In contrast, emerging Generative Recommendation Systems (GRS) formulate recommendation as a sequence generation task, leveraging transformer-based architectures and tokenized item representations to unify retrieval, ranking, and reasoning within a single generative backbone.This survey provides the first comprehensive synthesis of recent industrial progress in generative recommendation. We categorize over twenty state-of-the-art systems along four complementary dimensions including modeling paradigm (encoder-only, decoder-only, encoder–decoder), functional scope (retrieval, ranking, end-to-end frameworks), representation space (semantic ID–based, dense and hybrid representation), and training and alignment objectives (no alignment, alignment via reinforcement learning and preference optimization). We further summarize emerging research on scaling laws for generative recommenders and analyze their implications for efficiency, generalization, and model–data scaling behavior.Our analysis reveals three converging trends: (1) the unification of retrieval and ranking under shared generative architectures; (2) the integration of preference-aligned, reward-driven learning objectives; and (3) the rapid adoption of multimodal and cross-domain foundation models. Finally, we identify open challenges, including latency–scalability trade-offs, robustness under distribution shifts, interpretability of generative reasoning, and multimodal integration. Then propose a forward looking roadmap to guide future research and industrial deployment of next-generation generative recommender systems.

Version published to 10.20944/preprints202512.0741.v1
Dec 8, 2025

CEMG: Collaborative-Enhanced Multimodal Generative Recommendation

This article has 6 authors:
1. Yuzhen Lin
2. Hongyi Chen
3. Xuanjing Chen
4. Shaowen Wang
5. Ivonne Xu
6. Dongming Jiang
This article has no evaluationsLatest version Dec 29, 2025
Large Language Models: A Survey of Architectures, Training Paradigms, and Alignment Methods

This article has 5 authors:
1. Deepshikha Bhati
2. Fnu Neha
3. Devi Sri Bandaru
4. Matthew Weber
5. Ishan Dilipbhai Gajera
This article has no evaluationsLatest version Jan 15, 2026
Intelligent Recommendation Systems Using Multi-Scale LoRA Fine-Tuning and Large Language Models

This article has 6 authors:
1. Huajun Zhang
2. Lin Zhu
3. Chong Peng
4. Jiasen Zheng
5. Junjiang Lin
6. Runyuan Bao
This article has no evaluationsLatest version Dec 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

CEMG: Collaborative-Enhanced Multimodal Generative Recommendation

Large Language Models: A Survey of Architectures, Training Paradigms, and Alignment Methods

Intelligent Recommendation Systems Using Multi-Scale LoRA Fine-Tuning and Large Language Models