Intent-Driven Semantic ID Generation for Grounded Conversational News Recommendation

Hongyang Su; Beibei Kong; Lei Cheng; Chengxiang Zhuo; Zang Li; Chenyun Yu

Intent-Driven Semantic ID Generation for Grounded Conversational News Recommendation

Hongyang Su, Beibei Kong, Lei Cheng, Chengxiang Zhuo, Zang Li, Chenyun Yu

Abstract

Conversational news recommendation requires grounding each suggestion in a rapidly evolving article corpus while addressing implicit user intents that lack explicit retrievable keywords. To characterize this scenario, we identify 6 intent types from production dialogues: five are implicit and pose fundamental challenges to standard RAG pipelines, forming a critical retrieve-first bottleneck. To address these issues, we introduce intent-driven Semantic ID (SID) generation under a Generate-then-Match paradigm. With two-stage training that consists of multi-task SID alignment and GPT-4 Chain-of-Thought distillation, an LLM maps diverse intents to hierarchical SID prefixes, which are then fuzzy-matched to the current news pool to guarantee fully grounded recommendations. Profile-Aware Dual-Signal Reasoning (PADR) further enables cold-start users to obtain valid recommendations using only profiles. On a mainstream Chinese news platform, our 7B model achieves 0% hallucination and 12.4% L1 match in the 152K open-generation SID space (4 × random baseline). It matches GPT-4+Hybrid RAG on L1 while surpassing it on finer-grained metrics (L2 2 ×, Category +1.2pp) at ∼ 100 × lower cost. Cold-start users, where existing baselines score 0%, achieve 18.0% L1 (6 × random), the highest among all user groups.

Anthology ID:: 2026.acl-industry.130
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Yunyao Li, Georg Rehm, Mei Tu
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1898–1916
Language:
URL:: https://aclanthology.org/2026.acl-industry.130/
DOI:
Bibkey:
Cite (ACL):: Hongyang Su, Beibei Kong, Lei Cheng, Chengxiang Zhuo, Zang Li, and Chenyun Yu. 2026. Intent-Driven Semantic ID Generation for Grounded Conversational News Recommendation. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 1898–1916, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: Intent-Driven Semantic ID Generation for Grounded Conversational News Recommendation (Su et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-industry.130.pdf

PDF Cite Search Fix data