AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration

Minjiang Huang; Jipeng Qiang; Yi Zhu; Chaowei Zhang; Xiangyu Zhao; Kui Yu

doi:10.18653/v1/2025.acl-demo.21

AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration

Minjiang Huang, Jipeng Qiang, Yi Zhu, Chaowei Zhang, Xiangyu Zhao, Kui Yu

Abstract

Audiobook interpretations are attracting increasing attention, as they provide accessible and in-depth analyses of books that offer readers practical insights and intellectual inspiration. However, their manual creation process remains time-consuming and resource-intensive. To address this challenge, we propose AI4Reading, a multi-agent collaboration system leveraging large language models (LLMs) and speech synthesis technology to generate podcast-like audiobook interpretations. The system is designed to meet three key objectives: accurate content preservation, enhanced comprehensibility, and a logical narrative structure. To achieve these goals, We develop a framework composed of 11 specialized agents—including topic analysts, case analysts, editors, a narrator, and proofreaders—that work in concert to explore themes, extract real-world cases, refine content organization, and synthesize natural spoken language. By comparing expert interpretations with our system’s output, the results show that although AI4Reading still has a gap in speech generation quality, the generated interpretative scripts are simpler and more accurate. The code of AI4Reading is publicly accessible , with a demonstration video available .

Anthology ID:: 2025.acl-demo.21
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Pushkar Mishra, Smaranda Muresan, Tao Yu
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 211–220
Language:
URL:: https://aclanthology.org/2025.acl-demo.21/
DOI:: 10.18653/v1/2025.acl-demo.21
Bibkey:
Cite (ACL):: Minjiang Huang, Jipeng Qiang, Yi Zhu, Chaowei Zhang, Xiangyu Zhao, and Kui Yu. 2025. AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 211–220, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration (Huang et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-demo.21.pdf

PDF Cite Search Fix data