Evaluating the Performance of RAG Methods for Conversational AI in the Airport Domain

Yuyang Li; Pjm Kerbusch; Rhr Pruim; Tobias Käfer

doi:10.18653/v1/2025.naacl-industry.61

Evaluating the Performance of RAG Methods for Conversational AI in the Airport Domain

Yuyang Li, Pjm Kerbusch, Rhr Pruim, Tobias Käfer

Abstract

Airports from the top 20 in terms of annual passengers are highly dynamic environment with thousands of flights daily, and they aim to increase the degree of automation. To contribute to this, we implemented a Conversational AI system that enables staff in an airport to communicate with flight information systems. This system not only answers standard airport queries but also resolves airport terminology, jargon, abbreviations, and dynamic questions involving reasoning. In this paper, we built three different Retrieval-Augmented Generation (RAG) methods, including traditional RAG, SQL RAG, and Knowledge Graph-based RAG (Graph RAG). Experiments showed that traditional RAG achieved 84.84% accuracy using BM25 + GPT-4 but occasionally produced hallucinations, which is risky to airport safety. In contrast, SQL RAG and Graph RAG achieved 80.85% and 91.49% accuracy respectively, with significantly fewer hallucinations. Moreover, Graph RAG was especially effective for questions that involved reasoning. Based on our observations, we thus recommend SQL RAG and Graph RAG are better for airport environments, due to fewer hallucinations and the ability to handle dynamic questions.

Anthology ID:: 2025.naacl-industry.61
Volume:: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: Industry Track)
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Weizhu Chen, Yi Yang, Mohammad Kachuee, Xue-Yong Fu
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 794–808
Language:
URL:: https://aclanthology.org/2025.naacl-industry.61/
DOI:: 10.18653/v1/2025.naacl-industry.61
Bibkey:
Cite (ACL):: Yuyang Li, Pjm Kerbusch, Rhr Pruim, and Tobias Käfer. 2025. Evaluating the Performance of RAG Methods for Conversational AI in the Airport Domain. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: Industry Track), pages 794–808, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Evaluating the Performance of RAG Methods for Conversational AI in the Airport Domain (Li et al., NAACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.naacl-industry.61.pdf

PDF Cite Search Fix data