LLatrieval: LLM-Verified Retrieval for Verifiable Generation

Xiaonan Li; Changtai Zhu; Linyang Li; Zhangyue Yin; Tianxiang Sun; Xipeng Qiu (邱锡鹏)

doi:10.18653/v1/2024.naacl-long.305

LLatrieval: LLM-Verified Retrieval for Verifiable Generation

Xiaonan Li, Changtai Zhu, Linyang Li, Zhangyue Yin, Tianxiang Sun, Xipeng Qiu

Abstract

Verifiable generation aims to let the large language model (LLM) generate text with supporting documents, which enables the user to flexibly verify the answer and makes the LLM’s output more reliable. Retrieval plays a crucial role in verifiable generation. Specifically, the retrieved documents not only supplement knowledge to help the LLM generate correct answers, but also serve as supporting evidence for the user to verify the LLM’s output. However, the widely used retrievers become the bottleneck of the entire pipeline and limit the overall performance. Their capabilities are usually inferior to LLMs since they often have much fewer parameters than the large language model and have not been demonstrated to scale well to the size of LLMs. If the retriever does not correctly find the supporting documents, the LLM can not generate the correct and verifiable answer, which overshadows the LLM’s remarkable abilities. To address these limitations, we propose **LLatrieval** (**L**arge **La**nguage Model Verified Re**trieval**),where the LLM updates the retrieval result until it verifies that the retrieved documents can sufficiently support answering the question. Thus, the LLM can iteratively provide feedback to retrieval and facilitate the retrieval result to fully support verifiable generation. Experiments on ALCE show that LLatrieval significantly outperforms extensive baselines and achieves state-of-the-art results.

Anthology ID:: 2024.naacl-long.305
Volume:: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:: June
Year:: 2024
Address:: Mexico City, Mexico
Editors:: Kevin Duh, Helena Gomez, Steven Bethard
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5453–5471
Language:
URL:: https://aclanthology.org/2024.naacl-long.305/
DOI:: 10.18653/v1/2024.naacl-long.305
Bibkey:
Cite (ACL):: Xiaonan Li, Changtai Zhu, Linyang Li, Zhangyue Yin, Tianxiang Sun, and Xipeng Qiu. 2024. LLatrieval: LLM-Verified Retrieval for Verifiable Generation. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 5453–5471, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):: LLatrieval: LLM-Verified Retrieval for Verifiable Generation (Li et al., NAACL 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.naacl-long.305.pdf
Video:: https://aclanthology.org/2024.naacl-long.305.mp4

PDF Cite Search Video Fix data