Auto-ACE: An Automatic Answer Correctness Evaluation Method for Conversational Question Answering

Zhixin Bai, Bingbing Wang, Bin Liang, Ruifeng Xu


Abstract
Conversational question answering aims to respond to questions based on relevant contexts and previous question-answer history. Existing studies typically use ground-truth answers in history, leading to the inconsistency between the training and inference phases. However, in real-world scenarios, progress in question answering can only be made using predicted answers. Since not all predicted answers are correct, indiscriminately using all predicted answers for training introduces noise into the model. To tackle these challenges, we propose an automatic answer correctness evaluation method named **Auto-ACE**. Specifically, we first construct an Att-BERT model which employs attention weight to the BERT model, so as to bridge the relation between the current question and the question-answer pair in history. Furthermore, to reduce the interference of the irrelevant information in the predicted answer, A-Scorer, an answer scorer is designed to evaluate the confidence of the predicted answer. We conduct a series of experiments on QuAC and CoQA datasets, and the results demonstrate the effectiveness and practicality of our proposed Auto-ACE framework.
Anthology ID:
2024.sighan-1.9
Volume:
Proceedings of the 10th SIGHAN Workshop on Chinese Language Processing (SIGHAN-10)
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Kam-Fai Wong, Min Zhang, Ruifeng Xu, Jing Li, Zhongyu Wei, Lin Gui, Bin Liang, Runcong Zhao
Venues:
SIGHAN | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
80–87
Language:
URL:
https://aclanthology.org/2024.sighan-1.9
DOI:
Bibkey:
Cite (ACL):
Zhixin Bai, Bingbing Wang, Bin Liang, and Ruifeng Xu. 2024. Auto-ACE: An Automatic Answer Correctness Evaluation Method for Conversational Question Answering. In Proceedings of the 10th SIGHAN Workshop on Chinese Language Processing (SIGHAN-10), pages 80–87, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Auto-ACE: An Automatic Answer Correctness Evaluation Method for Conversational Question Answering (Bai et al., SIGHAN-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.sighan-1.9.pdf