ReproHum #0087-01: A Reproduction Study of the Human Evaluation of the Coverage of Fact Checking Explanations

Mingqi Gao; Jie Ruan; Xiaojun Wan

ReproHum #0087-01: A Reproduction Study of the Human Evaluation of the Coverage of Fact Checking Explanations

Abstract

We present a reproduction study of the human evaluation of the coverage of fact checking explanations conducted by Atanasova et al. (2020), as a team in Track B of ReproNLP 2024. The setup of our reproduction study is almost the same as the original study, with some necessary modifications to the evaluation guideline and annotation interface. Our reproduction achieves a higher IAA of 0.20 compared to the original study’s 0.12, but discovers a mismatch between the IAA calculated by us with the raw annotation in the original study and the IAA reported in the original paper. Additionally, our reproduction results on the ranks of three types of explanations are drastically different from the original experiment, rendering that one important conclusion in the original paper cannot be confirmed at all. The case study illustrates that the annotators in the reproduction study may understand the quality criterion differently from the annotators in the original study.

Anthology ID:: 2024.humeval-1.25
Original:: 2024.humeval-1.25v1
Version 2:: 2024.humeval-1.25v2
Volume:: Proceedings of the Fourth Workshop on Human Evaluation of NLP Systems (HumEval) @ LREC-COLING 2024
Month:: May
Year:: 2024
Address:: Torino, Italia
Editors:: Simone Balloccu, Anya Belz, Rudali Huidrom, Ehud Reiter, Joao Sedoc, Craig Thomson
Venues:: HumEval | WS
SIG:
Publisher:: ELRA and ICCL
Note:
Pages:: 269–273
Language:
URL:: https://aclanthology.org/2024.humeval-1.25/
DOI:
Bibkey:
Cite (ACL):: Mingqi Gao, Jie Ruan, and Xiaojun Wan. 2024. ReproHum #0087-01: A Reproduction Study of the Human Evaluation of the Coverage of Fact Checking Explanations. In Proceedings of the Fourth Workshop on Human Evaluation of NLP Systems (HumEval) @ LREC-COLING 2024, pages 269–273, Torino, Italia. ELRA and ICCL.
Cite (Informal):: ReproHum #0087-01: A Reproduction Study of the Human Evaluation of the Coverage of Fact Checking Explanations (Gao et al., HumEval 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.humeval-1.25.pdf
Optionalsupplementarymaterial:: 2024.humeval-1.25.OptionalSupplementaryMaterial.zip

PDF (v2) PDF (v1) Cite Search Optionalsupplementarymaterial Fix data