Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign

Karën Fort, Claire François, Olivier Galibert, Maha Ghribi


Abstract
This article details work aiming at evaluating the quality of the manual annotation of gene renaming couples in scientific abstracts, which generates sparse annotations. To evaluate these annotations, we compare the results obtained using the commonly advocated inter-annotator agreement coefficients such as S, κ and π, the less known R, the weighted coefficients κω and α as well as the F-measure and the SER. We analyze to which extent they are relevant for our data. We then study the bias introduced by prevalence by changing the way the contingency table is built. We finally propose an original way to synthesize the results by computing distances between categories, based on the produced annotations.
Anthology ID:
L12-1310
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1474–1480
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/549_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Karën Fort, Claire François, Olivier Galibert, and Maha Ghribi. 2012. Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1474–1480, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign (Fort et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/549_Paper.pdf