Bayes Test of Precision, Recall, and F1 Measure for Comparison of Two Natural Language Processing Models Ruibo Wang author Jihong Li author 2019-07 text Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics Anna Korhonen editor David Traum editor Lluís Màrquez editor Association for Computational Linguistics Florence, Italy conference publication wang-li-2019-bayes 10.18653/v1/P19-1405 https://aclanthology.org/P19-1405/ 2019-07 4135 4145