ColumbiaNLP at SemEval-2019 Task 8: The Answer is Language Model Fine-tuning

Tuhin Chakrabarty, Smaranda Muresan


Abstract
Community Question Answering forums are very popular nowadays, as they represent effective means for communities to share information around particular topics. But the information shared on these forums are often not authentic. This paper presents the ColumbiaNLP submission for the SemEval-2019 Task 8: Fact-Checking in Community Question Answering Forums. We show how fine-tuning a language model on a large unannotated corpus of old threads from Qatar Living forum helps us to classify question types (factual, opinion, socializing) and to judge the factuality of answers on the shared task labeled data from the same forum. Our system finished 4th and 2nd on Subtask A (question type classification) and B (answer factuality prediction), respectively, based on the official metric of accuracy.
Anthology ID:
S19-2200
Volume:
Proceedings of the 13th International Workshop on Semantic Evaluation
Month:
June
Year:
2019
Address:
Minneapolis, Minnesota, USA
Editors:
Jonathan May, Ekaterina Shutova, Aurelie Herbelot, Xiaodan Zhu, Marianna Apidianaki, Saif M. Mohammad
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
1144–1148
Language:
URL:
https://aclanthology.org/S19-2200
DOI:
10.18653/v1/S19-2200
Bibkey:
Cite (ACL):
Tuhin Chakrabarty and Smaranda Muresan. 2019. ColumbiaNLP at SemEval-2019 Task 8: The Answer is Language Model Fine-tuning. In Proceedings of the 13th International Workshop on Semantic Evaluation, pages 1144–1148, Minneapolis, Minnesota, USA. Association for Computational Linguistics.
Cite (Informal):
ColumbiaNLP at SemEval-2019 Task 8: The Answer is Language Model Fine-tuning (Chakrabarty & Muresan, SemEval 2019)
Copy Citation:
PDF:
https://aclanthology.org/S19-2200.pdf