Lee at SemEval-2020 Task 5: ALBERT Model Based on the Maximum Ensemble Strategy and Different Data Sampling Methods for Detecting Counterfactual Statements

Junyi Li, Yuhang Wu, Bin Wang, Haiyan Ding


Abstract
This article describes the system submitted to SemEval 2020 Task 5: Modelling Causal Reasoning in Language: Detecting Counterfactuals. In this task, we only participate in the subtask A which is detecting counterfactual statements. In order to solve this sub-task, first of all, because of the problem of data balance, we use the undersampling and oversampling methods to process the data set. Second, we used the ALBERT model and the maximum ensemble method based on the ALBERT model. Our methods achieved a F1 score of 0.85 in subtask A.
Anthology ID:
2020.semeval-1.86
Volume:
Proceedings of the Fourteenth Workshop on Semantic Evaluation
Month:
December
Year:
2020
Address:
Barcelona (online)
Editors:
Aurelie Herbelot, Xiaodan Zhu, Alexis Palmer, Nathan Schneider, Jonathan May, Ekaterina Shutova
Venue:
SemEval
SIG:
SIGLEX
Publisher:
International Committee for Computational Linguistics
Note:
Pages:
664–669
Language:
URL:
https://aclanthology.org/2020.semeval-1.86
DOI:
10.18653/v1/2020.semeval-1.86
Bibkey:
Cite (ACL):
Junyi Li, Yuhang Wu, Bin Wang, and Haiyan Ding. 2020. Lee at SemEval-2020 Task 5: ALBERT Model Based on the Maximum Ensemble Strategy and Different Data Sampling Methods for Detecting Counterfactual Statements. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 664–669, Barcelona (online). International Committee for Computational Linguistics.
Cite (Informal):
Lee at SemEval-2020 Task 5: ALBERT Model Based on the Maximum Ensemble Strategy and Different Data Sampling Methods for Detecting Counterfactual Statements (Li et al., SemEval 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.semeval-1.86.pdf
Data
MultiNLISNLI