Projection of Argumentative Corpora from Source to Target Languages

Ahmet Aker, Huangpan Zhang


Abstract
Argumentative corpora are costly to create and are available in only few languages with English dominating the area. In this paper we release the first publicly available Mandarin argumentative corpus. The corpus is created by exploiting the idea of comparable corpora from Statistical Machine Translation. We use existing corpora in English and manually map the claims and premises to comparable corpora in Mandarin. We also implement a simple solution to automate this approach with the view of creating argumentative corpora in other less-resourced languages. In this way we introduce a new task of multi-lingual argument mapping that can be evaluated using our English-Mandarin argumentative corpus. The preliminary results of our automatic argument mapper mirror the simplicity of our approach, but provide a baseline for further improvements.
Anthology ID:
W17-5108
Volume:
Proceedings of the 4th Workshop on Argument Mining
Month:
September
Year:
2017
Address:
Copenhagen, Denmark
Venues:
ArgMining | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
67–72
Language:
URL:
https://aclanthology.org/W17-5108
DOI:
10.18653/v1/W17-5108
Bibkey:
Cite (ACL):
Ahmet Aker and Huangpan Zhang. 2017. Projection of Argumentative Corpora from Source to Target Languages. In Proceedings of the 4th Workshop on Argument Mining, pages 67–72, Copenhagen, Denmark. Association for Computational Linguistics.
Cite (Informal):
Projection of Argumentative Corpora from Source to Target Languages (Aker & Zhang, 2017)
Copy Citation:
PDF:
https://aclanthology.org/W17-5108.pdf
Attachment:
 W17-5108.Attachment.txt