An Arabic-Moroccan Darija Code-Switched Corpus

Younes Samih, Wolfgang Maier


Abstract
In this paper, we describe our effort in the development and annotation of a large scale corpus containing code-switched data. Until recently, very limited effort has been devoted to develop computational approaches or even basic linguistic resources to support research into the processing of Moroccan Darija.
Anthology ID:
L16-1658
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
4170–4175
Language:
URL:
https://aclanthology.org/L16-1658
DOI:
Bibkey:
Cite (ACL):
Younes Samih and Wolfgang Maier. 2016. An Arabic-Moroccan Darija Code-Switched Corpus. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 4170–4175, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
An Arabic-Moroccan Darija Code-Switched Corpus (Samih & Maier, LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1658.pdf