Arabic to English Person Name Transliteration using Twitter

Hamdy Mubarak, Ahmed Abdelali


Abstract
Social media outlets are providing new opportunities for harvesting valuable resources. We present a novel approach for mining data from Twitter for the purpose of building transliteration resources and systems. Such resources are crucial in translation and retrieval tasks. We demonstrate the benefits of the approach on Arabic to English transliteration. The contribution of this approach includes the size of data that can be collected and exploited within the span of a limited time; the approach is very generic and can be adopted to other languages and the ability of the approach to cope with new transliteration phenomena and trends. A statistical transliteration system built using this data improved a comparable system built from Wikipedia wikilinks data.
Anthology ID:
L16-1054
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
351–355
Language:
URL:
https://aclanthology.org/L16-1054
DOI:
Bibkey:
Cite (ACL):
Hamdy Mubarak and Ahmed Abdelali. 2016. Arabic to English Person Name Transliteration using Twitter. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 351–355, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Arabic to English Person Name Transliteration using Twitter (Mubarak & Abdelali, LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1054.pdf