GLÀFF, a Large Versatile French Lexicon

Nabil Hathout, Franck Sajous, Basilio Calderone


Abstract
This paper introduces GLAFF, a large-scale versatile French lexicon extracted from Wiktionary, the collaborative online dictionary. GLAFF contains, for each entry, inflectional features and phonemic transcriptions. It distinguishes itself from the other available French lexicons by its size, its potential for constant updating and its copylefted license. We explain how we have built GLAFF and compare it to other known resources in terms of coverage and quality of the phonemic transcriptions. We show that its size and quality are strong assets that could allow GLAFF to become a reference lexicon for French NLP and linguistics. Moreover, other derived lexicons can easily be based on GLAFF to satisfy specific needs of various fields such as psycholinguistics.
Anthology ID:
L14-1469
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1007–1012
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/58_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Nabil Hathout, Franck Sajous, and Basilio Calderone. 2014. GLÀFF, a Large Versatile French Lexicon. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 1007–1012, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
GLÀFF, a Large Versatile French Lexicon (Hathout et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/58_Paper.pdf