FLELex: a graded lexical resource for French foreign learners

Thomas François, Nùria Gala, Patrick Watrin, Cédrick Fairon


Abstract
In this paper we present FLELex, the first graded lexicon for French as a foreign language (FFL) that reports word frequencies by difficulty level (according to the CEFR scale). It has been obtained from a tagged corpus of 777,000 words from available textbooks and simplified readers intended for FFL learners. Our goal is to freely provide this resource to the community to be used for a variety of purposes going from the assessment of the lexical difficulty of a text, to the selection of simpler words within text simplification systems, and also as a dictionary in assistive tools for writing.
Anthology ID:
L14-1083
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3766–3773
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1108_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Thomas François, Nùria Gala, Patrick Watrin, and Cédrick Fairon. 2014. FLELex: a graded lexical resource for French foreign learners. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 3766–3773, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
FLELex: a graded lexical resource for French foreign learners (François et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1108_Paper.pdf