CEPLEXicon ― A Lexicon of Child European Portuguese

Ana Lúcia Santos, Maria João Freitas, Aida Cardoso


Abstract
CEPLEXicon (version 1.1) is a child lexicon resulting from the automatic tagging of two child corpora: the corpus Santos (Santos, 2006; Santos et al. 2014) and the corpus Child ― Adult Interaction (Freitas et al. 2012), which integrates information from the corpus Freitas (Freitas, 1997). This lexicon includes spontaneous speech produced by seven children (1;02.00 to 3;11.12) during approximately 86h of child-adult interaction. The automatic tagging comprised the lemmatization and morphosyntactic classification of the speech produced by the seven children included in the two child corpora; the lexicon contains information pertaining to lemmas and syntactic categories as well as absolute number of occurrences and frequencies in three age intervals: < 2 years; ≥ 2 years and < 3 years; ≥ 3 years. The information included in this lexicon and the format in which it is presented enables research in different areas and allows researchers to obtain measures of lexical growth. CEPLEXicon is available through the ELRA catalogue.
Anthology ID:
L16-1216
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1360–1364
Language:
URL:
https://aclanthology.org/L16-1216
DOI:
Bibkey:
Cite (ACL):
Ana Lúcia Santos, Maria João Freitas, and Aida Cardoso. 2016. CEPLEXicon ― A Lexicon of Child European Portuguese. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 1360–1364, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
CEPLEXicon ― A Lexicon of Child European Portuguese (Santos et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1216.pdf