Personalized Text Retrieval for Learners of Chinese as a Foreign Language

Chak Yan Yeung, John Lee


Abstract
This paper describes a personalized text retrieval algorithm that helps language learners select the most suitable reading material in terms of vocabulary complexity. The user first rates their knowledge of a small set of words, chosen by a graph-based active learning model. The system trains a complex word identification model on this set, and then applies the model to find texts that contain the desired proportion of new, challenging, and familiar vocabulary. In an evaluation on learners of Chinese as a foreign language, we show that this algorithm is effective in identifying simpler texts for low-proficiency learners, and more challenging ones for high-proficiency learners.
Anthology ID:
C18-1292
Volume:
Proceedings of the 27th International Conference on Computational Linguistics
Month:
August
Year:
2018
Address:
Santa Fe, New Mexico, USA
Editors:
Emily M. Bender, Leon Derczynski, Pierre Isabelle
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3448–3455
Language:
URL:
https://aclanthology.org/C18-1292
DOI:
Bibkey:
Cite (ACL):
Chak Yan Yeung and John Lee. 2018. Personalized Text Retrieval for Learners of Chinese as a Foreign Language. In Proceedings of the 27th International Conference on Computational Linguistics, pages 3448–3455, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
Cite (Informal):
Personalized Text Retrieval for Learners of Chinese as a Foreign Language (Yeung & Lee, COLING 2018)
Copy Citation:
PDF:
https://aclanthology.org/C18-1292.pdf