Takashi Omoto


1994

pdf bib
Application of Corpora in Second Language Learning: The Problem of Collocational Knowledge Acquisition
Kenji Kita | Takashi Omoto | Yoneo Yano | Yasuhiko Kato
Second Workshop on Very Large Corpora

While corpus-based studies are now becoming a new methodology in natural language processing, second language learning offers one interesting potential application. In this paper, we are primarily concerned with the acquisition of collocational knowledge from corpora for use in language learning. First we discuss the importance of collocational knowledge in second language learning, and then take up two measures, mutual information and cost criteria, for automatically identifying or extracting collocations from corpora. Comparitive experiments are made between the two measures using both Japanese and English corpora. In our experiments, the cost criteria measure proved more effective in extracting interesting collocations such as fundamental idiomatic expressions and phrases.