pdf bibEstimating the Likelihood of Words Being Known with Corpus Analysis and K-Means Clustering AlgorithmTong Zhu | Derek Irwin | Yanhui Zhang | Renjie Wu | Xiaoyi JiangProceedings of the 37th Pacific Asia Conference on Language, Information and Computation