Estimating the Likelihood of Words Being Known with Corpus Analysis and K-Means Clustering Algorithm Tong Zhu author Derek Irwin author Yanhui Zhang author Renjie Wu author Xiaoyi Jiang author 2023-12 text Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation Chu-Ren Huang editor Yasunari Harada editor Jong-Bok Kim editor Si Chen editor Yu-Yin Hsu editor Emmanuele Chersoni editor Pranav A editor Winnie Huiheng Zeng editor Bo Peng editor Yuxi Li editor Junlin Li editor Association for Computational Linguistics Hong Kong, China conference publication zhu-etal-2023-estimating https://aclanthology.org/2023.paclic-1.53/ 2023-12 535 542