A compression based algorithm for Chinese word segmentation W J Teahan author Yingying Wen author Rodger McNab author Ian H Witten author 2000 text journal article Computational Linguistics continuing MIT Press Cambridge, MA periodical academic journal teahan-etal-2000-compression https://aclanthology.org/J00-3004/ 2000 26 3 375 394