Data-driven Language Independent Word Segmentation Using Character-Level Information Dong-Hee Lim author Seung-Shik Kang author 2005 text Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing conference publication lim-kang-2005-data https://aclanthology.org/I05-3024/ 2005