One Tokenization per Source Jin Guo author 1998 text COLING 1998 Volume 1: The 17th International Conference on Computational Linguistics conference publication guo-1998-one https://aclanthology.org/C98-1073/ 1998