Tokenization: Returning to a Long Solved Problem — A Survey, Contrastive Experiment, Recommendations, and Toolkit — Rebecca Dridan author Stephan Oepen author 2012-07 text Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) Haizhou Li editor Chin-Yew Lin editor Miles Osborne editor Gary Geunbae Lee editor Jong C Park editor Association for Computational Linguistics Jeju Island, Korea conference publication dridan-oepen-2012-tokenization https://aclanthology.org/P12-2074/ 2012-07 378 382