Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models Orevaoghene Ahia author Sachin Kumar author Hila Gonen author Jungo Kasai author David Mortensen author Noah Smith author Yulia Tsvetkov author 2023-12 text Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication ahia-etal-2023-languages 10.18653/v1/2023.emnlp-main.614 https://aclanthology.org/2023.emnlp-main.614/ 2023-12 9904 9923