Multi-word Tokenization for Sequence Compression Leonidas Gee author Leonardo Rigutini author Marco Ernandes author Andrea Zugarini author 2023-12 text Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track Mingxuan Wang editor Imed Zitouni editor Association for Computational Linguistics Singapore conference publication gee-etal-2023-multi 10.18653/v1/2023.emnlp-industry.58 https://aclanthology.org/2023.emnlp-industry.58/ 2023-12 612 621