Length-aware Byte Pair Encoding for Mitigating Over-segmentation in Korean Machine Translation Jungseob Lee author Hyeonseok Moon author Seungjun Lee author Chanjun Park author Sugyeong Eo author Hyunwoong Ko author Jaehyung Seo author Seungyoon Lee author Heuiseok Lim author 2024-08 text Findings of the Association for Computational Linguistics: ACL 2024 Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication lee-etal-2024-length 10.18653/v1/2024.findings-acl.135 https://aclanthology.org/2024.findings-acl.135/ 2024-08 2287 2303