A Statistical Extension of Byte-Pair Encoding David Vilar author Marcello Federico author 2021-08 text Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021) Marcello Federico editor Alex Waibel editor Marta R Costa-jussà editor Jan Niehues editor Sebastian Stuker editor Elizabeth Salesky editor Association for Computational Linguistics Bangkok, Thailand (online) conference publication vilar-federico-2021-statistical 10.18653/v1/2021.iwslt-1.31 https://aclanthology.org/2021.iwslt-1.31/ 2021-08 263 275