Extremely Low Bit Transformer Quantization for On-Device Neural Machine Translation Insoo Chung author Byeongwook Kim author Yoonjung Choi author Se Jung Kwon author Yongkweon Jeon author Baeseong Park author Sangha Kim author Dongsoo Lee author 2020-11 text Findings of the Association for Computational Linguistics: EMNLP 2020 Trevor Cohn editor Yulan He editor Yang Liu editor Association for Computational Linguistics Online conference publication chung-etal-2020-extremely 10.18653/v1/2020.findings-emnlp.433 https://aclanthology.org/2020.findings-emnlp.433/ 2020-11 4812 4826