TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference Deming Ye author Yankai Lin author Yufei Huang author Maosong Sun author 2021-06 text Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies Kristina Toutanova editor Anna Rumshisky editor Luke Zettlemoyer editor Dilek Hakkani-Tur editor Iz Beltagy editor Steven Bethard editor Ryan Cotterell editor Tanmoy Chakraborty editor Yichao Zhou editor Association for Computational Linguistics Online conference publication ye-etal-2021-tr 10.18653/v1/2021.naacl-main.463 https://aclanthology.org/2021.naacl-main.463/ 2021-06 5798 5809