Hierarchical Transformers Are More Efficient Language Models Piotr Nawrot author Szymon Tworkowski author MichaƂ Tyrolski author Lukasz Kaiser author Yuhuai Wu author Christian Szegedy author Henryk Michalewski author 2022-07 text Findings of the Association for Computational Linguistics: NAACL 2022 Marine Carpuat editor Marie-Catherine de Marneffe editor Ivan Vladimir Meza Ruiz editor Association for Computational Linguistics Seattle, United States conference publication nawrot-etal-2022-hierarchical 10.18653/v1/2022.findings-naacl.117 https://aclanthology.org/2022.findings-naacl.117/ 2022-07 1559 1571