Mask Attention Networks: Rethinking and Strengthen Transformer Zhihao Fan author Yeyun Gong author Dayiheng Liu author Zhongyu Wei author Siyuan Wang author Jian Jiao author Nan Duan author Ruofei Zhang author Xuanjing Huang author 2021-06 text Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies Kristina Toutanova editor Anna Rumshisky editor Luke Zettlemoyer editor Dilek Hakkani-Tur editor Iz Beltagy editor Steven Bethard editor Ryan Cotterell editor Tanmoy Chakraborty editor Yichao Zhou editor Association for Computational Linguistics Online conference publication fan-etal-2021-mask 10.18653/v1/2021.naacl-main.135 https://aclanthology.org/2021.naacl-main.135/ 2021-06 1692 1701