Scheduled DropHead: A Regularization Method for Transformer Models Wangchunshu Zhou author Tao Ge author Furu Wei author Ming Zhou author Ke Xu author 2020-11 text Findings of the Association for Computational Linguistics: EMNLP 2020 Trevor Cohn editor Yulan He editor Yang Liu editor Association for Computational Linguistics Online conference publication zhou-etal-2020-scheduled 10.18653/v1/2020.findings-emnlp.178 https://aclanthology.org/2020.findings-emnlp.178/ 2020-11 1971 1980