Predicting Attention Sparsity in Transformers Marcos Treviso author António Góis author Patrick Fernandes author Erick Fonseca author Andre Martins author 2022-05 text Proceedings of the Sixth Workshop on Structured Prediction for NLP Andreas Vlachos editor Priyanka Agrawal editor André Martins editor Gerasimos Lampouras editor Chunchuan Lyu editor Association for Computational Linguistics Dublin, Ireland conference publication treviso-etal-2022-predicting 10.18653/v1/2022.spnlp-1.7 https://aclanthology.org/2022.spnlp-1.7/ 2022-05 67 81