Mitigating the Burden of Redundant Datasets via Batch-Wise Unique Samples and Frequency-Aware Losses Donato Crisostomi author Andrea Caciolai author Alessandro Pedrani author Kay Rottmann author Alessandro Manzotti author Enrico Palumbo author Davide Bernardi author 2023-07 text Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track) Sunayana Sitaram editor Beata Beigman Klebanov editor Jason D Williams editor Association for Computational Linguistics Toronto, Canada conference publication crisostomi-etal-2023-mitigating 10.18653/v1/2023.acl-industry.23 https://aclanthology.org/2023.acl-industry.23/ 2023-07 235 247