Image and Text: Fighting the same Battle? Super Resolution Learning for Imbalanced Text Classification

Romain Meunier, Benamara Farah, Véronique Moriceau, Patricia Stolf


Abstract
In this paper, we propose SRL4NLP, a new approach for data augmentation by drawing an analogy between image and text processing: Super-resolution learning. This method is based on using high-resolution images to overcome the problem of low resolution images. While this technique is a common usage in image processing when images have a low resolution or are too noisy, it has never been used in NLP. We therefore propose the first adaptation of this method for text classification and evaluate its effectiveness on urgency detection from tweets posted in crisis situations, a very challenging task where messages are scarce and highly imbalanced. We show that this strategy is efficient when compared to competitive state-of-the-art data augmentation techniques on several benchmarks datasets in two languages.
Anthology ID:
2023.findings-emnlp.718
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2023
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
10707–10720
Language:
URL:
https://aclanthology.org/2023.findings-emnlp.718
DOI:
10.18653/v1/2023.findings-emnlp.718
Bibkey:
Cite (ACL):
Romain Meunier, Benamara Farah, Véronique Moriceau, and Patricia Stolf. 2023. Image and Text: Fighting the same Battle? Super Resolution Learning for Imbalanced Text Classification. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 10707–10720, Singapore. Association for Computational Linguistics.
Cite (Informal):
Image and Text: Fighting the same Battle? Super Resolution Learning for Imbalanced Text Classification (Meunier et al., Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-emnlp.718.pdf