Generative Data Augmentation for Aspect Sentiment Quad Prediction

An Wang, Junfeng Jiang, Youmi Ma, Ao Liu, Naoaki Okazaki


Abstract
Aspect sentiment quad prediction (ASQP) analyzes the aspect terms, opinion terms, sentiment polarity, and aspect categories in a text. One challenge in this task is the scarcity of data owing to the high annotation cost. Data augmentation techniques are commonly used to address this issue. However, existing approaches simply rewrite texts in the training data, restricting the semantic diversity of the generated data and impairing the quality due to the inconsistency between text and quads. To address these limitations, we augment quads and train a quads-to-text model to generate corresponding texts. Furthermore, we designed novel strategies to filter out low-quality data and balance the sample difficulty distribution of the augmented dataset. Empirical studies on two ASQP datasets demonstrate that our method outperforms other data augmentation methods and achieves state-of-the-art performance on the benchmarks. The source code will be released upon acceptance.
Anthology ID:
2023.starsem-1.12
Volume:
Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Alexis Palmer, Jose Camacho-collados
Venue:
*SEM
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
128–140
Language:
URL:
https://aclanthology.org/2023.starsem-1.12
DOI:
10.18653/v1/2023.starsem-1.12
Bibkey:
Cite (ACL):
An Wang, Junfeng Jiang, Youmi Ma, Ao Liu, and Naoaki Okazaki. 2023. Generative Data Augmentation for Aspect Sentiment Quad Prediction. In Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023), pages 128–140, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Generative Data Augmentation for Aspect Sentiment Quad Prediction (Wang et al., *SEM 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.starsem-1.12.pdf