UMUTeam at SemEval-2023 Task 3: Multilingual transformer-based model for detecting the Genre, the Framing, and the Persuasion Techniques in Online News

Ronghao Pan, José Antonio García-Díaz, Miguel Ángel Rodríguez-García, Rafael Valencia-García


Abstract
In this manuscript, we describe the participation of the UMUTeam in SemEval-2023 Task 3, a shared task on detecting different aspects of news articles and other web documents, such as document category, framing dimensions, and persuasion technique in a multilingual setup. The task has been organized into three related subtasks, and we have been involved in the first two. Our approach is based on a fine-tuned multilingual transformer-based model that uses the dataset of all languages at once and a sentence transformer model to extract the most relevant chunk of a text for subtasks 1 and 2. The input data was truncated to 200 tokens with 50 overlaps using the sentence-transformer model to obtain the subset of text most related to the articles’ titles. Our system has performed good results in subtask 1 in most languages, and in some cases, such as French and German, we have archived first place in the official leader board. As for task 2, our system has also performed very well in all languages, ranking in all the top 10.
Anthology ID:
2023.semeval-1.83
Volume:
Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Atul Kr. Ojha, A. Seza Doğruöz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
609–615
Language:
URL:
https://aclanthology.org/2023.semeval-1.83
DOI:
10.18653/v1/2023.semeval-1.83
Bibkey:
Cite (ACL):
Ronghao Pan, José Antonio García-Díaz, Miguel Ángel Rodríguez-García, and Rafael Valencia-García. 2023. UMUTeam at SemEval-2023 Task 3: Multilingual transformer-based model for detecting the Genre, the Framing, and the Persuasion Techniques in Online News. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 609–615, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
UMUTeam at SemEval-2023 Task 3: Multilingual transformer-based model for detecting the Genre, the Framing, and the Persuasion Techniques in Online News (Pan et al., SemEval 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.semeval-1.83.pdf