Marco Madeddu
2024
Exploring YouTube Comments Reacting to Femicide News in Italian
Chiara Ferrando
|
Marco Madeddu
|
Viviana Patti
|
Mirko Lai
|
Sveva Pasini
|
Giulia Telari
|
Beatrice Antola
Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024)
In recent years, the Gender Based Violence (GBV) has become an important issue in modern society and a central topic in different research areas due to its alarming spread. Several Natural Language Processing (NLP) studies, concerning Hate Speech directed against women, have focused on slurs or incel communities. The main contribution of our work is the creation of the first dataset on social media comments to GBV, in particular to a femicide event. Our dataset, named GBV-Maltesi, contains 2,934 YouTube comments annotated following a new schema that we developed in order to study GBV and misogyny with an intersectional approach. During the experimental phase, we trained models on different corpora for binary misogyny detection and found that datasets that mostly include explicit expressions of misogyny are an easier challenge, compared to more implicit forms of misogyny contained in GVB-Maltesi.
GFG - Gender-Fair Generation: A CALAMITA Challenge
Simona Frenda
|
Andrea Piergentili
|
Beatrice Savoldi
|
Marco Madeddu
|
Martina Rosola
|
Silvia Casola
|
Chiara Ferrando
|
Viviana Patti
|
Matteo Negri
|
Luisa Bentivogli
Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024)
Gender-fair language aims at promoting gender equality by using terms and expressions that include all identities and avoid reinforcing gender stereotypes. Implementing gender-fair strategies is particularly challenging in heavily gender-marked languages, such as Italian. To address this, the Gender-Fair Generation challenge intends to help shift toward gender-fair language in written communication. The challenge, designed to assess and monitor the recognition and generation of gender-fair language in both mono- and cross-lingual scenarios, includes three tasks: (1) the detection of gendered expressions in Italian sentences, (2) the reformulation of gendered expressions into gender-fair alternatives, and (3) the generation of gender-fair language in automatic translation from English to Italian. The challenge relies on three different annotated datasets: the GFL-it corpus, which contains Italian texts extracted from administrative documents provided by the University of Brescia; GeNTE, a bilingual test set for gender-neutral rewriting and translation built upon a subset of the Europarl dataset; and Neo-GATE, a bilingual test set designed to assess the use of non-binary neomorphemes in Italian for both fairformulation and translation tasks. Finally, each task is evaluated with specific metrics: average of F1-score obtained by means of BERTScore computed on each entry of the datasets for task 1, an accuracy measured with a gender-neutral classifier, and a coverage-weighted accuracy for tasks 2 and 3.
Search
Fix data
Co-authors
- Chiara Ferrando 2
- Viviana Patti 2
- Beatrice Antola 1
- Luisa Bentivogli 1
- Silvia Casola 1
- show all...