Lucia Passaro
Also published as: Lucia C. Passaro
2025
Sparse Autoencoders Find Partially Interpretable Features in Italian Small Language Models
Alessandro Bondielli | Lucia Passaro | Alessandro Lenci
Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
Alessandro Bondielli | Lucia Passaro | Alessandro Lenci
Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
ExpliCa: Evaluating Explicit Causal Reasoning in Large Language Models
Martina Miliani | Serena Auriemma | Alessandro Bondielli | Emmanuele Chersoni | Lucia C. Passaro | Irene Sucameli | Alessandro Lenci
Findings of the Association for Computational Linguistics: ACL 2025
Martina Miliani | Serena Auriemma | Alessandro Bondielli | Emmanuele Chersoni | Lucia C. Passaro | Irene Sucameli | Alessandro Lenci
Findings of the Association for Computational Linguistics: ACL 2025
Large Language Models (LLMs) are increasingly used in tasks requiring interpretive and inferential accuracy. In this paper, we introduce ExpliCa, a new dataset for evaluating LLMs in explicit causal reasoning. ExpliCa uniquely integrates both causal and temporal relations presented in different linguistic orders and explicitly expressed by linguistic connectives. The dataset is enriched with crowdsourced human acceptability ratings. We tested LLMs on ExpliCa through prompting and perplexity-based metrics. We assessed seven commercial and open-source LLMs, revealing that even top models struggle to reach 0.80 accuracy. Interestingly, models tend to confound temporal relations with causal ones, and their performance is also strongly influenced by the linguistic order of the events. Finally, perplexity-based scores and prompting performance are differently affected by model size.
MAIA: A Benchmark for Multimodal AI Assessment
Davide Testa | Giovanni Bonetta | Raffaella Bernardi | Alessandro Bondielli | Alessandro Lenci | Alessio Miaschi | Lucia Passaro | Bernardo Magnini
Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
Davide Testa | Giovanni Bonetta | Raffaella Bernardi | Alessandro Bondielli | Alessandro Lenci | Alessio Miaschi | Lucia Passaro | Bernardo Magnini
Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
Leveraging LLMs to Build a Semi-synthetic Dataset for Legal Information Retrieval: A Case Study on the Italian Civil Code and GPT4-O
Mattia Proietti | Lucia Passaro | Alessandro Lenci
Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
Mattia Proietti | Lucia Passaro | Alessandro Lenci
Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
All-in-one: Understanding and Generation in Multimodal Reasoning with the MAIA Benchmark
Davide Testa | Giovanni Bonetta | Raffaella Bernardi | Alessandro Bondielli | Alessandro Lenci | Alessio Miaschi | Lucia Passaro | Bernardo Magnini
Findings of the Association for Computational Linguistics: EMNLP 2025
Davide Testa | Giovanni Bonetta | Raffaella Bernardi | Alessandro Bondielli | Alessandro Lenci | Alessio Miaschi | Lucia Passaro | Bernardo Magnini
Findings of the Association for Computational Linguistics: EMNLP 2025
We introduce MAIA (Multimodal AI Assessment), a native-Italian benchmark designed for fine-grained investigation of the reasoning abilities of visual language models on videos. MAIA differs from other available video benchmarks for its design, its reasoning categories, the metric it uses, and the language and culture of the videos. MAIA evaluates Vision Language Models (VLMs) on two aligned tasks: a visual statement verification task, and an open-ended visual question-answering task, both on the same set of video-related questions. It considers twelve reasoning categories that aim to disentangle language and vision relations by highlighting the role of the visual input. Thanks to its carefully taught design, it evaluates VLMs’ consistency and visually grounded natural language comprehension and generation simultaneously through an aggregated metric revealing low results that highlight models’ fragility. Last but not least, the video collection has been carefully selected to reflect the Italian culture, and the language data are produced by native-speakers.Data available at *[GitHub](https://github.com/Caput97/MAIA-Multimodal_AI_Assessment.git).*
LLMs Struggle on Explicit Causality in Italian
Alessandro Bondielli | Martina Miliani | Luca Paglione | Serena Auriemma | Lucia Passaro | Alessandro Lenci
Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
Alessandro Bondielli | Martina Miliani | Luca Paglione | Serena Auriemma | Lucia Passaro | Alessandro Lenci
Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
2024
VeryfIT - Benchmark of Fact-Checked Claims for Italian: A CALAMITA Challenge
Jacopo Gili | Viviana Patti | Lucia Passaro | Tommaso Caselli
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024)
Jacopo Gili | Viviana Patti | Lucia Passaro | Tommaso Caselli
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024)
Achieving factual accuracy is a known pending issue for language models. Their design centered around the interactive component of user interaction and the extensive use of “spontaneous” training data, has made them highly adept at conversational tasks but not fully reliable in terms of factual correctness. VeryfIT addresses this issue by evaluating the in-memory factual knowledge of language models on data written by professional fact-checkers, posing it as a true or false question.Topics of the statements vary but most are in specific domains related to the Italian government, policies, and social issues. The task presents several challenges: extracting statements from segments of speeches, determining appropriate contextual relevance both temporally and factually, and ultimately verifying the accuracy of the statements.
Women’s Professions and Targeted Misogyny Online
Alessio Cascione | Aldo Cerulli | Marta Marchiori Manerba | Lucia C. Passaro
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024)
Alessio Cascione | Aldo Cerulli | Marta Marchiori Manerba | Lucia C. Passaro
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024)
With the increasing popularity of social media platforms, the dissemination of misogynistic content has become more prevalent and challenging to address. In this paper, we investigate the phenomenon of online misogyny on Twitter through the lens of hurtfulness, qualifying its different manifestation considering the profession of the targets of misogynistic attacks.By leveraging manual annotation and a BERTweet model trained for fine-grained misogyny identification, we find that specific types of misogynistic speech are more intensely directed towards particular professions: derailing discourse predominantly targets authors and cultural figures, while dominance-oriented speech and sexual harassment are mainly directed at politicians and athletes. Additionally, we use the HurtLex lexicon and ItEM to assign hurtfulness scores to tweets based on different hate speech categories. Our analysis reveals that these scores align with the profession-based distribution of misogynistic speech, highlighting the targeted nature of such attacks.
2023
Check-IT!: A Corpus of Expert Fact-checked Claims for Italian
Jacopo Gili | Lucia Passaro | Tommaso Caselli
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)
Jacopo Gili | Lucia Passaro | Tommaso Caselli
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)
Challenging Specialized Transformers on Zero-shot Classification
Serena Auriemma | Mauro Madeddu | Martina Miliani | Alessandro Bondielli | Alessandro Lenci | Lucia Passaro
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)
Serena Auriemma | Mauro Madeddu | Martina Miliani | Alessandro Bondielli | Alessandro Lenci | Lucia Passaro
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)
2022
Bias Discovery within Human Raters: A Case Study of the Jigsaw Dataset
Marta Marchiori Manerba | Riccardo Guidotti | Lucia Passaro | Salvatore Ruggieri
Proceedings of the 1st Workshop on Perspectivist Approaches to NLP @LREC2022
Marta Marchiori Manerba | Riccardo Guidotti | Lucia Passaro | Salvatore Ruggieri
Proceedings of the 1st Workshop on Perspectivist Approaches to NLP @LREC2022
Understanding and quantifying the bias introduced by human annotation of data is a crucial problem for trustworthy supervised learning. Recently, a perspectivist trend has emerged in the NLP community, focusing on the inadequacy of previous aggregation schemes, which suppose the existence of single ground truth. This assumption is particularly problematic for sensitive tasks involving subjective human judgments, such as toxicity detection. To address these issues, we propose a preliminary approach for bias discovery within human raters by exploring individual ratings for specific sensitive topics annotated in the texts. Our analysis’s object consists of the Jigsaw dataset, a collection of comments aiming at challenging online toxicity identification.
2021
GQA-it: Italian Question Answering on Image Scene Graphs
Danilo Croce | Lucia C. Passaro | Alessandro Lenci | Roberto Basili
Proceedings of the Eighth Italian Conference on Computational Linguistics (CLiC-it 2021)
Danilo Croce | Lucia C. Passaro | Alessandro Lenci | Roberto Basili
Proceedings of the Eighth Italian Conference on Computational Linguistics (CLiC-it 2021)
2020
FRAQUE: a FRAme-based QUEstion-answering system for the Public Administration domain
Martina Miliani | Lucia C. Passaro | Alessandro Lenci
Proceedings of the 1st Workshop on Language Technologies for Government and Public Administration (LT4Gov)
Martina Miliani | Lucia C. Passaro | Alessandro Lenci
Proceedings of the 1st Workshop on Language Technologies for Government and Public Administration (LT4Gov)
In this paper, we propose FRAQUE, a question answering system for factoid questions in the Public administration domain. The system is based on semantic frames, here intended as collections of slots typed with their possible values. FRAQUE queries unstructured textual data and exploits the potential of different approaches: it extracts pattern elements from texts which are linguistically analyzed through statistical methods.FRAQUE allows Italian users to query vast document repositories related to the domain of Public Administration. Given the statistical nature of most of its components such as word embeddings, the system allows for a flexible domain and language adaptation process. FRAQUE’s goal is to associate questions with frames stored into a Knowledge Graph along with relevant document passages, which are returned as the answer.
“Voices of the Great War”: A Richly Annotated Corpus of Italian Texts on the First World War
Federico Boschetti | Irene De Felice | Stefano Dei Rossi | Felice Dell’Orletta | Michele Di Giorgio | Martina Miliani | Lucia C. Passaro | Angelica Puddu | Giulia Venturi | Nicola Labanca | Alessandro Lenci | Simonetta Montemagni
Proceedings of the Twelfth Language Resources and Evaluation Conference
Federico Boschetti | Irene De Felice | Stefano Dei Rossi | Felice Dell’Orletta | Michele Di Giorgio | Martina Miliani | Lucia C. Passaro | Angelica Puddu | Giulia Venturi | Nicola Labanca | Alessandro Lenci | Simonetta Montemagni
Proceedings of the Twelfth Language Resources and Evaluation Conference
“Voices of the Great War” is the first large corpus of Italian historical texts dating back to the period of First World War. This corpus differs from other existing resources in several respects. First, from the linguistic point of view it gives account of the wide range of varieties in which Italian was articulated in that period, namely from a diastratic (educated vs. uneducated writers), diaphasic (low/informal vs. high/formal registers) and diatopic (regional varieties, dialects) points of view. From the historical perspective, through a collection of texts belonging to different genres it represents different views on the war and the various styles of narrating war events and experiences. The final corpus is balanced along various dimensions, corresponding to the textual genre, the language variety used, the author type and the typology of conveyed contents. The corpus is fully annotated with lemmas, part-of-speech, terminology, and named entities. Significant corpus samples representative of the different “voices” have also been enriched with meta-linguistic and syntactic information. The layer of syntactic annotation forms the first nucleus of an Italian historical treebank complying with the Universal Dependencies standard. The paper illustrates the final resource, the methodology and tools used to build it, and the Web Interface for navigating it.
2019
Text Frame Detector: Slot Filling Based On Domain Knowledge Bases
Martina Miliani | Lucia C. Passaro | Alessandro Lenci
Proceedings of the Sixth Italian Conference on Computational Linguistics (CLiC-it 2019)
Martina Miliani | Lucia C. Passaro | Alessandro Lenci
Proceedings of the Sixth Italian Conference on Computational Linguistics (CLiC-it 2019)
2018
CoreNLP-it: A UD Pipeline for Italian based on Stanford CoreNLP
Alessandro Bondielli | Lucia C. Passaro | Alessandro Lenci
Proceedings of the Fifth Italian Conference on Computational Linguistics (CLiC-it 2018)
Alessandro Bondielli | Lucia C. Passaro | Alessandro Lenci
Proceedings of the Fifth Italian Conference on Computational Linguistics (CLiC-it 2018)
2017
Emo2Val: Inferring Valence Scores from fine-grained Emotion Values
Alessandro Bondielli | Lucia C. Passaro | Alessandro Lenci
Proceedings of the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017)
Alessandro Bondielli | Lucia C. Passaro | Alessandro Lenci
Proceedings of the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017)
INFORMed PA: A NER for the Italian Public Administration Domain
Lucia C. Passaro | Alessandro Lenci | Anna Gabbolini
Proceedings of the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017)
Lucia C. Passaro | Alessandro Lenci | Anna Gabbolini
Proceedings of the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017)
2016
Evaluating Context Selection Strategies to Build Emotive Vector Space Models
Lucia C. Passaro | Alessandro Lenci
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Lucia C. Passaro | Alessandro Lenci
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
In this paper we compare different context selection approaches to improve the creation of Emotive Vector Space Models (VSMs). The system is based on the results of an existing approach that showed the possibility to create and update VSMs by exploiting crowdsourcing and human annotation. Here, we introduce a method to manipulate the contexts of the VSMs under the assumption that the emotive connotation of a target word is a function of both its syntagmatic and paradigmatic association with the various emotions. To study the differences among the proposed spaces and to confirm the reliability of the system, we report on two experiments: in the first one we validated the best candidates extracted from each model, and in the second one we compared the models’ performance on a random sample of target words. Both experiments have been implemented as crowdsourcing tasks.
Search
Fix author
Co-authors
- Alessandro Lenci 15
- Alessandro Bondielli 8
- Martina Miliani 6
- Serena Auriemma 3
- Raffaella Bernardi 2
- Giovanni Bonetta 2
- Tommaso Caselli 2
- Jacopo Gili 2
- Bernardo Magnini 2
- Marta Marchiori Manerba 2
- Alessio Miaschi 2
- Davide Testa 2
- Roberto Basili 1
- Federico Boschetti 1
- Alessio Cascione 1
- Aldo Cerulli 1
- Emmanuele Chersoni 1
- Danilo Croce 1
- Irene De Felice 1
- Felice Dell’Orletta 1
- Michele Di Giorgio 1
- Anna Gabbolini 1
- Riccardo Guidotti 1
- Nicola Labanca 1
- Mauro Madeddu 1
- Simonetta Montemagni 1
- Luca Paglione 1
- Viviana Patti 1
- Mattia Proietti 1
- Angelica Puddu 1
- Stefano Dei Rossi 1
- Salvatore Ruggieri 1
- Irene Sucameli 1
- Giulia Venturi 1