Vito Pirrelli


pdf bib
Assessing Reading Literacy of Bulgarian Pupils with Finger–tracking
Alessandro Lento | Andrea Nadalini | Marcello Ferro | Claudia Marzi | Vito Pirrelli | Tsvetana Dimitrova | Hristina Kukova | Valentina Stefanova | Maria Todorova | Svetla Koeva
Proceedings of the Sixth International Conference on Computational Linguistics in Bulgaria (CLIB 2024)

The paper reports on the first steps in developing a time-stamped multimodal dataset of reading data by Bulgarian children. Data are being collected, structured and analysed by means of ReadLet, an innovative infrastructure for multimodal language data collection that uses a tablet as a reader’s front-end. The overall goal of the project is to quantitatively analyse the reading skills of a sample of early Bulgarian readers collected over a two-year period, and compare them with the reading data of early readers of Italian, collected using the same protocol. We illustrate design issues of the experimental protocol, as well as the data acquisition process and the post-processing phase of data annotation/augmentation. To evaluate the potential and usefulness of the Bulgarian dataset for reading research, we present some preliminary statistical analyses of our recently collected data. They show robust convergence trends between Bulgarian and Italian early reading development stages.

pdf bib
Comparative Evaluation of Computational Models Predicting Eye Fixation Patterns During Reading: Insights from Transformers and Simpler Architectures
Alessandro Lento | Andrea Nadalini | Nadia Khlif | Vito Pirrelli | Claudia Marzi | Marcello Ferro
Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024)

Eye tracking data during reading provides significant insights into the cognitive processes underlying language comprehension. It allows for the estimation of lexical, contextual, and higher-level structural effects on word identification through metrics such as fixation duration. Despite advancements in psycholinguistic experiments that have elucidated these effects, the extent to which computational models can predict gaze patterns remains unclear. Recent developments in computational modeling, particularly the use of pre-trained transformer language models, have shown promising results in mirroring human reading behaviors. However, previous studies have not adequately compared these models to alternative architectures or considered various input features comprehensively. This paper addresses these gaps by replicating prior findings on English data, critically evaluating performance metrics, and proposing a stricter accuracy measurement method. Furthermore, it compares different computational models, demonstrating that simpler architectures can achieve results comparable to or better than transformers. The study also emphasizes the significance of individual differences in reading behavior, presenting challenges for simulating natural reading tasks.

pdf bib
ReadLet: A Dataset for Oral, Visual and Tactile Text Reading Data of Early and Mature Readers
Marcello Ferro | Claudia Marzi | Andrea Nadalini | Loukia Taxitari | Alessandro Lento | Vito Pirrelli
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

The paper presents the design and construction of a time-stamped multimodal dataset for reading research, including multiple time-aligned temporal signals elicited with four experimental trials of connected text reading by both child and adult readers. We present the experimental protocols, as well as the data acquisition process and the post-processing phase of data annotation/augmentation. To evaluate the potential and usefulness of a time-aligned multimodal dataset for reading research, we present a few statistical analyses showing the correlation and complementarity of multimodal time-series of reading data, as well as some results of modelling adults’ reading data by integrating different modalities. The total dataset size amounts to about 2.5 GByte in compressed format.


pdf bib
NLP-based Assessment of Reading Efficiency in Early Grade Children
Vito Pirrelli
Proceedings of the Third International Conference on Computational Linguistics in Bulgaria (CLIB 2018)

Assessing reading skills is a laborious and time-consuming task, which requires monitoring a variety of interlocked abilities, ranging from accurate word rendering, reading fluency and lexical access, to linguistic comprehension, and interpretation, management and inference of complex events in working memory. No existing software, to our knowledge, is able to cover and integrate reading performance monitoring, instant feedback, personalised potentiation and intelligent decision support to teachers and speech therapists, assessment of response to intervention. NLP and ICT technologies can make such an ambitious platform an achievable target.

pdf bib
Evaluating Inflectional Complexity Crosslinguistically: a Processing Perspective
Claudia Marzi | Marcello Ferro | Ouafae Nahli | Patrizia Belik | Stavros Bompolas | Vito Pirrelli
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)


pdf bib
The PAISÀ Corpus of Italian Web Texts
Verena Lyding | Egon Stemle | Claudia Borghetti | Marco Brunello | Sara Castagnoli | Felice Dell’Orletta | Henrik Dittmann | Alessandro Lenci | Vito Pirrelli
Proceedings of the 9th Web as Corpus Workshop (WaC-9)


pdf bib
Evaluating Hebbian Self-Organizing Memories for Lexical Representation and Access
Claudia Marzi | Marcello Ferro | Claudia Caudai | Vito Pirrelli
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

The lexicon is the store of words in long-term memory. Any attempt at modelling lexical competence must take issues of string storage seriously. In the present contribution, we discuss a few desiderata that any biologically-inspired computational model of the mental lexicon has to meet, and detail a multi-task evaluation protocol for their assessment. The proposed protocol is applied to a novel computational architecture for lexical storage and acquisition, the """"Topological Temporal Hebbian SOMs"""" (T2HSOMs), which are grids of topologically organised memory nodes with dedicated sensitivity to time-bound sequences of letters. These maps can provide a rigorous and testable conceptual framework within which to provide a comprehensive, multi-task protocol for testing the performance of Hebbian self-organising memories, and a comprehensive picture of the complex dynamics between lexical processing and the acquisition of morphological structure.


pdf bib
Unsupervised Acquisition of Verb Subcategorization Frames from Shallow-Parsed Corpora
Alessandro Lenci | Barbara McGillivray | Simonetta Montemagni | Vito Pirrelli
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

In this paper, we reported experiments of unsupervised automatic acquisition of Italian and English verb subcategorization frames (SCFs) from general and domain corpora. The proposed technique operates on syntactically shallow-parsed corpora on the basis of a limited number of search heuristics not relying on any previous lexico-syntactic knowledge about SCFs. Although preliminary, reported results are in line with state-of-the-art lexical acquisition systems. The issue of whether verbs sharing similar SCFs distributions happen to share similar semantic properties as well was also explored by clustering verbs that share frames with the same distribution using the Minimum Description Length Principle (MDL). First experiments in this direction were carried out on Italian verbs with encouraging results.


pdf bib
Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment
Felice Dell’Orletta | Alessandro Lenci | Simonetta Montemagni | Vito Pirrelli
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

The paper reports on a detailed quantitative analysis of distributional language data of both Italian and Czech, highlighting the relative contribution of a number of distributed grammatical factors to sentence-based identification of subjects and direct objects. The work is based on a Maximum Entropy model of stochastic resolution of grammatical conflicting constraints, and is demonstrably capable of putting explanatory theoretical accounts to the challenging test of an extensive, usage-based empirical verification.

pdf bib
Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
Roberto Bartolini | Caterina Caracciolo | Emiliano Giovanetti | Alessandro Lenci | Simone Marchi | Vito Pirrelli | Chiara Renso | Laura Spinsanti
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

In this paper we present an original approach to natural language query interpretation which has been implemented withinthe FuLL (Fuzzy Logic and Language) Italian project of BC S.r.l. In particular, we discuss here the creation of linguisticand ontological resources, together with the exploitation of existing ones, for natural language-driven database access andretrieval. Both the database and the queries we experiment with are Italian, but the methodology we broach naturally extends to other languages.

pdf bib
Probing the Space of Grammatical Variation: Induction of Cross-Lingual Grammatical Constraints from Treebanks
Felice Dell’Orletta | Alessandro Lenci | Simonetta Montemagni | Vito Pirrelli
Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora 2006


pdf bib
Climbing the Path to Grammar: A Maximum Entropy Model of Subject/Object Learning
Felice Dell’Orletta | Alessandro Lenci | Simonetta Montemagni | Vito Pirrelli
Proceedings of the Workshop on Psychocomputational Models of Human Language Acquisition


pdf bib
Semantic Mark-up of Italian Legal Texts Through NLP-based Techniques
Roberto Bartolini | Alessandro Lenci | Simonetta Montemagni | Vito Pirrelli | Claudia Soria
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

pdf bib
Hybrid Constraints for Robust Parsing: First Experiments and Evaluation
Roberto Bartolini | Alessandro Lenci | Simonetta Montemagni | Vito Pirrelli
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

pdf bib
Non-locality all the way through: Emergent Global Constraints in the Italian Morphological Lexicon
Vito Pirrelli | Basilio Calderone | Ivan Herreros | Michele Virgilio
Proceedings of the 7th Meeting of the ACL Special Interest Group in Computational Phonology: Current Themes in Computational Phonology and Morphology


pdf bib
Advanced Tools for the Study of Natural Interactivity
Claudia Soria | Niels Ole Bernsen | Niels Cadée | Jean Carletta | Laila Dybkjær | Stefan Evert | Ulrich Heid | Amy Isard | Mykola Kolodnytsky | Christoph Lauer | Wolfgang Lezius | Lucas P.J.J. Noldus | Vito Pirrelli | Norbert Reithinger | Andreas Vögele
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)

pdf bib
The Lexicon-Grammar Balance in Robust Parsing of Italian
Roberto Bartolini | Alessandro Lenci | Simonetta Montemagni | Vito Pirrelli
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)

pdf bib
Grammar and Lexicon in the Robust Parsing of Italian towards a Non-Naïve Interplay
Roberto Bartolini | Alessandro Lenci | Simonetta Montemagni | Vito Pirrelli
COLING-02: Grammar Engineering and Evaluation


pdf bib
Learning Word Clusters from Data Types
Paolo Allegrini | Simonetta Montemagni | Vito Pirrelli
COLING 2000 Volume 1: The 18th International Conference on Computational Linguistics

pdf bib
Where Opposites Meet. A Syntactic Meta-scheme for Corpus Annotation and Parsing Evaluation
Alessandro Lenci | Simonetta Montemagni | Vito Pirrelli | Claudia Soria
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)

pdf bib
Controlled Bootstrapping of Lexico-semantic Classes as a Bridge between Paradigmatic and Syntagmatic Knowledge: Methodology and Evaluation
Paolo Allegrini | Simonetta Montemagni | Vito Pirrelli
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)


pdf bib
A recognition-based meta-scheme for dialogue acts annotation
Claudia Soria | Vito Pirrelli
Towards Standards and Tools for Discourse Tagging

pdf bib
FAME: a Functional Annotation Meta-scheme for multi-modal and multi-lingual Parsing Evaluation
Alessandro Lenci | Simonetta Montemagni | Vito Pirrelli | Claudia Soria
Computer Mediated Language Assessment and Evaluation in Natural Language Processing


pdf bib
Augmenting WordNet-like lexical resources with distributional evidence. An application-oriented perspective
Simonetta Montemagni | Vito Pirrelli
Usage of WordNet in Natural Language Processing Systems


pdf bib
Inferring Semantic Similarity from Distributional Evidence: an Analogy-based Approach to Word Sense Disambiguation
Stefano Federici | Simonetta Montemagni | Vito Pirrelli
Automatic Information Extraction and Building of Lexical Semantic Resources for NLP Applications


pdf bib
Monotonic Paradigmatic Schemata in Italian Verb Inflection
Vito Pirrelli | Marco Battista
COLING 1996 Volume 1: The 16th International Conference on Computational Linguistics

pdf bib
Resolving syntactic ambiguities with lexico-semantic patterns: an analogy-based approach
Simonetta Montemagni | Stefano Federici | Vito Pirrelli
COLING 1996 Volume 1: The 16th International Conference on Computational Linguistics


pdf bib
“Derivational” Paradigms in Morphonology
Vito Pirrelli | Stefano Federici
COLING 1994 Volume 1: The 15th International Conference on Computational Linguistics