Vincenzo Lombardo

2017

Metadata Annotation for Dramatic Texts
Vincenzo Lombardo | Rossana Damiano | Antonio Pizzo
Proceedings of the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017)

2010

pdf bib abs

As the interest of the NLP community grows to develop several treebanks also for languages other than English, we observe efforts towards evaluating the impact of different annotation strategies used to represent particular languages or with reference to particular tasks. This paper contributes to the debate on the influence of resources used for the training and development on the performance of parsing systems. It presents a comparative analysis of the results achieved by three different dependency parsers developed and tested with respect to two treebanks for the Italian language, namely TUT and ISST--TANL, which differ significantly at the level of both corpus composition and adopted dependency representations.

2008

pdf bib abs

The EVALITA 2007 Parsing Task has been the first contest among parsing systems for Italian. It is the first attempt to compare the approaches and the results of the existing parsing systems specific for this language using a common treebank annotated using both a dependency and a constituency-based format. The development data set for this parsing competition was taken from the Turin University Treebank, which is annotated both in dependency and constituency format. The evaluation metrics were those standardly applied in CoNLL and PARSEVAL. The results of the parsing results are very promising and higher than the state-of-the-art for dependency parsing of Italian. An analysis of such results is provided, which takes into account other experiences in treebank-driven parsing for Italian and for other Romance languages (in particular, the CoNLL X & 2007 shared tasks for dependency parsing). It focuses on the characteristics of data sets, i.e. type of annotation and size, parsing paradigms and approaches applied also to languages other than Italian.

pdf bib abs

EVALITA 2007, the first edition of the initiative devoted to the evaluation of Natural Language Processing tools for Italian, provided a shared framework where participants systems had the possibility to be evaluated on five different tasks, namely Part of Speech Tagging (organised by the University of Bologna), Parsing (organised by the University of Torino), Word Sense Disambiguation (organised by CNR-ILC, Pisa), Temporal Expression Recognition and Normalization (organised by CELCT, Trento), and Named Entity Recognition (organised by FBK, Trento). We believe that the diffusion of shared tasks and shared evaluation practices is a crucial step towards the development of resources and tools for Natural Language Processing. Experiences of this kind, in fact, are a valuable contribution to the validation of existing models and data, allowing for consistent comparisons among approaches and among representation schemes. The good response obtained by EVALITA, both in the number of participants and in the quality of results, showed that pursuing such goals is feasible not only for English, but also for other languages.

2006

pdf bib abs

Comparing linguistic information in treebank annotations
Cristina Bosco | Vincenzo Lombardo
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

The paper investigates the issue of portability of methods and results over treebanks in different languages and annotation formats. In particular, it addresses the problem of converting an Italian treebank, the Turin University Treebank (TUT), developed in dependency format, into the Penn Treebank format, in order to possibly exploit the tools and methods already developed and compare the adequacy of information encoding in the two formats. We describe the procedures for converting the two annotation formats and we present an experiment that evaluates some linguistic knowledge extracted from the two formats, namely sub-categorization frames.

Integration of syntactic and lexical information in a hierarchical dependency grammar
Cristina Barbero | Leonardo Lesmo | Vincenzo Lombardo
Processing of Dependency-Based Grammars

pdf bib

Formal Aspects and Parsing Issues of Dependency Theory
Vincenzo Lombardo | Leonardo Lesmo
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 2

1996

pdf bib

An Earley-type recognizer for dependency grammar
Vincenzo Lombardo | Leonardo Lesmo
COLING 1996 Volume 2: The 16th International Conference on Computational Linguistics

1995

pdf bib abs

A Practical Dependency Parser
Vincenzo Lombardo | Leonardo Lesmo
Proceedings of the Fourth International Workshop on Parsing Technologies

The working assumption is that cognitive modeling of NLP and engineering solutions to free text parsing can converge to optimal parsing. The claim of the paper is that the methodology to achieve such a result is to develop a concrete environment with a flexible parser, that allows the testing of various psycholinguistic strategies on real texts. In this paper we outline a flexible parser based on a dependency grammar.

1992

pdf bib

Incremental Dependency Parsing
Vincenzo Lombardo
30th Annual Meeting of the Association for Computational Linguistics

pdf bib

The Assignment of Grammatical Relations in Natural Language Processing
Leonardo Lesmo | Vincenzo Lombardo
COLING 1992 Volume 4: The 14th International Conference on Computational Linguistics