Andrea Cimino
2021
Audience Engagement Prediction in Guided Tours through Multimodal Features
Andrea Amelio Ravelli | Andrea Cimino | Felice Dell’Orletta
Proceedings of the Eighth Italian Conference on Computational Linguistics (CLiC-it 2021)
Andrea Amelio Ravelli | Andrea Cimino | Felice Dell’Orletta
Proceedings of the Eighth Italian Conference on Computational Linguistics (CLiC-it 2021)
2020
Profiling-UD: a Tool for Linguistic Profiling of Texts
Dominique Brunato | Andrea Cimino | Felice Dell’Orletta | Giulia Venturi | Simonetta Montemagni
Proceedings of the Twelfth Language Resources and Evaluation Conference
Dominique Brunato | Andrea Cimino | Felice Dell’Orletta | Giulia Venturi | Simonetta Montemagni
Proceedings of the Twelfth Language Resources and Evaluation Conference
In this paper, we introduce Profiling–UD, a new text analysis tool inspired to the principles of linguistic profiling that can support language variation research from different perspectives. It allows the extraction of more than 130 features, spanning across different levels of linguistic description. Beyond the large number of features that can be monitored, a main novelty of Profiling–UD is that it has been specifically devised to be multilingual since it is based on the Universal Dependencies framework. In the second part of the paper, we demonstrate the effectiveness of these features in a number of theoretical and applicative studies in which they were successfully used for text and author profiling.
A Machine Learning approach for Sentiment Analysis for Italian Reviews in Healthcare
Luca Bacco | Andrea Cimino | Luca Paulon | Mario Merone | Felice Dell’Orletta
Proceedings of the Seventh Italian Conference on Computational Linguistics (CLiC-it 2020)
Luca Bacco | Andrea Cimino | Luca Paulon | Mario Merone | Felice Dell’Orletta
Proceedings of the Seventh Italian Conference on Computational Linguistics (CLiC-it 2020)
2019
Quanti anni hai? Age Identification for Italian
Aleksandra Maslennikova | Paolo Labruna | Andrea Cimino | Felice Dell’Orletta
Proceedings of the Sixth Italian Conference on Computational Linguistics (CLiC-it 2019)
Aleksandra Maslennikova | Paolo Labruna | Andrea Cimino | Felice Dell’Orletta
Proceedings of the Sixth Italian Conference on Computational Linguistics (CLiC-it 2019)
2018
Sentences and Documents in Native Language Identification
Andrea Cimino | Felice Dell’Orletta | Dominique Brunato | Giulia Venturi
Proceedings of the Fifth Italian Conference on Computational Linguistics (CLiC-it 2018)
Andrea Cimino | Felice Dell’Orletta | Dominique Brunato | Giulia Venturi
Proceedings of the Fifth Italian Conference on Computational Linguistics (CLiC-it 2018)
2017
Identifying Predictive Features for Textual Genre Classification: the Key Role of Syntax
Andrea Cimino | Martijn Wieling | Felice Dell’Orletta | Simonetta Montemagni | Giulia Venturi
Proceedings of the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017)
Andrea Cimino | Martijn Wieling | Felice Dell’Orletta | Simonetta Montemagni | Giulia Venturi
Proceedings of the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017)
Stacked Sentence-Document Classifier Approach for Improving Native Language Identification
Andrea Cimino | Felice Dell’Orletta
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications
Andrea Cimino | Felice Dell’Orletta
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications
In this paper, we describe the approach of the ItaliaNLP Lab team to native language identification and discuss the results we submitted as participants to the essay track of NLI Shared Task 2017. We introduce for the first time a 2-stacked sentence-document architecture for native language identification that is able to exploit both local sentence information and a wide set of general-purpose features qualifying the lexical and grammatical structure of the whole document. When evaluated on the official test set, our sentence-document stacked architecture obtained the best result among all the participants of the essay track with an F1 score of 0.8818.
2016
PaCCSS-IT: A Parallel Corpus of Complex-Simple Sentences for Automatic Text Simplification
Dominique Brunato | Andrea Cimino | Felice Dell’Orletta | Giulia Venturi
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
Dominique Brunato | Andrea Cimino | Felice Dell’Orletta | Giulia Venturi
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
2014
Assessing the Readability of Sentences: Which Corpora and Features?
Felice Dell’Orletta | Martijn Wieling | Giulia Venturi | Andrea Cimino | Simonetta Montemagni
Proceedings of the Ninth Workshop on Innovative Use of NLP for Building Educational Applications
Felice Dell’Orletta | Martijn Wieling | Giulia Venturi | Andrea Cimino | Simonetta Montemagni
Proceedings of the Ninth Workshop on Innovative Use of NLP for Building Educational Applications
T2K^2: a System for Automatically Extracting and Organizing Knowledge from Texts
Felice Dell’Orletta | Giulia Venturi | Andrea Cimino | Simonetta Montemagni
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Felice Dell’Orletta | Giulia Venturi | Andrea Cimino | Simonetta Montemagni
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
In this paper, we present T2K^2, a suite of tools for automatically extracting domain―specific knowledge from collections of Italian and English texts. T2K^2 (Text―To―Knowledge v2) relies on a battery of tools for Natural Language Processing (NLP), statistical text analysis and machine learning which are dynamically integrated to provide an accurate and incremental representation of the content of vast repositories of unstructured documents. Extracted knowledge ranges from domain―specific entities and named entities to the relations connecting them and can be used for indexing document collections with respect to different information types. T2K^2 also includes linguistic profiling functionalities aimed at supporting the user in constructing the acquisition corpus, e.g. in selecting texts belonging to the same genre or characterized by the same degree of specialization or in monitoring the added value of newly inserted documents. T2K^2 is a web application which can be accessed from any browser through a personal account which has been tested in a wide range of domains.