2020
pdf
bib
abs
Keyphrase Generation with GANs in Low-Resources Scenarios
Giuseppe Lancioni
|
Saida S.Mohamed
|
Beatrice Portelli
|
Giuseppe Serra
|
Carlo Tasso
Proceedings of SustaiNLP: Workshop on Simple and Efficient Natural Language Processing
Keyphrase Generation is the task of predicting Keyphrases (KPs), short phrases that summarize the semantic meaning of a given document. Several past studies provided diverse approaches to generate Keyphrases for an input document. However, all of these approaches still need to be trained on very large datasets. In this paper, we introduce BeGanKP, a new conditional GAN model to address the problem of Keyphrase Generation in a low-resource scenario. Our main contribution relies in the Discriminator’s architecture: a new BERT-based module which is able to distinguish between the generated and humancurated KPs reliably. Its characteristics allow us to use it in a low-resource scenario, where only a small amount of training data are available, obtaining an efficient Generator. The resulting architecture achieves, on five public datasets, competitive results with respect to the state-of-the-art approaches, using less than 1% of the training data.
2018
pdf
bib
abs
Predicting the Usefulness of Amazon Reviews Using Off-The-Shelf Argumentation Mining
Marco Passon
|
Marco Lippi
|
Giuseppe Serra
|
Carlo Tasso
Proceedings of the 5th Workshop on Argument Mining
Internet users generate content at unprecedented rates. Building intelligent systems capable of discriminating useful content within this ocean of information is thus becoming a urgent need. In this paper, we aim to predict the usefulness of Amazon reviews, and to do this we exploit features coming from an off-the-shelf argumentation mining system. We argue that the usefulness of a review, in fact, is strictly related to its argumentative content, whereas the use of an already trained system avoids the costly need of relabeling a novel dataset. Results obtained on a large publicly available corpus support this hypothesis.
2017
pdf
bib
abs
Exploiting and Evaluating a Supervised, Multilanguage Keyphrase Extraction pipeline for under-resourced languages
Marco Basaldella
|
Muhammad Helmy
|
Elisa Antolli
|
Mihai Horia Popescu
|
Giuseppe Serra
|
Carlo Tasso
Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017
This paper evaluates different techniques for building a supervised, multilanguage keyphrase extraction pipeline for languages which lack a gold standard. Starting from an unsupervised English keyphrase extraction pipeline, we implement pipelines for Arabic, Italian, Portuguese, and Romanian, and we build test collections for languages which lack one. Then, we add a Machine Learning module trained on a well-known English language corpus and we evaluate the performance not only over English but on the other languages as well. Finally, we repeat the same evaluation after training the pipeline over an Arabic language corpus to check whether using a language-specific corpus brings a further improvement in performance. On the five languages we analyzed, results show an improvement in performance when using a machine learning algorithm, even if such algorithm is not trained and tested on the same language.
2016
pdf
bib
abs
Evaluating anaphora and coreference resolution to improve automatic keyphrase extraction
Marco Basaldella
|
Giorgia Chiaradia
|
Carlo Tasso
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
In this paper we analyze the effectiveness of using linguistic knowledge from coreference and anaphora resolution for improving the performance for supervised keyphrase extraction. In order to verify the impact of these features, we define a baseline keyphrase extraction system and evaluate its performance on a standard dataset using different machine learning algorithms. Then, we consider new sets of features by adding combinations of the linguistic features we propose and we evaluate the new performance of the system. We also use anaphora and coreference resolution to transform the documents, trying to simulate the cohesion process performed by the human mind. We found that our approach has a slightly positive impact on the performance of automatic keyphrase extraction, in particular when considering the ranking of the results.
1991
pdf
bib
Teaching the English Tense: Integrating Naive and Formal Grammars in an Intelligent Tutor for Foreign Language Teaching
Danilo Fum
|
Bruno Pani
|
Carlo Tasso
Fifth Conference of the European Chapter of the Association for Computational Linguistics
1989
pdf
bib
Tense Generation in an Intelligent Tutor for Foreign Language Teaching: Some Issues in the Design of the Verb Expert
Danilo Fum
|
Paolo Giangrandi
|
Carlo Tasso
Fourth Conference of the European Chapter of the Association for Computational Linguistics
1988
pdf
bib
A Distributed Multi-Agent Architecture for Natural Language Processing
Danilo Fum
|
Giovanni Guida
|
Carlo Tasso
Coling Budapest 1988 Volume 2: International Conference on Computational Linguistics
1986
pdf
bib
Tailoring Importance Evaluation to Reader’s Goals: A Contribution to Descriptive Text Summarization
Danilo Fum
|
Giovanni Guida
|
Carlo Tasso
Coling 1986 Volume 1: The 11th International Conference on Computational Linguistics
1985
pdf
bib
A Rule-Based Approach to Evaluating Importance in Descriptive Texts
Danilo Fum
|
Giovanni Guida
|
Carlo Tasso
Second Conference of the European Chapter of the Association for Computational Linguistics
1983
pdf
bib
IR-NLI : An Expert Natural Language Interface to Online Data Bases
Giovanni Guida
|
Carlo Tasso
First Conference on Applied Natural Language Processing
1982
pdf
bib
Forward and Backward Reasoning in Automatic Abstracting
Danilo Fum
|
Giovanni Guida
|
Carlo Tasso
Coling 1982: Proceedings of the Ninth International Conference on Computational Linguistics