Massimiliano Ciaramita


2022

pdf bib
Decoding a Neural Retriever’s Latent Space for Query Suggestion
Leonard Adolphs | Michelle Chen Huebscher | Christian Buck | Sertan Girgin | Olivier Bachem | Massimiliano Ciaramita | Thomas Hofmann
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing

Neural retrieval models have superseded classic bag-of-words methods such as BM25 as the retrieval framework of choice. However, neural systems lack the interpretability of bag-of-words models; it is not trivial to connect a query change to a change in the latent space that ultimately determines the retrieval results. To shed light on this embedding space, we learn a “query decoder” that, given a latent representation of a neural search engine, generates the corresponding query. We show that it is possible to decode a meaningful query from its latent representation and, when moving in the right direction in latent space, to decode a query that retrieves the relevant paragraph. In particular, the query decoder can be useful to understand “what should have been asked” to retrieve a particular paragraph from the collection. We employ the query decoder to generate a large synthetic dataset of query reformulations for MSMarco, leading to improved retrieval performance. On this data, we train a pseudo-relevance feedback (PRF) T5 model for the application of query suggestion that outperforms both query reformulation and PRF information retrieval baselines.

2015

pdf bib
A Computationally Efficient Algorithm for Learning Topical Collocation Models
Zhendong Zhao | Lan Du | Benjamin Börschinger | John K Pate | Massimiliano Ciaramita | Mark Steedman | Mark Johnson
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)

pdf bib
Using Entity Information from a Knowledge Base to Improve Relation Extraction
Lan Du | Anish Kumar | Mark Johnson | Massimiliano Ciaramita
Proceedings of the Australasian Language Technology Association Workshop 2015

2011

pdf bib
Piggyback: Using Search Engines for Robust Cross-Domain Named Entity Recognition
Stefan Rüd | Massimiliano Ciaramita | Jens Müller | Hinrich Schütze
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies

pdf bib
Learning to Rank Answers to Non-Factoid Questions from Web Collections
Mihai Surdeanu | Massimiliano Ciaramita | Hugo Zaragoza
Computational Linguistics, Volume 37, Issue 2 - June 2011

2010

pdf bib
Learning Dense Models of Query Similarity from User Click Logs
Fabio De Bona | Stefan Riezler | Keith Hall | Massimiliano Ciaramita | Amaç Herdaǧdelen | Maria Holmqvist
Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics

pdf bib
Adaptive Parameters for Entity Recognition with Perceptron HMMs
Massimiliano Ciaramita | Olivier Chapelle
Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing

pdf bib
Instance Sense Induction from Attribute Sets
Ricardo Martin-Brualla | Enrique Alfonseca | Marius Pasca | Keith Hall | Enrique Robledo-Arnuncio | Massimiliano Ciaramita
Coling 2010: Posters

2009

pdf bib
Company-Oriented Extractive Summarization of Financial News
Katja Filippova | Mihai Surdeanu | Massimiliano Ciaramita | Hugo Zaragoza
Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009)

pdf bib
Gazpacho and summer rash: lexical relationships from temporal patterns of web search queries
Enrique Alfonseca | Massimiliano Ciaramita | Keith Hall
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing

pdf bib
The CoNLL-2009 Shared Task: Syntactic and Semantic Dependencies in Multiple Languages
Jan Hajič | Massimiliano Ciaramita | Richard Johansson | Daisuke Kawahara | Maria Antònia Martí | Lluís Màrquez | Adam Meyers | Joakim Nivre | Sebastian Padó | Jan Štěpánek | Pavel Straňák | Mihai Surdeanu | Nianwen Xue | Yi Zhang
Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL 2009): Shared Task

2008

pdf bib
DeSRL: A Linear-Time Semantic Role Labeling System
Massimiliano Ciaramita | Giuseppe Attardi | Felice Dell’Orletta | Mihai Surdeanu
CoNLL 2008: Proceedings of the Twelfth Conference on Computational Natural Language Learning

pdf bib
Learning to Rank Answers on Large Online QA Collections
Mihai Surdeanu | Massimiliano Ciaramita | Hugo Zaragoza
Proceedings of ACL-08: HLT

pdf bib
Semantically Annotated Snapshot of the English Wikipedia
Jordi Atserias | Hugo Zaragoza | Massimiliano Ciaramita | Giuseppe Attardi
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

This paper describes SW1, the first version of a semantically annotated snapshot of the English Wikipedia. In recent years Wikipedia has become a valuable resource for both the Natural Language Processing (NLP) community and the Information Retrieval (IR) community. Although NLP technology for processing Wikipedia already exists, not all researchers and developers have the computational resources to process such a volume of information. Moreover, the use of different versions of Wikipedia processed differently might make it difficult to compare results. The aim of this work is to provide easy access to syntactic and semantic annotations for researchers of both NLP and IR communities by building a reference corpus to homogenize experiments and make results comparable. These resources, a semantically annotated corpus and a “entity containment” derived graph, are licensed under the GNU Free Documentation License and available from http://www.yr-bcn.es/semanticWikipedia

pdf bib
Supersense Tagger for Italian
Davide Picca | Alfio Massimiliano Gliozzo | Massimiliano Ciaramita
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

In this paper we present the procedure we followed to develop the Italian Super Sense Tagger. In particular, we adapted the English SuperSense Tagger to the Italian Language by exploiting a parallel sense labeled corpus for training. As for English, the Italian tagger uses a fixed set of 26 semantic labels, called supersenses, achieving a slightly lower accuracy due to the lower quality of the Italian training data. Both taggers accomplish the same task of identifying entities and concepts belonging to a common set of ontological types. This parallelism allows us to define effective methodologies for a broad range of cross-language knowledge acquisition tasks

2007

pdf bib
UNT-Yahoo: SuperSenseLearner: Combining SenseLearner with SuperSense and other Coarse Semantic Features
Rada Mihalcea | Andras Csomai | Massimiliano Ciaramita
Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007)

pdf bib
Tree Revision Learning for Dependency Parsing
Giuseppe Attardi | Massimiliano Ciaramita
Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference

pdf bib
Dependency Parsing with Second-Order Feature Maps and Annotated Semantic Information
Massimiliano Ciaramita | Giuseppe Attardi
Proceedings of the Tenth International Conference on Parsing Technologies

pdf bib
Multilingual Dependency Parsing and Domain Adaptation using DeSR
Giuseppe Attardi | Felice Dell’Orletta | Maria Simi | Atanas Chanev | Massimiliano Ciaramita
Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)

2006

pdf bib
A Figure of Merit for the Evaluation of Web-Corpus Randomness
Massimiliano Ciaramita | Marco Baroni
11th Conference of the European Chapter of the Association for Computational Linguistics

pdf bib
Broad-Coverage Sense Disambiguation and Information Extraction with a Supersense Sequence Tagger
Massimiliano Ciaramita | Yasemin Altun
Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing

2004

pdf bib
Multi-component Word Sense Disambiguation
Massimiliano Ciaramita | Mark Johnson
Proceedings of SENSEVAL-3, the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text

2003

pdf bib
Supersense Tagging of Unknown Nouns in WordNet
Massimiliano Ciaramita | Mark Johnson
Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing

2002

pdf bib
Boosting automatic lexical acquisition with morphological information
Massimiliano Ciaramita
Proceedings of the ACL-02 Workshop on Unsupervised Lexical Acquisition

2000

pdf bib
Explaining away ambiguity: Learning verb selectional preference with Bayesian networks
Massimiliano Ciaramita | Mark Johnson
COLING 2000 Volume 1: The 18th International Conference on Computational Linguistics