2008
pdf
bib
abs
More Semantic Links in the SIMPLE-CLIPS Database
Nilda Ruimy
|
Antonio Toral
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
Notwithstanding its acknowledged richness, the SIMPLE semantic model does not offer the representational vocabulary for encoding some conceptual links holding between events and their participants and among co-participants in events. Although critical for boosting performance in many NLP application tasks, such deep lexical information is therefore only partially encoded in the SIMPLE-CLIPS Italian semantic database. This paper reports on the enrichment of the SIMPLE relation set by some expressive means, namely semantic relations, borrowed from the EuroWordNet model and their implementation in the SIMPLE-CLIPS lexicon. The original situation existing in the database, as to the expression of this type of information is described and the loan descriptive vocabulary presented. Strategies based on the exploitation of the source lexicon data were adopted to induce new information: a wide range of semantic - but also syntactic - information was investigated for singling out word senses candidate to be linked by the new relations. The lexicon enrichment by 5,000 new relations instantiated so far has therefore been carried out as a largely automated, low-effort and cost-free process, with no heavy human intervention. The redundancy set off by such an extension of information is being addressed by the implementation of inheritance in the SIMPLE-CLIPS database (Del Gratta et al., 2008).
pdf
bib
abs
Simple-Clips ongoing research: more information with less data by implementing inheritance
Riccardo Del Gratta
|
Nilda Ruimy
|
Antonio Toral
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
This paper presents the application of inheritance to the formal taxonomy (is-a) of a semantically rich Language Resource based on the Generative Lexicon theory, SIMPLE-CLIPS. The aim is to lighten the representation of its semantic layer by reducing the number of encoded relations. A prediction calculation on the impact of introducing inheritance regarding space occupancy is carried out, yielding a significant space reduction of 22%. This is corroborated by its actual application, which reduces the number of explicitly encoded relations in this lexicon by 18.4%. Later on, we study the issues that inheritance poses to the Language Resources, and discuss sensitive solutions to tackle each of them, including examples. Finally, we present a discussion on the application of inheritance, from which two side effect advantages arise: consistency enhancement and inference capabilities.
pdf
bib
abs
Mapping Events and Abstract Entities from PAROLE-SIMPLE-CLIPS to ItalWordNet
Adriana Roventini
|
Nilda Ruimy
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
In the few last years, due to the increasing importance of the web, both computational tools and resources need to be more and more visible and easily accessible to a vast community of scholars, students and researchers. Furthermore, high quality lexical resources are crucially required for a wide range of HLT-NLP applications, among which word sense disambiguation. Vast and consistent electronic lexical resources do exist which can be further enhanced and enriched through their linking and integration. An ILC project dealing with the link of two large lexical semantic resources for the Italian language, namely ItalWordNet and PAROLE-SIMPLE-CLIPS, fits this trend. Concrete entities were already linked and this paper addresses the semi-automatic mapping of events and abstract entities. The lexical models of the two resources, the mapping strategy and the tool that was implemented to this aim are briefly outlined. Special focus is put on the results of the linking process: figures are reported and examples are given which illustrate both the linking and harmonization of the resources but also cases of discrepancies, mainly due to the different underlying semantic models.
2007
pdf
bib
Mapping Concrete Entities from PAROLE-SIMPLE-CLIPS to ItalWordNet: Methodology and Results
Adriana Roventini
|
Nilda Ruimy
|
Rita Marinelli
|
Marisa Ulivieri
|
Michele Mammini
Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions
2006
pdf
bib
abs
Merging two Ontology-based Lexical Resources
Nilda Ruimy
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
ItalWordNet (IWN) and PAROLE/SIMPLE/CLIPS (PSC), the two largest electronic, general-purpose lexical resources of Italian language present many compatible aspects although they are based on two different lexical models having their own underlying principles and peculiarities. Such compatibility prompted us to study the feasibility of semi-automatically linking and eventually merging the two lexicons. To this purpose, the mapping of the ontologies on which basis both lexicons are structured was performed and the sets of semantic relations enabling to relate lexical units were compared. An overview of this preliminary phase is provided in this paper. The linking methodology and related problematic issues are described. Beyond the advantage for the end user to dispose of a more exhaustive and in-depth lexical information combining the potentialities and most outstanding features offered by the two lexical models, resulting benefits and enhancements for the two resources are illustrated that definitely legitimize the soundness of this linking and merging initiative.
pdf
bib
abs
Structuring a Domain Vocabulary in a General Knowledge Environment
Nilda Ruimy
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
The study which is reported here aims at investigating the extent to which the conceptual and representational tools provided by a lexical model designed for the semantic representation of general language may suit the requirements of knowledge modelling in a domain-specific perspective. A general linguistic ontology and a set of semantic links, which allow classifying, describing and interconnecting word senses, play a central role in structuring and representing such knowledge. The health and medicine vocabulary has been taken as a case study for this investigation.
2004
pdf
bib
Semi-Automatic Derivation of a French Lexicon from CLIPS
Nilda Ruimy
|
Pierrette Bouillon
|
Bruno Cartoni
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
2002
pdf
bib
CLIPS, a Multi-level Italian Computational Lexicon: a Glimpse to Data
Nilda Ruimy
|
Monica Monachini
|
Raffaella Distante
|
Elisabetta Guazzini
|
Stefano Molino
|
Marisa Ulivieri
|
Nicoletta Calzolari
|
Antonio Zampolli
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)
2000
pdf
bib
SIMPLE: A General Framework for the Development of Multilingual Lexicons
Nuria Bel
|
Federica Busa
|
Nicoletta Calzolari
|
Elisabetta Gola
|
Alessandro Lenci
|
Monica Monachini
|
Antoine Ogonowski
|
Ivonne Peters
|
Wim Peters
|
Nilda Ruimy
|
Marta Villegas
|
Antonio Zampolli
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)
pdf
bib
Multilingual Linguistic Resources: From Monolingual Lexicons to Bilingual Interrelated Lexicons
Marta Villegas
|
Nuria Bel
|
Alessandro Lenci
|
Nicoletta Calzolari
|
Nilda Ruimy
|
Antonio Zampolli
|
Teresa Sadurní
|
Joan Soler
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)