Representing the Translation Relation in a Bilingual Wordnet

Jyrki Niemi, Krister Lindén


Abstract
This paper describes representing translations in the Finnish wordnet, FinnWordNet (FiWN), and constructing the FiWN database. FiWN was created by translating all the word senses of the Princeton WordNet (PWN) into Finnish and by joining the translations with the semantic and lexical relations of PWN extracted into a relational (database) format. The approach naturally resulted in a translation relation between PWN and FiWN. Unlike many other multilingual wordnets, the translation relation in FiWN is not primarily on the synset level, but on the level of an individual word sense, which allows more precise translation correspondences. This can easily be projected into a synset-level translation relation, used for linking with other wordnets, for example, via Core WordNet. Synset-level translations are also used as a default in the absence of word-sense translations. The FiWN data in the relational database can be converted to other formats. In the PWN database format, translations are attached to source-language words, allowing the implementation of a Web search interface also working as a bilingual dictionary. Another representation encodes the translation relation as a finite-state transducer.
Anthology ID:
L12-1054
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2439–2446
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/194_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Jyrki Niemi and Krister Lindén. 2012. Representing the Translation Relation in a Bilingual Wordnet. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 2439–2446, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Representing the Translation Relation in a Bilingual Wordnet (Niemi & Lindén, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/194_Paper.pdf