2022
pdf
bib
abs
Morpholex Turkish: A Morphological Lexicon for Turkish
Bilge Arican
|
Aslı Kuzgun
|
Büşra Marşan
|
Deniz Baran Aslan
|
Ezgi Saniyar
|
Neslihan Cesur
|
Neslihan Kara
|
Oguzhan Kuyrukcu
|
Merve Ozcelik
|
Arife Betul Yenice
|
Merve Dogan
|
Ceren Oksal
|
Gökhan Ercan
|
Olcay Taner Yıldız
Proceedings of Globalex Workshop on Linked Lexicography within the 13th Language Resources and Evaluation Conference
MorphoLex is a study in which root, prefix and suffixes of words are analyzed. With MorphoLex, many words can be analyzed according to certain rules and a useful database can be created. Due to the fact that Turkish is an agglutinative language and the richness of its language structure, it offers different analyzes and results from previous studies in MorphoLex. In this study, we revealed the process of creating a database with 48,472 words and the results of the differences in language structure.
pdf
bib
abs
Time Travel in Turkish: WordNets for Modern Turkish
Ceren Oksal
|
Hikmet N. Oguz
|
Mert Catal
|
Nurkay Erbay
|
Ozgecan Yuzer
|
Ipek B. Unsal
|
Oguzhan Kuyrukcu
|
Arife B. Yenice
|
Aslı Kuzgun
|
Büşra Marşan
|
Ezgi Sanıyar
|
Bilge Arican
|
Merve Dogan
|
Özge Bakay
|
Olcay Taner Yıldız
Proceedings of Globalex Workshop on Linked Lexicography within the 13th Language Resources and Evaluation Conference
Wordnets have been popular tools for providing and representing semantic and lexical relations of languages. They are useful tools for various purposes in NLP studies. Many researches created WordNets for different languages. For Turkish, there are two WordNets, namely the Turkish WordNet of BalkaNet and KeNet. In this paper, we present new WordNets for Turkish each of which is based on one of the first 9 editions of the Turkish dictionary starting from the 1944 edition. These WordNets are historical in nature and make implications for Modern Turkish. They are developed by extending KeNet, which was created based on the 2005 and 2011 editions of the Turkish dictionary. In this paper, we explain the steps in creating these 9 new WordNets for Turkish, discuss the challenges in the process and report comparative results about the WordNets.
pdf
bib
abs
WordNet and Wikipedia Connection in Turkish WordNet KeNet
Merve Doğan
|
Ceren Oksal
|
Arife Betül Yenice
|
Fatih Beyhan
|
Reyyan Yeniterzi
|
Olcay Taner Yıldız
Proceedings of Globalex Workshop on Linked Lexicography within the 13th Language Resources and Evaluation Conference
This paper aims to present WordNet and Wikipedia connection by linking synsets from Turkish WordNet KeNet with Wikipedia and thus, provide a better machine-readable dictionary to create an NLP model with rich data. For this purpose, manual mapping between two resources is realized and 11,478 synsets are linked to Wikipedia. In addition to this, automatic linking approaches are utilized to analyze possible connection suggestions. Baseline Approach and ElasticSearch Based Approach help identify the potential human annotation errors and analyze the effectiveness of these approaches in linking. Adopting both manual and automatic mapping provides us with an encompassing resource of WordNet and Wikipedia connections.