Gülşen Eryiğit - ACL Anthology

Gülşen Eryiğit

Also published as: Gülşen Eryiǧit, Gulsen Eryigit

2026

MWE-2026 Shared Task: AdMIRe 2 Advancing Multimodal Idiomaticity Representation
Doğukan Arslan | Rodrigo Wilkens | Wei He | Dilara Torunoglu Selamet | Thomas Pickard | Aline Villavicencio | Adriana Silvina Pagano | Gülşen Eryiğit
Proceedings of the 22nd Workshop on Multiword Expressions (MWE 2026)

Idiomatic expressions present a unique chal-lenge in NLP, as their meanings are often notdirectly inferable from their constituent words.Despite recent advancements in large languagemodels, idiomaticity remains a significant ob-stacle to robust semantic representation. Wepresent datasets and task results for MWE-2026 Shared Task 2: Advancing MultimodalIdiomaticity Representation 2 (AdMIRe 2),which challenges the community to assess andimprove models’ ability to interpret idiomaticexpressions in multimodal contexts across mul-tiple languages. Participants competed in animage ranking task in which, for each item,systems receive a context sentence containinga potentially idiomatic expression (PIE) andfive candidate images. Participating systemsare required to predict the sentence type (i.e.,idiomatic vs. literal) for the given context andrank the images by how well they depict the in-tended meaning in that context. Among the par-ticipating systems the most effective methodsinclude pipelines utilizing closed-source com-mercial models such as Gemini 2.5 and GPT-5, and employing chain-of-thought reasoningstrategies. Methods to mitigate language mod-els’ bias towards literal interpretations and en-sembles to smooth out variance were common.

CorefInst: Leveraging LLMs for Multilingual Coreference Resolution
Tuğba Pamay Arslan | Emircan Erol | Gülşen Eryiğit
Transactions of the Association for Computational Linguistics, Volume 14

Coreference Resolution (CR) is a crucial yet challenging task in natural language understanding, often constrained by task-specific architectures and encoder-based language models that demand extensive training and lack adaptability. This study introduces the first multilingual CR methodology which leverages decoder-only LLMs to handle both overt and zero mentions. The article explores how to model the CR task for LLMs via five different instruction sets using a controlled inference method. The approach is evaluated across three LLMs: Llama 3.1, Gemma 2, and Mistral 0.3. The results indicate that LLMs, when instruction-tuned with a suitable instruction set, can surpass state-of-the-art task-specific architectures. Specifically, our best model, a fully fine-tuned Llama 3.1 for multilingual CR, outperforms the leading multilingual CR model (i.e., Corpipe 24 single stage variant) by 2 percentage points on average across all languages in the CorefUD v1.2 dataset collection.

Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 5: Industry Track)
Yevgen Matusevych | Gülşen Eryiğit | Nikolaos Aletras
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 5: Industry Track)

ITUNLP at MWE-2026 AdMIRe 2: A Zero-Shot LLM Pipeline for Multimodal Idiom Understanding and Ranking
Atakan Site | Oğuz Ali Arslan | Gülşen Eryiğit
Proceedings of the 22nd Workshop on Multiword Expressions (MWE 2026)

This paper presents our system for AdMIRe 2 (Advancing Multimodal Idiomaticity Representation), a shared task on multilingual multimodal idiom understanding. The task focuses on ranking images according to how well they depict the literal or idiomatic usage of potentially idiomatic expressions (PIEs) in context, across 15 languages and two tracks: a text-only track, and a multimodal track that uses both images and captions. To tackle both tracks, we propose a hybrid zero-shot pipeline built on large vision–language models (LVLMs). Our system employs a chain-of-thought prompting scheme that first classifies each PIE usage as literal or idiomatic and then ranks candidate images by their alignment with the inferred meaning.A primary–fallback routing mechanism increases robustness to safety-filter refusals, while lightweight post-processing recovers consistent rankings from imperfect model outputs.Without any task-specific fine-tuning, our approach achieves 55.9% Top-1 Accuracy in the text-only track and 60.1% in the multimodal (text+image) track, ranking first overall on the official leaderboard. These results suggest that carefully designed zero-shot LVLM pipelines can provide strong baselines for multilingual multimodal idiomaticity benchmarks.

2025

ITU NLP at TSAR 2025 Shared Task A Three-Stage Prompting Approach for CEFR-Oriented Text Simplification
Kutay Arda Dinç | Fatih Bektaş | Gülşen Eryiğit
Proceedings of the Fourth Workshop on Text Simplification, Accessibility and Readability (TSAR 2025)

Automatic Text Simplification (TS) makes complex texts more accessible but often lacks control over target readability levels. We propose a lightweight, prompt-based approach to English TS that explicitly aligns outputs with CEFR proficiency standards. Our method employs a three-stage pipeline, guided by rule-informed prompts inspired by expert strategies. In the TSAR 2025 Shared Task, our system achieved competitive performance, with stronger results at B1 level and challenges at A2 level due to over-simplification. These findings highlight the promise of prompt-based CEFR-oriented simplification and the need for more flexible constraint design.

Findings of the UniDive 2025 shared task on multilingual Morpho-Syntactic Parsing
Omer Goldman | Leonie Weissweiler | Kutay Acar | Diego Alves | Anna Baczkowska | Gülşen Eryiğit | Lenka Krippnerová | Adriana Pagano | Tanja Samardžić | Luigi Talamo | Alina Wróblewska | Daniel Zeman | Joakim Nivre | Reut Tsarfay
Proceedings of The UniDive 2025 Shared Task on Multilingual Morpho-Syntactic Parsing

This paper details the findings of the 2025 UniDive shared task on multilingual morphosyntactic parsing. It introduces a new representation in which morphology and syntax are modelled jointly to form dependency trees of contentful elements, each characterized by features determined by grammatical words and morphemes. This schema allows bypassing the theoretical debate over the definition of “words” and it encourages development of parsers for typologically diverse languages. The data for the task, spanning 9 languages, was annotated based on existing Universal Dependencies (UD) treebanks that were adapted to the new format. We accompany the data with a new metric, MSLAS, that combines syntactic LAS with F1 over grammatical features. The task received two submissions, which together with three baselines give a detailed view on the ability of multi-task encoder models to cope with the task at hand. The best performing system, UM, achieved 78.7 MSLAS macro-averaged over all languages, improving by 31.4 points over the few-shot prompting baseline.

ITUNLP at SemEval-2025 Task 8: Question-Answering over Tabular Data: A Zero-Shot Approach using LLM-Driven Code Generation
Atakan Site | Emre Erdemir | Gülşen Eryiğit
Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)

This paper presents our system for SemEval-2025 Task 8: DataBench, Question-Answeringover Tabular Data. The primary objective ofthis task is to perform question answering ongiven tabular datasets from diverse domains;under two subtasks: DataBench QA (SubtaskI) and DataBench Lite QA (Subtask II). Totackle both subtasks, we developed a zero-shotsolution with a particular emphasis on lever-aging Large Language Model (LLM)-basedcode generation. Specifically, we proposeda Python code generation framework, utiliz-ing state-of-the-art open-source LLMs to gen-erate executable Pandas code via optimizedprompting strategies. Our experiments revealthat different LLMs exhibit varying levels ofeffectiveness in Python code generation. Addi-tionaly, results show that Python code genera-tion achieves superior performance in tabularquestion answering compared to alternative ap-proaches. Although our ranking among zero-shot systems is unknown at the time of this pa-per’s submission, our system achieved eighthplace in Subtask I and sixth place in Subtask IIamong the 30 systems that outperformed thebaseline in the open-source models category.

Using LLMs to Advance Idiom Corpus Construction
Doğukan Arslan | Hüseyin Anıl Çakmak | Gülşen Eryiğit | Joakim Nivre
Proceedings of the 21st Workshop on Multiword Expressions (MWE 2025)

Idiom corpora typically include both idiomatic and literal examples of potentially idiomatic expressions, but creating such corpora traditionally requires substantial expert effort and cost. In this article, we explore the use of large language models (LLMs) to generate synthetic idiom corpora as a more time- and cost-efficient alternative. We evaluate the effectiveness of synthetic data in training task-specific models and testing GPT-4 in few-shot prompting setting using synthetic data for idiomaticity detection. Our findings reveal that although models trained on synthetic data perform worse than those trained on human-generated data, synthetic data generation offers considerable advantages in terms of cost and time. Specifically, task-specific idiomaticity detection models trained on synthetic data outperform the general-purpose LLM that generated the data when evaluated in a zero-shot setting, achieving an average improvement of 11 percentage points across four languages. Moreover, synthetic data enhances the LLM’s performance, enabling it to match the task-specific models trained with synthetic data when few-shot prompting is applied.

Typology-aware Multilingual Morphosyntactic Parsing with Functional Node Filtering
Kutay Acar | Gülşen Eryiğit
Proceedings of The UniDive 2025 Shared Task on Multilingual Morpho-Syntactic Parsing

This paper presents a system for the UniDive Morphosyntactic Parsing (MSP) Shared Task, where it ranked second overall among participating teams. The task introduces a morphosyntactic representation that jointly models syntactic dependencies and morphological features by treating content-bearing elements as graph nodes and encoding functional elements as feature annotations, posing challenges for conventional parsers and necessitating more flexible, linguistically informed approaches. The proposed system combines a typology-aware, multitask parser with a multilingual content/function classifier to handle structural variance across languages. The architecture uses adapter modules and language embeddings to encode typological information. Evaluations across 9 typologically varied languages confirm that the system can accurately replicate both universal and language-specific morphosyntactic patterns.

2024

This paper presents the objectives, organization and activities of the UniDive COST Action, a scientific network dedicated to universality, diversity and idiosyncrasy in language technology. We describe the objectives and organization of this initiative, the people involved, the working groups and the ongoing tasks and activities. This paper is also an pen call for participation towards new members and countries.

2023

Neural End-to-End Coreference Resolution using Morphological Information
Tuğba Pamay Arslan | Kutay Acar | Gülşen Eryiğit
Proceedings of the CRAC 2023 Shared Task on Multilingual Coreference Resolution

In morphologically rich languages, words consist of morphemes containing deeper information in morphology, and thus such languages may necessitate the use of morpheme-level representations as well as word representations. This study introduces a neural multilingual end-to-end coreference resolution system by incorporating morphological information in transformer-based word embeddings on the baseline model. This proposed model participated in the Sixth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2023). Including morphological information explicitly into the coreference resolution improves the performance, especially in morphologically rich languages (e.g., Catalan, Hungarian, and Turkish). The introduced model outperforms the baseline system by 2.57 percentage points on average by obtaining 59.53% CoNLL F-score.

Incorporating Dropped Pronouns into Coreference Resolution: The case for Turkish
Tuğba Pamay Arslan | Gülşen Eryiğit
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop

Representation of coreferential relations is a challenging and actively studied topic for pro-drop and morphologically rich languages (PD-MRLs) due to dropped pronouns (e.g., null subjects and omitted possessive pronouns). These phenomena require a representation scheme at the morphology level and enhanced evaluation methods. In this paper, we propose a representation & evaluation scheme to incorporate dropped pronouns into coreference resolution and validate it on the Turkish language. Using the scheme, we extend the annotations on the only existing Turkish coreference dataset, which originally did not contain annotations for dropped pronouns. We provide publicly available pre and post processors to enhance the prominent CoNLL coreference scorer also to cover coreferential relations arising from dropped pronouns. As a final step, the paper reports the first neural Turkish coreference resolution results in the literature. Although validated on Turkish, the proposed scheme is language-independent and may be used for other PD-MRLs.

Towards Automatic Grammatical Error Type Classification for Turkish
Harun Uz | Gülşen Eryiğit
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop

Automatic error type classification is an important process in both learner corpora creation and evaluation of large-scale grammatical error correction systems. Rule-based classifier approaches such as ERRANT have been widely used to classify edits between correct-erroneous sentence pairs into predefined error categories. However, the used error categories are far from being universal yielding many language specific variants of ERRANT.In this paper, we discuss the applicability of the previously introduced grammatical error types to an agglutinative language, Turkish. We suggest changes on current error categories and discuss a hierarchical structure to better suit the inflectional and derivational properties of this morphologically highly rich language. We also introduce ERRANT-TR, the first automatic error type classification toolkit for Turkish. ERRANT-TR currently uses a rule-based error type classification pipeline which relies on word level morphological information. Due to unavailability of learner corpora in Turkish, the proposed system is evaluated on a small set of 106 annotated sentences and its performance is measured as 77.04% F0.5 score. The next step is to use ERRANT-TR for the development of a Turkish learner corpus.

2022

AMR Alignment for Morphologically-rich and Pro-drop Languages
K. Elif Oral | Gülşen Eryiğit
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop

Alignment between concepts in an abstract meaning representation (AMR) graph and the words within a sentence is one of the important stages of AMR parsing. Although there exist high performing AMR aligners for English, unfortunately, these are not well suited for many languages where many concepts appear from morpho-semantic elements. For the first time in the literature, this paper presents an AMR aligner tailored for morphologically-rich and pro-drop languages by experimenting on the Turkish language being a prominent example of this language group. Our aligner focuses on the meaning considering the rich Turkish morphology and aligns AMR concepts that emerge from morphemes using a tree traversal approach without additional resources or rules. We evaluate our aligner over a manually annotated gold data set in terms of precision, recall and F1 score. Our aligner outperforms the Turkish adaptations of the previously proposed aligners for English and Portuguese by an F1 score of 0.87 and provides a relative error reduction of up to 76%.

2020

Substituto – A Synchronous Educational Language Game for Simultaneous Teaching and Crowdsourcing
Marianne Grace Araneta | Gülşen Eryiğit | Alexander König | Ji-Ung Lee | Ana Luís | Verena Lyding | Lionel Nicolas | Christos Rodosthenous | Federico Sangati
Proceedings of the 9th Workshop on NLP for Computer Assisted Language Learning

Constructing Multimodal Language Learner Texts Using LARA: Experiences with Nine Languages
Elham Akhlaghi | Branislav Bédi | Fatih Bektaş | Harald Berthelsen | Matthias Butterweck | Cathy Chua | Catia Cucchiarin | Gülşen Eryiğit | Johanna Gerlach | Hanieh Habibi | Neasa Ní Chiaráin | Manny Rayner | Steinþór Steingrímsson | Helmer Strik
Proceedings of the Twelfth Language Resources and Evaluation Conference

LARA (Learning and Reading Assistant) is an open source platform whose purpose is to support easy conversion of plain texts into multimodal online versions suitable for use by language learners. This involves semi-automatically tagging the text, adding other annotations and recording audio. The platform is suitable for creating texts in multiple languages via crowdsourcing techniques that can be used for teaching a language via reading and listening. We present results of initial experiments by various collaborators where we measure the time required to produce substantial LARA resources, up to the length of short novels, in Dutch, English, Farsi, French, German, Icelandic, Irish, Swedish and Turkish. The first results are encouraging. Although there are some startup problems, the conversion task seems manageable for the languages tested so far. The resulting enriched texts are posted online and are freely available in both source and compiled form.

2019

Extracting Complex Relations from Banking Documents
Berke Oral | Erdem Emekligil | Seçil Arslan | Gülşen Eryiğit
Proceedings of the Second Workshop on Economics and Natural Language Processing

In order to automate banking processes (e.g. payments, money transfers, foreign trade), we need to extract banking transactions from different types of mediums such as faxes, e-mails, and scanners. Banking orders may be considered as complex documents since they contain quite complex relations compared to traditional datasets used in relation extraction research. In this paper, we present our method to extract intersentential, nested and complex relations from banking orders, and introduce a relation extraction method based on maximal clique factorization technique. We demonstrate 11% error reduction over previous methods.

Towards Turkish Abstract Meaning Representation
Zahra Azin | Gülşen Eryiğit
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop

Using rooted, directed and labeled graphs, Abstract Meaning Representation (AMR) abstracts away from syntactic features such as word order and does not annotate every constituent in a sentence. AMR has been specified for English and was not supposed to be an Interlingua. However, several studies strived to overcome divergences in the annotations between English AMRs and those of their target languages by refining the annotation specification. Following this line of research, we have started to build the first Turkish AMR corpus by hand-annotating 100 sentences of the Turkish translation of the novel “The Little Prince” and comparing the results with the English AMRs available for the same corpus. The next step is to prepare the Turkish AMR annotation specification for training future annotators.

2018

Detecting Code-Switching between Turkish-English Language Pair
Zeynep Yirmibeşoğlu | Gülşen Eryiğit
Proceedings of the 2018 EMNLP Workshop W-NUT: The 4th Workshop on Noisy User-generated Text

Code-switching (usage of different languages within a single conversation context in an alternative manner) is a highly increasing phenomenon in social media and colloquial usage which poses different challenges for natural language processing. This paper introduces the first study for the detection of Turkish-English code-switching and also a small test data collected from social media in order to smooth the way for further studies. The proposed system using character level n-grams and conditional random fields (CRFs) obtains 95.6% micro-averaged F1-score on the introduced test data set.

2017

Survey: Multiword Expression Processing: A Survey
Mathieu Constant | Gülşen Eryiǧit | Johanna Monti | Lonneke van der Plas | Carlos Ramisch | Michael Rosner | Amalia Todirascu
Computational Linguistics, Volume 43, Issue 4 - December 2017

Multiword expressions (MWEs) are a class of linguistic forms spanning conventional word boundaries that are both idiosyncratic and pervasive across different languages. The structure of linguistic processing that depends on the clear distinction between words and phrases has to be re-thought to accommodate MWEs. The issue of MWE handling is crucial for NLP applications, where it raises a number of challenges. The emergence of solutions in the absence of guiding principles motivates this survey, whose aim is not only to provide a focused review of MWE processing, but also to clarify the nature of interactions between MWE processing and downstream applications. We propose a conceptual framework within which challenges and research contributions can be positioned. It offers a shared understanding of what is meant by “MWE processing,” distinguishing the subtasks of MWE discovery and identification. It also elucidates the interactions between MWE processing and two use cases: Parsing and machine translation. Many of the approaches in the literature can be differentiated according to how MWE processing is timed with respect to underlying use cases. We discuss how such orchestration choices affect the scope of MWE-aware systems. For each of the two MWE processing subtasks and for each of the two use cases, we conclude on open issues and research perspectives.

2016

TGB at SemEval-2016 Task 5: Multi-Lingual Constraint System for Aspect Based Sentiment Analysis
Fatih Samet Çetin | Ezgi Yıldırım | Can Özbey | Gülşen Eryiğit
Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)

Universal Dependencies for Turkish
Umut Sulubacak | Memduh Gokirmak | Francis Tyers | Çağrı Çöltekin | Joakim Nivre | Gülşen Eryiğit
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers

The Universal Dependencies (UD) project was conceived after the substantial recent interest in unifying annotation schemes across languages. With its own annotation principles and abstract inventory for parts of speech, morphosyntactic features and dependency relations, UD aims to facilitate multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. This paper presents the Turkish IMST-UD Treebank, the first Turkish treebank to be in a UD release. The IMST-UD Treebank was automatically converted from the IMST Treebank, which was also recently released. We describe this conversion procedure in detail, complete with mapping tables. We also present our evaluation of the parsing performances of both versions of the IMST Treebank. Our findings suggest that the UD framework is at least as viable for Turkish as the original annotation framework of the IMST Treebank.

2015

Annotation and Extraction of Multiword Expressions in Turkish Treebanks
Gülşen Eryiǧit | Kübra Adali | Dilara Torunoğlu-Selamet | Umut Sulubacak | Tuğba Pamay
Proceedings of the 11th Workshop on Multiword Expressions

Transition-based Dependency DAG Parsing Using Dynamic Oracles
Alper Tokgöz | Gülşen Eryiǧit
Proceedings of the ACL-IJCNLP 2015 Student Research Workshop

The Annotation Process of the ITU Web Treebank
Tuğba Pamay | Umut Sulubacak | Dilara Torunoğlu-Selamet | Gülşen Eryiğit
Proceedings of the 9th Linguistic Annotation Workshop

Using Finite State Transducers for Helping Foreign Language Learning
Hasan Kaya | Gülşen Eryiğit
Proceedings of the 2nd Workshop on Natural Language Processing Techniques for Educational Applications

2014

ITU Turkish NLP Web Service
Gülşen Eryiğit
Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics

A Cascaded Approach for Social Media Text Normalization of Turkish
Dilara Torunoǧlu | Gülşen Eryiǧit
Proceedings of the 5th Workshop on Language Analysis for Social Media (LASM)

Vowel and Diacritic Restoration for Social Media Texts
Kübra Adali | Gülşen Eryiǧit
Proceedings of the 5th Workshop on Language Analysis for Social Media (LASM)

2013

TURKSENT: A Sentiment Annotation Tool for Social Media
Gülşen Eryiǧit | Fatih Samet Çetin | Meltem Yanık | Tanel Temel | İlyas Çiçekli
Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse

Representation of Morphosyntactic Units and Coordination Structures in the Turkish Dependency Treebank
Umut Sulubacak | Gülşen Eryiğit
Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages

2012

Disambiguating Main POS tags for Turkish
Razieh Ehsani | Muzaffer Ege Alper | Gülşen Eryiğit | Eşref Adali
Proceedings of the 24th Conference on Computational Linguistics and Speech Processing (ROCLING 2012)

The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish
Gülşen Eryiğit
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

The studies on dependency parsing of Turkish so far gave their results on the Turkish Dependency Treebank. This treebank consists of sentences where gold standard part-of-speech tags are manually assigned to each word and the words forming multi word expressions are also manually determined and combined into single units. For the first time, we investigate the results of parsing Turkish sentences from scratch and observe the accuracy drop at the end of processing raw data. We test one state-of-the art morphological analyzer together with two different morphological disambiguators. We both show separately the accuracy drop due to the automatic morphological processing and to the lack of multi word unit extraction. With this purpose, we use and present a new version of the Turkish Treebank where we detached the multi word expressions (MWEs) into multiple tokens and manually annotated the missing part-of-speech tags of these new tokens.

Initial Explorations on using CRFs for Turkish Named Entity Recognition
Gökhan Akın Şeker | Gülşen Eryiğit
Proceedings of COLING 2012

Word Alignment for English-Turkish Language Pair
Mehmet Talha Çakmak | Süleyman Acar | Gülşen Eryiğit
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

Word alignment is an important step for machine translation systems. Although the alignment performance between grammatically similar languages is reported to be very high in many studies, the case is not the same for language pairs from different language families. In this study, we are focusing on English-Turkish language pairs. Turkish is a highly agglutinative language with a very productive and rich morphology whereas English has a very poor morphology when compared to this language. As a result of this, one Turkish word is usually aligned with several English words. The traditional models which use word-level alignment approaches generally fail in such circumstances. In this study, we evaluate a Giza++ system by splitting the words into their morphological units (stem and suffixes) and compare the model with the traditional one. For the first time, we evaluate the performance of our aligner on gold standard parallel sentences rather than in a real machine translation system. Our approach reduced the alignment error rate by 40% relative. Finally, a new test corpus of 300 manually aligned sentences is released together with this study.

2011

Multiword Expressions in Statistical Dependency Parsing
Gülşen Eryiğit | Tugay İlbay | Ozan Arkan Can
Proceedings of the Second Workshop on Statistical Parsing of Morphologically Rich Languages

2008

Dependency Parsing of Turkish
Gülşen Eryiğit | Joakim Nivre | Kemal Oflazer
Computational Linguistics, Volume 34, Number 3, September 2008

Erratum: Dependency Parsing of Turkish
Gülşen Eryiğit | Joakim Nivre | Kemal Oflazer
Computational Linguistics, Volume 34, Number 4, December 2008

2007

Single Malt or Blended? A Study in Multilingual Parser Optimization
Johan Hall | Jens Nilsson | Joakim Nivre | Gülşen Eryiǧit | Beáta Megyesi | Mattias Nilsson | Markus Saers
Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)

ITU Treebank Annotation Tool
Gülşen Eryiǧit
Proceedings of the Linguistic Annotation Workshop

2006

Statistical Dependency Parsing for Turkish
Gülşen Eryiǧit | Kemal Oflazer
11th Conference of the European Chapter of the Association for Computational Linguistics

Labeled Pseudo-Projective Dependency Parsing with Support Vector Machines
Joakim Nivre | Johan Hall | Jens Nilsson | Gülşen Eryiǧit | Svetoslav Marinov
Proceedings of the Tenth Conference on Computational Natural Language Learning (CoNLL-X)

Co-authors

Tuğba Pamay Arslan 3

Doğukan Arslan 2

Fatih Bektaş 2

Carlos Ramisch 2

Alina Wróblewska 2

Fatih Samet Çetin 2

Mohammad AL-Smadi 1

Süleyman Acar 1

Elham Akhlaghi 1

Mahmoud Al-Ayyoub 1

Nikolaos Aletras 1

Muzaffer Ege Alper 1

Ion Androutsopoulos 1

Marianna Apidianaki 1

Marianne Grace Araneta 1

Seçil Arslan 1

Oğuz Ali Arslan 1

Verginica Barbu Mititelu 1

Anabela Barreiro 1

Harald Berthelsen 1

Matthias Butterweck 1

Branislav Bédi 1

Anna Bączkowska 1

Olesea Caftanatov 1

Ozan Arkan Can 1

Ilyas Cicekli 1

Cagri Coltekin 1

Matthieu Constant 1

Catia Cucchiarin 1

Orphee De Clercq 1

Kutay Arda Dinç 1

Kaja Dobrovoljc 1

Razieh Ehsani 1

Erdem Emekligil 1

Dimitrios Galanis 1

Johanna Gerlach 1

Bruno Guillaume 1

Memduh Gökırmak 1

Hanieh Habibi 1

Veronique Hoste 1

Salud María Jiménez-Zafra 1

Evgeny Kotelnikov 1

Lenka Krippnerová 1

Alexander König 1

Natalia Loukachevitch 1

Verena Lyding 1

Suresh Manandhar 1

Svetoslav Marinov 1

Stella Markantonatou 1

Yevgen Matusevych 1

Beáta Megyesi 1

Johanna Monti 1

Lionel Nicolas 1

Mattias Nilsson 1

Neasa Ní Chiaráin 1

Atul Kr. Ojha 1

Adriana Silvina Pagano 1

Adriana Pagano 1

Harris Papageorgiou 1

Thomas Pickard 1

Maria Pontiki 1

Bing Qin (秦兵) 1

Christos Rodosthenous 1

Michael Rosner 1

Tanja Samardzic 1

Federico Sangati 1

Steinþór Steingrímsson 1

Xavier Tannier 1

Amalia Todirascu 1

Alper Tokgöz 1

Francis Tyers 1

Aline Villavicencio 1

Abigail Walsh 1

Leonie Weissweiler 1

Rodrigo Wilkens 1

Beata Wójtowicz 1

Meltem Yanık 1

Zeynep Yirmibeşoğlu 1

Ezgi Yıldırım 1

Marie-Catherine de Marneffe 1

Lonneke van der Plas 1

Mehmet Talha Çakmak 1

Hüseyin Anıl Çakmak 1

Gökhan Akın Şeker 1

Venues