Aarne Ranta

2021

Grammar-Based Concept Alignment for Domain-Specific Machine Translation
Arianna Masciolini | Aarne Ranta
Proceedings of the Seventh International Workshop on Controlled Natural Language (CNL 2020/21)

pdf bib

Traitement Automatique des Langues, Volume 62, Numéro 3 : Diversité Linguistique [Linguistic Diversity in Natural Language Processing]
Aarne Ranta | Cyril Goutte
Traitement Automatique des Langues, Volume 62, Numéro 3 : Diversité Linguistique [Linguistic Diversity in Natural Language Processing]

pdf bib

Linguistic Diversity in Natural Language Processing
Aarne Ranta | Cyril Goutte
Traitement Automatique des Langues, Volume 62, Numéro 3 : Diversité Linguistique [Linguistic Diversity in Natural Language Processing]

2020

pdf bib abs

Abstract Syntax as Interlingua: Scaling Up the Grammatical Framework from Controlled Languages to Robust Pipelines
Aarne Ranta | Krasimir Angelov | Normunds Gruzitis | Prasanth Kolachina
Computational Linguistics, Volume 46, Issue 2 - June 2020

Abstract syntax is an interlingual representation used in compilers. Grammatical Framework (GF) applies the abstract syntax idea to natural languages. The development of GF started in 1998, first as a tool for controlled language implementations, where it has gained an established position in both academic and commercial projects. GF provides grammar resources for over 40 languages, enabling accurate generation and translation, as well as grammar engineering tools and components for mobile and Web applications. On the research side, the focus in the last ten years has been on scaling up GF to wide-coverage language processing. The concept of abstract syntax offers a unified view on many other approaches: Universal Dependencies, WordNets, FrameNets, Construction Grammars, and Abstract Meaning Representations. This makes it possible for GF to utilize data from the other approaches and to build robust pipelines. In return, GF can contribute to data-driven approaches by methods to transfer resources from one language to others, to augment data by rule-based generation, to check the consistency of hand-annotated corpora, and to pipe analyses into high-precision semantic back ends. This article gives an overview of the use of abstract syntax as interlingua through both established and emerging NLP applications involving GF.

2019

pdf bib

Grammatical Framework: an Interlingual Grammar Formalism
Aarne Ranta
Proceedings of the 14th International Conference on Finite-State Methods and Natural Language Processing

pdf bib abs

Bootstrapping UD treebanks for Delexicalized Parsing
Prasanth Kolachina | Aarne Ranta
Proceedings of the 22nd Nordic Conference on Computational Linguistics

Standard approaches to treebanking traditionally employ a waterfall model (Sommerville, 2010), where annotation guidelines guide the annotation process and insights from the annotation process in turn lead to subsequent changes in the annotation guidelines. This process remains a very expensive step in creating linguistic resources for a target language, necessitates both linguistic expertise and manual effort to develop the annotations and is subject to inconsistencies in the annotation due to human errors. In this paper, we propose an alternative approach to treebanking—one that requires writing grammars. This approach is motivated specifically in the context of Universal Dependencies, an effort to develop uniform and cross-lingually consistent treebanks across multiple languages. We show here that a bootstrapping approach to treebanking via interlingual grammars is plausible and useful in a process where grammar engineering and treebanking are jointly pursued when creating resources for the target language. We demonstrate the usefulness of synthetic treebanks in the task of delexicalized parsing. Our experiments reveal that simple models for treebank generation are cheaper than human annotated treebanks, especially in the lower ends of the learning curves for delexicalized parsing, which is relevant in particular in the context of low-resource languages.

Chinese in the Grammatical Framework: Grammar, Translation, and Other Applications Anonymous
Aarne Ranta | Yan Tian | Haiyan Qiao
Proceedings of the Eighth SIGHAN Workshop on Chinese Language Processing

pdf bib

Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop
Emily M. Bender | Lori Levin | Stefan Müller | Yannick Parmentier | Aarne Ranta
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop

pdf bib

Grammar Engineering for a Customer: a Case Study with Five Languages
Aarne Ranta | Christina Unger | Daniel Vidal Hussey
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop

pdf bib

A Cloud-Based Editor for Multilingual Grammars
Thomas Hallgren | Ramona Enache | Aarne Ranta
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop

pdf bib

Formalising the Swedish Constructicon in Grammatical Framework
Normunds Gruzitis | Dana Dannélls | Benjamin Lyngfelt | Aarne Ranta
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop

2014

pdf bib

Speech-Enabled Hybrid Multilingual Translation for Mobile Devices
Krasimir Angelov | Björn Bringert | Aarne Ranta
Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics

pdf bib abs

Sharing resources between free/open-source rule-based machine translation systems: Grammatical Framework and Apertium
Grégoire Détrez | Víctor M. Sánchez-Cartagena | Aarne Ranta
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

In this paper, we describe two methods developed for sharing linguistic data between two free and open source rule based machine translation systems: Apertium, a shallow-transfer system; and Grammatical Framework (GF), which performs a deeper syntactic transfer. In the first method, we describe the conversion of lexical data from Apertium to GF, while in the second one we automatically extract Apertium shallow-transfer rules from a GF bilingual grammar. We evaluated the resulting systems in a English-Spanish translation context, and results showed the usefulness of the resource sharing and confirmed the a-priori strong and weak points of the systems involved.

pdf bib

Types and Records for Predication
Aarne Ranta
Proceedings of the EACL 2014 Workshop on Type Theory and Natural Language Semantics (TTNLS)

pdf bib

Developing an interlingual translation lexicon using WordNets and Grammatical Framework
Shafqat Mumtaz Virk | K.V.S. Prasad | Aarne Ranta | Krasimir Angelov
Proceedings of the Fifth Workshop on South and Southeast Asian Natural Language Processing