2002
pdf
bib
abs
Using word formation rules to extend MT lexicons
Claudia Gdaniec
|
Esmé Manandise
Proceedings of the 5th Conference of the Association for Machine Translation in the Americas: Technical Papers
In the IBM LMT Machine Translation (MT) system, a built-in strategy provides lexical coverage of a particular subset of words that are not listed in its bilingual lexicons. The recognition and coding of these words and their transfer generation is based on a set of derivational morphological rules. A new utility extends unfound words of this type in an LMT-compatible format in an auxiliary bilingual lexical file to be subsequently merged into the core lexicons. What characterizes this approach is the use of morphological, semantic, and syntactic features for both analysis and transfer. The auxiliary lexical file (ALF) has to be revised before a merge into the core lexicons. This utility integrates a linguistics-based analysis and transfer rules with a corpus-based method of verifying or falsifying linguistic hypotheses against extensive document translation, which in addition yields statistics on frequencies of occurrence as well as local context.
2001
pdf
bib
abs
Derivational morphology to the rescue: how it can help resolve unfound words in MT
Claudia Gdaniec
|
Esmé Manandise
|
Michael C. McCord
Proceedings of Machine Translation Summit VIII
Machine Translation (MT) systems that process unrestricted text should be able to deal with words that are not found in the MT lexicon. Without some kind of recognition, the parse may be incomplete, there is no transfer for the unfound word, and tests for transfers for surrounding words will often fail, resulting in poor translation. Interestingly, not much has been published on unfound- word guessing in the context of MT although such work has been going on for other applications. In our work on the IBM MT system, we implemented a far-reaching strategy for recognizing unfound words based on rules of word formation and for generating transfers. What distinguishes our approach from others is the use of semantic and syntactic features for both analysis and transfer, a scoring system to assign levels of confidence to possible word structures, and the creation of transfers in the transformation component. We also successfully applied rules of derivational morphological analysis to non-derived unfound words.
2000
bib
MTranslatability
Arendse Bernth
|
Claudia Gdaniec
Proceedings of the Fourth Conference of the Association for Machine Translation in the Americas: Tutorial Descriptions
1998
pdf
bib
abs
Lexical choice and syntactic generation in a transfer system: transformations in the new LMT English-German system
Claudia Gdaniec
Proceedings of the Third Conference of the Association for Machine Translation in the Americas: Technical Papers
This paper argues that, contrary to received wisdom in the MT research community, a transfer system such as LMT is well suited to deal with most of the problems that MT faces. It may in fact be superior to other approaches in that it can handle target surface-structure constraints, variation of syntactic patterns, discourse-structure constraints, and stylistic preference. The paper describes the linguistic issues involved in LMT’s English⇒German transformational component, its interaction with the lexical transfer component, and types of transformations. It identifies context-dependent and context-independent transformations and among the context-dependent ones, it differentiates between those that are triggered by instructions in the lexicon, by semantic category, by syntactic context, and by setting of stylistic preference. The paper concludes with some examples of divergence between English and German and shows how LMT handles them.
1996
pdf
bib
Evolution of the Logos grammar: system design and development methodology
Patricia Schmid
|
Claudia Gdaniec
Conference of the Association for Machine Translation in the Americas
1995
pdf
bib
Constituent Shifts in the Logos English-German System
Claudia Gdaniec
|
Patricia Schmid
Proceedings of the Sixth Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages
1994
pdf
bib
The Logos Translatability Index
Claudia Gdaniec
Proceedings of the First Conference of the Association for Machine Translation in the Americas
1991
pdf
bib
A Procedure for Quantitatively Comparing the Syntactic Coverage of English Grammars
E. Black
|
S. Abney
|
D. Flickenger
|
C. Gdaniec
|
R. Grishman
|
P. Harrison
|
D. Hindle
|
R. Ingria
|
F. Jelinek
|
J. Klavans
|
M. Liberman
|
M. Marcus
|
S. Roukos
|
B. Santorini
|
T. Strzalkowski
Speech and Natural Language: Proceedings of a Workshop Held at Pacific Grove, California, February 19-22, 1991