Yiorgos Tambouratzis


2005

pdf bib
Monolingual Corpus-based MT Using Chunks
Stella Markantonatou | Sokratis Sofianopoulos | Vassiliki Spilioti | Yiorgos Tambouratzis | Marina Vassiliou | Olga Yannoutsou | Nikos Ioannou
Workshop on example-based machine translation

In the present article, a hybrid approach is proposed for implementing a machine translation system using a large monolingual corpus coupled with a bilingual lexicon and basic NLP tools. In the first phase of the METIS system, a source language (SL) sentence, after being tagged, lemmatised and translated by a flat lemma-to-lemma lexicon, was matched against a tagged and lemmatised target language (TL) corpus using a pattern matching algorithm. In the second phase, translations are generated by combining sub-sentential structures. In this paper, the main features of the second phase are discussed while the system architecture and the corresponding translation approach are presented. The proposed methodology is illustrated with examples of the translation process.