Johann Roturier

The ACCEPT post-editing environment: a flexible and customisable online tool to perform and analyse machine translation post-editing
Johann Roturier | Linda Mitchell | David Silva
Proceedings of the 2nd Workshop on Post-editing Technology and Practice

pdf bib

Quality Estimation-guided Data Selection for Domain Adaptation of SMT
Pratyush Banerjee | Raphael Rubino | Johann Roturier | Josef van Genabith
Proceedings of Machine Translation Summit XIV: Papers

2012

pdf bib

pdf bib

Translation Quality-Based Supplementary Data Selection by Incremental Update of Translation Models
Pratyush Banerjee | Sudip Kumar Naskar | Johann Roturier | Andy Way | Josef van Genabith
Proceedings of COLING 2012

pdf bib

Domain Adaptation in SMT of User-Generated Forum Content Guided by OOV Word Reduction: Normalization and/or Supplementary Data
Pratyush Banerjee | Sudip Kumar Naskar | Johann Roturier | Andy Way | Josef van Genabith
Proceedings of the 16th Annual Conference of the European Association for Machine Translation

pdf bib

Evaluation of Machine-Translated User Generated Content: A pilot study based on User Ratings
Linda Mitchell | Johann Roturier
Proceedings of the 16th Annual Conference of the European Association for Machine Translation

pdf bib abs

A Detailed Analysis of Phrase-based and Syntax-based MT: The Search for Systematic Differences
Rasoul Samad Zadeh Kaljahi | Raphael Rubino | Johann Roturier | Jennifer Foster
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers

This paper describes a range of automatic and manual comparisons of phrase-based and syntax-based statistical machine translation methods applied to English-German and English-French translation of user-generated content. The syntax-based methods underperform the phrase-based models and the relaxation of syntactic constraints to broaden translation rule coverage means that these models do not necessarily generate output which is more grammatical than the output produced by the phrase-based models. Although the systems generate different output and can potentially be fruitfully combined, the lack of systematic difference between these models makes the combination task more challenging.

pdf bib abs

Using Automatic Machine Translation Metrics to Analyze the Impact of Source Reformulations
Johann Roturier | Linda Mitchell | Robert Grabowski | Melanie Siegel
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers

This paper investigates the usefulness of automatic machine translation metrics when analyzing the impact of source reformulations on the quality of machine-translated user generated content. We propose a novel framework to quickly identify rewriting rules which improve or degrade the quality of MT output, by trying to rely on automatic metrics rather than human judgments. We find that this approach allows us to quickly identify overlapping rules between two language pairs (English- French and English-German) and specific cases where the rules’ precision could be improved.

2011

pdf bib

Domain Adaptation in Statistical Machine Translation of User-Forum Data using Component Level Mixture Modelling
Pratyush Banerjee | Sudip Kumar Naskar | Johann Roturier | Andy Way | Josef van Genabith
Proceedings of Machine Translation Summit XIII: Papers

pdf bib

Evaluation of MT Systems to Translate User Generated Content
Johann Roturier | Anthony Bensadoun
Proceedings of Machine Translation Summit XIII: Papers

pdf bib

Qualitative Analysis of Post-Editing for High Quality Machine Translation
Frédéric Blain | Jean Senellart | Holger Schwenk | Mirko Plitt | Johann Roturier
Proceedings of Machine Translation Summit XIII: Papers

pdf bib abs

In this paper, we provide a description of the Dublin City University’s (DCU) submissions in the IWSLT 2011 evaluationcampaign.1 WeparticipatedintheArabic-Englishand Chinese-English Machine Translation(MT) track translation tasks. We use phrase-based statistical machine translation (PBSMT) models to create the baseline system. Due to the open-domain nature of the data to be translated, we use domain adaptation techniques to improve the quality of translation. Furthermore, we explore target-side syntactic augmentation for an Hierarchical Phrase-Based (HPB) SMT model. Combinatory Categorial Grammar (CCG) is used to extract labels for target-side phrases and non-terminals in the HPB system. Combining the domain adapted language models with the CCG-augmented HPB system gave us the best translations for both language pairs providing statistically significant improvements of 6.09 absolute BLEU points (25.94% relative) and 1.69 absolute BLEU points (15.89% relative) over the unadapted PBSMT baselines for the Arabic-English and Chinese-English language pairs, respectively.

2010

pdf bib abs

Source Text Characteristics and Technical and Temporal Post-Editing Effort: What is Their Relationship
Midori Tatsumi | Johann Roturier
Proceedings of the Second Joint EM+/CNGL Workshop: Bringing MT to the User: Research on Integrating MT in the Translation Industry

This paper focuses on the relationship between source text characteristics (ambiguity, complexity and style compliance) and machine-translation post-editing effort (both temporal and technical). Post-editing data is collected in a traditional translation environment and subsequently plotted against textual scores produced by a range of systems. Our findings show some strong correlation between ambiguity and complexity scores and technical post-editing effort, as well as moderate correlation between one of the style guide compliance scores and temporal post-editing effort.

pdf bib

TMX Markup: A Challenge When Adapting SMT to the Localisation Environment
Jinhua Du | Johann Roturier | Andy Way
Proceedings of the 14th Annual Conference of the European Association for Machine Translation

pdf bib abs

Improving the Post-Editing Experience using Translation Recommendation: A User Study
Yifan He | Yanjun Ma | Johann Roturier | Andy Way | Josef van Genabith
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers

We report findings from a user study with professional post-editors using a translation recommendation framework (He et al., 2010) to integrate Statistical Machine Translation (SMT) output with Translation Memory (TM) systems. The framework recommends SMT outputs to a TM user when it predicts that SMT outputs are more suitable for post-editing than the hits provided by the TM. We analyze the effectiveness of the model as well as the reaction of potential users. Based on the performance statistics and the users’ comments, we find that translation recommendation can reduce the workload of professional post-editors and improve the acceptance of MT in the localization industry.

2009

bib

Postediting Machine Translation Output Guidelines
Sharon O’Brien | Johann Roturier | Giselle de Almeida
Proceedings of Machine Translation Summit XII: Tutorials

pdf bib

Deploying Novel MT Technology to Raise the Bar for Quality at Symantec: Key Advantages and Challenge
Johann Roturier | Symantec
Proceedings of Machine Translation Summit XII: Plenaries

2007

pdf bib

How portable are controlled language rules? A comparison of two empirical MT studies
Sharon O’Brien | Johann Roturier
Proceedings of Machine Translation Summit XI: Papers

2004

pdf bib

Assessing a set of Controlled Language Rules: Can They Improve the Performance of Commercial Machine Translation Systems
Johann Roturier
Proceedings of Translating and the Computer 26

Venues

WMT2

JEC1

TC1

WS1

Fix author

Johann Roturier

2019

2015

2014

2013

2012

2011

2010

2009

2007

2004

Co-authors

Venues