Johann Roturier


2019

pdf bib
Bootstrapping a Natural Language Interface to a Cyber Security Event Collection System using a Hybrid Translation Approach
Johann Roturier | Brian Schlatter | David Silva Schlatter
Proceedings of Machine Translation Summit XVII: Translator, Project and User Tracks

2015

pdf bib
Foreebank: Syntactic Analysis of Customer Support Forums
Rasoul Kaljahi | Jennifer Foster | Johann Roturier | Corentin Ribeyre | Teresa Lynn | Joseph Le Roux
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing

2014

pdf bib
The ACCEPT Portal: An Online Framework for the Pre-editing and Post-editing of User-Generated Content
Violeta Seretan | Johann Roturier | David Silva | Pierrette Bouillon
Proceedings of the EACL 2014 Workshop on Humans and Computer-assisted Translation

pdf bib
Syntax and Semantics in Quality Estimation of Machine Translation
Rasoul Kaljahi | Jennifer Foster | Johann Roturier
Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation

pdf bib
Quality Estimation of English-French Machine Translation: A Detailed Study of the Role of Syntax
Rasoul Kaljahi | Jennifer Foster | Johann Roturier | Raphael Rubino
Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers

pdf bib
Semantic Role Labelling with minimal resources: Experiments with French
Rasoul Kaljahi | Jennifer Foster | Johann Roturier
Proceedings of the Third Joint Conference on Lexical and Computational Semantics (*SEM 2014)

pdf bib
Using the ACCEPT framework to conduct an online community-based translation evaluation study
Linda Mitchell | Johann Roturier | David Silva
Proceedings of the 17th Annual conference of the European Association for Machine Translation

2013

pdf bib
Parser Accuracy in Quality Estimation of Machine Translation: A Tree Kernel Approach
Rasoul Samad Zadeh Kaljahi | Jennifer Foster | Raphael Rubino | Johann Roturier | Fred Hollowood
Proceedings of the Sixth International Joint Conference on Natural Language Processing

pdf bib
Estimating the Quality of Translated User-Generated Content
Raphael Rubino | Jennifer Foster | Rasoul Samad Zadeh Kaljahi | Johann Roturier | Fred Hollowood
Proceedings of the Sixth International Joint Conference on Natural Language Processing

pdf bib
DCU-Symantec at the WMT 2013 Quality Estimation Shared Task
Raphael Rubino | Joachim Wagner | Jennifer Foster | Johann Roturier | Rasoul Samad Zadeh Kaljahi | Fred Hollowood
Proceedings of the Eighth Workshop on Statistical Machine Translation

pdf bib
Quality Estimation-guided Data Selection for Domain Adaptation of SMT
Pratyush Banerjee | Raphael Rubino | Johann Roturier | Josef van Genabith
Proceedings of Machine Translation Summit XIV: Papers

pdf bib
Community-based post-editing of machine-translated content: monolingual vs. bilingual
Linda Mitchell | Johann Roturier | Sharon O’Brien
Proceedings of the 2nd Workshop on Post-editing Technology and Practice

pdf bib
The ACCEPT post-editing environment: a flexible and customisable online tool to perform and analyse machine translation post-editing
Johann Roturier | Linda Mitchell | David Silva
Proceedings of the 2nd Workshop on Post-editing Technology and Practice

2012

pdf bib
DCU-Symantec Submission for the WMT 2012 Quality Estimation Task
Raphael Rubino | Jennifer Foster | Joachim Wagner | Johann Roturier | Rasul Samad Zadeh Kaljahi | Fred Hollowood
Proceedings of the Seventh Workshop on Statistical Machine Translation

pdf bib
Translation Quality-Based Supplementary Data Selection by Incremental Update of Translation Models
Pratyush Banerjee | Sudip Kumar Naskar | Johann Roturier | Andy Way | Josef van Genabith
Proceedings of COLING 2012

pdf bib
Evaluation of Machine-Translated User Generated Content: A pilot study based on User Ratings
Linda Mitchell | Johann Roturier
Proceedings of the 16th Annual conference of the European Association for Machine Translation

pdf bib
Domain Adaptation in SMT of User-Generated Forum Content Guided by OOV Word Reduction: Normalization and/or Supplementary Data
Pratyush Banerjee | Sudip Kumar Naskar | Johann Roturier | Andy Way | Josef van Genabith
Proceedings of the 16th Annual conference of the European Association for Machine Translation

pdf bib
Using Automatic Machine Translation Metrics to Analyze the Impact of Source Reformulations
Johann Roturier | Linda Mitchell | Robert Grabowski | Melanie Siegel
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers

This paper investigates the usefulness of automatic machine translation metrics when analyzing the impact of source reformulations on the quality of machine-translated user generated content. We propose a novel framework to quickly identify rewriting rules which improve or degrade the quality of MT output, by trying to rely on automatic metrics rather than human judgments. We find that this approach allows us to quickly identify overlapping rules between two language pairs (English- French and English-German) and specific cases where the rules’ precision could be improved.

pdf bib
A Detailed Analysis of Phrase-based and Syntax-based MT: The Search for Systematic Differences
Rasoul Samad Zadeh Kaljahi | Raphael Rubino | Johann Roturier | Jennifer Foster
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers

This paper describes a range of automatic and manual comparisons of phrase-based and syntax-based statistical machine translation methods applied to English-German and English-French translation of user-generated content. The syntax-based methods underperform the phrase-based models and the relaxation of syntactic constraints to broaden translation rule coverage means that these models do not necessarily generate output which is more grammatical than the output produced by the phrase-based models. Although the systems generate different output and can potentially be fruitfully combined, the lack of systematic difference between these models makes the combination task more challenging.

2011

pdf bib
Qualitative Analysis of Post-Editing for High Quality Machine Translation
Frédéric Blain | Jean Senellart | Holger Schwenk | Mirko Plitt | Johann Roturier
Proceedings of Machine Translation Summit XIII: Papers

pdf bib
Evaluation of MT Systems to Translate User Generated Content
Johann Roturier | Anthony Bensadoun
Proceedings of Machine Translation Summit XIII: Papers

pdf bib
Domain Adaptation in Statistical Machine Translation of User-Forum Data using Component Level Mixture Modelling
Pratyush Banerjee | Sudip Kumar Naskar | Johann Roturier | Andy Way | Josef van Genabith
Proceedings of Machine Translation Summit XIII: Papers

pdf bib
The DCU machine translation systems for IWSLT 2011
Pratyush Banerjee | Hala Almaghout | Sudip Naskar | Johann Roturier | Jie Jiang | Andy Way | Josef van Genabith
Proceedings of the 8th International Workshop on Spoken Language Translation: Evaluation Campaign

In this paper, we provide a description of the Dublin City University’s (DCU) submissions in the IWSLT 2011 evaluationcampaign.1 WeparticipatedintheArabic-Englishand Chinese-English Machine Translation(MT) track translation tasks. We use phrase-based statistical machine translation (PBSMT) models to create the baseline system. Due to the open-domain nature of the data to be translated, we use domain adaptation techniques to improve the quality of translation. Furthermore, we explore target-side syntactic augmentation for an Hierarchical Phrase-Based (HPB) SMT model. Combinatory Categorial Grammar (CCG) is used to extract labels for target-side phrases and non-terminals in the HPB system. Combining the domain adapted language models with the CCG-augmented HPB system gave us the best translations for both language pairs providing statistically significant improvements of 6.09 absolute BLEU points (25.94% relative) and 1.69 absolute BLEU points (15.89% relative) over the unadapted PBSMT baselines for the Arabic-English and Chinese-English language pairs, respectively.

2010

pdf bib
TMX Markup: A Challenge When Adapting SMT to the Localisation Environment
Jinhua Du | Johann Roturier | Andy Way
Proceedings of the 14th Annual conference of the European Association for Machine Translation

pdf bib
Source Text Characteristics and Technical and Temporal Post-Editing Effort: What is Their Relationship
Midori Tatsumi | Johann Roturier
Proceedings of the Second Joint EM+/CNGL Workshop: Bringing MT to the User: Research on Integrating MT in the Translation Industry

This paper focuses on the relationship between source text characteristics (ambiguity, complexity and style compliance) and machine-translation post-editing effort (both temporal and technical). Post-editing data is collected in a traditional translation environment and subsequently plotted against textual scores produced by a range of systems. Our findings show some strong correlation between ambiguity and complexity scores and technical post-editing effort, as well as moderate correlation between one of the style guide compliance scores and temporal post-editing effort.

pdf bib
Improving the Post-Editing Experience using Translation Recommendation: A User Study
Yifan He | Yanjun Ma | Johann Roturier | Andy Way | Josef van Genabith
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers

We report findings from a user study with professional post-editors using a translation recommendation framework (He et al., 2010) to integrate Statistical Machine Translation (SMT) output with Translation Memory (TM) systems. The framework recommends SMT outputs to a TM user when it predicts that SMT outputs are more suitable for post-editing than the hits provided by the TM. We analyze the effectiveness of the model as well as the reaction of potential users. Based on the performance statistics and the users’ comments, we find that translation recommendation can reduce the workload of professional post-editors and improve the acceptance of MT in the localization industry.

2009

pdf bib
Deploying Novel MT Technology to Raise the Bar for Quality at Symantec: Key Advantages and Challenge
Johann Roturier | Symantec
Proceedings of Machine Translation Summit XII: Plenaries

bib
Postediting Machine Translation Output Guidelines
Sharon O’Brien | Johann Roturier | Giselle de Almeida
Proceedings of Machine Translation Summit XII: Tutorials

2007

pdf bib
How portable are controlled language rules? A comparison of two empirical MT studies
Sharon O’Brien | Johann Roturier
Proceedings of Machine Translation Summit XI: Papers

2004

pdf bib
Assessing a set of Controlled Language Rules: Can They Improve the Performance of Commercial Machine Translation Systems
Johann Roturier
Proceedings of Translating and the Computer 26