Index to statistical and computational aspects since 2010

Machine Translation Archive

Index to statistical and computational aspects

Publications since 2010

For other periods go to: publications 2005-2009; publications 2000-2004; publications 1990-1999; publications 1970-1989; publications before 1970

[To return to home page click here]

Algorithms see Programming and algorithms

Bayesian methods

(2014) Abdullah Alrajeh & Mahesan Niranjan: Bayesian reordering model with feature selection. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.477-485. [PDF, 356KB]

(2014) Baskaran Sankaran & Anoop Sarkar: Bayesian iterative-cascade framework for hierarchical phrase-based translation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.15-27. [PDF, 728KB]

(2013) Daniel Beck, Kashif Shah, Trevor Cohn & Lucia Specia: SHEF-Lite: when less is more for translation quality estimation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.337-342. [PDF, 330KB]

(2013) Victor Chahuneau, Noah A.Smith, & Chris Dyer: Knowledge-rich morphological priors for Bayesian language models. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.1206-1215. [PDF, 208KB]

(2013) Victor Chahuneau, Eva Schlinger, Noah A.Smith, & Chris Dyer: Translating into morphologically rich languages with synthetic phrases. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1044-1054. [PDF, 312KB]

(2013) Trevor Cohn & Gholamreza Haffari: An infinite hierarchical Bayesian model of phrasal translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.780-790. [PDF, 432KB]

(2013) Yang Feng & Trevor Cohn: A Markov model of machine translation using non-parametric Bayesian inference. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.333-342. [PDF, 312KB]

(2013) Yarin Gal & Phil Blunsom: A systematic Bayesian treatment of the IBM alignment models. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.969-977. [PDF, 290KB]

(2013) Jesús González-Rubio & Francisco Casacuberta: Improving the minimum Bayes’ risk combination of machine translation systems. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 311KB]

(2013) Aaron Li-Feng Han, Yi Lu, Derek F. Wong, Lidia S. Chao, Liangye He & Junwen Xing: Quality estimation for machine translation using the joint method of evaluation criteria and statistical modeling. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.365-372. [PDF, 309KB]

(2013) Tingting Li, Tiejun Zhao, Andrew Finch, & Chunyue Zhang: A tightly-coupled unsupervised clustering and bilingual alignment model for transliteration. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.393-398. [PDF, 698KB]

(2013) Sujith Ravi: Scalable decipherment for machine translation via hash sampling. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.362-271. [PDF, 276KB]

(2013) Markus Saers & Dekai Wu: Bayesian induction of bracketing inversion transduction grammars. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1158-1166. [PDF, 483KB]

(2013) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Scalable variational inference for extracting hierarchical phrase-based translation rules. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.438-446. [PDF, 310KB]

(2013) Kashif Shah, Trevor Cohn, & Lucia Specia: An investigation on the effectiveness of features for translation quality estimation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.167-174. [PDF, 523KB]

(2013) Katsuhito Sudoh, Graham Neubig, Kevin Duh, & Hajime Tsukada: NTT-NAIST SMT systems for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 91KB]

(2012) Chenhui Chu, Toshiaki Nakazawa, & Sadao Kurohashi: EBMT system of Kyoto University in OLYMPICS task at IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.96-101. [PDF, 771KB]

(2012) Shujie Liu, Chi-Ho Li, Mu Li & Ming Zhou: Learning translation consensus with structured label propagation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.302-310. [PDF, 204KB]

(2012) Andrew Finch, Paul Dixon, & Eiichiro Sumita: Rescoring a phrase-based machine transliteration system with recurrent neural network language models. [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.47-51. [PDF, 188KB]

(2012) Coşkun Mermer, Hamza Kaya, İlknur Durgar El-Kahlout, & Mehmet Uğur Doğan: The TÜBİTAK statistical machine translation system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.144-148. [PDF,630KB]

(2012) Darcey Riley & Daniel Gildea: Improving the IBM alignment models using variational Bayes. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.306-310. [PDF, 103KB]

(2012) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Compact rule extraction for hierarchical phrase-based translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 342KB]

(2012) Hiroaki Shimizu, Masao Utiyama, Eiichiro Sumita, & Satoshi Nakamura: Minimum Bayes-risk decoding extended with similar examples: NAIST-NCT at IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.117-120. [PDF, 604KB]

(2012) Zhaopeng Tu, Yang Liu, Yifan He, Josef van Genabith, Qun Liu, & Shouxun Lin: Combining multiple alignments to improve machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1249-1260. [PDF, 178KB]

(2011) Nan Duan, Mu Li, Ming Zhou, & Lei Cui: Improving phrase extraction via MBR phrase scoring and pruning. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.189-197. [PDF, 302KB]

(2011) Kevin Duh, Katsuhito Sudoh, Tomoharu Iwata, & Hajime Tsukada: Alignment inference and Bayesian adaptation for machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.114-121. [PDF, 117KB]

(2011) Kevin Duh, Katsuhito Sudoh, Xianchao Wu, Hajime Tsukada, & Masaaki Nagata: Generalized minimum Bayes risk system combination. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1356-1360. [PDF, 162KB]

(2011) Jason Eisner & Markus Dreyer: A non-parametric Bayesian approach to inflectional morphology [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 27 January, 2011; presentation by Jason Eisner: 7 slides [PDF of PPT, 100KB]; presentation by Markus Dreyer: 200 slides [PDF of PPT, 11706KB]

(2011) Andrew Finch, Paul Dixon, & Eiichiro Sumita: Integrating models derived from non-parametric Bayesian co-segmentation into a statistical machine transliteration system. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.23-27. [PDF, 266KB]

(2011) Takaaki Fukunishi, Seiichi Yamamoto, Andrew Finch, & Eiichiro Sumita: Using features from a bilingual alignment model in transliteration mining. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.49-57. [PDF, 660KB]

(2011) Jesús González-Rubio, Alfons Juan, & Francisco Casacuberta: Minimum Bayes-risk system combination. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1268-1277. [PDF, 174KB]

(2011) Jesús González-Rubio & Francisco Casacuberta: The UPV-PRHLT combination system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.140-144. [PDF, 143KB]

(2011) Yun Huang, Min Zhang, & Chew Lim Tan: Nonparametric Bayesian machine transliteration with asynchronous adaptor grammars. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.534-539. [PDF, 162KB]

(2011) Zhifei Li, Jason Eisner, Ziyuan Wang, Sanjeev Khudanpur, & Brian Roark: Minimum imputed risk: unsupervised discriminative training for machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.920-929. [PDF, 236KB]

(2011) Coşkun Mermer & Murat Saraçlar: Bayesian word alignment for statistical machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.182-187. [PDF, 140KB]

(2011) Peter Nabende: Mining transliterations from Wikipedia using dynamic Bayesian networks. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.385-391. [PDF, 580KB]

(2011) Toshiaki Nakazawa & Sadao Kurohashi: Bayesian subtree alignment model based on dependency trees. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.794-802. [PDF, 790KB]

(2011) Michael Paul, Andrew Finch, Paul R.Dixon, & Eiichiro Sumita: Dialect translation: integrating Bayesian co-segmentation models with pivot-based SMT. [EM NLP 2011] DIALECTS2011: Proceedings of the First Workshop on Algorithms and Resources for Modelling of Dialects and Language Varieties, Edinburgh, Scotland, UK, July 31, 2011; pp.1-9. [PDF, 210KB]

(2011) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Bayesian extraction of minimal SCFG rules for hierarchical phrase-based translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.533-541. [PDF, 148KB]

(2011) Yinggong Zhao, Shujie Liu, Yangsheng Ji, Jiajun Chen, & Guodong Zhou: Transductive minimum error rate training for statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.641-648. [PDF, 214KB]

(2010) Abhishek Arun, Barry Haddow, & Philipp Koehn: A unified approach to minimum risk training and decoding. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 365-374. [PDF, 225KB]

(2010) Graeme Blackwood, Adrià de Gispert, & William Byrne: Efficient path counting transducers for minimum Bayes-risk decoding of statistical machine translation lattices. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.27-32. [PDF, 288KB]

(2010) Graeme Blackwood, Adrià de Gispert & William Byrne: Fluency constraints for minimum Bayes-risk decoding of statistical machine translation lattices. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.71-79. [PDF, 547KB]

(2010) Boxing Chen, George Foster, & Roland Kuhn: Fast consensus hypothesis regeneration for machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 11-16. [PDF, 120KB]

(2010) Jinhua Du, Pavel Pecina, & Andy Way: An augmented three-pass system combination framework: DCU combination system for WMT 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 290-295. [PDF, 550KB]

(2010) Nan Duan, Mu Li, Dongdong Zhang, & Ming Zhou: Mixture model-based minimum Bayes risk decoding using multiple machine translation systems. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.313-321. [PDF, 372KB]

(2010) Vladimir Eidelman, Chris Dyer, & Philip Resnik: The University of Maryland statistical machine translation system for the Fifth Workshop on Machine Translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 72-76. [PDF, 115KB]

(2010) Andrew Finch & Eiichiro Sumita: A Bayesian model of bilingual segmentation for transliteration. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.259-266. [PDF, 511KB]

(2010) Christian Hardmeier, Arianna Bisazza, & Marcello Federico: FBK at WMT 2010: word lattices for morphological reduction and chunk-based reordering. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 88-92. [PDF, 132KB]

(2010) Sergio Penkale, Rejwanul Haque, Sandipan Dandapat, Pratyush Banerjee, Ankit K.Srivastava, Jinhua Du, Pavel Pecina, Sudip Kumar Naskar, Mikel L.Forcada, & Andy Way: MaTrEx: the DCU MT system for WMT 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 143-148. [PDF, 535KB]

(2010) Juan Pino, Gonzalo Iglesias, Adrià de Gispert, Graeme Blackwood, Jamie Brunning, & William Byrne: The CUED HiFST system for the WMT10 translation shared task. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 155-160. [PDF, 97KB]

(2010) Germán Sanchis-Trilles & Francisco Casacuberta: Log-linear weight optimisation via Bayesian adaptation in statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1077-1085. [PDF, 225KB]

(2010) Sami Virpioja, Jaako Väyrynen, André Mansikkaniemi, & Mikko Kurimo: Applying morphological decomposition to statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 195-200. [PDF, 116KB]

Boltzmann machines

(2012) Jan Niehues & Alex Waibel: Continuous space language models using restricted Boltzmann machines. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.164-170. [PDF, 675KB]

Bootstrapping

(2014) Krasimir Angelov: Bootstrapping open-source English-Bulgarian computational dictionary. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1018-1023. [PDF, 158KB]

(2013) Hien Vu Huy, Phuong-Thai Nguyen, Tung-Lam Nguyen, & M.L.Nguyen: Bootstrapping phrase-based statistical machine translation via WSD integration. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1042-1046. [PDF, 231KB]

(2013) Taesung Lee & Seung-won Hwang: Bootstrapping entity translation on weakly comparable corpora. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.631-640. [PDF, 624KB]

(2013) Katsuhito Sudoh, Shinsuke Mori & Masaaki Nagata: Noise-aware character alignment for bootstrapping statistical machine transliteration from bilingual corpora. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.204-209. [PDF, 617KB]

(2013) Kristina Toutanova & Byung-Gyu Ahn: Learning non-linear features for machine translation using gradient boosting machines. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.406-411. [PDF, 725KB]

(2013) Ivan Vulić & Marie-Francine Moens: A study on bootstrapping bilingual vector spaces from non-parallel data (and nothing else). [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1044-1054. [PDF, 261KB]

(2013) Chao Zhang, Shiqi Zhao, & Haifeng Wang: Bootstrapping large-scale named entities using URL-text hybrid patterns. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.293-301. [PDF, 249KB]

(2012) Kurt Eberle, Bogdan Babych, Johanna Geiß, Mireia Ginestí-Rosell, Anthony Hartley, Reinhard Rapp, Serge Sharoff, & Martin Thomas: Design of a hybrid high quality machine translation system. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.101-112. [PDF, 381KB]

(2012) Masood Ghayoomi: From grammar rule extraction to treebanking: a bootstrapping approach. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1912-1919. [PDF, 1203KB]

(2012) Jie Jiang, Andy Way, Nelson Ng, Rejwanul Haque, Mike Dillinger, & Jun Lu: Monolingual data optimisation for bootstrapping SMT engines. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.17-26. [PDF, 736KB]

(2012) Santanu Pal & Sivaji Bandyopadhyay: Bootstrapping method for chunk alignment in phrase based SMT. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp. 93-100. [PDF, 246KB]

(2011) Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Bootstrapping Arabic-Italian SMT through comparable texts and pivot translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.249-256. [PDF, 354KB]; presentation, 12 slides [PDF]

(2011) Mitesh M.Khapra, Salil Joshi, Arindam Chatterjee, & Pushpak Bhattacharyya: Together we can: bilingual bootstrapping for WSD. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.561-569. [PDF, 137KB]

(2011) Manny Rayner, Paula Estrella, & Pierrette Bouillon: Bootstrapping a statistical speech translator from a rule-based one. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.21-28. [PDF, 199KB]

(2011) Marco Turchi, Vanni Zavarella, & Hristo Tanev: Pattern learning for event extraction using monolingual statistical machine translation. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.371-377. [PDF, 148KB]

(2010) Lieve Macken & Walter Daelemans: A chunk-driven bootstrapping approach to extracting translation patterns [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

(2010) Manny Rayner, Paula Estrella, & Pierrette Bouillon: A bootstrapped interlingua-based SMT architecture. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 611KB]; presentation: 30 slides [PDF, 737KB]

Character coding

(2013) Xiaodong Shi, Yidong Chen, & Xiuping Huang: Key problems in conversion from simplified to traditional Chinese characters. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.287-293. [PDF, 273KB]

Chart parsing

(2012) Kenneth Heafield: Chart-based decoding. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 77 slides [PDF of PPT, 1257KB]

(2012) Hwidong Na & Jong-Hyeok Lee: Forest-to-string translation using binarized dependency forest for IWSLT 2012 OLYMPICS task. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.130-135. [PDF, 787KB]

(2010) Martin Čmejrek & Bowen Zhou: Two methods for extending hierarchical rules from the bilingual chart parsing. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.180-188. [PDF, 200KB]

(2010) Petr Homola & Vladislav Kuboň: Exploiting charts in the MT between related languages [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

(2010) Yang Liu & Liang Huang: Tree-based and forest-based translation. ACL 2010:Tutorial abstracts, Uppsala, Sweden, 11 July, 2010; p.2. [PDF, 58KB]

(2010) Haitao Mi & Qun Liu: Constituency to dependency translation with forests. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1433-1442. [PDF, 190KB]

(2010) Haitao Mi, Liang Huang, & Qun Liu: Machine translation with lattices and forests. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.837-845. [PDF, 268KB]

(2010) Zhaopeng Tu, Yang Liu, Young-Sook Hwang, Qun Liu, & Shouxun Lin: Dependency forest for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1092-1100. [PDF, 215KB]

Cloud computing

(2014) Maud Ehrmann, Francesco Cecconi, Daniele Vannella, John McCrae, Philipp Cimiano, & Roberto Navigli: Representing multilingual data as linked data: the case of BabelNet 2.0. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.401-408. [PDF, 674KB]

(2014) EU-Bridge – Bridges across the language divide. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.54. [PDF, 311KB]

(2013) Andrejs Vasiljevs & Tatiana Gornostay: TaaS – terminology as a service. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.445. [PDF, 288KB]

(2012) Jörg Tiedemann, Dorte Haltrup Hansen, Lene Offersgaard, Sussi Olsen, & Matthias Zumpe: A distributed resource repository for cloud-based machine translation. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2207,2213. [PDF, 441KB]

(2012) Hans Uszkoreit: The META-NET strategic research agenda: overview, preparation, dissemination. META-FORUM, Brussels, June 19-21, 2012; 28 slides. [PDF of PPT, 3012KB]

(2012) Andrejs Vasiļjevs, Raivis Skadiņš, & Jörg Tiedemann: LetsMT!: a cloud-based platform for do-it-yourself machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 10 July 2012, System Demonstrations; pp.43-48. [PDF, 309KB]

(2012) Andrejs Vasiļjevs: LetsMT! – platform to drive development and application of statistical machine translation. [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”, LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.20-23. [PDF, 444KB]

(2012) Andrejs Vasiļjevs: Priority theme 1: Translation in the cloud. META-FORUM, Brussels, June 19-21, 2012; 10 slides. [PDF of PPT, 611KB]

(2012) LetsMT!: do-it-yourself machine translation factory on the cloud. [Project paper at] EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; p.100. [PDF, 156KB]

(2011) Andrejs Vasiļjevs, Raivis Skadiņš, & Jörg Tiedemann: LetsMT!: cloud-based platform for building user tailored machine translation engines. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.507-511. [PDF, 211KB]

(2010) Graham Wilcock: Cloud computing for the humanities. Human Language Technologies—The Baltic Perspective, 4th International Conference, Riga, Latvia, October 7-8, 2010; 28 slides [PDF of PPT, 2265KB]

(2011) LetsMT! Platform for online sharing of training data and building user tailored MT. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.337. [PDF, 66KB]

(2010) Andrejs Vasiljevs: LetsMT! – towards cloud based service for MT generation. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 15pp. [PDF, 368KB]

Computational complexity (see also Modularity)

(2014) Miriam Kaeshammer & Anika Westburg: On complex word alignment configurations. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1773-1780. [PDF, 171KB]

(2014) Nikola Ljubešić & Antonio Toral: caWaC – a web corpus of Catalan and its application to language modeling an machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1728-1732. [PDF, 138KB]

(2012) Vladimir Eidelman: Optimization strategies for online large-margin learning in machine translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.480-489. [PDF, 396KB]

(2012) Antonio Toral, Marc Poch, Pavel Pecina, & Gregor Thurmair: Efficiency-based evaluation of aligners for industrial applications. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.57-60. [PDF, 210KB]

(2012) Wenduan Xu & Philipp Koehn: Extending Hiero decoding in Moses with cube growing. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.133-142. [PDF, 184KB]

(2011) Kenneth Heafield: KenLM: faster and smaller language model queries. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.187-197. [PDF, 169KB]

(2011) Gonzalo Iglesias, Cyril Allauzen, William Byrne, Adrià de Gispert, & Michael Riley: Hierarchical phrase-based translation representations. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1373-1383. [PDF, 334KB]

Conditional random fields

(2013) Eunah Cho, Than-Le Ha, & Alex Waibel: CRF-based disfluency detection using semantic features for German to English spoken language translation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 473KB]

(2013) Patrick Lehnen, Jan-Thorsten Peter, Joern Wuebker, Stephan Peitz, & Hermann Ney: (Hidden) conditional random fields using intermediate classes for statistical machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.151-158. [PDF, 588KB]

(2013) Xuchen Yao, Benjamin Van Durme, Chris Callison-Burch, & Peter Clark: A lightweight and high performance monolingual word aligner. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.702-707. [PDF, 231KB]

(2012) Waleed Ammar, Chris Dyer, & Noah A.Smith: Transliteration by sequence labeling with lattice encodings and reranking. [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.66-70. [PDF, 188KB]

(2012) Joseph Z.Chang, Jason S.Chang, & Jyh-Shing Roger Jang: Learning to find translations and transliterations on the web. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.130-134. [PDF, 270KB]

(2012) Chan-Hung Kuo, Shih-Hung Liu, Mike Tian-Jian Jiang, Cheng-Wei Lee, & Wen-Lian Hsu: Cost-benefit analysis of two-stage conditional random fields based English-to-Chinese machine transliteration. [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.76-80. [PDF, 121KB]

(2011) Wilker Aziz, Miguel Rios, & Lucia Specia: Improving chunk-based semantic role labelling with lexical features. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.226-232. [PDF, 99KB]

(2011) Thomas Lavergne, Josep Maria Crego, Alexandre Allauzen, & François Yvon: From n-gram-based to CRF-based translation models. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.542-553. [PDF, 197KB]

(2011) Mike Tian-Jian Jiang, Chan-Hung Kuo, & Wen-Lai Hsu: English-toChinese machine transliteration using accessor variety features of source graphemes. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.86-90. [PDF, 282KB]

(2011) Ying Qin & Guohua Chen: Forward-backward machine transliteration between English and Chinese based on combined CRFs. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.82-85. [PDF, 168KB]

(2010) Benjamin Roth, Andrew McCallum, Marc Dymetman & Nicola Cancedda: Machine translation using overlapping alignments and SampleRank. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 138KB]

(2010) Dong Yang, Paul Dixon, & Sadaoki Furui: Jointly optimizing a two-step conditional random field model for machine transliteration and its fast decoding algorithm. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.275-280. [PDF, 115KB]

Confusion networks

(2015) Markus Müller, Thai-Son Nguyen, Matthias Sperber, Kevin Kilgour, Sebastian Stüker, & Alex Waibel: The 2015 KIT IWSLT speech-to-text systems for English and German. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.70-75. [PDF, 2.1MB]

(2013) Alexandra Birch, Nadir Durrani, & Philipp Koehn: English SLT and MT system description for the IWSLT 2013 evaluation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 9pp. [PDF, 225KB]

(2013) Shahab Jalalvand & Daniele Falavigna: Parameter optimization for iterative confusion network decoding in weather-domain speech recognition. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 631KB]

(2013) Kevin Kilgour, Christian Mohr, Michael Heck, Quoc Bao Nguyen, Van Huy Nguyen, Evgeniy Shin, Igor Tseyzer, Jonas Gehring, Markus Müller, Matthias Sperber, Sebastian Stüker & Alex Waibel: The 2013 KIT IWSLT speech-to-text systems for German and English. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 86KB]

(2013) Tian Xia, Zongcheng Ji, Shaodan Zhai, Yidong Chen, Qun Liu, & Shaojun Wang: Improving alignment of system combination by using multi-objective optimization. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.535-544. [PDF, 204KB]

(2013) Junguo Zhu, Muyun Yang, Sheng Li, & Tiejun Zhao: Repairing incorrect translation with examples. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.967-971. [PDF, 523KB]

(2012) Zeeshan Ahmed, Jie Jiang, Julie Carson-Berndsen, Peter Cahill, & Andy Way: Hierarchical phrase-based MT for phonetic representation-based speech translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 246KB]

(2012) Li Maoxi & Wang Mingwen: Confusion network based system combination for Chinese translation output: word-level or character-level? COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT), Mumbai, December 2012; pp.17-24. [PDF, 282KB]

(2012) Antti-Veikko I.Rosti, Xiaodong He, Damianos Karakos, Gregor Leusch, Yuan Cao, Markus Freitag, Spyros Matsoukas, Hermann Ney, Jason R.Smith & Bing Zang: Review of hypothesis alignment algorithms for MT system combination via confusion network decoding. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.191-199. [PDF, 150KB]

(2011) Loïc Barrault: MANY improvements for WMT’11. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.135-139. [PDF, 151KB]

(2011) LoïcBarrault & Patrik Lambert: Machine translation system combination with MANY for ML4HMT. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.89-92. [PDF, 344KB]

(2011) Jie Jiang, Jinhua Du, & Andy Way: Incorporating source-language paraphrases into phrase-based SMT with confusion networks. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.31-40. [PDF, 1325KB]

(2011) Tsuyoshi Okita & Josef van Genabith: DCU confusion network-based system combination for ML4HMT. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.93-98. [PDF, 377KB]

(2011) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Expected BLEU training for graphs: BBN system description for WMT11 system combination task. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.159-165. [PDF, 147KB]

(2011) Taro Watanabe & Eiichiro Sumita: Machine translation system combination by confusion forest. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1249-1257. [PDF, 451KB]

(2011) Daguang Xu, Yuan Cao, & Damianos Karakos: Description of the JHU system combination scheme for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.171-176. [PDF, 123KB]

(2010) Loïc Barrault: Open source machine translation system combination. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.147-155. [PDF, 163KB]

(2010) Loïc Barrault: MANY: open source MT system combination at WMT’10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 271-275. [PDF, 101KB]

(2010) Nicola Bertoldi, Mauro Cettolo, & Marcello Federico: Statistical machine translation of texts with misspelled words. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.412-419. [PDF, 160KB]

(2010) Carmen Heger, Joern Wuebker, Matthias Huck, Gregor Leusch, Saab Mansour, Daniel Stein, & Hermann Ney: The RWTH Aachen machine translation system for WMT 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 93-97. [PDF, 98KB]

(2010) Gregor Leusch & Hermann Ney: The RWTH system combination system for WMT 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 315-320. [PDF, 152KB]

(2010) Zhifei Li, Ziyuan Wang, Sanjeev Khudanpur, & Jason Eisner: Unsupervised discriminative language model training for machine translation using simulated confusion sets. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.656-664. [PDF, 348KB]

(2010) Sushant Narsale: JHU system combination scheme for WMT 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 311-314. [PDF, 99KB]

(2010) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: BBN system description for WMT10 system combination task. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 321-326. [PDF, 102KB]

Continuous space

(2015) Walid Aransa, Holger Schwenk, & Loïc Barrault: Improving continuous space language models auxiliary features. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.151-158. [PDF, 2.9MB]

(2013) Nal Kalchbrenner & Phil Blunsom: Recurrent continuous translation models. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1700-1709. [PDF, 274KB]

(2013) Rui Wang, Masao Utiyama, Isao Goto, Eiichiro Sumita, Hai Zhao, & Bao-Liang Lu: Converting continuous-space language models into n-gram language models for statistical machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.845-850. [PDF, 571KB]

(2012) Holger Schwenk, Anthony Rousseau, & Mohammed Attik: Large, pruned or continuous space language models on a GPU for statistical machine translation. NAACL-HLT Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, Montréal, Canada, June 8, 2012; pp.11-19. [PDF, 198KB]

(2012) Holger Schwenk: Continuous space translation models for phrase-based statistical machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1071-1080. [PDF, 124KB]

(2010) Holger Schwenk: Continuous-space language models for statistical machine translation. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.137-146. [PDF, 151KB]

Customization and implementation (see also Integration, System development)

(2014) John Tinsley: From the lab to the market: commercialising MT research. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Commercial MT Users, pp.105-130. [PDF, 10882KB]

(2014) Alex Yanishevsky: Tools-driven content curation and engine tuning. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Commercial MT Users, pp. 131-151. [PDF, 8307KB]

(2013) Sigrun Gehrig: IBM Germany using RMT. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.399. [PDF, 166KB]

(2013) Sergio Penkale & Andy Way: Tailor-made quality-controlled translation. [Aslib 2013] Translating and the Computer 35, 28-29 November 2013, etc.venues, Paddington, London, UK; 7pp. [PDF, 281KB]; presentation, 24 slides [PDF of PPT, 869KB]

(2013) Bruno Pouliquen, Cecilia Elizalde, Marcin Junczys-Dowmunt, Christophe Mazenc, & José García-Verdugo: Large-scale multiple language translation accelerator at the United Nations. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.345-352. [PDF, 566KB]

(2013) Lori Thicke: The industrial process for quality machine translation. Journal of Specialised Translation, issue 19, January 2013; pp.8-18. [PDF, 303KB]

(2013) Lori Thicke: A technology agnostic approach to machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.309-311. [PDF, 356KB]

(2013) Chris Wendt & Federico Garcea: Customization and collaboration to enhance MT for a knowledge base online portal. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.401-405. [PDF, 636KB]

(2012) Claire Jaja, Douglas M.Briesch, Jamal Laoudi, & Claire R.Voss: Assessing divergence measures for automated document routing in an adaptive MT system. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3963-3970. [PDF, 722KB]

(2012) Maxim Khalilov & Rahzeb Choudury: Building English-Chinese and Chinese-English MT engines for the computer software domain. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.7-11. [PDF, 193KB]

(2012) Katrin Kirchhoff, Daniel Capurro, & Anne Turner: Evaluating user preferences in machine translation using conjoint analysis. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.119-126. [PDF, 223KB]

(2012) Craig Plesco & Nestor Rychtyckyj: Machine translation as a global enterprise at Ford. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 9pp. [PDF, 211KB]

(2012) Laura Ramírez Polo: Use and evaluation of controlled languages in industrial environments and feasibility study for the implementation of machine translation. PhD thesis, Universidad de Valencia, June 2012; 504pp. [PDF, 6696KB]

(2012) Mark Seligman & Mike Dillinger: Spoken language translation: three business opportunities. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 6pp. [PDF, 56KB]

(2011) Ryamond Flournoy: MT use within the enterprise: encouraging adoption via a unified MT API. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.234-238. [PDF, 42KB]

(2011) Yanli Sun, Juan Li, & Yi Li: Deploying MT into a localisation workflow: pains and gains. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.239-243. [PDF, 83KB]

(2011) Carol Van Ess-Dykema: An effective model for insertion of translation technologies into US government translation environments. Translating and the Computer 33, 17-18 November 2011, London; 11pp. [PDF, 253KB]

(2011) Andrejs Vasiļjevs, Raivis Skadiņš, & Inguna Skadiņa: Towards application of user-tailored machine translation in localization. Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.23-31. [PDF, 1896KB]

(2011) Andy Way, Kenny Holden, Lee Ball, & Gavin Wheeldon: SmartMATE: online self-serve access to state-of-the-art SMT. Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.43-52. [PDF, 1175KB]

(2010) Sven C.Andrä & Jörg Schütz: Effectual MT within a translation workflow panopticon. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 4pp. [PDF, 169KB]

(2010) Olga Beregovaya & Alex Yanishevsky: PROMT at PayPal: enterprise-scale MT deployment for financial industry content. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 4pp. [PDF, 253KB]

(2010) Heidi Depraetere & Pablo Vazquez: Machine translation engine selection in the enterprise. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 4pp. [PDF, 480KB]; presentation: 14 slides [PDF, 1631KB]

(2010) Fred Hollowood: 2010 the industrialisation of MT. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 8pp. [PDF, 224KB]

(2010) Michael Ladwig: Language NOW. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Jenny Lu: Machine translation (MT) @ CA Technologies: where can MT be best successful and what are the best MT engines for various languages. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 3pp. [PDF, 108KB]

(2010) Olivier Pasteur: Challenges at the World Trade Organization: evaluation and implementation of a statistical machine translation system. Translating and the Computer 32, 18-19 November 2010, London; 7pp. [PDF, 419KB]

(2010) Kirti Vashee & Rustin Gibbs: Scenarios for customizing an SMT engine based on availability of data. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 4pp. [PDF, 153KB]

(2010) E.Yuste, M.Herranz, A.-L.Lagarda, L.Tarazón, I.Sánchez-Cortina, & F.Casacuberta: PangeaMT – putting open standards to work…well. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 480KB]

Dynamic programming

(2011) Alexander M.Rush & Michael Collins: Exact decoding of syntactic translation models through Lagrangian relaxation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.72-82. [PDF, 203KB]

Feedback see Learning and training

File interchange and exchange

(2014) Andrzej Zydroń: The dos and don’ts of XML document localization. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.137-144. [PDF, 239KB]

(2013) Eric Joanis, Darlene Stewart, Samuel Larkin & Roland Kuhn: Transferring markup tags in statistical machine translation: a two-stream approach. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 73-81. [PDF, 599KB]

(2013) Felix Sasaki: Metadata for the multilingual web: introducing internationalization tag set (ITS) 2.0. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.423. [PDF, 205KB]

(2013) Andrzej Zydroń: Using Excel as an XLIFF editor: You cannot be serious! [Aslib 2013] Translating and the Computer 35, 28-29 November 2013, etc.venues, Paddington, London, UK; 4pp. [PDF, 370KB]; presentation, 11 slides [PDF of PPT, 601KB]

(2012) Nicola Cancedda: Private access to phrase tables for statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.23-27. [PDF, 135KB]

(2012) Alan K.Melby & Tyler A.Snow: Linport as a standard for interoperability between translation systems. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 16pp. [PDF, 382KB], presentation: 25 slides [PDF, 1488KB]

(2012) Stelios Piperidis: META-SHARE: the open exchange platform: overview – current state – towards v3.0. META-FORUM, Brussels, June 19-21, 2012; 28 slides. [PDF of PPT, 2925KB]

(2011) Sven Christian Andrä & Jörg Schütz: The semantically-enriched translation interoperability protocol. [IJCNLP 2011] Proceedings of Workshop on Language Resources, Technology and Services in the Sharing Paradigm, Chiang Mai, Thailand, November 12, 2011; pp.24-31. [PDF, 218KB]

(2011) Seiji Okura, Yuji Yamamoto, Hajime Ito, Michael Kato, Miwako Shimazu, & Francis Bond: UTX 1.11, a simple and open user dictionary/terminology standard, and its effectiveness with multiple MT systems. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.587-594. [PDF, 95KB]

(2011) Stelios Piperidis: META-SHARE: an open resource exchange infrastructure for stimulating research and innovation. META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 37 slides [PDF of PPT, 5302KB]

(2010) Jinhua Du, Johann Roturier, & Andy Way: TMX markup: a challenge when adapting SMT to the localisation environment. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 656KB]

(2010) Stelios Piperidis: META-SHARE: the open resource exchange facility. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 32 slides [PDF of PPT, 6471KB]

Finite-state methods (see also Statistical analysis)

(2014) Jonathan North Washington, Ilnar Salimzyanov, & Francis M.Tyers: Finite-state morphological transducers for three Kypchak languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3378-3385. [PDF, 1727KB]

(2013) Olatz Perez-de-Viñaspre, Maite Oronoz, Manex Agirrezabal & Mikel Lersundi: A finite state approach to translate SNOMED CT terms into Basque using medical prefixes and suffixes. Proceedings of the 11th International Conference on Finite State Methods and Natural Language Processing, St Andrews, Scotalnd, 15-17 July 2013; pp.99-103. [PDF, 169KB]

(2013) Juan Pino, Aurelien Waite, Tong Xiao, Adrià de Gispert, Federico Flego & William Byrne: The University of Cambridge Russian-English system at WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.200-205. [PDF, 169KB]

(2013) Sakriani Sakti, Keigo Kubo, Graham Neubig, Tomoki Toda, & Satoshi Nakamura: The NAIST English speech recognition system for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 109KB]

(2012) Tina Bögel: Urdu-Roman transliteration via finite state transducers. Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing, Donostia-San Sebastián, July 23-25, 2012; pp.25-29. [PDF, 147KB]

(2012) Victor Chahuneau, Noah A.Smith, & Chris Dyer: pycdec: a Python interface to cdec. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.51-61. [PDF, 280KB]

(2012) Jorge González: A finite-state approach to phrase-based statistical machine translation. Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing, Donostia-San Sebastián, July 23-25, 2012; pp.90-98. [PDF, 253KB]

(2012) Aingeru Mayor, Mans Hulden, & Gorka Labaka: Developing an open-source FST grammar for verb chain transfer in a Spanish-Basque MT system. Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing, Donostia-San Sebastián, July 23-25, 2012; pp.65-69. [PDF, 157KB]

(2012) Jyrki Niemi & Krister Lindén: Representing the translation relation in a bilingual wordnet. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2439-2446. [PDF, 365KB]

(2012) Alicia Pérez, M.Inés Torres, & Francisco Casacuberta: Finite-state acoustic and translation model composition in statistical speech translation: empirical assessment. Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing, Donostia-San Sebastián, July 23-25, 2012; pp.99-107. [PDF, 211KB]

(2012) Tommi A.Pirinen & Francis M.Tyers: Compiling Apertium morphological dictionaries with HFST and using them in HFST applications. [SALTMIL 2012] Workshop on Language Technology for Normalisation of Less-Resourced Languages, 22 May 2012, Istanbul, Turkey; pp.25-28. [PDF, 120KB]

(2012) Daniel Quernheim & Kevin Knight: Towards probabilistic acceptors and transducers for feature structures. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.76-85. [PDF, 180KB]

(2012) Markus Saers, Karteek Addanki, & Dekai Wu: From finite-state to inversion transductions: toward unsupervised bilingual grammar induction. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2325-2339. [PDF, 222KB]

(2012) Francis M.Tyers, Felipe Sánchez-Martínez, & Mikel L.Forcada: Flexible finite-state lexical selection for rule-based machine translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.213-220. [PDF, 211KB]

(2012) Aurelien Waite, Graeme Blackwood, & William Byrne: Lattice-based minimum error rate training using weighted finite-state transducers with tropical polynomial weights. FSMNLP 2012: Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing, July 23-25, 2012, University of the Basque Country, Donostia-San Sebastián; pp.116-125. [PDF, 277KB]

(2012) Mengqiu Wang & Christopher D.Manning: Probabilistic finite state machines for regression-based MT evaluation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.984-994. [PDF, 613KB]

(2012) Mengqiu Wang & Christopher D.Manning: SPEDE: probabilistic edit distance metrics for MT evaluation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.76-83. [PDF, 583KB]

(2012) Jonathan North Washington, Mirlan Ipasov, & Francis M.Tyers: A finite-state morphological transducer for Kyrgyz. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.934-940. [PDF, 402KB]

(2011) Gonzalo Iglesias: Hierarchical phrase-based translation with weighted finite-state transducers. Best thesis award (2010) presented at [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 31 May 2011, Leuven, Belgium; 24 slides [PDF,

(2011) Alicia Pérez, M.Inés Torres, & Francisco Casacuberta: Steps taken in Spanish-Basque speech translation using stochastic finite-state transducers. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23-27 January, 2011; 2pp. [PDF, 90KB]

(2011) Markus Saers & Dekai Wu: Linear transduction grammars and zipper finite-state transducers. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.640-647. [PDF, 144KB]

(2011) Germán Sanchis-Trilles, Daniel Ortiz-Martínez, Jesús González-Rubio, Jorge González, & Francisco Casacuberta: Bilingual segmentation for phrasetable pruning in statistical machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.257-264. [PDF, 374KB]; presentation, 15 slides [PDF]

(2010) Bill Byrne: Hierarchical phrase-based translation with weighted finite state transducers. Invited talk at: 7th International Workshop on Spoken Language Translation, 2 December 2010, Paris, France; 46 slides. [PDF of PPT, 1266KB]

(2010) Markus Dreyer: A non-parametric model for the discovery of inflectional paradigms from plain text using graphical models over strings. Dissertation defense, November 5, 2010, Johns Hopkins University; presentation: 200 slides [PDF of PPT, 11706KB]

(2010) Chris Dyer & Philip Resnik: Context-free reordering, finite-state translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.858-866. [PDF, 272KB]

(2010) Chris Dyer, Adam Lopez, Juri Ganitkevitch, Jonathan Weese, Ferhan Ture, Phil Blunsom, Hendra Setiawan, Vladimir Eidelman, & Philip Resnik: cdec: a decoder, alignment, and learning framework for finite-state and context-free translation models. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.7-12. [PDF, 171KB]

(2010) Marc Dymetman & Nicola Cancedda: Intersecting hierarchical and phrase-based models of translation: formal aspects and algorithms. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.1-9. [PDF, 354KB]

(2010) Adrià de Gispert, Gonzalo Iglesias, Graeme Blackwood, Eduardo R.Banga, & William Byrne: Hierarchical phrase-based translation with weighted finite-state transducers and shallow-n grammars. Computational Linguistics 36 (3), pp. 505-533 [PDF, 672KB]

(2010) Raquel Justo, Alicia Pérez, M.Inés Torres, & Francisco Casacuberta: Hierarchical finite-state models for speech translation using categorization of phrases [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 57KB]

(2010) M.G.Abbas Malik, Christian Boitet, & Pushpak Bhattacharyya: Finite-state scriptural translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.791-800. [PDF, 580KB]

(2010) M.G.Abbas Malik, Christian Boitet, Pushpak Bhattacharyya & Laurent Besacier: Weak translation problems – a case study of scriptural translation. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 6pp. [PDF, 241KB]

(2010) Sara Noeman & Amgad Madkour: Language independent transliteration mining system using finite state automata framework. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.57-61. [PDF, 120KB]

(2010) Alicia Pérez, M.Inés Torres, & Francisco Casacuberta: Potential scope of a fully-integrated architecture for speech translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 448KB]

Graphs

(2014) Germán Sánchis-Trilles, Daniel Ortiz-Martínez, & Francisco Casacuberta: Efficient wordgraph for interactive translation prediction. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014; pp.27-34. [PDF-450KB]

(2013) Lei Cui, Dongdong Zhang, Shujie Liu, Mu Li, & Ming Zhou: Bilingual data cleaning for SMT using graph-based random walk. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.340-345. [PDF, 259KB]

(2013) Qun Liu, Zhaopeng Tu, & Shouxun Lin: A novel graph-based compact representation of word alignment. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.358-363. [PDF, 211KB]

(2013) Majid Razmara, Maryam Siahbani, Gholamreza Haffari, & Anoop Sarkar: Graph propagation for paraphrasing out-of-vocabulary words in statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1105-1115. [PDF, 325KB]

(2013) Hendra Setiawan, Bowen Zhou & Bing Xiang: Anchor graph: global reordering contexts for statistical machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.501-512. [PDF, 242KB]

(2012) Meritxell Gonzàlez, Jesús Giménez & Lluís Màrquez: A graphical interface for MT evaluation and error analysis. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 10 July 2012, System Demonstrations; pp.139-144. [PDF, 321KB]

(2012) Bevan Jones, Jacob Andreas, Daniel Bauer, Karl Moritz Hermann, & Kevin Knight: Semantics-based machine translation with hyperedge replacement grammars. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1359-1376. [PDF, 169KB]

(2012) Daniele Pighin, Lluís Màrquez, & Lluís Formiga: The FAUST corpus of adequacy assessments for real-world machine translation output. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.29-35. [PDF, 389KB]

(2012) Daniele Pighin, Lluís Formiga, & Lluís Màrquez: A graph-based strategy to streamline translation quality assessments. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 243KB]

(2012) Akihiro Tamura, Taro Watanabe, & Eiichiro Sumita: Bilingual lexicon extraction from comparable corpora using label propagation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.24-36. [PDF, 347KB]

(2011) Dipanjan Das & Slav Petrov: Unsupervised part-of-speech tagging with bilingual graph-based projections. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.600-609. [PDF, 225KB]

(2011) Ali El-Kahky, Kareem Darwish, Ahmed Saad Aldein, Mohamed Abd El-Wahab, Ahmed Hefny, & Waleed Ammar: Improved transliteration mining using graph reinforcement. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1384-1393. [PDF, 939KB]

(2011) Yupeng Liu, Tiejun Zhao, & Sheng Li: Hypergraph training and decoding of system combination in SMT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.570-577 [PDF, 235KB]

(2010) Diptesh Chatterjee, Sudeshna Sarkar, & Arpit Mishra: Co-occurrence graph based iterative bilingual lexicon extraction from comparable corpora. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.35-42. [PDF, 618KB]

(2010) Christian Scheible: Sentiment translation through lexicon induction. Proceedings of the ACL 2010 Student Research Workshop, Uppsala, Sweden, 13 July 2010; pp.25-30. [PDF, 119KB]

(2010) Christian Scheible, Florian Laws, Lukas Michelbacher, & Hinrich Schütze: Sentiment translation through multi-edge graphs. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1104-1112. [PDF, 236KB]

(2010) Gae-won You, Seung-won Hwang, Young-In Song, Long Jiang, & Zaiqing Nie: Mining name translations from entity graph mapping. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.430-439. [PDF, 1542KB]

Hardware see Computer equipment and accessories

Hidden Markov models [see also Markov random fields]

(2013) Elif Eyigöz, Daniel Gildea & Kemal Oﬂazer: Multi-rate HMMs for word alignment. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.494-502. [PDF, 570KB]

(2013) Shuhei Kondo, Kevin Duh & Yuji Matsumoto: Hidden Markov tree model for word alignment. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.503-511. [PDF, 531KB]

(2012) Elena Irimia: Experimenting with extracting lexical dictionaries from comparable corpora for English-Romanian language pair. [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”, LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.49-55. [PDF, 738KB]

(2011) Jason Naradowsky & Kristina Toutanova: Unsupervised bilingual morpheme segmentation and alignment with context-rich hidden semi-Markov models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.895-904. [PDF, 285KB]

(2011) Ashish Vaswani, Haitao Mi, Liang Huang & David Chiang: Rule Markov models for fast tree-to-string translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.856-864. [PDF, 122KB]

(2010) Yanjun Ma & Andy Way: HMM word-to-phrase alignment with dependency constraints. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.101-109. [PDF, 191KB]

(2010) Peter Nabende: Mining transliterations from Wikipedia using pair HMMs. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.76-80. [PDF, 145KB]

(2010) Shaojun Zhao & Daniel Gildea: A fast fertility Hidden Markov model for word alignment using MCMC. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.596-605. [PDF, 163KB]

Human-computer communication

(2011) Anne H.Schneider & Saturnino Luz: Speaker alignment in synthesised, machine translation communication. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.254-260. [PDF, 231KB]

Human Language Technology see Language technology

Implementation of MT see Customization and implementation

Information theory

(2013) Manaal Faruqui & Chris Dyer: An information theoretic approach to bilingual word clustering. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.777-783. [PDF, 263KB]

(2010) Marcus Dobrinkat, Tero Tapiovaara, Jaakko Väyrynen, & Kimmo Kettunen: Evaluating machine translations using mNCD. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.80-85. [PDF, 145KB]

(2010) Marcus Dobrinkat, Jaako Väyrynen, Tero Tapiovaara, & Kimmo Kettunen: Normalized compression distance based measures for MetricsMATR 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 343-348. [PDF, 118KB]

Integration of systems

(2014) Joanna Drugan: Top-down or bottom-up: what do industry approaches to translation quality mean for effective integration of standards and tools? Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.109-117. [PDF, 327KB]

(2013) Aswarth Dara, Sandipan Dandapat, Declan Groves, & Josef van Genabith: TMTprime: a recommender system for MT and TM integration. [NAACL-HLT 2013] Proceedings of the NAACL HLT 2013 Demonstration Session, 10-12 June 2013, Atlanta, Georgia; pp.10-13. [PDF, 346KB]

(2013) Mirela-Ştefania Duma & Cristina Vertan: Integration of machine translation in on-line multilingual applications – domain adaptation. Translation: Computation, Corpora, Cognition 3 (1), June 2013; pp.67-74. [PDF, 395KB]

(2013) Adrian Laurenzi, Megumu Brownstein, Anne M.Turner, & Katrin Kirchhoff: Integrated post-editing and translation management for lay user communities. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 27-34. [PDF, 357KB]

(2013) Kun Wang, Chengqing Zong, & Key-Yih Su: Integrating translation memory into phrase-based machine translation during decoding. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.11-21. [PDF, 1776KB]

(2012) Svetla Koeva: ATLAS – human language technologies integrated within a multilingual web content management system. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.72-76. [PDF, 331KB]

(2012) Cristina Vertan: Two approaches for integrating translation and retrieval in real applications. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp. 59-64. [PDF, 226KB]

(2010) Michael Paul, Andrew Finch, & Eiichiro Sumita: Integration of multiple bilingually-learned segmentation schemes into statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 400-408. [PDF, 256KB]

(2010) Chris Wendt: Better translations with user collaboration – integrated MT at Microsoft. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 807KB]

Interoperability

(2013) Pedro Luís Díez Orzas: ITS 2.0 automated translation of natural language content: multilingual web linguistic technology. [Aslib 2013] Translating and the Computer 35, 28-29 November 2013, etc.venues, Paddington, London, UK; 9pp. [PDF, 417KB]; presentation, 42 slides [PDF of PPT, 4089KB]

(2013) Fredrik Estreen: Overview of the XLIFF 2.0 specification [abstract]. [Aslib 2013] Translating and the Computer 35, 28-29 November 2013, etc.venues, Paddington, London, UK; presentation [PDF of defective PPT, 254KB]

(2012) Oliver Kutz, Christoph Lange, Till Mossakowski, C.Maria Keet, Fabian Neuhaus, & Michael Gruninger: The Babel of the Semantic Web tongues – in search of the Rosetta stone of interoperability. Semantic Web Conference 2012; 6pp. [PDF, 813KB]

Lattices and networks

(2014) Jorge Gracia, Elena Montiel-Ponsoda, Daniel Vila-Suero, & Guadalupe Aguado-de-Cea: Enabling language resources to expose translations as linked data on the Web. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.409-413. [PDF, 346KB]

(2014) Bushra Jawaid & Ondřej Bojar: Two-step machine translation with lattices. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.682-686. [PDF, 67KB]

(2014) Nicolas Pécheux, AlexandreAllauzen, & Francçois Yvon: Rule-based reordering space in statistical machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1800-1806. [PDF, 607]

(2013) Michael Auli, Michel Galley, Chris Quirk, & Geoffrey Zweig: Joint language and translation modeling with recurrent neural networks. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1044-1054. [PDF, 308KB]

(2012) Ulrich Germann: Syntax-aware phrase-based statistical machine translation: system description. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.292-297. [PDF, 94KB]

(2012) Takanori Kusumoto & Tomoyosi Akiba: Statistical machine translation without a source-side parallel corpus using word lattice and phrase extension. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3929-3932. [PDF, 352KB]

(2012) Wei-Yun Ma & Kathleen McKeown: Phrase-level system combination for machine translation based on target-to-target decoding. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 111KB]

(2012) Artem Sokolov, Guillaume Wisniewski, & François Yvon: Computing lattice BLEU oracle scores for machine translation. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp.120-129. [PDF, 258KB]

(2012) Joern Wuebker & Hermann Ney: Phrase model training for statistical machine translation with word lattices of preprocessing alternatives. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.450-459. [PDF, 160KB]

(2011) Chris Dyer: Integrating morphology in probabilistic translation models [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; presentation: 106 slides [PDF of PPT, 1823KB]

(2011) Rashmi Gangadharaiah: Reducing asymmetry between language-pairs to improve alignment and translation quality. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1346-1350. [PDF, 239KB]

(2011) Teresa Herrmann, Mohammed Mediani, Jan Niehues, & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.379-385. [PDF, 106KB]

(2011) Wei-Yun Ma & Kathleen McKeown: System combination for machine translation based on text-to-text generation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.546-553. [PDF,

(2010) Samidh Chatterjee & Nicola Cancedda: Minimum error rate training by sampling the translation lattice. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.606-615. [PDF, 248KB]

(2010) Jinhua Du, Jie Jiang, & Andy Way: Facilitating translation using source language paraphrase lattices. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.420-429. [PDF, 766KB]

(2010) Chooi-Ling Goh, Taro Watanabe, Michael Paul, Andrew Finch, & Eiichiro Sumita: The NICT translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.139-146. [PDF, 379KB]

(2010) Susan Howlett & Mark Dras: Dual-path phrase-based statistical machine translation. Australasian Language Technology Workshop 2010 (ALTA Workshop 2010), 9-10 December 2008, Melbourne, Australia; pp.32-40. [PDF, 633KB]

(2010) Jie Jiang, Andy Way, & Julie Carson-Berndsen: Lattice score based data cleaning for phrase-based statistical machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 631KB]

(2010) Jie Jiang, Jinhua Du, & Andy Way: Improved phrase-based SMT with syntactic reordering patterns learned from lattice scoring. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 257KB]

(2010) Jie Jiang, Jinhua Du, & Andy Way: Source-side syntactic reordering patterns with functional words for improved phrase-based SMT. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.19-27. [PDF, 208KB]

(2010) Samuel Larkin, Boxing Chen, George Foster, Ulrich Germann, Eric Joanis, Howard Johnson, & Roland Kuhn: Lessons from NRC’s Portage system at WMT 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 127-132. [PDF, 124KB]

(2010) Takashi Onishi, Masao Utiyama & Eiichiro Sumita: Paraphrase lattice for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.1-5. [PDF, 419KB]

(2010) Xinyan Xiao, Yang Liu, Young-Sook Hwang, Qun Liu, & Shouxun Lin: Joint tokenization and translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1200-1208. [PDF, 342KB]

Learning and training

(2015) Jinhua Du, Ankit Srivastava, Andy Way, Alfredo Maldonado-Guerra, & David Lewis: An empirical study of segment prioritization for incrementally retrained post-editing-based SMT. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.172-185. [PDF, 788KB]

(2015) William Lewis, Christian Federmann, & Ying Xin: Applying cross-entropy difference for selecting parallel training data from publicly available sources for conversational machine translation. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.126-134. [PDF, 3.5MB]

(2015) Hideya Mino, Andrew Finch, & Eiichiro Sumita: Learning bilingual phrase representations with recurrent neural networks. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.44-55. [PDF, 499KB]

(2015) Artem Sokolov, Stefan Riezler, & Tanguy Urvoy: Bandit structured prediction for learning from partial feedback in statistical machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.160-171. [PDF, 611KB]

(2015) Chaochao Wang, Deyi Xiong, Min Zhang, & Chunyu Kit: Learning bilingual distributed phrase represenations for statistical machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.32-43. [PDF, 668KB]

(2015) Feifei Zhai & Liang Huang: A pilot study towards end-to-end MT training. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.56-65. [PDF, 640KB]

(2014) Eleftherios Avramidis: Efforts on machine learning over human-mediated translation edit rate. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.302-306. [PDF, 331KB]

(2014) Burak Aydın & Arzucan Özgür: Expanding machine translation training data with an out-of-domain corpus using language modeling based vocabulary saturation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.180-192. [PDF, 523KB]

(2014) Marine Carpuat, Cyril Goutte, & George Foster: Linear mixture models for robust machine translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.499-509. [PDF, 329KB]

(2014) Mara Chinea Rios, Germán Sanchis-Trilles, Daniel Ortiz-Martínez, & Francisco Casacuberta: Online optimisation of log-linear weights in interactive machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3556-3559. [PDF, 201KB]

(2014) Marta R.Costa-jussà, Parth Gupta, Rafael E.Banchs & Paolo Rosso: English-to-Hindi system descrption for WMT 2014: deep source-context features for Moses. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.79-83. [PDF, 296KB]

(2014) José G.C. de Souza, Marco Turchi, Matteo Negri: Towards a combination of online and multitask learning for MT quality estimation: a preliminary study. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Interactive and Adaptive Machine Translation; pp.9-19. [PDF, 282KB]

(2014) Quoc Kanh Do, Teresa Herrmann, Jan Niehues, Alexandre Allauzen, François Yvon & Alex Waibel: The KIT-LIMSI translation system for WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.84-89. [PDF, 378KB]

(2014) Spence Green, Daniel Cer, & Christopher D.Manning: An empirical comparison of features and tuning for phrase-based machine translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.466-476. [PDF, 507KB]

(2014) Varvara Logacheva & Lucia Specia: A quality-based active sample selection strategy for statistical machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2690-2695. [PDF, 230KB]

(2014) Prashant Mathur, Mauro Cettolo, Marcello Federico, & José G.C.de Souza: Online multi-user adaptive statistical machine translation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp. 152-165. [PDF, 657KB]

(2014) Prashant Mathur, Mauro Cettolo: Optimized MT online learning in computer assisted translation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Interactive and Adaptive Machine Translation; pp. 32-41. [PDF, 254KB]

(2014) Jan Niehues, Alexander Allauzen, François Yvon, & Alex Waibel: Combining techniques from different NN-based language models for machine translation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.222-233. [PDF, 683KB]

(2014) Michel Simard: Clean data for training statistical MT: the case of MT contamination. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.69-82. [PDF, 533KB]

(2014) Karan Singla, David Orrego-Carmona, Ashleigh Rhea Gonzales, Michael Carl, & Srinivas Bangalore: Predicting post-editor profiles from the translation process. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Interactive and Adaptive Machine Translation; pp51-60. [PDF, 191KB]

(2014) Xingyi Song, Lucia Specia, & Trevor Cohn: Data selection for discriminative training in statistical machine translation. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014; pp.45-52. [PDF, 415KB]

(2014) Miloš Stanojević & Khalil Sima’an: BEER: Better evaluation as ranking. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.414-419. [PDF, 291KB]

(2014) Ventsislav Zhechev: Term translation central: up-to-date MT without frequent retraining. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Commercial MT Users, pp.152-159. [PDF, 432KB]

(2014) PEDAL: Post-editing with dynamic active learning. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014; p.56. [PDF, 271KB]

(2013) Wilker Aziz, Marc Dymetman & Sriram Venkatapathy: Investigations in exact inference for hierarchical translation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.472-483. [PDF, 581KB]

(2013) Nicola Bertoldi, Mauro Cettolo, & Marcello Federico: Cache-based online adaptation for machine translation enhanced computer assisted translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.35-42. [PDF, 471KB]

(2013) Ergun Bicici: Feature decay algorithms for fast deployment of accurate statistical machine translation systems. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.78-84. [PDF, 182KB]

(2013) Martin Čmejrek, Haitao Mi, & Bowen Zhou: Flexible and efficient hypergarph interactions for joint hierarchical and forest-to-string decoding. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.545-555. [PDF, 195KB]

(2013) Trevor Cohn & Lucia Specia: Modelling annotator bias with multi-task Gaussian processes: an application to machine translation quality estimation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp. 32-42. [PDF, 265KB]

(2013) Lei Cui, Xilun Chen, Dongdong Zhang, Shujie Liu, Mu Li, & Ming Zhou: Multi-domain adaptation for SMT using multi-task learning. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1055-1065. [PDF, 302KB]

(2013) Chris Dyer, Victor Chahuneau, & Noah A.Smith: A simple, fast, and effective reparameterization of IBM Model 2. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.644-648. [PDF, 184KB]

(2013) Sajad Ebrahimi, Kourosh Meshgi, Shahram Khadivi, & Mohammad Ebrahim Shiri Ahmad Abady: Meta-level statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1151-1157. [PDF, 264KB]

(2013) İlknur Durgar El-Kahlout & Coşkun Mermer: TÜBİTAK-BİLGEM German-English machine translation systems for W13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.109-113. [PDF, 100KB]

(2013) Vladimir Eidelman, Yuval Marton, & Philip Resnik: Online relative margin mazimization for statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1116-1126. [PDF, 380KB]

(2013) Vladimir Eidelman, Ke Wu, Ferhan Ture, Philip Resnik & Jimmy Lin: Towards efﬁcient large-scale feature-rich statistical machine translation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.128-133. [PDF, 342KB]

(2013) Seppo Enarvi & Mikko Kurimo: Studies on training text selection for conversational Finnish language modeling. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 210KB]

(2013) Jeffrey Flanigan, Chris Dyer, & Jaime Carbonell: Large-scale discriminative training for statistical machine translation using held-out line search. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.248-258. [PDF, 174KB]

(2013) George Foster, Boxing Chen, & Roland Kuhn: Simulating discriminative training for linear mixture adaptation in statistical machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.183-190. [PDF, 556KB]

(2013) Jianfeng Gao & Xiadong He: Training MRF-based phrase translation models using gradient ascent. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.450-459. [PDF, 212KB]

(2013) Spence Green, Sida Wang, Daniel Cer, & Christopher D.Manning: Fast and adaptive online training of feature-rich translation models. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.311-321. [PDF, 363KB]

(2013) Spence Green, Daniel Cer, Kevin Reschke, Rob Voigt, John Bauer, Sida Wang, Natalia Silveira, Julia Neidert & Christopher D. Manning: Feature-rich phrase-based translation: Stanford University’s submission to the WMT 2013 translation task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.148-153. [PDF, 272KB]

(2013) Barry Haddow: Applying pairwise ranked optimisation to improve the interpolation of translation models. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.342-347. [PDF, 128KB]

(2013) Michael Heck, Sebastian Stüker, Sakriani Sakti, Alex Waibel, & Satoshi Nakamura: Incremental unsupervised training for university lecture recognition. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 162KB]

(2013) Philippe Langlais: Mapping source to target strings without alignment by analogical learning: a case study with transliteration. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.684-689. [PDF, 221KB]

(2013) William Lewis & Sauleh Eetemadi: Dramatically reducing training data size through vocabulary saturation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.281-291. [PDF, 3943KB]

(2013) Lemao Liu, Tiejun Zhao, Taro Watanabe, & Eiichiro Sumita: Tuning SMT with a large number of features via online feature grouping. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.279-285. [PDF, 153KB]; revised, 7pp. [PDF]

(2013) Chi-kiu Lo, Karteek Addanki, Markus Saers & Dekai Wu: Improving machine translation by training against an automatic semantic frame based evaluation metric. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.375-381. [PDF, 370KB]

(2013) Saab Mansour & Hermann Ney: Phrase training based adaptation for statistical machine translation. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.649-654. [PDF, 111KB]

(2013) Benjamin Marie & Aurélien Max: A study in greedy oracle improvement of translation hypotheses. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 400KB]

(2013) Prashant Mathur, Mauro Cettolo & Marcello Federico: Online learning approaches in computer assisted translation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.301-308. [PDF, 274KB]

(2013) Evgeny Matusov & Gregor Leusch: Omniﬂuent^TM English-to-French and Russian-to-English systems for the 2013 Workshop on Statistical Machine Translation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.158-163. [PDF, 136KB]

(2013) Shachar Mirkin & Nicola Cancedda: Assessing quick update methods of statistical translation models. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 605KB]

(2013) Hwidong Na & Jong-Hyeok Lee: A discriminative reordering parser for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 105KB]

(2013) Preslav Nakov, Fahad Al Obaidli, Francisco Guzmán & Stephan Vogel: Parameter optimization for statistical machine translation: it pays to learn from hard examples. Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.504-510. [PDF, 124KB]

(2013) Jan Niehues & Alex Waibel: An MT error-driven discriminative word lexicon using sentence structure features. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.512-520. [PDF, 189KB]

(2013) Xiaojun Quan, Chunyu Kit, & Yan Song: Non-monotonic sentence alignment via semisupervised learning. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.622-630. [PDF, 348KB]

(2013) Majid Razmara & Anoop Sarkar: Stacking for statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.334-339. [PDF, 284KB]

(2013) Annette Rios & Anne Göhring: Machine learning disambiguation of Quechua verb morphology. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.13-18. [PDF, 226KB]

(2013) Martha Alicia Rocha & Joan Andreu Sánchez: Towards the supervised machine translation: real word alignments and translations in a multi-task active learning process. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.239-247. [PDF, 632KB]

(2013) Markus Saers, Karteek Addanki & Dekai Wu: Segmenting vs.chunking rules: unsupervised ITG induction via minim conditional description length. Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.584-592. [PDF, 388KB]

(2013) Markus Saers & Dekai Wu: Unsupervised learning of bilingual categories in inversion transduction grammar induction. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 223KB]

(2013) Thomas Schoenemann: Training nondeficient variants of IBM-3 and IBM-4 for word alignment. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.22-31. [PDF, 300KB]

(2013) Hendra Setiawan & Bowen Zhou: Discriminative training of 150 million translation parameters and its application to pruning. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.335-341. [PDF, 219KB]

(2013) Hiroaki Shimizu, Graham Neubig, Sakriani Sakti, Tomoki Toda, & Satoshi Nakamura: Constructing a speech translation system using simultaneous interpretation data. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 1041KB]

(2013) Patrick Simianer, Laura Jehl, & Stefan Riezler: The Heidelberg University machine translation systems for IWSLT2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 224KB]

(2013) Patrick Simianer & Stefan Riezler: Multi-task learning for improved discriminative training in SMT. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.292-300. [PDF, 267KB]

(2013) Ekaterina Stambolieva: Learning comparable corpora from latent semantic analysis simplified document space. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.129-137. [PDF, 531KB]

(2013) Oscar Täckström, Ryan McDonald, & Joakim Nivre: Target language adaptation of discriminative transfer parsers. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.1061-1071. [PDF, 258KB]

(2013) Antonio Toral: Hybrid selection of language model training data using linguistic information and perplexity. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.8-12. [PDF, 307KB]

(2013) Dan Tufiş, Radu Ion, Ştefan Daniel Dumitrescu, & Dan Ştefănescu: Wikipedia as an SMT training corpus. Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.702-709. [PDF, 220KB]

(2013) Ashish Vaswani, Yinggong Zhao, Victoria Fossum, & David Chiang: Decoding with large-scale neural language models improves translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1387-1392. [PDF, 107KB]

(2013) Svitlana Volkova, Theresa Wilson, & David Yarowsky: Exploring sentiment in social media: bootstrapping subjectivity clues from multilingual Twitter streams. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.505-510. [PDF, 317KB]

(2013) Longyue Wang, Derek F.Wong, Lidia S.Chao, Junwen Xing, Yi Lu, & Isabel Trancoso: Edit distance: a new data selection criterion for domain adaptation in SMT. Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.727-732. [PDF, 231KB]

(2013) K.Wäschle, P.Simianer, N.Bertoldi, S.Riezler, & M.Federico: Generative and discriminative methods for online adaptation in SMT. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.11-18. [PDF, 420KB]

(2013) Krzysztof Wolk & Krzysztof Marasek: Polish-English speech statistical machine translation systems for the IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 511KB]

(2013) Dekai Wu, Karteek Addanki, & Markus Saers: Modeling hip hop challenge-response lyrics as machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.109-116. [PDF, 408KB]

(2013) Joern Wuebker, Stefan Peitz, Felix Rietig & Hermann Ney: Improving statistical machine translation with word class models. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1377-1381. [PDF, 125KB]

(2013) Joern Wuebker & Hermann Ney: Length-incremental phrase training for SMT. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.309-319. [PDF, 249KB]

(2013) Wenduan Xu, Yue Zhang, Philip Williams, & Philipp Koehn: Learning to prune: context-sensitive pruning for syntactic MT. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.352-357. [PDF, 334KB]

(2013) Heng Yu, Liang Huang, Haitao Mi, & Kai Zhou: Max-violation perceptron and forced decoding for scalable MT training. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1112-1123. [PDF, 374KB]

(2013) Rabih Zbib, Gretchen Markiewicz, Spyros Matsoukas, Richard Schwartz, & John Makhoul: Systematic comparison of professional and crowdsourced reference translations for machine translation. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.612-616. [PDF, 89KB]

(2013) Conghui Zhu, Taro Watanabe, Eiichiro Sumita, & Tiejun Zhao: Hierarchical phrase table combination for machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.802-810. [PDF, 1464KB]

(2012) Dia Abu Zeina, Moustafa Elshafei, Husni Al-Muhtaseb, & Wasfi Al-Khatib: Rescoring N-best hypotheses for Arabic speech recognition: a syntax-mining approach. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.57-64. [PDF, 846KB]

(2012) Eleftherios Avramidis: Quality estimation for machine translation output using linguistic analysis and decoding features. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.84-90. [PDF, 111KB]

(2012) Pratyush Banerjee, Sudip Kumar Naskar, Johann Roturier, Andy Way, & Josef van Genabith: Translation quality-based supplementary data selection by incremental update of translation models. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.149-165. [PDF, 149KB]

(2012) Frédéric Blain, Holger Schwenk, & Jean Senellart: Incremental adaptation using translation information and post-editing analysis. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.229-236. [PDF, 873KB]

(2012) Yuan Cao & Sanjeev Khudanpur: Sample selection for large-scale MT discriminative training. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 117KB]

(2012) Jinying Chen, Jacob Devlin, Huaigu Cao, Rohit Prasad, & Premkumar Natarajan: Automatic tune set generation for machine translation with limited in-domain data. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.161-168. [PDF, 287KB]

(2012) Nan Duan, Mu Li, & Ming Zhou: Forced derivation tree based model training to statistical machine translation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.445-454. [PDF, 761KB]

(2012) Chris Dyer: Discriminative training of language models. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 83 slides [PDF of PPT, 1322KB]

(2012) Christian Federmann: Can machine learning algorithms improve phrase selection in hybrid machine translation? EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.113-118. [PDF, 200KB]

(2012) Christian Federmann: Hybrid machine translation using joint, binarised feature vectors. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 198KB]

(2012) Guillem Gascó, Martha-Alicia Rocha, Germán Sanchis-Trilles, Jesús Andrés-Ferrer, & Francisco Casacuberta: Does more data always yield better translations? [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp.152-161. [PDF, 162KB]

(2012) Jesús González-Rubio, Daniel Ortiz-Martínez, & Francisco Casacuberta: Active learning for interactive machine translation. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 245-254. [PDF,

(2012) Barry Haddow & Philipp Koehn: Analysing the effect of out-of-domain data on SMT systems. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.422-432. [PDF, 155KB]

(2012) Xiaodong He & Li Deng: Maximum expected BLEU training of phrase and lexicon translation models. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.292-301. [PDF, 8163KB]

(2012) Matthias Huck & Hermann Ney: Pivot lightly-supervised training for statistical machine translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 158KB]

(2012) Sabine Hunsicker, Chen Yu, & Christian Federmann: Machine learning for hybrid machine translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.312-316. [PDF, 131KB]

(2012) Jie Jiang, David Kolovratnik, & Ian Johnson: Building Moses training pipelines with Arrows. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 7 slides [PDF of PPT, 119KB]

(2012) Mahdi Khademian, Kaveh Taghipour, Saab Mansour, & Shahram Khadivi: A holistic approach to bilingual sentence fragment extraction from comparable corpora. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.4073-4079. [PDF, 416KB]

(2012) Philipp Koehn & Barry Haddow: Toward effective use of training data in statistical machine translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.317-321. [PDF, 89KB]

(2012) Prasanth Kolachina, Nicola Cancedda, Marc Dymetman, & Sriram Venkatapathy: Prediction of learning curves in machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.22-30. [PDF, 368KB]

(2012) Hyoung-Gyu Lee & Hae-Chang Rim: Decoder-based discriminative training of phrase segmentation for statistical machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.611-620. [PDF, 237KB]

(2012) Lemao Liu, Hailong Cao, Taro Watanabe, Tiejun Zhao, Mo Yu, & CongHui Zhu: Locally training the log-linear model for SMT. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.402-411. [PDF, 289KB]

(2012) Shujie Liu, Chi-Ho Li, Mu Li, & Ming Zhou: Re-training monolingual parser bilingually for syntactic SMT. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.854-862. [PDF, 708KB]

(2012) Evgeny Matusov: Incremental re-training of a hybrid English-French MT system with customer translation memory data. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 8pp. [PDF, 116KB]

(2012) Mohammed Mediani, Jan Niehues, & Alex Waibel: Parallel phrase scoring for extra-large corpora. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.87-98. [PDF, 142KB]

(2012) Smruthi Mukund & Rohini K.Srihari: Analyzing Urdu social media for sentiments using transfer learning with controlled translations. Proceedings of the 2012 Workshop on Language in Social Media (LSM 2012), Montréal, Canada, June 7, 2012; pp.1-8. [PDF, 393KB]

(2012) Vassilina Nikoulina, Agnes Sandor, & Marc Dymetman: Hybrid adaptation of named entity recognition for statistical machine translation. COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT), Mumbai, December 2012; pp.1-15. [PDF, 198KB]

(2012) Malte Nuhn, Arne Mauser, & Hermann Ney: Deciphering foreign language by combining language models and context vectors. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.156-164. [PDF, 198KB]

(2012) Pavel Pecina, Antonio Toral, Vassilis Papavassiliou, Prokopis Prokopidis, & Josef van Genabith: Domain adaptation of statistical machine translation using web-crawled resources: a case study. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.145-152. [PDF, 201KB]

(2012) Stephan Peitz, Simon Wiesler, Markus Nußbaum-Thom, & Hermann Ney: Spoken language translation using automatically transcribed text in training. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.276-283. [PDF, 623KB]

(2012) Juan Pino, Aurelien Waite, & William Byrne: Simple and efficient model filtering in statistical machine translation. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.5-24. [PDF, 172KB]

(2012) Ariya Rastrow, Mark Dredze, & Sanjeev Khudanpur: Fast syntactic analysis for statistical language modeling via substructure sharing and uptraining. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.175-183. [PDF, 419KB]

(2012) Avneesh Saluja, Ian Lane, & Ying Zhang: Machine translation with binary feedback: a large-margin approach. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 322KB]

(2012) Holger Schwenk: Continuous space translation models for phrase-based statistical machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1071-1080. [PDF, 124KB]

(2012) Patrick Simianer, Stefan Riezler, & Chris Dyer: Joint feature selection in distributed stochastic learning for large-scale discriminative training in SMT. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.11-21. [PDF, 281KB]

(2012) Artem Sokolov, Guillaume Wisniewski & François Yvon: Non-linear n-best list reranking with few features. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 326KB]

(2012) Hong Sun & Ming Zhou: Joint learning of a dual SMT system for paraphrase generation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.38-42. [PDF, 121KB]

(2012) Dan Tufiş & Ştefan Daniel Dumitrescu: Cascaded phrase-based statistical machine translation systems. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.129-136. [PDF, 252KB]

(2012) Marco Turchi, Cyril Goutte, & Nello Cristianini: Learning machine translation from in-domain and out-of-domain data. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.305-312. [PDF, 160KB]

(2012) Joern Wuebker, Mei-Yuh Hwang, & Chris Quirk: Leave-one-out phrase model training for large-scale deployment. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.460-467. [PDF, 223KB]

(2012) Xinyan Xiao, Deyi Xiong, Yang Liu, Qun Liu, & Shouxun Lin: Unsupervised discriminative induction of synchronous grammar for machine translation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2883-2898. [PDF, 252KB]

(2012) Yue Zhang & Joakim Nivre: Analyzing the effect of global learning and beam-search on transition-based dependency parsing. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1391-1400. [PDF, 282KB]

(2012) Jingbo Zhu, Tong Xiao, & Chunliang Zhang: Learning better rule extraction with translation span alignment. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.280-284. [PDF, 235KB]

(2011) Vamshi Ambati, Sanjika Hewavitharana, Stephan Vogel, & Jaime Carbonell: Active learning with multiple annotations for comparable data classification task. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.69-77. [PDF, 201KB]

(2011) Vamshi Ambati, Stephan Vogel, & Jaime Carbonell: Multi-strategy approaches to active learning for statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.122-129. [PDF, 432KB]

(2011) Ergun Biçici & Deniz Yuret: Instance selection for machine translation using feature decay algorithms. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.272-283. [PDF, 244KB]

(2011) Ergun Biçici & Deniz Yuret: RegMT system for machine translation, system combination, and evaluation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.323-329. [PDF, 168KB]

(2011) Ondřej Bojar & Aleš Tamchyna: Forms wanted: training SMT on monolingual data. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; 2pp. [PDF, 75KB]; presentation: 20 slides [PDF of PPT, 111KB]

(2011) Ondřej Bojar & Aleš Tamchyna: Improving translation model by monolingual data . [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.330-336. [PDF, 107KB]

(2011) David Chiang, Steve DeNeefe, & Michael Pust: Two easy improvements to lexical weighting. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.455-460. [PDF, 310KB]

(2011) Miquel Esplà-Gomis, Victor M.Sánchez-Cartagena, & Juan Antonio Pérez-Ortiz: Enlarging monolingual dictionaries for machine translation with active learning and non-expert users. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.339-346. [PDF, 148KB]

(2011) Miquel Esplà-Gomis, Víctor M.Sánchez-Cartegna & Juan Antonio Pérez-Ortiz: Multimodal building of monolingual dictionaries for machine translation by non-expert users. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.147-154. [PDF, 163KB]

(2011) Qin Gao, Will Lewis, Chris Quirk, & Mei-Yuh Hwang: Incremental training and intentional over-fitting of word alignment. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.106-113. [PDF, 157KB]

(2011) Monica Gavrila & Cristina Vertan: Training data in statistical machine translation – the more, the better? [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.551-556. [PDF, 282KB]

(2011) Barry Haddow, Abhishek Arun, & Philipp Koehn: SampleRank training for phrase-based machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.261-271. [PDF, 194KB]

(2011) Carlos A.Henríquez Q., José B.Mariño, & Rafael E.Banchs: Deriving translation units using small additional corpora. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.121-128. [PDF, 438KB]

(2011) Jason Katz-Brown, Slav Petrov, Ryan McDonald, Franz Och, David Talbot, Hiroshi Ichikawa, Masakazu Seno, & Hideto Kazawa: Training a parser for machine translation reordering. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.183-192. [PDF, 192KB]

(2011) Canasai Kruengkrai, Thatsanee Charoenporn, & Virach Sornlertlamvanich: Simple discriminative training for machine transliteration. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.28-31. [PDF, 514KB]

(2011) Patrik Lambert, Holger Schwenk, Christophe Servan, & Sadaf Abdul-Rauf: Investigations on translation model adaptation using monolingual data. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.284-293. [PDF, 124KB]

(2011) Gennadi Lembersky, Noam Ordan, & Shuly Wintner: Language models for machine translation: original vs. translated texts. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.363-374. [PDF, 288KB]

(2011) Gennadi Lembersky, Noam Ordan, & Shuly Wintner: Language models for machine translation: original vs. translated texts. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 2pp. [PDF, 112KB]; presentation: 40 slides [PDF of PPT, 382KB]

(2011) Véronique López-Ludeña & Rubén San-Segundo: UPM system for the translation task. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.420-425. [PDF, 160KB]

(2011) Yanjun Ma, Yifan He, Andy Way, & Josef van Genabith: Consistent translation using discriminative learning: a trnslation memory-inspired approach. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1239-1248. [PDF, 193KB]

(2011) Ryan McDonald, Slav Petrov, & Keith Hall: Multi-source transfer of delexicalized dependency parsers. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.62-72. [PDF, 284KB]

(2011) Daniel Ortiz-Martínez, Luis A.Leiva, Vicent Alabau, Ismael García-Varea, & Francisco Casacuberta: An interactive machine translation system with online learning. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Portland, Oregon, June 19-24, 2011; pp.68-73. [PDF, 508KB]

(2011) Aaron B.Phillips & Ralf D.Brown: Training machine translation with a second-order Taylor approximation of weighted translation instances. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.40-47. [PDF, 198KB]

(2011) Marius Popescu: Studying translationese at the character level. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.634-639. [PDF, 121KB]

(2011) Andrei Popescu-Belis, Bruno Cartoni, Andrea Gesmundo, James Henderson, Cristina Grisot, Paola Merlo, Thomas Meyer, Jacques Moeschler & Sandrine Zufferey: Improving MT coherence through text-level processing of input texts: the COMTIS project. Tralogy, Paris, 3-4 March 2011; 17pp. [PDF, 169KB]

(2011) Marion Potet, Emmanuelle Esperança-Rodier, Hervé Blanchon, & Laurent Besacier: Preliminary experiments on using users’ post-editions to enhance a SMT system. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.161-168. [PDF, 346KB]

(2011) Spencer Rarrick, Chris Quirk, & Will Lewis: MT detection in web-scraped parallel corpora. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.422-429. [PDF, 323KB]

(2011) Jeffrey Rueppel, Li Jiang, Gong Yu, & Ray Flournoy: AIR-based light clients for supporting Moses engine training. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.503-506. [PDF, 352KB]

(2011) Rico Sennrich: Combining multi-engine machine translation and online learning through dynamic phrase tables. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.89-96. [PDF, 337KB]; presentation, 22 slides [PDF]

(2011) Kashif Shah, Loïc Barrault, & Holger Schwenk: Parametric weighting of parallel data for statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1323-1331. [PDF, 754KB]

(2011) Patrick Simianer, Katharina Wäschle, & Stefan Riezler: Multi-task minimum error rate training for SMT. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.99-108. [PDF, 138KB]; presentation, 45 slides [PDF of PPT, 295KB]

(2011) Nadi Tomeh, Marco Turchi, Guillaume Wisinewski, Alexandre Allauzen, & François Yvon: How good are your phrases? Assessing phrase quality with single class classification. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.261-268. [PDF, 691KB]

(2011) Viet Ha-Thuc & Nicola Cancedda: Confidence-weighted learning of factored discriminitave language models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.439-444. [PDF,

(2011) Ashish Venugopal, Jakob Uszkoreit, David Talbot, Franz J.Och, & Juri Ganitkevitch: Watermarking the outputs of structured prediction with an application in statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1363-1372. [PDF, 293KB]

(2011) Xinyan Xiao, Yang Liu, Qun Liu, & Shouxun Lin: Fast generation of translation forest for large-scale SMT discriminative training. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.880-888. [PDF, 818KB]

(2011) Daqi Zheng, Yifan He, Yang Liu & Qun Liu: Maximum rank correlation training for statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.48-55. [PDF, 184KB]

(2011) PRESEMT: pattern recognition-based statistically enhanced MT. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.353. [PDF, 98KB]

(2010) Vamshi Ambati, Stephen Vogel, & Jaime Carbonell: Active learning and crowd-sourcing for machine translation. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2169-2174. [PDF, 436KB]

(2010) Vamshi Ambati, Stephan Vogel & Jaime Carbonell: Active learning-based elicitation for semi-supervised word alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.365-370. [PDF, 182KB]

(2010) Sankaranarayanan Ananthakrishnan, Rohit Prasad, David Stallard, & Prem Natarajan: Discriminative sample selection for statistical machine translation. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.626-635. [PDF, 329KB]

(2010) Sankaranarayanan Ananthakrishnan, Rohit Prasad, David Stallard, & Prem Natarajan: A semi-supervised batch-mode active learning strategy for improved statistical machine translation. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.126-134. [PDF, 256KB]

(2010) Wilker Aziz, Marc Dymetman, Shachar Mirkin, Lucia Specia, Nicola Cancedda, & Ido Dagan: Learning an expert from human annotations in statistical machine translation: the case of out-of-vocabulary words. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 585KB]; presentation: 21 slides [PDF, 777KB]

(2010) Ergun Biçici & Deniz Yuret: L₁ regularized regression for reranking and system combination in machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 282-289. [PDF, 197KB]

(2010) Michael Bloodgood & Chris Callison-Burch: Bucking the trend: large-scale cost-focused active learning for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.854-864. [PDF, 489KB]

(2010) David Burkett, Slav Petrov, John Blitzer, & Dan Klein: Learning better monolingual models with unannotated bilingual text. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.46-54. [PDF, 431KB]

(2010) Do Thi Ngoc Diep, Laurent Besacier, & Eric Castelli: A fully unsupervised approach for mining parallel data from comparable corpora. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 856KB]; presentation: 30 slides [PDF, 2621KB]

(2010) Kevin Duh, Katsuhito Sudoh, & Hajime Tsukada: Analysis of translation model adaptation in statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.243-250. [PDF, 556KB]

(2010) Kevin Duh, Katsuhito Sudoh, Hajime Tsukada, Hideki Isozaki, & Masaaki Nagata: N-best reranking by multitask learning. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 375-383. [PDF, 184KB]

(2010) Qin Gao & Stephan Vogel: Training phrase-based machine translation models on the cloud. Open source machine translation toolkit Chaski. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.37-46. [PDF, 158KB]

(2010) Dmitriy Genzel: Automatically learning source-side reordering rules for large scale machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.376-384. [PDF, 187KB]

(2010) Daniel Hardt & Jakob Elming: Incremental re-training for post-editing SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 191KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Learning phrase boundaries for hierarchical phrase-based translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.383-390. [PDF, 378KB]

(2010) Fei Huang & Bing Xiang: Feature-rich discriminative phrase rescoring for SMT. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.492-500. [PDF, 260KB]

(2010) Shujian Huang, Kangxi Li, Xinyu Dai, & Jiajun Chen: Improving word alignment by semi-supervised ensemble. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.135-143. [PDF, 585KB]

(2010) Cong Hui, Hai Zhao, Yan Song, & Bao-Liang Lu: An empirical study on development set selection strategy for machine translation learning. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 67-71. [PDF, 447KB]

(2010) Ann Irvine, Chris Callison-Burch, & Alexandre Klementiev: Transliterating from all languages. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 282KB]

(2010) Iustina Ilisei, Diana Inkpen, Gloria Corpas Pastor, & Ruslan Mitkov: Identification of translational sublanguage: a machine learning approach [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 57KB]

(2010) Prasanth Kolachina, Sriram Venkatapathy, Srinivas Bangalore, Sudheer Kolachina, & Avinesh PVS: Phrase-based decoding using a discriminative model. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.34-42. [PDF, 184KB]

(2010) Le Hai Son, Alexandre Allauzen, Guillaume Wisniewski, & François Yvon: Training continuous space language models: some practical issues. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.778-788. [PDF, 311KB]

(2010) Abby Levenberg, Chris Callison-Burch, & Miles Osborne: Stream-based translation models for statistical machine translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.394-402. [PDF, 184KB]

(2010) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Ann Irvine, Sanjeev Khudanpur, Lane Schwartz, Wren N.G.Thornton, Ziyuan Wang, Jonathan Weese, & Omar F.Zaidan: Joshua 2.0: a toolkit for parsing-based machine translation with syntax, semirings, discriminative training and other goodies. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 133-137. [PDF, 101KB]

(2010) Frank Liberato, Behrang Mohit, & Rebecca Hwa: Improving phrase-based translation with prototypes of short phrases. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.301-304. [PDF, 106KB]

(2010) Shujie Liu, Chi-Ho Li, & Ming Zhou: Improved discriminative ITG alignment using hierarchical phrase pairs and semi-supervised training. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.730-738. [PDF, 1050KB]

(2010) Pascual Martínez Gómez, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda, & Germán Sanchis-Trilles: A deterministic annealing-based training algorithm for statistical machine translation models. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 712KB]

(2010) Robert C.Moore & William Lewis: Intelligent selection of language model training data. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.220-224. [PDF, 176KB]

(2010) Markos Mylonakis & Khalil Sima’an: Learning probabilistic synchronous CFGs for phrase-based translation. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.117-125. [PDF, 151KB]

(2010) Daniel Ortiz-Martínez, Ismael García-Varea, & Francisco Casacuberta: Online learning for interactive statistical machine translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.546-554. [PDF, 284KB]

(2010) Germán Sanchis-Trilles & Mauro Cettolo: Online language model adaptation via n-gram mixtures for statistical machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 605KB]; presentation: 22 slides [PDF, 241KB]

(2010) Lei Shi, Rada Mihalcea, & Mingjun Tian: Cross language text classification by model translation and semi-supervised learning. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.1057-1067. [PDF, 525KB]

(2010) Kathrin Spreyer, Lilja Øvrelid, & Jonas Kuhn: Training parsers on partial trees: a cross-language comparison. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1978-1985. [PDF, 285KB]

(2010) Liang Tian, Fai Wong, & Sam Chao: An improvement of translation quality with adding key-words in parallel corpus. Ninth International Conference on Machine Learning and Cybernetics Qingdao, China, 11-14 July 2010; pp. 1273-1278. [PDF, 226KB]

(2010) Nadi Tomeh, Alexandre Allauzen, Guillaume Wisniewski & François Yvon: Refining word alignment with discriminative training. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 203KB]

(2010) Wei Wang, Jonathan May, Kevin Knight, & Daniel Marcu: Re-structuring, re-labeling, and re-aligning for syntax-based machine translation. Computational Linguistics 36 (2), pp. 247-277 [PDF, 984KB]

(2010) Joern Wuebker, Arne Mauser, & Hermann Ney: Training phrase translation models with leaving-one-out. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.475-484. [PDF, 298KB]

Margin induced relaxed algorithm [MIRA]

(2013) Ming Tan, Tian Xia, Shaojun Wang, & Bowen Zhou: A corpus level MIRA tuning strategy for machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.851-856. [PDF, 171KB]

(2011) Mridul Gupta, Sanjika Hewavitharana, & Stephan Vogel: Extending a probabilistic phrase alignment approach for SMT. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.175-182. [PDF, 573KB]

(2011) Eva Hasler, Barry Haddow & Philipp Koehn: Margin infused relaxed algorithm for Moses. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.69-78. [PDF, 137KB]; presentation, 38 slides [PDF of PPT, 597KB]

(2011) Shujie Liu, Chi-Ho Li, & Ming Zhou: A unified SMT framework combining MIRA and MERT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.181-188. [PDF, 233KB]

Markov random fields

(2013) Xiaoning Zhu, Zhongjun He, Hua Wu, Haifeng Wang, Conghui Zhu, & Tiejun Zhao: Improving pivot-based statistical machine translation using random walk. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.524-534. [PDF, 291KB]

Maximum entropy methods

(2014) Judit Ács: Pivot-based multilingual dictionary building using Wiktionary. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1938-1942. [PDF, 81KB]

(2013) Yang Liu: A shift-reduce parsing algorithm for phrase-based string-to-dependency translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1-10. [PDF, 290KB]

(2013) Alex Rudnick & Michael Gasser: Lexical selection for hybrid MT with sequence labeling. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.102-108. [PDF, 251KB]

(2013) Evgeniy Shin, Sebastian Stüker, Kevin Kilgour, Christian Fügen, & Alex Waibel: Maximum entropy language modeling for Russian ASR. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 167KB]

(2012) Amittai Axelrod, QingJun Li, & William D.Lewis: Applications of data selection via cross-entropy difference for real-world statistical machine translation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.201-208. [PDF, 610KB]

(2012) Joseph Max Kaufmann: JMaxAlign: a maximum entropy parallel sentence alignment tool. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 277-287. [PDF, 342KB]

(2012) Xinxin Li, Xuan Wang, & Wingwei Liao: Simple maximum entropy models for multilingual coreference resolution. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.83-87. [PDF, 205KB]

(2012) Wang Ling, João Graça, Isabel Trancoso, & Alan Black: Entropy-based pruning for phrase-based machine translation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.962-971. [PDF, 212KB]

(2011) Felipe Sánchez-Martínez: Choosing the best machine translation system to translate a sentence by using only source-language information. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.97-104. [PDF, 338KB] ; presentation, 32 slides [PDF]

(2010) Niyu Ge: A direct syntax-driven reordering model for phrase-based machine translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.849-857. [PDF, 161KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Extending the hierachical phrase based model with maximum entropy based BTG. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 104KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Maximum entropy based phrase reordering for hierarchical phrase-based translation. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.555-563. [PDF, 144KB]

(2010) Marcin Junczys-Dowmunt: A maximum entropy approach to syntactic translation rule filtering [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 58KB]

(2010) Maxim Khalilov & Khalil Sima’an: Source reordering using MaxEnt classifiers and supertags. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 605KB]

(2011) Li Zezhong, Hideto Ikeda, & Nguyen Thanh Hung: Exploiting multi-features for word alignment in patent MT. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.86-90. [PDF, 238KB]

(2010) David Mareček, Martin Popel, & Zdeněk Žabokrtský: Maximum entropy translation model in dependency-based MT framework. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 201-206. [PDF, 254KB]

(2010) Jinsong Su, Yang Liu, Haitao Mi, Hongmei Zhao, Yajuan Lü, & Qun Liu: Dependency-based bracketing transduction grammar for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1185-1193. [PDF, 226KB]

(2010) Jun Sun, Min Zhang, & Chew Lim Tan: Discriminative induction of sub-tree alignment using limited labeled data. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1047-1055. [PDF, 370KB]

(2010) Deyi Xiong, Min Zhang, & Haizhou Li: Error detection for statistical machine translation using linguistic features. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.604-611. [PDF, 520KB]

Minimum Bayes risk see Bayesian methods

MIRA see Margin induced relaxed algorithm

Mixture modelling

(2013) Rico Sennrich, Holger Schwenk & Walid Aransa: A multi-domain translation model framework for statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.832-840. [PDF, 1387KB]

(2012) Majid Razmara, George Foster, Baskaran Sankaran, & Anoop Sarkar: Mixing multiple translation models in statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.940-949. [PDF, 229KB]

(2012) Rico Sennrich: Mixture-modeling with unsupervised clusters for domain adaptation in statistical machine translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.185-192. [PDF, 217KB]

(2011) Pratyush Banerjee, Sudip Kumar Naskar, Johann Roturier, Andy Way, & Josef van Genabith: Domain adaptation in statistical machine translation of user-forum data using component-level mixture modelling. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.285-292. [PDF, 137KB]

(2011) Bing Xiang & Abraham Ittycheriah: Discriminative feature-tied mixture modeling for statistical machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.424-428. [PDF, 85KB]

Networks see Continuous space, Lattices and networks, Semantic analysis and representation

Neural networks

(2015) Miguel Ángel Del Agua Teba, Adrià Agusti Martinez Villaronga, Santiago Piqueras Gozalbes, Adrià Giménez Pastor, José Alberto Sanchis Navarro, Jorge Civera Saiz, & Alfons Juan Císcar: The MLLP ASR systems for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp. 39-44. [PDF, 2.1MB]

(2015) Kevin Kilgour & Alex Waibel: Multifeature modular deep neural network acoustic models. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.159-166. [PDF, 2.6MB]

(2015) Minh-Thang Luong & Christopher Manning: Stanford neural machine translation systems for spoken language domains. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.76-79. [PDF, 1.2MB]

(2015) Xiaoguang Hu, Wei Li, Xiang Lan, Hua Wu, & Haifeng Wang: Improved beam search with constrained softmax for NMT. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.297-309. [PDF, 1,178KB]

(2015) Markus Müller & Alex Waibel: Using language adaptive deep neural networks for improved multilingual speech recognition. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.167-172. [PDF, 1.8MB]

(2015) Andrzej Zydroń: Neocortical computing: next generation machine translation. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp.102-105. [PDF, 80KB]

(2013) Heike Adel, Ngoc Thang Vu, & Tanja Schultz: Combination of recurrent neural networks and factored language models for code-switching language modeling. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.206-211. [PDF, 210KB]

(2013) Peter Bell, Fergus McInnes, Siva Reddy Gangireddy, Mark Sinclair, Alexandra Birch, & Steve Renals: The UEDIN English ASR system for the IWSLT 2013 evaluation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 173KB]

(2013) Kevin Duh, Graham Neubig, Katsuhito Sudoh, & Hajime Tsukada: Adapation data selection using neural language models: experiments in machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.678-683. [PDF, 215KB]

(2013) Chien-Lin Huang, Paul R.Dixon, Shigeki Matsuda, Youzheng Wu, Xugang Lu, Masahiro Saiko, & Chiori Hori: The NICT ASR system for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 225KB]

2013) Lemao Liu, Taro Watanabe, Eiichiro Sumita, & Tiejun Zhao: Additive neural networks for statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.791-801. [PDF, 261KB]

(2013) Nan Yang, Shujie Liu, Mu Li, Ming Zhou, & Nenghai Yu: Word alignment modeling with context dependent deep neural network. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.166-175. [PDF, 554KB]

(2013) Will Y.Zou, Richard Socher, Daniel Cer, & Christopher D.Manning: Bilingual word embeddings for phrase-based machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1393-1398. [PDF, 337KB]

(2012) Christian Buck: Black box features for the WMT 2012 quality estimation shared task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.91-95. [PDF, 115KB]

(2012) Hai-Son Le, Thomas Lavergne, Alexandre Allauzen, Marianna Apidianaki, Li Gong, Aurélien Max, Artem Sokolov, Guillaume Wisniewski, & François Yvon: LIMSI @ WMT’12. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.330-337. [PDF, 189KB]

(2012) Tsuyoshi Okita: Neural probabilistic language model for system combination. COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT, Mumbai, December 2012; pp.65-75. [PDF, 150KB]

(2012) Reinhard Rapp, Serge Sharoff, & Bogdan Babych: Identifying word translations from comparable documents without a seed lexicon. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.460-466. [PDF, 329KB]

(2012) Holger Schwenk: Continuous space translation models for phrase-based statistical machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1071-1080. [PDF, 124KB]

(2012) Youzheng Wu, Xugang Lu, Hitoshi Yamamoto, Shigeki Matsuda, Chiori Hori, & Hideki Kashioka: Factored language model based on recurrent neural network. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2835-2850. [PDF, 3816KB]

(2012) Youzheng Wu, Hitoshi Yamamoto, Xugang Lu, Shigeki Matsuda, Chiori Hori, & Hideki Kashioka: Factored recurrent neural network language model in TED lecture transcription. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.222-228. [PDF, 645KB]

(2012) Dong Yu: Who can understand your speech better – deep neural network of Gaussian mixture model? [Abstract of] Keynote speech at: IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; p.9. [PDF, 546KB]

(2011) Francisco Zamora-Martínez & M.J.Castro-Bleda: CEU-UPV English-Spanish system for WMT11. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.490-495. [PDF, 364KB]

(2010) Francisco Zamora-Martínez, María José Castro-Bleda, & Holger Schwenk: N-gram-based machine translation enhanced with neural networks for the French-English BTEC-IWSLT’10 task. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.45-52. [PDF, 545KB]

(2010) Francisco Zamora-Martínez & Germán Sanchis-Trilles: UCH-UPV English-Spanish system for WMT10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 207-211. [PDF, 451KB]

Oracle extraction see Confusion networks

Pairwise ranking

(2012) Juri Ganitkevitch, Yuan Cao, Jonathan Weese, Matt Post, & Chris Callison-Burch: Joshua 4.0: packing, PRO, and paraphrases. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.283-291. [PDF, 195KB]

Parallel processing

(2012) Lane Schwartz: Better splitting algorithms for parallel corpus processing. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.109-119. [PDF, 142KB]

Perceptron

(2011) Alexandre Patry & Philippe Langlais: Going beyond word cooccurrences in global lexicon selection for statistical machine translation using a multilayer perceptron. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.658-666. [PDF, 428KB]

(2010) Ralf D.Brown: Taming structured perceptrons on wild feature vectors. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 384-391. [PDF, 132KB]

(2010) Simon Carter & Christof Monz: Discriminative syntactic reranking for statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 143KB]

(2010) Cristina España-Bonet & Lluís Màrquez: Robust estimation of feature weights in statistical machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 618KB]

(2010) Yan Song, Chunyu Kit, & Hai Zhao: Reranking with multiple features for better transliteration. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.62-65. [PDF, 395KB]

Predictive methods (see also Text prediction)

(2015) Fatemeh Azadi & Shahram Khadivi: Improved search strategy for interactive predictions in computer-assisted translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.319-332. [PDF, 957KB]

(2015) Joachim Daiber & Khalil Sima’an: Machine translation with source-predicted target morphology. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.283-296. [PDF, 647KB]

(2015) Quoc Truong Do, Sakriani Sakti, Graham Neubig, Tomoki Toda, & Satoshi Nakamura: Improving translation of emphasis with pause prediction in speech-to-speech translation systems. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.204-208. [PDF, 1.7MB]

(2015) Carla Schelfhout: Accurately predicting post-editing time & labor for cost management. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.2: MT Users’ Track; p.192-198. [PDF, 83KB]

(2014) Ergun Biçici & Andy Way: Referential translation machines for predicting translation quality. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.313-321. [PDF, 378KB]

(2014) Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: The repetition rate of text as a predictor of the effectiveness of machine translation adaptation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp. 166-179. [PDF, 954KB]

(2014) Philipp Koehn, Michael Carl, Francisco Casacuberta, & Eva Marcos: CASMACAT: cognitive analysis and statistical methods for advanced computer aided trslation. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.57 [PDF. 306KB]

(2014) Lucia Specia, & Kashif Shah: Predicting human translation quality. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.288-300. [PDF, 717KB]

(2013) Ergun Biçici & Josef van Genabith: CNGL: grading student answers by acts of translation. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.585-591. [PDF, 129KB]

(2013) Ergun Biçici & Josef van Genabith: CNGL-CORE: referential translation machines for measuring semantic similarity. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Atlanta, Georgia, June 13-14, 2013; vol.1, pp.234-240. [PDF, 129KB]

(2013) Ergun Bicici: Referential translation machines for quality estimation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.343-351. [PDF, 300KB]

(2013) Tze Yuang Chong, Rafael E.Banchs, Eng Siong Chng, & Haizhou Li: Modeling of term-distance and term-occurrence information for improving n-gram language model performance. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.233-237. [PDF, 118KB]

(2013) Christian Hardmeier, Jörg Tiedemann & Joakim Nivre: Latent anaphora resolution for cross-lingual pronoun prediction. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.380-391. [PDF, 274KB]

(2013) Amir Hazem & Emmanuel Morin: Word co-occurrence counts prediction for bilingual terminology extraction from comparable corpora. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1392-1400. [PDF, 162KB]

(2013) Philipp Koehn, Michael Carl, Francisco Casacuberta, & Eva Marcos: CASMACAT: Cognitive Analysis and Statistical Methods for Advanced Computer Aided Translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.411. [PDF, 219KB]

(2013) Abhijit Mishra, Pushpak Bhattacharyya, & Michael Carl: Automarically predicting sentence translation difficulty. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.346-351. [PDF, 430KB]

(2013) Maja Popović, Eleftherios Avramidis, Aljoscha Burchardt, David Vilar, & Hans Uszkoreit: What can we learn about the selection mechanism for post-editing? Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 11-17. [PDF, 160KB]

(2013) Xiaojun Wan: Co-regression for cross-language review rating prediction. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.526-531. [PDF, 220KB]

(2013) Marion Weller, Alexander Fraser, & Sabine Schulte im Walde: Using subcategorization knowledge to improve case prediction for translation into German. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.593-603. [PDF, 252KB]

(2012) Eunah Cho, Jan Niehues, & Alex Waibel: Segmentation and punctuation prediction in speech language translation using a monolingual translation system. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.252-259. [PDF, 603KB]

(2012) Yang Feng, Yang Liu, Qun Liu, & Trevor Cohn: Left-to-right tree-to-string decoding with prediction. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1191-1200. [PDF, 511KB]

(2012) Ram Prakash H: Quillpad multilingual predictive transliteration system. COLING 2012: Proceedings of the Second Workshop on Advances in Text Input Methods (WTIM 2), Mumbai, December 2012; pp.107-113. [PDF, 310KB]

(2012) Joern Wuebker, Hermann Ney, & Richard Zens: Fast and scalable decoding with language model look-ahead for phrase-based statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.28-32. [PDF, 113KB]

(2011) Jun Hatori & Hisami Suzuki: Japanese pronunciation prediction as phrasal statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.120-128. [PDF, 580KB]

(2011) Feifan Liu, Fei Liu, & Yang Liu: Learning from Chinese-English parallel data for Chinese tense prediction. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1116-1124. [PDF, 556KB]

(2011) Stephan Peitz, Markus Freitag, Arne Mauser, & Hermann Ney: Modeling punctuation prediction as machine translation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.238-245. [PDF, 214KB]

(2011) Lucia Specia, Najeh Hajlaoui, Catalina Hallett, & Wilker Aziz: Predicting machine translation adequacy. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.513-520. [PDF, 193KB]

Probabilistic methods see Statistical analysis

Programming and algorithms [see also Bootstrapping, Computer equipment and accessories, Dynamic programming, Parallel processing]

(2014) Ergun Biçici, Qun Liu, & Andy Way: Parallel FDA5 for fast development of accurate statistical machine translation systems. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.59-65. [PDF, 315KB]

(2014) Christian Hardmeier: Discourse in statistical machine translation [abstract]. PhD thesis, Uppsala University, 2014; 2pp. [PDF, 15KB]

(2014) Ngoc-Quang Luong, Laurent Besacier, & Benjamin Lecouteux: An efficient two-pass decoder for SMT using word confidence estimation. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.117-124. [PDF, 621KB]

(2013) Matthias Büchse, Alexander Koller, & Heiko Vogler: Generic binarization for parsing and translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.145-154. [PDF, 304KB]

(2013) Greg Hanneman & Alon Lavie: Improving syntax-augmented machine translation by coarsening the label set. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.288-297. [PDF, 1205KB]

(2013) Christian Hardmeier, Sara Stymne, Jörg Tiedemann, & Joakim Nivre: Docent: a document-level decoder for phrase-based statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, System demonstrations, Sofia, Bulgaria, August 4-9 2013; pp.193-198. [PDF, 147KB]

(2013) Hua He, Jimmy Lin, & Adam Lopez: Massively parallel suffix array queries and on-demand phrase extraction for statistical machine translation using GPUs. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.325-334. [PDF, 157KB]

(2013) Kenneth Heafield, Ivan Pouzyrevsky, Jonathan H.Clark, & Philipp Koehn: Scalable modified Kneser-Ney language model estimation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.690-696. [PDF, 212KB]

(2013) Matthias Huck, David Vilar, Markus Freitag & Hermann Ney: A performance study of cube pruning for large-scale hierarchical machine translation. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 29-38. [PDF, 191KB]

(2013) Mitesh M.Khapra, Ananthakrishnan Ramanathan, & Karthik Visweswariah: Improving reordering performance using higher order and structural features. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.315-324. [PDF, 177KB]

(2013) Graham Neubig: Travatar: a forest-to-string machine translation engine based on tree transducers. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, System demonstrations, Sofia, Bulgaria, August 4-9 2013; pp.91-96. [PDF, 344KB]

(2013) Malte Nuhn, Julian Schamper & Hermann Ney: Beam search for solving substitution ciphers. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1568-1576. [PDF, 284KB]

(2013) Alexander M.Rush, Yin-Wen Chang, & Michael Collins: Optimal beam search for machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1044-1054. [PDF, 274KB]

(2013) Markus Saers, Karteek Addanki, & Dekai Wu: Unsupervised transduction grammar induction via minimum description length. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.67-73. [PDF, 418KB]

(2013) Maryam Siahbani, Baskaran Sankaran, & Anoop Sarkar: Efficient left-to-right hierarchical phrase-based translation with improved reordering. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1089-1099. [PDF,351KB]

(2013) Andrei Simion, Michael Collins, & Clifford Stein: A convex alternative to IBM model 2. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1574-1583. [PDF, 172KB]

(2013) Xinyan Xiao & Deyi Xiong: Max-margin synchronous grammar induction for machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.255-264. [PDF, 885KB]

(2012) Nicola Bertoldi, Robert Grabowski, Liane Guillou, Michal Novak, Sorin Slavescu, & Jose de Souza: Document-level decoding in Moses. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 16 slides [PDF of PPT, 71KB]

(2012) Victor Chahuneau, Noah A.Smith, & Chris Dyer: pycdec: a Python interface to cdec. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.51-61. [PDF, 280KB]

(2012) Colin Cherry: Decoding. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 83 slides [PDF of PPT, 413KB]

(2012) Colin Cherry & Barry Haddow: Sparse features in Moses. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 10 slides [PDF of PPT, 55KB]

(2012) Christian Federmann: Relaxed-constraint decoding for Moses. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 5 slides [PDF of PPT, 42KB]

(2012) Kenneth Heafield, Philipp Koehn, & Alon Lavie: Language model rest costs and space-efficient storage. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1169-1178. [PDF, 161KB]

(2012) Marcin Junczys-Dowmunt: A phrase table without phrases: rank encoding for better phrase table compression. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.245-252. [PDF, 240KB]

(2012) Marcin Junczys-Dowmunt: Phrasal rank-encoding: exploiting phrase redundancy and translational relations for phrase table compression. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.63-74. [PDF, 169KB]; presentation, 75 slides [PDF of PPT, 1126KB]

(2012) Marcin Junczys-Dowmunt: A space-efficient phrase table implermentation. TSD 2012: 15th International Conference on Text, Speech and Dialogue, Brno, Czech Republic, September 3-7, 2012; abstract #480, 1p. [HTML]

(2012) Casey Redd Kennington, Martin Kay, & Annemarie Friedrich: Suffix trees as language models. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.446-453. [PDF, 1201KB]

(2012) Alexandre Klementiev, Ann Irvine, Chris Callison-Burch, & David Yarowsky: Toward statistical machine translation without parallel corpora. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp.130-140. [PDF, 344KB]

(2012) Seung-Wook Lee, Dongdong Zhang, Mu Li, Ming Zhou, & Hae-Chang Rim: Translation model size reduction for hierarchical phrase-based statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.291-295. [PDF, 106KB]

(2012) Peng Li, Yang Liu, & Maosung Sun: A beam search algorithm for ITG word alignment. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.673-682. [PDF, 471KB]

(2012) Lemao Liu, Tiejun Zhao, Taro Watanabe, Hailong Cao, & Conghui Zhu: Expected error minimization with ultraconservative update for SMT. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.723-732. [PDF, 144KB]

(2012) Preslav Nakov, Francisco Guzmán, & Stephan Vogel: Optimizing for sentence-level BLEU+1 yields short translations. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1979-1994. [PDF, 176KB]

(2012) Stephan Peitz, Arne Mauser, Joern Wuebker, & Hermann Ney: Forced derivations for hierarchical machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.933-942. [PDF, 129KB]

(2012) Ivan Pouzvrevsky, Mohammed Mediani, & Kenneth Heafield: Bounded-memory language model building. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 13 slides [PDF of PPT, 192KB]

(2012) Lane Schwartz: Better splitting algorithms for parallel corpus processing. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.109-119. [PDF, 142KB]

(2012) Dan Ştefănescu, Radu Ion, & Sabine Hunsicker: Hybrid parallel sentence mining from comparable corpora. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.137-144. [PDF, 493KB]

(2012) Wenduan Xu & Philipp Koehn: Extending Hiero decoding in Moses with cube growing. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.133-142. [PDF, 184KB]

(2011) Alexandre Allauzen, Hélène Bonneau-Maynard, Hai-Son Le, Aurélien Max, Guillaume Wisniewski, François Yvon, Gilles Adda, Josep M.Crego, Adrien Lardilleux, Thomas Lavergne, & Artem Sokolov: LIMSI @ WMT11. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.309-315. [PDF, 107KB]

(2011) Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Methods for smoothing the optimizer instability in SMT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.32-39. [PDF, 150KB]

(2011) Yin-Wen Chang & Michael Collins: Exact decoding of phrase-based translation models through Lagrangin relaxation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.26-37. [PDF, 312KB]

(2011) Boxing Chen, Roland Kuhn, George Foster, & Howard Johnson: Unpacking and transforming feature functions: new ways to smooth phrase tables. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.269-275. [PDF, 158KB]

(2011) Fabien Cromieres & Sadao Kurohashi: Efficient retrieval of tree translation examples for syntax-based machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.508-518. [PDF, 616KB]

(2011) Jacob Devlin, Antti-Veikko I.Rosti, Shankar Ananthakrishnan, & Spyros Matsoukas: System combination using discriminative cross-adaptation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.667-675. [PDF, 627KB]

(2011) Michel Galley & Chris Quirk: Optimal search for minimum error rate training. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.38-49. [PDF, 1280KB]

(2011) Sumukh Ghodke, Steven Bird, & Rui Zhang: A breadth-first representation for tree matching in large scale forest-based translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.785-793. [PDF, 941KB]

(2011) Greg Hanneman, Michelle Burroughs, & Alon Lavie: A general-purpose rule extractor for SCFG-based machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.135-144. [PDF, 267KB]

(2011) Kenneth Heafield, Hieu Hoang, Philipp Koehn, Tetsuo Kiso & Marcello Federico: Left language model state for syntactic machine translation. IWSLT 2011: Proc eedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.183-190. [PDF, 286KB]

(2011) Mitesh M.Khapra, Salil Joshi, & Pushpak Bhattacharyya: It takes two to tango: a bilingual unsupervised approach for estimating sense distributions using expectation maximization. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.695-704. [PDF, 280KB]

(2011) Xianhua Li, Yajuan Lü, Yao Meng, Qun Liu, & Hao Yu: Feedback selecting of manually acquired rules using automatic evaluation. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.52-59. [PDF, 185KB]

(2011) J.Scott McCarley, Abraham Ittycheriah, Salim Roukos, Bing Xiang, & Jian-ming Xu: A correction model for word alignments. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.889-898. [PDF, 236KB]

(2011) Adam Pauls & Dan Klein: Faster and smaller n-gram language models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.258-267. [PDF, 246KB]

(2011) Christian Rishøj & Anders Søgaard: Factored translation with unsupervised word clusters. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.447-451. [PDF, 163KB]

(2011) Markus Saers & Dekai Wu: Reestimation of reified rules in semiring parsing and biparsing. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.70-78. [PDF, 237KB]

(2011) Thomas Schoenemann: Regularizing mono- and bi-word models for word alignment. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1313-1322. [PDF, 286KB]

(2011) Christophe Servan & Holger Schwenk: Optimising multiple metrics with MERT. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.109-117. [PDF, 141KB]; presentation, 20 slides [PDF of PPT, 535KB]

(2011) Xiadong Shi, Xiang Zhu, & Yidong Chen: Improving the hierarchical phrase-based translation model. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.383-388. [PDF, 174KB]

(2011) Jun Suzuki, Kevin Duh, & Masaaki Nagata: Distributed minimum error rate training of SMT using particle swarm optimization. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.649-657. [PDF, 1101KB]

(2011) Kaveh Taghipour, Shahram Khadivi, & Jia Xu: Parallel corpus refinement as an outlier detection algorithm. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.414-421. [PDF, 213KB]

(2011) Ming Tan, Wenli Zhou, Lei Zheng, & Shaojun Wang: A large scale distributed syntactic, semantic and lexical language model for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.201-210. [PDF, 192KB]

(2011) M.Inés Torres & Francisco Cascuberta: Stochastic K-TSS bi-languages for machine translation. Proceedings of the 9th International Workshop on Finite State Methods and Natural Language Processing, Blois (France), July 12-15; pp.98-106. [PDF, 195KB]

(2011) Kristina Toutanova & Michel Galley: Why initialization matters for IBM Model 1: multiple optima and non-strict convexity. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.461-466. [PDF, 181KB]

(2010) Jonathan H.Clark, Jonathan Weese, Byung Gyu Ahn, Andreas Zollmann, Qin Gao, Kenneth Heafield, & Alon Lavie: The machine translation toolpack for LoonyBin: an automated management of experimental machine translation hyperworkflows. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.117-126. [PDF, 153KB]

(2010) Lei Cui, Dongdong Zhang, Mu Li, Ming Zhou, & Tiejun Zhao: A joint rule selection model for hierarchical phrase-based translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.6-11. [PDF, 213KB]

(2010) Nan Duan, Hong Sun, & Ming Zhou: Translation model generalization using probability averaging for machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.304-312. [PDF, 296KB]

(2010) Yang Feng, Haitao Mi, Yang Liu, & Qun Liu: An efficient shift-reduce decoding algorithm for phrase-based machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.285-293. [PDF, 175KB]

(2010) Mikel L.Forcada, Boyan Ivanov Bonev, Sergio Ortiz Rojas, Juan Antonio Pérez Ortiz, Gema Ramírez Sánchez, Felipe Sánchez Martínez, Carme Armentano-Oller, Marco A.Montava, & Francis M.Tyers: Documentation of the open-source shallow-transfer machine translation platform Apertium; ed. Mireia Ginestí Rosell. Departament de Llenguatges i Sistemes Informàtics, Universitat d’Alacant, March 10, 2010; 214pp. [PDF, 700KB]

(2010) Qin Gao, Francisco Guzman, & Stephan Vogel: EMDC: a semi-supervised approach for word alignment. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.349-357. [PDF, 231KB]

(2010) Andrea Gesmundo & James Henderson: Faster cube pruning. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.267-274. [PDF, 437KB]

(2010) Spence Green, Michel Galley, & Christopher D.Manning: Improved models of distortion cost for statistical machine translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.867-875. [PDF, 223KB]

(2010) Barry Haddow: Adding multi-threaded decoding to Moses. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.57-66. [PDF, 136KB]

(2010) Christian Hardmeier: Fast and extensible phrase scoring for statistical machine translation. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.87-96. [PDF, 123KB]

(2010) Kenneth Heafield & Alon Lavie: Combining machine translation output with open source: the Carnegie Mellon multi-engine machine translation scheme. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.27-36. [PDF, 133KB]

(2010) Liang Huang & Haitao Mi: Efficient incremental decoding for tree-to-string translation. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.273-283. [PDF, 266KB]

(2010) Philipp Koehn & Jean Senellart: Fast approximate string matching with suffix arrays and A* parsing. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 178KB]

(2010) Roland Kuhn, Boxing Chen, George Foster, & Evan Stratford: Phrase clustering for smoothing TM probabilities – or, how to extract paraphrases from phrase tables. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.608-616. [PDF, 328KB]

(2010) Mu Li, Yinggong Zhao, Dongdong Zhang, & Ming Zhou: Adaptive development data selection for log-linear model in statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.662-670. [PDF, 213KB]

(2010) Peng Li, Maosong Sun, & Ping Xue: Fast-Champollion: a fast and robust sentence alignment algorithm. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.710-718. [PDF, 267KB]

(2010) Peng Li, Maosong Sun & Ping Xue: Fast-Champollion: a fast and robust sentence alignment algorithm [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

(2010) Wang Ling, Tiago Luís, João Graça, Luísa Coheur & Isabel Trancoso: Towards a general and extensible phrase-extraction algorithm. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.313-320. [PDF, 382KB]

(2010) Yang Liu & Liang Huang: Tree-based and forest-based translation. ACL 2010:Tutorial abstracts, Uppsala, Sweden, 11 July, 2010; p.2. [PDF, 58KB]

(2010) Andreas Maletti & Giorgio Satta: Parsing and translation algorithms based on weighted extended tree transducers. Proceedings of the 2010 Workshop on Applications of Tree Automata in Natural Language Processing, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.19-27. [PDF, 190KB]

(2010) Behrang Mohit, Rebecca Hwa, & Alon Lavie: Using variable decoding weight for language model in statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 165KB]

(2010) Sylwia Ozdowska & Vincent Claveau: Inferring syntactic rules for word alignment through inductive logic programming. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.749-754. [PDF, 370KB]

(2010) Baskaran Sankaran, Ajeet Grewal, & Anoop Sarkar: Incremental decodings for phrase-based statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 216-223. [PDF, 137KB]

(2010) Thomas Schoenemann: Computing optimal alignments for the IBM-3 translation model. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.98-106. [PDF, 130KB]

(2010) Lane Schwartz & Chris Callison-Burch: Hierarchical phrase-based grammar extraction in Joshua: suffix arrays and prefix trees. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.157-166. [PDF, 136KB]

(2010) Kashif Shah, Loïc Barrault, & Holger Schwenk: Translation model adaptation by resampling. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 392-399. [PDF, 176KB]

(2010) Libin Shen, Jinxi Xu, & Ralph Weischedel: String-to-dependency statistical machine translation. Computational Linguistics 36 (4), pp. 649-671 [PDF, 233KB]

(2010) Drahomira (Johanka) Spoustová & Miroslav Spousta: Dependency parsing as a sequence labeling task. Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.7-14. [PDF, 271KB]

(2010) Zhiyang Wang, Yajuan Lü, Qun Liu, & Young-Sook Hwang: Better filtration and augmentation for hierarchical phrase-based translation rules. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.142-146. [PDF, 135KB]

(2010) Sirvan Yahyaei & Christoph Monz: Dynamic distortion in a discriminative reordering model for statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.353-360. [PDF, 460KB]

Pushdown automata

Robustness

(2012) Cyril Goutte, Marine Carpuat, & George Foster: The impact of sentence alignment errors on phrase-based machine translation performance. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 147KB]

Similarity measures

(2015) Mihaela Vela & Ekaterina Lapshinova-Koltunski: Register-based machine translation evaluation with text classification techniques. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.215-228. [PDF, 506KB]

(2014) Hiroshi Echizen’ya, Kenji Araki, & Eduard Hovy: Application of prize based on sentence length in chunk-based automatic evaluation of machine translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.381-386. [PDF, 315KB]

(2014) Meritxell Gonzàlez, Alberto Barrón-Cedeño, & Lluís Màrquez: IPA and STOUT: leveraging linguistic and source-based features for machine translation evaluation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.394-401. [PDF, 319KB]

(2014) Rohit Gupta, Hanna Bechara, & Constantin Orasan: Intelligent translation memory matching and retrieval metric exploiting linguistic technology. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.86-89. [PDF, 176KB]

(2014) Eva Hasler, Barry Haddow, & Philipp Koehn: Dynamic topic adaptation for SMT using distribution profiles. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.445-456. [PDF, 740KB]

(2014) John E.Ortega, Felipe Sánchez-Martinez, & Mikel L.Forcada: Using any machine translation source for fuzzy-match repair in a computer-aided translation setting. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.42-53. [PDF, 557KB]

(2014) John Richardson, Toshiaki Nakazawa, & Sadao Kurohashi: Bilingual dictionary construction with transliteration filtering. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1013-1017. [PDF, 164KB]

(2014) Tom Vanallemeersch & Vincent Vandeghinste: Improving fuzzy matching through syntactic knowledge. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.90-99. [PDF, 430KB]

(2014) Friedel Wolff, Laurette Pretorius, & Paul Buitelaar: Missed opportunities in translation memory matching. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4401-4406. [PDF, 133KB]

(2013) Alberto Barrón-Cedeño, Lluís Màrquez, Maria Fuentes, Horacio Rodríguez, & Jordi Turmo: UPC-CORE: what can machine evaluation metrics and Wikipedia do for estimating semantic textual similarity? [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Atlanta, Georgia, June 13-14, 2013; vol.1, pp.143-147. [PDF, 103KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Using WordNet and semantic similarity for bilingual terminology mining from comparable corpora. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.16-23. [PDF, 628KB]

(2013) Daniel Cer, Christopher D.Manning & Dan Jurafsky: Positive diversity tuning for machine translation system combination. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.320-328. [PDF, 525KB]

(2013) Avishek Dan & Pushpak Bhattacharyya: CFILT-CORE: semantic textual similarity using Universal Networking Language. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Atlanta, Georgia, June 13-14, 2013; vol.1, pp.216-220. [PDF, 128KB]

(2013) Yvette Graham, Bahar Salehi, & Timothy Baldwin: Umelb: cross-lingual textual entailment with word alignment and string similarity features. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.133-137. [PDF, 91KB]

(2013) Sergio Jimenez, Claudia Becerra, & Alexander Gelbukh: SOFTCARDINALITY: learning to identify directional cross-lingual entailment from cardinalities and SMT. [SEMEVAL-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation (SemEval 2013), Atlanta, Georgia, June 14-15, 2013; pp.118-123. [PDF, 144KB]

(2013) Nikola Ljubešić & Darja Fišer: Identifying false friends between closely related languages. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 8-9 August 2013, Sofia, Bulgaria; pp.69-77. [PDF, 342KB]

(2013) John Philip McCrae, Philipp Cimiano, & Roman Klinger: Orthonormal explicit topic analysis for cross-lingual document matching. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1732-1740. [PDF, 192KB]

(2013) Takashi Tsunakawa, Yosuke Yamamoto, & Hiroyuki Kaji: Improving calculation of contextual similarity for constructing a bilingual dictionary via a third language. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1057-1061. [PDF, 689KB]

(2013) Jiang Zhao, Man Lan, & Zheng-Yu Niu: ECNUCS: recognizing cross-lingual textual entailment using multiple text similarity and text difference measures. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.118-123. [PDF, 100KB]

(2013) Zede Zhu, Miao Li, Lei Chen, & Zhenxin Yang: Building comparable corpora based on bilingual LDA model. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.278-281. [PDF, 127KB]

(2012) Walid Aransa, Holger Schwenk, & Loic Barrault: Semi-supervised transliteration mining from parallel and comparable corpora. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp. 185-192. [PDF, 650KB]

(2012) Julio Castillo & Paula Estrella: Semantic textual similarity for MT evaluation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.52-58. [PDF, 278KB]

(2012) Daniele Falavigna & Roberto Gretter: Focusing language models for automatic speech recognition. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.171-178. [PDF, 722KB]

(2012) Septina Dian Larasati: Improving word alignment by exploiting adapted word similarity. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.41-45. [PDF, 227KB]

(2012) Johannes Leveling, Debasis Ganguly, Sandipan Dandapat, & Gareth J.F.Jones: Approximate sentence retrieval for scalable and efficient example-based machine translation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1571-1586. [PDF, 291KB]

(2012) Li LiangYou, Gong ZhengXian, & Zhou GuoDong: Phrase-based evaluation for machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.663-672. [PDF, 201KB]

(2012) Christian M.Meyer & Iryna Gurevych: To exhibit is not to loiter: a multilingual, sense-disambiguated Wiktionary for measuring verb similarity. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1763-1780. [PDF, 771KB]

(2012) Erwan Moreau & Carl Vogel: Quality estimation: an experimental study using unsupervised similarity measures. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.120-126. [PDF, 139KB]

(2012) Preslav Nakov & Hwee Tou Ng: Improving statistical machine translation for a resource-poor language using related resource-rich languages. Journal of Artificial Intelligence Research 44 (2012); pp.179-222. [PDF, 421KB]

(2012) Michel Simard & Atsushi Fujita: A poor man’s translation memory using machine translation evaluation metrics. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 958KB]

(2012) Aleš Tamchyna, Petra Galuščáková, Amir Kamran, Miloš Stanojević, & Ondřej Bojar: Selecting data for English-to-Czech machine translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.374-381. [PDF, 110KB]

(2012) Xinyan Xiao, Deyi Xiong, Min Zhang, Qun Liu, & Shouxun Lin: A topic similarity model for hierarchical phrase-based translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.750-758. [PDF, 630KB]

(2012) Daniel Zeman: Data issues of the multilingual translation matrix. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.395-400. [PDF, 225KB]

Statistical methods (general)

(2014) Sandra Antunes & Amália Mendes: An evaluation of the role of statistical measures and frequency for MWE identification. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4046-4051. [PDF, 138KB]

(2014) Christian Buck, Kenneth Heafield, & Bas van Ooyen: N-gram counts and language models from the common crawl. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3579-3584. [PDF, 180KB]

(2014) Boxing Chen & Colin Cherry: A systematic comparison of smoothing techniques for sentence-level BLEU. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.362-367. [PDF, 316KB]

(2014) Yvette Graham, Nikita Mathur, & Timothy Baldwin: Randomized significance tests in machine translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.266-274. [PDF, 358KB]

(2014) Nasredine Semmar, Othman Zennaki, & Meriama Laib: Using cross-language information retrieval and statistical language modelling in example-based machine translation. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.36-44. [PDF, 274KB]

(2013) Amir Hazem & Emmanuel Morin: A comparison of smoothing techniques for bilingual lexicon extraction from comparable corpora. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.24-33. [PDF, 546KB]

(2013) Anil Kumar Singh, Guillaume Wisniewski & François Yvon: LIMSI submission for the WMT’13 quality estimation task: an experiment with n-gram posteriors. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.398-404. [PDF, 204KB]

(2013) Xuchen Yao, Benjamin Van Durme, Chris Callison-Burch, & Peter Clark: Semi-Markov phrase-based monolingual alignment. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.590-600. [PDF, 184KB]

(2012) Hanna Béchara, Raphaël Rubino, Yifan He, Yanjun Ma, & Josef van Genabith: An evaluation of statistical post-editing systems applied to RBMT and SMT systems. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.215-230. [PDF, 243KB]

(2012) Chris Dyer: Discriminative training of language models. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 83 slides [PDF of PPT, 1322KB]

(2012) J.Howard Johnson: Conditional significance pruning: discarding more of huge phrase tables. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 241KB]

(2012) Philipp Koehn & Barry Haddow: Interpolated backoff for factored translation models. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 239KB]

(2012) Saab Mansour & Hermann Ney: A simple and effective weighted phrase extraction for machine translation adaptation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.193-200. [PDF, 641KB]

(2012) Marion Potet, Laurent Besacier, Hervé Blanchon, & Marwen Azouzi: Towards a better understanding of statistical post-editing. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.284-291. [PDF, 624KB]

(2012) Nick Ruiz & Marcello Federico: MDI adaptation for the lazy: avoiding normalization in LM adaptation for lecture translation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.244-251. [PDF, 665KB]

(2012) Christer Samuelsson: HAL: challenging three key aspects of IBM-style statistical machine translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 917KB]

Suffix trees

Support vector machines

(2014) Sandipat Dandapat & Declan Groves: MTWatch: a tool for the analysis of noisy parallel data. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.41-45. [PDF, 190KB]

(2014) Jian Zhang, Liangyou Li, Andy Way, & Qun Liu: A probabilistic feature-based fill-up for SMT. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.96-109. [PDF, 482KB]

(2013) David Langlois & Kamel Smaili: LORIA system for the WMT13 quality estimation shared task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.380-385. [PDF, 177KB]

(2013) Darnes Vilariño, David Pinto, Saúl León, Yuridiana Alemán, & Helena Gómez-Adorno: BUAP: N-gram based feature evaluation for the cross-lingual textual entailment task. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.124-127. [PDF, 83KB]

(2012) Christian Federmann: System combination using joint, binarised feature vectors. COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT), Mumbai, December 2012; pp.77-83. [PDF, 116KB]

(2012) Christian Hardmeier, Joakim Nivre, & Jörg Tiedemann: Tree kernels for machine translation quality estimation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.109-113. [PDF, 423KB]

(2012) David Langlois, Sylvain Raybaud, & Kamel Smaïli: LORIA system for the WMT12 quality estimation shared task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.114-119. [PDF, 96KB]

(2012) Yashar Mehdad, Matteo Negri, & Marcello Federico: Match without a referee: evaluating MT adequacy without reference translations. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.171-180. [PDF, 126KB]

(2012) Radu Soricut, Nguyen Bach, & Ziyuan Wang: The SDL Language Weaver systems in the WMT12 quality estimation shared task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.145-151. [PDF, 106KB]

(2012) Chunyang Wu & Hai Zhao: Regression with phrase indicators for estimating MT quality. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.152-156. [PDF, 119KB]

(2011) Christian Hardmeier: Improving machine translation quality prediction with syntactic tree kernels. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.233-240. [PDF, 633KB] ; presentation, 18 slides [PDF]

(2010) Yuejie Zhang, Yang Wang, Lei Cen, Yanxia Su, Cheng Jin, Xiangyang Xue, & Jianping Fan: Fusion of multiple features and ranking SVM for web-based English-Chinese OOV term translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1435-1443. [PDF, 551KB]

(2010) Junguo Zhu, Muyun Yang, Bo Wang, Sheng Li, & Tiejun Zhao: All in strings: a powerful string-based automatic MT evaluation metric with multiple granularities. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1533-1540. [PDF, 159KB]

System development

(2014) Lori Thicke: Machine translation is not one size fits all. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Commercial MT Users, pp. 80-104. [PDF, 2094KB]

(2012) Cristina España-Bonet, Gorka Labaka, Arantza Díaz de Ilarranza, Lluís Màrquez, & Kepa Sarasola: Deep evaluation of hybrid architectures: use of different metrics in MERT weight optimization. In: Free/Open-Source Rule-Based Machine Translation, ed.Cristina España-Bonet and Aarne Ranta. Proceedings of a Workshop held in Gothenburg, 14-15 June, 2012; pp.65-76. [PDF, 498KB]

(2011) Bertrand Le Chapelain: Machine translation project and activities in EPO “the patent case”. Invited talk at [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.1-16. [PDF, 1271KB]

(2011) Sudip Kumar Naskar, Antonio Toral, Federico Gaspari, & Andy Way: A framework for diagnostic evaluation of MT based on linguistic checkpoints. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.529-536. [PDF, 170KB]

(2011) Eiichi Yamamoto: Future plans of machine translation system in the JPO . Invited talk at [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.37-51. [PDF, 1018KB]

(2010) Najeh Hajlaoui & Christian Boitet: Multilinguization and personalization of NL-based systems. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.79-87. [PDF, 453KB]

(2010) William D.Lewis: Haitian Creole: how to build and ship an MT engine from scratch in 4 days, 17 hours, & 30 minutes. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 518KB]; presentation: 24 slides [PDF, 630KB]

(2010) Martin Volk, Rico Sennrich, Christian Hardmeier, & Frida Tidström: Machine translation of TV subtitles for large scale production. JEC 2010: Second joint EM+/CNGL Workshop “Bringing MT to the user: research on integrating MT in the translation industry”, AMTA 2010, Denver, Colorado, November 4, 2010; pp.53-62. [PDF, 120KB]

Telecommunications

(2010) Alex Waibel: Preliminary findings of the “Interactive systems” vision group. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 15 slides [PDF of PPT, 1059KB]

Text prediction

(2012) Chung-chi Huang, Ping-che Yang, Mei-hua Chen, Hung-ting Hsieh, Ting-hui Kao, & Jason S.Chang: TransAhead: a writing assistant for CAT and CALL. [EACL 2012] Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp.16-19. [PDF,

(2012) Shahram Khadivi & Zeinab Vakil: Interactive-predictive speech-enabled computer-assisted translation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.237-243. [PDF, 774KB]

(2012) Zeinab Vakil & Shahram Khadivi: A new search approach for interactive-predictive computer-assisted translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1261-1270. [PDF, 155Kb]

(2010) Jesús González-Rubio, Daniel Ortiz-Martínez, & Francisco Casacuberta: Balancing user effort and translation error in interactive machine translation via confidence measures. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.173-177. [PDF, 97KB]

(2010) Jesús González-Rubio, Daniel Ortiz-Martínez, & Francisco Casacuberta: On the use of confidence measures within an interactive-predictive machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 525KB]

Vector models

(2013) Boxing Chen, Roland Kuhn, & George Foster: Vector space model for adaptation in statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1285-1293. [PDF, 207KB]

XML in MT

(2011) Hajer Sahnou & Kais Haddar: Automatic translation: comparative study between linguistic methods, selection criteria and some suggestions for NooJ. Tralogy, Paris, 3-4 March 2011; 16pp. [PDF, 280KB

(2011) Arda Tezcan & Vincent Vandeghinste: SMT-CAT integration in a technical domain: handling XML markup using pre & post-processing methods. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.55-62. [PDF, 343KB]