Machine Translation Archive

Index to methods, techniques, processes

Publications since 2010

For other periods go to: publications 2005-2009; publications 2000-2004; publications 1990-1999; publications 1970-1989; publications before 1970

[To return to home page click here]

Alignment [see also Character alignment, Statistical analysis, Word alignment]

(2015) Franck Burlot & François Yvon: Morphology-aware alignments for translation to and from a synthetic language. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp. 188-195. [PDF, 3.1MB]

(2014) Petra Barančiková, Rudolf Rosa, & Aleš Tamchyna: Improving evaluation of English-Czech MT through paraphrasing.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2696-2700. [PDF, 127KB]

(2014) Kevin Flanagan; Bilingual phrase-to-phrase alignment for arbitrarily-small datasets. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp. 83-95. [PDF, 1203KB]

(2014) Kevin Flanagan: Filling in the gaps: what we need from TM subsegment recall. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.12-23. [PDF, 508KB]

(2014) Timur Gilmanov, Olga Scrivner, & Sandra Kübler: SWIFT aligner, a multifunctional tool for parallel corpora: visualization, word alignment, and (morpho)-syntactic cross-language transfer. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2913-2919. [PDF, 284KB]

(2014) Fabrizio Gotti, Philippe Langlais, & Atefeh Farzindar: Hashtag occurrences, layout and translation: a corpus-driven analysis of tweets published by the Canadian government.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2254-2261. [PDF, 486KB]

(2014) Sharid Loáiciga, Thomas Meyer, & Andrei Popescu-Belis: English-French verb phrase alignment in Europarl for tense translation modeling. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp. 674-681. [PDF, 179KB]

(2014) Juan Luo & Yves Lepage: Production of phrase tables in 11 European languages using an improved sub-sentential aligner. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.664-669. [PDF, 206KB]

(2014) Saab Mansour & Herman Ney: Translation model based weighting for phrase extraction. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014; pp.35-43. [PDF, 448KB]

(2014) Mircea Petic & Daniela Gîfu: Transliteration and alignment of parallel texts from Cyrillic to Latin. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1819-1823. [PDF, 462KB]

(2014) Lise Rebout & Philippe Langlais: An iterative approach for mining parallel sentences in a comparable corpus. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.648-655. [PDF, 201KB]

(2014) Michaela Regneri, Rui Wang, & Manfred Pinkal: Aligning predicate-argument structures for paraphrase fragment extraction. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4300-4307. [PDF, 216KB]

(2014) Dan Tufiş, Radu Ion, Ştefan Dumitrescu, & Dan Ştefănescu: Large SMT data-sets extracted from Wikipedia. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.656-663. [PDF, 232KB]

(2014) Pavel Vondřička: Aligning parallel texts with InterText. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1875-1879. [PDF, 476KB]

(2013) Ahmed El Kholy, Nizar Habash, Gregor Leusch, Evgeny Matusov, & Hassan Sawaf: Language independent connectivity strength features for phrasal pivot statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.412-418. [PDF, 200KB]; revised version.

(2013) Yufeng Chen, Chengqing Zong; & Keh-Yi Su: A joint model to identify and align bilingual named entities. Computational Linguistics 39 (2); pp.229-266. [PDF, 708KB]

(2013) Chenhui Chu, Toshiaki Nakazawa, & Sadao Kurohashi: Accurate parallel fragment extraction from quasi-comparable corpora using alignment model and translation lexicon. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1144-1150. [PDF, 415KB]

(2013) Chenhui Chu, Toshiaki Nakazawa, & Sadao Kurohashi: Chinese-Japanese parallel sentence extraction from quasi-comparable corpora. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.34-42. [PDF, 469KB]

(2013) Li Gong, Aurélien Max, & François Yvon: Improving bilingual sub-sentential alignment by sampling-based transpotting. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 431KB]

(2013) Rajdeep Gupta, Santanu Pal, & Sivaji Bandyopadhyay: Improving MT system using extracted parallel fragments of text from comparable corpora. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.69-76. [PDF, 436KB]

(2013) Miriam Kaeshammer: Synchronous linear context-free rewriting systems for machine translation. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 68-77. [PDF, 213KB]

(2013) Georgios Kontonatsios, Ioannis Korkontzelos, Jun’ichi Tsujii, & Sophia Ananiadou: Using a random forest classifier to recognise translations of biomedical terms across languages.  Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.95-104. [PDF, 656KB]

(2013) Fethi Lamraoui & Philippe Langlais: Yet another fast, robust and open source sentence aligner: time to reconsider sentence alignment? Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.77-84. [PDF, 610KB]

(2013) Tingting Li, Tiejun Zhao, Andrew Finch, & Chunyue Zhang: A tightly-coupled unsupervised clustering and bilingual alignment model for transliteration.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.393-398. [PDF, 698KB]

(2013) Santanu Pal, Sudio Kumar Naskar, & Sivaji Bandyopadhyay: MWE alignment in phrase based statistical machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.61-68. [PDF, 539KB]

(2013) Magdalena Plamadă & Martin Volk: Mining for domain-specific text from Wikipedia.  Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.112-120. [PDF, 382KB]

(2013) Xiaojun Quan, Chunyu Kit, & Yan Song: Non-monotonic sentence alignment via semisupervised learning.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.622-630. [PDF, 348KB]

(2013) Hassan Sajjad, Svetlana Smekalova, Nadir Durrani, Alexander Fraser & Helmut Schmid: QCRI-MES submission at WMT13: using transliteration mining to improve statistical machine trans­lation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.219-224. [PDF, 205KB]

(2013) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Scalable variational inference for extracting hierarchical phrase-based translation rules. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.438-446. [PDF, 310KB]

(2013) Andrei Simion, Michael Collins, & Clifford Stein: A convex alternative to IBM model 2. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1574-1583. [PDF, 172KB]

(2013) Tian Xia, Zongcheng Ji, Shaodan Zhai, Yidong Chen, Qun Liu, & Shaojun Wang: Improving alignment of system combination by using multi-objective optimization. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.535-544. [PDF, 204KB]

(2013) Xuchen Yao, Benjamin Van Durme, Chris Callison-Burch, & Peter Clark: Semi-Markov phrase-based monolingual alignment. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.590-600. [PDF, 184KB]

(2013) Omar Zaidan & Vishal Chowdhary: Evaluating (and improving) sentence alignment under noisy conditions. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.484-493. [PDF, 194KB]

(2012) Sophie Arnoult & Khalil Sima’an: Adjunct alignment in translation data with an application to phrase-based statistical machine translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.287-294. [PDF, 164KB]

(2012) Jan Berka, Ondřej Bojar, Mark Fishel, Maja Popović, & Daniel Zeman: Automatic MT error analysis: Hjerson helping Addicter.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2158-2163. [PDF, 580KB]

(2012) Frédéric Blain, Holger Schwenk, & Jean Senellart: Incremental adaptation using translation information and post-editing analysis. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.229-236. [PDF, 873KB]

(2012) Cristina Bosco, Manuela Sanguinetti, & Leonardo Lesmo: The Parallel-TUT: a multilingual and multiformat treebank.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3982-3987. [PDF, 460KB]

(2012) Julien Bourdaillet & Philippe Langlais: Identifying infrequent translations by aligning non parallel sentences. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 331KB]

(2012) Victor Chahuneau, Noah A.Smith, & Chris Dyer: pycdec: a Python interface to cdec. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.51-61. [PDF, 280KB]

(2012) Yu Chen & Andreas Eisele: MultiUN v2: UN documents with multilingual alignments.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2500-2504. [PDF, 405KB]

(2012) Cyril Goutte, Marine Carpuat, & George Foster: The impact of sentence alignment errors on phrase-based machine translation performance. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 147KB]

(2012) Petter Haugereid & Francis Bond: Extracting semantic transfer rules from parallel corpora with SMT phrase aligners. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.67-75. [PDF, 194KB]

(2012) Hieu Hoang: Phrase-based translation. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 103 slides [PDF of PPT, 1227KB]

(2012) Maria Holmqvist, Sara Stymne, Lars Ahrenberg, & Magnus Merkel: Alignment-based reordering for SMT.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3436-3440. [PDF, 300KB]

(2012) Joseph Max Kaufmann: JMaxAlign: a maximum entropy parallel sentence alignment tool. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 277-287. [PDF, 342KB]

(2012) Mitesh M.Khapra, Ananthakrishnan Ramanathan, & Karthik Visweswariah: Whitepaper for shared task on learning reordering from word alignments at RSMT 2012. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.1-8. [PDF, 326KB]

(2012) Svetla Koeva, Borislav Rizov, Ivelina Stoyanova, Svetlozara Leseva, Rositsa Dekova, Angel Genov, Ekaterina Tarpomanova, Tsvetana Dimitrova, & Hristina Kukova: Application of clause alignment for statistical machine translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.102-110. [PDF, 135KB]

(2012) Gideon Kotzé, Vincent Vandeghinste, Scott Martens, & Jörg Tiedemann: Large aligned treebanks for syntax-based machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.467-473. [PDF, 498KB]

(2012) Nikola Ljubešić, Špela Vintar, & Darja Fišer: Multi-word term extraction from comparable corpora by combining contextual and constituent clues.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.143-147. [PDF, 376KB]

(2012) Nitin Madnani, Joel Tetreault, & Martin Chodorow: Exploring grammatical error correction with not-so-crummy machine translation. [ACL 2012] The 7th Workshop on the Innovative Use of NLP for Building Educational Applications, Montréal, Canada, June 3-8, 2012; pp.44-53. [PDF, 227KB]

(2012) Mohammed Mediani, Jan Niehues, & Alex Waibel: Parallel phrase scoring for extra-large corpora. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.87-98. [PDF, 142KB]

(2012) Coşkun Mermer, Hamza Kaya, İlknur Durgar El-Kahlout, & Mehmet Uğur Doğan: The TÜBİTAK statistical machine translation system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.144-148. [PDF,630KB]

(2012) Yoh Okuno: Applying mpaligner to machine transliteration with Japanese-specific heuristics. [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.61-65. [PDF, 348KB]

(2012) Santanu Pal & Sivaji Bandyopadhyay: Bootstrapping method for chunk alignment in phrase based SMT.  EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp. 93-100. [PDF, 246KB]

(2012) Mārcis Pinnis, Radu Ion, Dan Ştefănescu, Fangzhong Su, Inguna Skadiņa, Andrejs Vasiļjevs, &  Bogdan Babych: ACCURAT toolkit for multi-level alignment and information extraction from comparable corpora. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 10 July 2012, System Demonstrations; pp.91-96. [PDF, 235KB]

(2012) Magdalena Plamada & Martin Volk: Towards a Wikipedia-extracted Alpine corpus.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.81-87. [PDF, 364KB]

(2012) Darcey Riley & Daniel Gildea: Improving the IBM alignment models using variational Bayes. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.306-310. [PDF, 103KB]

(2012) Antti-Veikko I.Rosti, Xiaodong He, Damianos Karakos, Gregor Leusch, Yuan Cao, Markus Freitag, Spyros Matsoukas, Hermann Ney, Jason R.Smith & Bing Zang: Review of hypothesis alignment algorithms for MT system combination via confusion network decoding. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.191-199. [PDF, 150KB]

(2012) Amit Sangodkar & Om P.Damani: Re-ordering source sentences for SMT.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2164-2171. [PDF, 649KB]

(2012) Michel Simard: Book review: Bitext alignment [by] Jörg Tiedemann. Computational Linguistics 38 (4); pp.799-825. [PDF, 37KB]

(2012) Fangzhong Su & Bogdan Babych: Development and application of a cross-language document comparability metric.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3956-3962. [PDF, 582KB]

(2012) Fangzhong Su & Bogdan Babych: Measuring comparability of documents in non-parallel corpora for efficient extraction of (semi-)parallel translation equivalents. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.10-19. [PDF, 188KB]

(2012) George Tambouratzis, Michalis Troullinos, Sokratis Sofianopoulos, & Marina Vassiliou: Accurate phrase alignment in a bilingual corpus for EBMT systems.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.104-111. [PDF, 294KB]

(2012) Jörg Tiedemann, Dorte Haltrup Hansen, Lene Offersgaard, Sussi Olsen, & Matthias Zumpe: A distributed resource repository for cloud-based machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2207,2213. [PDF, 441KB]

(2012) Antonio Toral, Marc Poch, Pavel Pecina, & Gregor Thurmair: Efficiency-based evaluation of aligners for industrial applications. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.57-60. [PDF, 210KB]

(2012) Qian Yu, Aurélien Max, & François Yvon: Aligning bilingual literary works: a pilot study. NAACL-HLT Workshop on Computational Linguistics for Literature, Montréal, Canada, June 8, 2012; pp.36-44. [PDF, 232KB]

(2012) Chun-Kai Wu, Yu-Chun Wang, & Richard Tzong-Han Tsai: English-Korean named entity transliteration using substring alignment and re-ranking rules. [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.57-60. [PDF, 103KB]

(2012) Xiaofeng Wu, Tsyoshi Okita, Josef van Genabith, & Qun Liu: System combination with extra alignment information. COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT, Mumbai, December 2012; pp.37-44. [PDF, 153KB]

(2012) Joern Wuebker, Mei-Yuh Hwang, & Chris Quirk: Leave-one-out phrase model training for large-scale deployment. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.460-467. [PDF, 223KB]

(2012) Qian Yu, Aurélien Max, & François Yvon: Revisiting sentence alignment algorithms for alignment visualization and evaluation. [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.10-16. [PDF, 430KB]

(2012) Jingbo Zhu, Tong Xiao, & Chunliang Zhang: Learning better rule extraction with translation span alignment. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.280-284. [PDF, 235KB]

(2011) Mohit Bansal, Chris Quirk, & Robert C.Moore: Gappy phrasal alignment by agreement . ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1308-1317. [PDF, 215KB]

(2011) Anja Belz & Eric Kow: Unsupervised alignment of comparable data and text resources. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.102-109. [PDF, 143KB]

(2011) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Improved statistical machine translation using multiword expressions. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.15-20. [PDF, 345KB]

(2011) Houda Bouamor, Aurélien Max, & Anne Vilnat: Monolingual alignment by edit rate computation on sentential paraphrase pairs. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.395-400. [PDF, 138KB]

(2011) Chenhui Chu, Toshiaki Nakazawa, & Sadao Kurohashi: Japanese-Chinese phrase alignment using common Chinese characters information. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.464-471. [PDF, 347KB]

(2011) Oliver Čulo, Silvia Hansen-Schirra, Karin Maksymski, & Stella Neumann: Empty links and crossing lines: querying multi-layer annotation and alignment in parallel corpora. Translation: Corpora, Computation, Cognition 1 (1), December 2011; pp.75-104. [PDF, 923KB]

(2011) Takaaki Fukunishi, Seiichi Yamamoto, Andrew Finch, & Eiichiro Sumita: Using features from a bilingual alignment model in transliteration mining. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.49-57. [PDF, 660KB]

(2011) Rashmi Gangadharaiah: Reducing asymmetry between language-pairs to improve alignment and translation quality. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1346-1350. [PDF, 239KB]

(2011) Mridul Gupta, Sanjika Hewavitharana, & Stephan Vogel: Extending a probabilistic phrase alignment approach for SMT. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.175-182. [PDF, 573KB]

(2011) Sanjika Hewavitharana & Stephan Vogel: Extracting parallel phrases from comparable data. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.61-68. [PDF, 217KB]

(2011) Mark Hopkins, Greg Langmead & Tai Vo: Extraction programs: a unified approach to translation rule extraction. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.523-532. [PDF, 552KB]

(2011) Radu Ion, Alexandru Ceauşu, & Elena Irimia: An expectation maximization algorithm for textual unit alignment. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.128-135. [PDF, 1035KB]

(2011) Jagadeesh Jagarlamudi, Hal Daumé III, & Raghavendra Udupa: From bilingual dictionaries to interlingual document representations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.147-152. [PDF, 112KB]

(2011) Mike Tian-Jian Jiang, Chan-Hung Kuo, & Wen-Lai Hsu: English-toChinese machine transliteration using accessor variety features of source graphemes. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.86-90. [PDF, 282KB]

(2011) Wang Ling, Tiago Luís, João Graça, Luísa Coheur, & Isabel Trancoso: Reordering modeling using weighted alignment matrices. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.450-454. [PDF, 121KB]

(2011) Chi-kiu Lo & Dekai Wu: A radically simple, effective annotation and alignment methodology for semantic frame based SMT and MT evaluation. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.58-65. [PDF, 898KB]

(2011) Najmeh Mousavi Nejad & Shahram Khadivi: An unsupervised alignment model for sequence labeling: application to name transliteration. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.73-81. [PDF, 155KB]

(2011) Toshiaki Nakazawa & Sadao Kurohashi: Bayesian subtree alignment model based on dependency trees. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.794-802. [PDF, 790KB]

(2011) Graham Neubig, Taro Watanabe, Eiichiro Sumita, Shinsuke Mori, & Tatsuya Kawahara: An unsupervised model for joint phrase alignment and extraction. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.632-641. [PDF, 782KB]

(2011) Jason Naradowsky & Kristina Toutanova: Unsupervised bilingual morpheme segmentation and alignment with context-rich hidden semi-Markov models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.895-904. [PDF, 285KB]

(2011) Ying Qin & Guohua Chen: Forward-backward machine transliteration between English and Chinese based on combined CRFs. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.82-85. [PDF, 168KB]

(2011) Johan Segura & Violaine Prince: Using alignment to detect multiword expressions in bilingual corpora. .  Tralogy, Paris, 3-4 March 2011;  17pp. [PDF, 430KB]

(2011) Kashif Shah, Loïc Barrault, & Holger Schwenk: Parametric weighting of parallel data for statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1323-1331. [PDF, 754KB]

(2011) Kiril Simov, Petya Osenova, Laska Laskova, Aleksandar Savkov, & Stanislava Kancheva: Bulgarian-English parallel treebank: word and semantic level alignment. AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; pp.29-38. [PDF, 396KB]

(2011) Wolfgang Täger: The sentence-aligned European patent corpus. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.177-184. [PDF, 234KB]

(2011) Nadi Tomeh, Marco Turchi, Guillaume Wisinewski, Alexandre Allauzen, & François Yvon: How good are your phrases? Assessing phrase quality with single class classification. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.261-268. [PDF, 691KB]

(2011) Dan Zeman, Mark Fishel, Jan Berka, & Ondřej Bojar: Addicter: what’s wrong with my translations? Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.79-88. [PDF, 276KB]; presentation, 20 slides [PDF of PPT, 359KB]

(2011) Min Zhang, Xiangyu Duan, Ming Liu, Yunqing Xia, & Haizhou Li: Joint alignment and artificial data generation: an empirical study of pivot-based machine transliteration. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1207-1215. [PDF, 319KB]

(2010) Lars Ahrenberg: Alignment-based profiling of Europarl data in an English-Swedish parallel corpus. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3398-3404. [PDF, 349KB]

(2010) Stacey Bailey: Data preparation for machine translation customization. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Ondřej Bojar, Pavel Straňák, & Daniel Zeman: Data issues in English-to-Hindi machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1771-1777. [PDF, 557KB]

(2010) Gosse Bouma: Cross-lingual ontology alignment using EuroWordNet and Wikipedia. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1023-1028. [PDF, 336KB]

(2010) Fabienne Braune & Alexander Fraser: Improved unsupervised sentence alignment for symmetrical and asymmetrical parallel corpora. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.81-89. [PDF, 228KB]

(2010) Anton Bryl & Josef van Genabith: f-align: an open-source alignment tool for LFG f-structures. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 229KB]

(2010) Mauro Cettolo, Marcello Federico, & Nicola Bertoldi: Mining parallel fragments from comparable texts. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.227-234. [PDF, 393KB]

(2010) Yufeng Chen, Chenqing Zong, & Keh-Yih Su: On jointly recognizing and aligning bilingual named entities. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.631-639. [PDF, 151KB]

(2010) John DeNero & Dan Klein: Discriminative modeling of extraction sets for machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1453-1463. [PDF, 616KB]

(2010) Do Thi Ngoc Diep, Laurent Besacier, & Eric Castelli: A fully unsupervised approach for mining parallel data from comparable corpora. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 856KB]; presentation: 30 slides [PDF, 2621KB]

(2010) Chris Dyer, Adam Lopez, Juri Ganitkevitch, Jonathan Weese, Ferhan Ture, Phil Blunsom, Hendra Setiawan, Vladimir Eidelman, & Philip Resnik: cdec: a decoder, alignment, and learning framework for finite-state and context-free translation models. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.7-12. [PDF, 171KB]

(2010) Miquel Esplà-Gomis & Mikel L.Forcada: Combining content-based and URL-based heuristics to harvest aligned bitexts from multilingual sites with Bitextor. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.77-86. [PDF, 160KB]

(2010) Mark Fishel & Heiki-Jaan Kaalep: CorporAl: a method and tool for handling overlapping parallel corpora. Fifth Machine Translation Marathon, 13-18 September, University of Le Mans; Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.67-76. [PDF, 149KB]; presentation [PDF, 1037KB]

(2010) Carmen Heger, Joern Wuebker, David Vilar, & Hermann Ney: A combination of hierarchical systems with forced alignments from phrase-based systems. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.291-297. [PDF, 377KB]

 (2010) Elena Irimia & Alexandru Ceausu: Dependency-based translation equivalents for factored machine translation [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 60KB]

(2010) Jie Jiang, Andy Way, & Julie Carson-Berndsen: Lattice score based data cleaning for phrase-based statistical machine translation.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 631KB]

(2010) Maxim Khalilov & Khalil Sima’an: Source reordering using MaxEnt classifiers and supertags. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 605KB]

(2010) Patrik Lambert, Simon Petitrenaud, Yanjun Ma & Andy Way: Statistical analysis of alignment characteristics for phrase-based machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 588KB]; presentation: 26 slides [PDF, 133KB]

(2010) Peng Li, Maosong Sun, & Ping Xue: Fast-Champollion: a fast and robust sentence alignment algorithm. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.710-718. [PDF, 267KB]

 (2010) Peng Li, Maosong Sun & Ping Xue: Fast-Champollion: a fast and robust sentence alignment algorithm [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

(2010) Wang Ling, Tiago Luís, João Graça, Luísa Coheur & Isabel Trancoso: Towards a general and extensible phrase-extraction algorithm. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.313-320. [PDF, 382KB]

(2010) Shujie Liu, Chi-Ho Li, & Ming Zhou: Discriminative pruning for discriminative ITG alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.316-324. [PDF, 972KB]

(2010) Minh-Thang Luong & Min-Yen Kan: Enhancing morphological alignment for translating highly inflected languages. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.743-751. [PDF, 610KB]

(2010) Yanjun Ma & Andy Way: HMM word-to-phrase alignment with dependency constraints. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.101-109. [PDF, 191KB]

(2010) Adam Pauls, Dan Klein, David Chiang, & Kevin Knight: Unsupervised syntactic alignment with inversion traduction grammars. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.118-126. [PDF, 766KB]

(2010) Benjamin Roth, Andrew McCallum, Marc Dymetman & Nicola Cancedda: Machine translation using overlapping alignments and SampleRank. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 138KB]

(2010) Markus Saers, Joakim Nivre, & Dekai Wu: Linear inversion transduction grammar alignments as a second translation path. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 167-171. [PDF, 111KB]

(2010) Thomas Schoenemann: Computing optimal alignments for the IBM-3 translation model. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.98-106. [PDF, 130KB]

(2010) Rico Sennrich & Martin Volk: MT-based sentence alignment for OCR-generated parallel texts. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 174KB]

(2010) Jason R.Smith, Chris Quirk, & Kristina Toutanova: Extracting parallel sentences from comparable corpora using document level alignment. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.403-411. [PDF, 313KB]

(2010) Anders Søgaard: Can inversion transduction grammars generate hand alignments? EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 6pp. [PDF, 602KB]; presentation by Michael Carl: 19 slides [PDF, 122KB]

(2010) Daniel Stein, Christoph Schmidt, & Hermann Ney: Sign language machine translation overkill. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.337-344. [PDF, 2052KB]

(2010) Jun Sun, Min Zhang, & Chew Lim Tan: Exploring syntactic structural features for sub-tree alignment using bilingual tree kernels. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.306-315. [PDF, 286KB]

(2010) Jörg Tiedemann: Lingua-Align: an experimental toolkit for automatic tree-to-tree alignment.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.736-743. [PDF, 296KB]

(2010) Karthik Visweswariah, Jiri Navratil, Jeffrey Sorensen, Vijil Chenthamarakshan, & Nanda Kambhatla: Syntax based reordering with automatically derived rules for improved statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1119-1127. [PDF, 169KB]

 (2010) Ventsislav Zhechev: Highlighting matched and mismatched segments in translation memory output through sub-tree alignment. Translating and the Computer 32, 18-19 November 2010, London; 11pp. [PDF, 483KB]; presentation, 33 slides [PDF of PPT, 1798KB]

(2010) Ventsislav Zhechev & Josef van Genabith: Maximising TM performance through sub-tree alignment and SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 293KB]

(2010) Ventsislav Zhechev & Josef van Genabith: Seeding statistical machine translation with translation memory output through tree-based structural alignment. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.43-51. [PDF, 510KB]

Analogies and analogical modelling (see also Example-based methods)

(2013) Philippe Langlais: Mapping source to target strings without alignment by analogical learning: a case study with transliteration.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.684-689. [PDF, 221KB]

(2012) João Silva, Luísa Coheur, Ângela Costa, & Isabel Trancoso: Dealing with unknown words in statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3977-3981. [PDF, 406KB]

(2011) Kota Takeya & Yves Lepage: Marker-based chunking for analogy-based translation of chunks.  MT Summit  XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.338-345. [PDF, 171KB]

(2010) Dmitry Davidov & Ari Rappoport: Automated translation of semantic relationships. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.241-249. [PDF, 196KB]

ASR see Speech recognition

Character alignment

(2013) Lisa Beinborn, Torsten Zesch & Iryna Gurevych: Cognate production using character-based machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.883-891. [PDF, 228KB]

(2013) Yves Scherrer & Tomaž Erjavec: Modernizing historical Slovene words with character-based SMT. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 8-9 August 2013, Sofia, Bulgaria; pp.58-62. [PDF, 110KB]

(2013) Katsuhito Sudoh, Shinsuke Mori & Masaaki Nagata: Noise-aware character alignment for bootstrapping statistical machine transliteration from bilingual corpora. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.204-209. [PDF, 617KB]

(2013) Jörg Tiedemann & Preslav Nakov: Analyzing the use of character-level translation with sparse and noisy datasets. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.676-684. [PDF, 133KB]

(2012) Chenhui Chu, Toshiaki Nakazawa, & Sadao Kurohashi: Chinese characters mapping table of Japanese, traditional Chinese and simplified Chinese. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2149-2152. [PDF, 410KB]

(2012) Chenhui Chu, Toshiaki Nakazawa, Daisuke Kawahara, & Sadao Kurohashi: Exploiting shared Chinese characters in Chinese word segmentation: optimization for Chinese-Japanese machine translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.35-42. [PDF, 359KB]

(2012) Chang Liu & Hwee Tou Ng: Character-level machine translation evaluation for languages with ambiguous word boundaries. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.921-929. [PDF, 196KB]

(2012) Preslav Nakov & Jörg Tiedemann: Combining word-level and character-level models for machine translation between closely-related languages. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.301-305. [PDF, 122KB]

(2012) Graham Neubig, Taro Watanabe, Shinsuke Mori, & Tatsuya Kawahara: Machine translation without words through substring alignment. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.165-174. [PDF, 596KB]

(2012) Jörg Tiedemann: Character-based pivot translation for under-resourced languages and domains. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp.141-151. [PDF, 254KB]

(2012) Ning Xi, Guangchao Tang, Xinyu Dai, Shujian Huang, & Jiajun Chen: Enhancing statistical machine translation with character alignment. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.285-290. [PDF, 171KB]

(2012) Chunye Zhang, Tingting Li, & Tiejun Zhao: Syllable-based machine transliteration with extra phrase features.  [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.52-56. [PDF, 163KB]

(2011) Chenhui Chu, Toshiaki Nakazawa, & Sadao Kurohashi: Japanese-Chinese phrase alignment using common Chinese characters information. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.464-471. [PDF, 347KB]

(2011) Canasai Kruengkrai, Thatsanee Charoenporn, & Virach Sornlertlamvanich: Simple discriminative training for machine transliteration. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.28-31. [PDF, 514KB]

Consistency

(2014) Chi-kiu Lo & Dekai Wu: On the reliability and inter-annotator agreement of human semantic MT evaluation via HMEANT. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.602-607. [PDF, 279KB]

(2013) Trevor Cohn & Lucia Specia: Modelling annotator bias with multi-task Gaussian processes: an application to machine translation quality estimation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp. 32-42. [PDF, 265KB]

(2013) Liane Guillou: Analysing lexical consistency in translation. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.10-18. [PDF, 350KB]

(2012) Nicola Bertoldi, Robert Grabowski, Liane Guillou, Michal Novak, Sorin Slavescu, & Jose de Souza: Document-level decoding in Moses. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 16 slides [PDF of PPT, 71KB]

(2012) Marine Carpuat & Michel Simard: The trouble with SMT consistency. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.442-449. [PDF, 106KB]

(2012) Adam Lopez: Putting human assessments of machine translation systems in order. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.1-9. [PDF, 112KB]

(2011) Yifan He, Yanjun Ma, Andy Way, & Josef van Genabith: Rich linguistic features for translation memory-inspired consistent translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.456-463. [PDF, 159KB]

 (2011) Yanjun Ma, Yifan He, Andy Way, & Josef van Genabith: Consistent translation using discriminative learning: a translation memory-inspired approach. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1239-1248. [PDF, 193KB]

 (2011) Joss Moorkens: Translation memories guarantee consistency: truth or fiction?  Translating and the Computer 33, 17-18 November 2011, London; 15pp. [PDF, 147KB]

 (2011) Tong Xiao, Jingbo Zhu, Shujie Yao, & Hao Zhang: Document-level consistency verification in machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.131-138. [PDF, 126KB]

Context-based and context-aware methods

(2013) Waleed Ammar, Victor Chahuneau, Michael Denkowski, Greg Hanneman, Wang Ling, Austin Matthews, Kenton Murray, Nicola Segall, Alon Lavie & Chris Dyer: The CMU machine translation systems at WMT 2013: syntax, synthetic translation options, and pseudo-references.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.70-77. [PDF, 201KB]

(2013) Sankaranarayanan Ananthakrishnan, Wei Chen, Rohit Kumar, & Dennis Mehay: Source aware phrase-based decoding for robust conversational spoken language translation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 222KB]

(2013) Mihael Arcan & Paul Buitelaar: Ontology label translation. [NAACL-HLT 2013] Proceedings of the NAACL HLT 2013 Student Research Workshop, 13 June 2013, Atlanta, Georgia; pp.40-46. [PDF, 127KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Context vector disambiguation for bilingual lexicon extraction from comparable corpora.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.759-764. [PDF, 199KB]

(2013) Isao Goto, Masao Utiyama, Eiichiro Sumita, Akihiro Tamura, & Sadao Kurohashi: Distortion model considering rich context for statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.155- 165. [PDF, 808KB]

(2013) Hien Vu Huy, Phuong-Thai Nguyen, Tung-Lam Nguyen, & M.L.Nguyen: Bootstrapping phrase-based statistical machine translation via WSD integration. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1042-1046. [PDF, 231KB]

(2013) Wenduan Xu, Yue Zhang, Philip Williams, & Philipp Koehn: Learning to prune: context-sensitive pruning for syntactic MT.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.352-357. [PDF, 334KB]

(2013) Nan Yang, Shujie Liu, Mu Li, Ming Zhou, & Nenghai Yu: Word alignment modeling with context dependent deep neural network. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.166-175. [PDF, 554KB]

(2013) Samira Tofighi Zahabi, Somayeh Bakhshaei, & Shahram Khadivi: Using context vectors in improving a machine translation system with bridge language.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.318-322. [PDF, 193KB]

(2012) Li Gong, Aurélien Max, & François Yvon: Towards contextual adaptation for any-text translation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.292-299. [PDF, 727KB]

(2012) Xinfan Meng, Furu Wei, Ge Xu, Longkai Zhang, Xiaohua Liu, Ming Zhou, & Houfeng Wang: Lost in translations? Building sentiment lexicons using context based machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.829-838. [PDF, 232KB]

(2012) Victor Muntés-Mulero, Patricia Paladini Adell, Cristina España-Bonet, Lluís Màrquez: Context-aware machine translation for software localization. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.77-80. [PDF, 424KB]

(2012) Mei Yang & Katrin Kirchhoff: Unsupervised  translation disambiguation for cross-domain statistical machine translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 424KB]

(2011) Daniel Andrade, Takuya Matsuzaki, & Jun’ichi Tsujii: Learning the optimal use of dependency-parsing information for finding translations with comparable corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.10-18. [PDF, 651KB]

(2011) Caroline Barrière & Pierre Isabelle: Searching parallel corpora for contextually equivalent terms. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.105-112. [PDF, 149KB]

(2011) Marta R.Costa-jussà & Rafael E.Banchs: The BM-I2R Haitian-Créole-to-English translation system description for the WMT 2011 evaluation campaign. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.452-456. [PDF, 101KB]

(2011) Zhengxian Gong, Min Zhang, & Guodong Zhou: Cache-based document-level statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.909-919. [PDF, 330KB]

(2011) Maxim Khalilov & Khalil Sima’an: Context-sensitive syntactic source-reordering by statistical translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.38-46. [PDF, 327KB]

(2011) Jan Niehues, Teresa Herrmann, Stephan Vogel, & Alex Waibel: Wider context by using bilingual langauge models in machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.198-206. [PDF, 570KB]

(2011) Takashi Onishi, Masao Utiyama, & Eiichiro Sumita: Reordering constraint based on document-level context. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.434-438. [PDF, 128KB]

(2011) Alexandre Patry & Philippe Langlais: Going beyond word cooccurrences in global lexicon selection for statistical machine translation using a multilayer perceptron. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.658-666. [PDF, 428KB]

(2011) Avneesh Saluja, Ian Lane, & Ying Zhang: Context-aware language modeling for conversational speech translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.97-104. [PDF, 177KB]

(2011) Tong Xiao, Jingbo Zhu, Shujie Yao, & Hao Zhang: Document-level consistency verification in machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.131-138. [PDF, 126KB]

(2011) Deyi Xiong, Min Zhang, & Haizhou Li: Enhancing language models in statistical machine translation with backward n-grams and mutual information triggers. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1288-1297. [PDF, 620KB]

(2011) Jaako Väyrynen, Timo Honkela, & Marcus Dobrinkat: Context in machine translation: challenge and data set. META-NET Workshop: Context in MT, June 14, 2011; 10 slides [PDF of PPT, 146KB]

(2010) George Foster, Pierre Isabelle, & Roland Kuhn: Translating structured documents. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 132KB]

(2010) Rejwanul Haque, Sudip Kumar Naskar, Antal van den Bosch, & Andy Way: Supertags as source language context in hierarchical phrase-based SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 196KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Maximum entropy based phrase reordering for hierarchical phrase-based translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.555-563. [PDF, 144KB]

(2010) Helena de Medeiros Caseli, Bruno Akio Sugiyama, & Junia Coutinho Anacleto: Using common sense to generate culturally contextualized machine translation. Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas, Los Angeles, CA, June 2010; pp.24-31. [PDF, 353KB]

(2010) Sergio Penkale, Rejwanul Haque, Sandipan Dandapat, Pratyush Banerjee, Ankit K.Srivastava, Jinhua Du, Pavel Pecina, Sudip Kumar Naskar, Mikel L.Forcada, & Andy Way: MaTrEx: the DCU MT system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 143-148. [PDF, 535KB]

(2010) Fangzhong Su & Katja Markert: Word sense subjectivity for cross-lingual lexical substitution. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.357-360. [PDF, 109KB]

(2010) Rabih Zbib, Spyros Matsoukas, Richard Schwartz, & John Makhoul: Decision trees for lexical smoothing in statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the work shop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 428-437. [PDF, 2766KB]

Decomposition see Segmentation

Disambiguation (see also Word sense disambiguation)

(2015) Martin Benjamin, Amar Mukunda & Jeff Allen: Kamusi pre-D-source-side disambiguation and a sense aligned multilingual lexicon. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp.27-32. [PDF, 188KB]

(2015) Teresa Herrmann, Jan Niehues, &Alex Waibel: Source discriminative word lexicon for translation disambiguation. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.135-142. [PDF, 3.1MB]

(2014) Irina Burukina: Translating implicit elements in RBMT. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.182-193. [PDF, 209KB]

(2014) László János Laki & György Orosz: An efficient language independent toolkit for complete morphological disambiguation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1625-1630. [PDF, 178KB]

(2014) Nicolas Pécheux, AlexandreAllauzen, & Francçois Yvon: Rule-based reordering space in statistical machine translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1800-1806. [PDF, 607]

(2013) Victor Chahuneau, Noah A.Smith, & Chris Dyer: Knowledge-rich morphological priors for Bayesian language models. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.1206-1215. [PDF, 208KB]

(2013) Annette Rios & Anne Göhring: Machine learning disambiguation of Quechua verb morphology. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.13-18. [PDF, 226KB]

(2013) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Handling ambiguities of bilingual predicate-argument structures for statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1127-1136. [PDF, 269KB]

(2012) Marianna Apidianaki, Guillaume Wisniewski, Artem Sokolov, Aurélien Max, & François Yvon: WSD for n-best reranking and local language modeling in SMT. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.1-9. [PDF, 108KB]

(2012) Bettina Harriehausen-Mühlbauer & Timm Heuss: Semantic web based machine translation. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.1-9. [PDF, 196KB]

(2012) Jayan V, Sunil R, & Bhadran V K: Disambiguation of pre/post positions in English – Malayalam text translation. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp. 93-101. [PDF, 249KB]

(2012) Thomas Meyer & Andrei Popescu-Belis: Using sense-labeled discourse connectives for statistical machine translation. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.129-138. [PDF, 275KB]

(2012) Francis M.Tyers, Felipe Sánchez-Martínez, & Mikel L.Forcada: Flexible finite-state lexical selection for rule-based machine translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.213-220. [PDF, 211KB]

(2012) Guangyou Zhou, Kang Liu, & Jun Zhao: Exploiting bilingual translation for question retrieval in community-based question answering. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.3153-3170. [PDF, 352KB]

(2011) Shane Bergsma, David Yarowsky, & Kenneth Church: Using large monolingual and bilingual corpora to improve coordination disambiguation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1346-1355. [PDF, 157KB]

(2011) Shujian Huang, Stephan Vogel, & Jiajun Chen: Dealing with spurious ambiguity in learning ITG-based word alignment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.379-383. [PDF, 182KB]

(2011) Thomas Meyer: Disambiguating temporal-contrastive discourse connectives for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.46-51. [PDF, 114KB]

(2011) Shoichi Yokoyama & Yuichi Takano: Investigation of translation disambiguation of verbs in patent sentences using word grouping. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.60-63. [PDF, 152KB]

(2010) Anabele Barreiro, Annibale Elia, Johanna Monti, & Mario Monteleone: Mixed up with machine translation: multi-word units disambiguation challenge.  Translating and the Computer 32, 18-19 November 2010, London; 21pp. [PDF, 138KB]

(2010) Benoît Gaillard, Malek Boualem, & Olivier Collin: Query translation using Wikipedia-based resources for analysis and disambiguation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 579KB]

Divergence see Lexical transfer, Syntactic transfer

Estimation maximization see Learning and training, Statistical analysis

Feedback see Learning and training

Generation (see also Text prediction)

(2013) Matthias Huck, David Vilar, Markus Freitag & Hermann Ney: A performance study of cube pruning for large-scale hierarchical machine translation. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 29-38. [PDF, 191KB]

(2013) Gennadi Lembersky, Noam Ordan, & Shuly Wintner: Improving statistical machine translation by adapting translation models to translationese. Computational Linguistics 39 (4); pp.999-1023. [PDF, 220KB]

(2013) Libin Shen & Bowen Zhou: What is hidden among translation rules. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.839-844. [PDF,118KB]

(2013) Sara Stymne, Nicola Cancedda, & Lars Ahrenberg: Generation of compound words in statistical machine translation into compounding languages. Computational Linguistics 39 (4); pp.1067-1108. [PDF, 615KB]

(2013) Kun Wang, Chengqing Zong, & Key-Yih Su: Integrating translation memory into phrase-based machine translation during decoding. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.11-21. [PDF, 1776KB]

(2012) Tod Allman, Stephen Beale, & Richard Denton: Linguist’sAssistant: a multi-lingual natural language generator based on linguistic universals, typologies, and primitives. INLG 2012: Proceedings of the 7th International Natural Language Generation Conference, Utica, May 2012; pp.59-66. [PDF, 641KB]

 (2012) Ahmed El Kholy & Nizar Habash: Rich morphology generation using statistical machine translation. INLG 2012 Proceedings of the 7th International Natural Language Generation Conference, Utica, May 2012; pp.90-94. [PDF, 100KB]

(2012) Yang Feng, Yang Liu, Qun Liu, & Trevor Cohn: Left-to-right tree-to-string decoding with prediction. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1191-1200. [PDF, 511KB]

(2012) Lluís Formiga, Adolfo Hernández, José B.Mariño, & Enric Monte: Improving English to Spanish out-of-domain translations by morphology generalization and generation. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.6-16. [PDF, 274KB]

(2012) Lluís Formiga, Carlos A.Henríquez Q., Adolfo Hernández, José B.Mariño, Enric Monte, & José A.R.Fonollosa: The TALP-UPC phrase-based translation systems for WMT12: morphology simplification and domain adaptation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.275-282. [PDF, 500KB]

(2012) Spence Green & John DeNero: A class-based agreement model for generating accurately inflected translations. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.22-30. [PDF, 284KB]

(2012) Li Maoxi & Wang Mingwen: Confusion network based system combination for Chinese translation output: word-level or character-level? COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT), Mumbai, December 2012; pp.17-24. [PDF, 282KB]

(2011) Anja Belz & Eric Kow: Unsupervised alignment of comparable data and text resources. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.102-109. [PDF, 143KB]

(2011) Jin-Woo Chung & Jong C.Park: Text parsing for sign language generation with combinatory categorial grammar. International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 23 October 2011, Dundee, UK; 11pp. [PDF, 420KB]

(2011) Lei Cui, Dongdong Zhang, Mu Li, & Ming Zhou: Function word generation in statistical machine translation systems. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.139-146. [PDF, 184KB]

(2011) Andrew Finch, Wei Song, Kumiko Tanana-Ishii, & Eiichiro Sumita: Source language generation from pictures for machine translation on mobile devices. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.263-274. [PDF, 1504KB]

(2011) Alexander Fraser, Marion Weller, Aoife Cahill, & Fabienne Fritzinger: Morphological generation of German for SMT. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 2pp. [PDF, 21KB];  presentation: 44 slides [PDF of PPT, 307KB]

(2011) Juri Ganitkevitch, Chris Callison-Burch, Courtney Napoles, & Benjamin Van Durme: Learning sentential paraphrases from bilingual parallel corpora for text-to-text generation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1168-1179. [PDF, 311KB]

(2011) Monica Gavrila: Constrained recombination in an example-based machine translation system. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. M ikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.193-200. [PDF, 417KB]; presentation, 36 slides [PDF]

(2011) François Lareau, Mark Dras, Benjamin Börschinger, & Robert Dale: Collocations in multilingual natural language generation: lexical functions meet lexical functional grammar. [ALTA 2011] Proceedings of Australasian Language Technology Association Workshop, Canberra, Australia, 1-2 December 2011; pp.95-104. [PDF, 302KB]

(2011) Wei-Yun Ma & Kathleen McKeown: System combination for machine translation based on text-to-text generation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.546-553. [PDF,

 (2011) Sara Stymne & Nicola Cancedda: Productive generation of compound words in statistical machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.250-260. [PDF, 167KB]

(2011) Tong Xiao, Jingbo Zhu, Shujie Yao, & Hao Zhang: Document-level consistency verification in machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.131-138. [PDF, 126KB]

(2010) Arafat Ahsan, Prasanth Kolachina, Sudheer Kolachina, Dipti Misra Sharma, & Rajeev Sangal: Coupling statistical machine translation with rule-based transfer and generation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 162KB]

(2010) Mohamed Altantawy, Nizar Habash, Owen Rambow, & Ibrahim Saleh: Morphological analysis and generation of Arabic nouns: a morphemic functional approach. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, organised by ELRA, 17-23 May 2010, Valletta, Malta; pp.851-858. [PDF,               899KB]

(2010) Nicola Bertoldi: IRSTLM toolkit. Presentation at Fifth Machine Translation Marathon, 13-18 September 2010, University of Le Mans; 34 slides. [PDF, 381KB]

(2010) Helena de Medeiros Caseli, Bruno Akio Sugiyama, & Junia Coutinho Anacleto: Using common sense to generate culturally contextualized machine translation. Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas, Los Angeles, CA, June 2010; pp.24-31. [PDF, 353KB]

(2010) Boxing Chen, George Foster, & Roland Kuhn: Fast consensus hypothesis regeneration for machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 11-16. [PDF, 120KB]

(2010) Chen Yuncong & Pascale Fung: Unsupervised synthesis of multilingual Wikipedia articles. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.197-205. [PDF, 1040KB]

(2010) Markus Dreyer: A non-parametric model for the discovery of inflectional paradigms from plain text using graphical models over strings. Dissertation defense, November 5, 2010, Johns Hopkins University; presentation: 200 slides [PDF of PPT, 11706KB]

Inference see Learning and training, Logic-based MT

Interpretation and MT see Spoken language translation

Language models (see also Generation)

(2014) Shyam S.Agrawal, Abhimane, Shweta Bansal, & Minakshi Mahakshi: Statistical analysis of multilingual text corpus and development of language models. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2436-2440. [PDF, 815KB]

(2014) Christian Buck, Kenneth Heafield, & Bas van Ooyen: N-gram counts and language models from the common crawl. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3579-3584. [PDF, 180KB]

(2014) Nikola Ljubešić & Antonio Toral: caWaC – a web corpus of Catalan and its application to language modeling an machine translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1728-1732. [PDF, 138KB]

(2014) Anthony Rousseau, Paul Deléglise, & Yannick Estève: Enhancing the TED-LIUM corpus with selected data for language modeling and more TED talks. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3935-3939. [PDF, 115KB]

(2013) Victor Chahuneau, Noah A.Smith, & Chris Dyer: Knowledge-rich morphological priors for Bayesian language models. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.1206-1215. [PDF, 208KB]

(2013) Nal Kalchbrenner & Phil Blunsom: Recurrent continuous translation models. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1700-1709. [PDF, 274KB]

(2013) Rui Wang, Masao Utiyama, Isao Goto, Eiichiro Sumita, Hai Zhao, & Bao-Liang Lu: Converting continuous-space language models into n-gram language models for statistical machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.845-850. [PDF, 571KB]

(2012) Chris Dyer: Discriminative training of language models. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 83 slides [PDF of PPT, 1322KB]

(2012) Casey Redd Kennington, Martin Kay, & Annemarie Friedrich: Suffix trees as language models.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.446-453. [PDF, 1201KB]

(2012) Gennadi Lembersky, Noam Ordan, & Shuly Wintner: Language models for machine translation: original vs. translated texts. Computational Linguistics 38 (4); pp.799-825. [PDF, 200KB]

(2012) Shixiang Lu, Wei Wei, Xiaoyin Fu, & Bo Xu: Translation model based cross-lingual language model adaptation: from word models to phrase models. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.512-522. [PDF, 196KB]

(2012) Yue Zhang, Graeme Blackwood, & Stephen Clark: Syntax-based word ordering incorporating a large-scale language model. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 736-746. [PDF, 155KB]

(2011) Nadir Durrani, Helmut Schmid, & Alexander Fraser: A joint sequence translation model with integrated reordering. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1045-1054. [PDF, 254KB]

(2010) Nicola Bertoldi: Statistical language models. Presentation at Fifth Machine Translation Marathon, 13-18 September 2010, University of Le Mans; 28 slides. [PDF, 495KB]

 

Lexical acquisition see Lexical resources in Data-1

Lexical transfer

(2014) David Kamholz, Jonathan Pool, & Susan M.Colowick: PanLex: building a resource for panlingual lexical translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2696-2700. [PDF, 145KB]

(2014) Gregor Thurmair: Conceptual transfer: using local classifiers for transfer selection. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4387-4393. [PDF, 409KB]

(2013) Marine Carpuat: A semantic evaluation of machine translation lexical choice. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp.1-10. [PDF, 145KB]

(2013) Chenhui Chu, Toshiaki Nakazawa, & Sadao Kurohashi: Accurate parallel fragment extraction from quasi-comparable corpora using alignment model and translation lexicon. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1144-1150. [PDF, 415KB]

(2013) Liane Guillou: Analysing lexical consistency in translation. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.10-18. [PDF, 350KB]

(2013) Georgios Kontonatsios, Ioannis Korkontzelos, Jun’ichi Tsujii, & Sophia Ananiadou: Using a random forest classifier to recognise translations of biomedical terms across languages.  Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.95-104. [PDF, 656KB]

(2013) Mikhail Kozhevnikov & Ivan Titov: Cross-lingual transfer of semantic role labeling models. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1190-1200. [PDF, 188KB]

(2013) Nikola Ljubešić & Darja Fišer: Identifying false friends between closely related languages. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 8-9 August 2013, Sofia, Bulgaria; pp.69-77. [PDF, 342KB]

(2013) Alex Rudnick & Michael Gasser: Lexical selection for hybrid MT with sequence labeling. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.102-108. [PDF, 251KB]

(2013) Mo Yu, Tiejun Zhao, Yalong Bai, Hao Tian, & Dianhai Yu: Cross-lingual projections between languages from differenr families. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.312-317. [PDF, 187KB]

(2012) Colin Cherry: Word-based models. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 57 slides [PDF of PPT, 226KB]

(2012) Petter Haugereid & Francis Bond: Extracting semantic transfer rules from parallel corpora with SMT phrase aligners. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.67-75. [PDF, 194KB]

(2012) Francis M.Tyers, Felipe Sánchez-Martínez, & Mikel L.Forcada: Flexible finite-state lexical selection for rule-based machine translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.213-220. [PDF, 211KB]

(2011) Petter Haugereid & Francis Bond: Extracting transfer rules for multiword expressions from parallel corpora. Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World (MWE 2011), Portland, Oregon, USA, 23 June 2011; pp.92-100. [PDF, 228KB]

 (2011) Paul Maergner, Ian Lane, & Alex Waibel: Unsupervised vocabulary selection for domain-independent simultaneous lecture translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.89-96. [PDF, 926KB]

(2011) Alexandre Patry & Philippe Langlais: Going beyond word cooccurrences in global lexicon selection for statistical machine translation using a multilayer perceptron. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.658-666. [PDF, 428KB]

 (2011) Tong Xiao, Jingbo Zhu, Shujie Yao, & Hao Zhang: Document-level consistency verification in machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.131-138. [PDF, 126KB]

(2010) Alexandra Birch & Miles Osborne: LRscore for evaluating lexical and reordering quality in MT.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 327-332. [PDF, 123KB]

(2010) Rashmi Gangadharaiah, Ralf D.Brown, & Jaime Carbonell: Monolingual distributional profiles for word substitution in machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.320-328. [PDF, 605KB]

 (2010) Elena Irimia & Alexandru Ceausu: Dependency-based translation equivalents for factored machine translation [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 60KB]

(2010) Hiroyuki Kaji, Takashi Tsunakawa, & Daisuke Okada: Using comparable corpora to adapt a translation model to domains. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2182-2188. [PDF, 432KB]

(2010) Aurélien Max, Josep Maria Crego, & François Yvon: Contrastive lexical evaluation of machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1753-1757. [PDF, 400KB]

 (2011) Christof Monz: Statistical machine translation with local language models. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.869-879. [PDF, 407KB]

(2010) Taiji Nagasaka, Ran Shimanouchi, Akiko Sakamoto, Takafumi Suzuki, Yohei Morishita, Takehito Utsuro, & Suguru Matsuyoshi: Utilizing semantic equivalence classes of Japanese functional expressions in translation rule acquisition from parallel patent sentences. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1777-1785. [PDF, 942KB]

(2010) Reinhard Rapp & Michael Zock: The noisier the better: identifying multilingual word translations using a single monolingual corpus. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.16-25. [PDF, 179KB]

(2010) Reinhard Rapp & Michael Zock: Utilizing citations of foreign words in corpus-based dictionary generation. [Coling 2010] Proceedings  of the Second Workshop on NLP Challenges in the Information Explosion Era, Beijing, China, 28 August 2010; pp.50-59. [PDF, 188KB]

 (2010) Christian Scheible: Sentiment translation through lexicon induction. Proceedings of the ACL 2010 Student Research Workshop, Uppsala, Sweden, 13 July 2010; pp.25-30. [PDF, 119KB]

(2010) Christian Scheible, Florian Laws, Lukas Michelbacher, & Hinrich Schütze: Sentiment translation through multi-edge graphs. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1104-1112. [PDF, 236KB]

(2010) Fangzhong Su & Katja Markert: Word sense subjectivity for cross-lingual lexical substitution. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.357-360. [PDF, 109KB]

(2010) Takashi Tsunakawa & Hiroyuki Kaji: Augmenting a bilingual lexicon with information for word translation disambiguation. [Coling 2010] Proceedings of the 8th Workshop on Asian Language Resources, Beijing, China, 21-22 August 2010; pp.30-37. [PDF, 175KB]

(2010) Bin Wei & Christopher Pal: Cross lingual adaptation: an experiment in sentiment classifications. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.258-262. [PDF, 109KB]

(2010) Rabih Zbib, Spyros Matsoukas, Richard Schwartz, & John Makhoul: Decision trees for lexical smoothing in statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the work shop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 428-437. [PDF, 2766KB]

Machine learning and MT see Learning and training

Morphological analysis (see also Segmentation, Tagging)

(2015) Franck Burlot & François Yvon: Morphology-aware alignments for translation to and from a synthetic language. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp. 188-195. [PDF, 3.1MB]

(2015) Joachim Daiber & Khalil Sima’an: Machine translation with source-predicted target morphology. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.283-296. [PDF, 647KB] 

(2015) Ahmed El Kholy & Nizar Habash: Morphological constraints for phrase pivot statistical machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.104-116. [PDF, 601KB]

(2015) Ke Tran, Arianna Bisazza, & Christof Monz: A distributed inflection model for translating into morphologically rich languages. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.145-159. [PDF, 720KB]

(2014) Krasimir Angelov: Bootstrapping open-source English-Bulgarian computational dictionary. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1018-1023. [PDF, 158KB]

(2014) Peter Baumann & Janet Pierrehumbert: Using resource-rich languages to improve morphological analysis of under-resourced languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3355-3359. [PDF, 143KB]

(2014) Fabienne Cap, Marion Weller, Anita Ramm, & Alexander Fraser: CimS – the CIS and IMS joint submission to WMT2014 translating from English into German. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.71-78. [PDF, 197KB

(2014) Piyush Dungarwal, Rajen Chatterjee, Abhijit Mishra, Anoop Kunchukuttan, Ritesh Shah, & Pushpak Bhattacharyya: The IIT Bombay HindiÛEnglish translation system at WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.90-96. [PDF, 222KB]

(2014) Weston Feely, Mehdi Manshadi, Robert Frederking, & Lori Levin: The CMU METAL Farsi NLP approach. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4052-4055. [PDF, 206KB]

(2014) Mark Fishel & Rico Sennrich: Handling technical OOVs in SMT.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.159-162. [PDF, 318KB]

(2014) Bushra Jawaid & Ondřej Bojar: Two-step machine translation with lattices. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.682-686. [PDF, 67KB]

(2014) László János Laki & György Orosz: An efficient language independent toolkit for complete morphological disambiguation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1625-1630. [PDF, 178KB]

(2014) Jonathan North Washington, Ilnar Salimzyanov, & Francis M.Tyers: Finite-state morphological transducers for three Kypchak languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3378-3385. [PDF, 1727KB]

(2014) Marion Weller, Alexander Fraser, & Ulrich Heid: Combining bilingual terminology mining and morphological modeling for domain adaptation in SMT. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.11-18. [PDF, 387KB]

(2014) Philip Williams, Rico Sennrich, Maria Nadejde, Matthias Huck, Eva Hasler, & Philipp Koehn: Edinburgh’s syntax-based systems at WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.207-214. [PDF, 222KB]

(2013) Fabienne Cap, Marion Weller, & Ulrich Heid: Using a rich feature set for the identification of German MWEs. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.34-42. [PDF, 308KB]

(2013) Victor Chahuneau, Noah A.Smith, & Chris Dyer: Knowledge-rich morphological priors for Bayesian language models. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.1206-1215. [PDF, 208KB]

(2013) Victor Chahuneau, Eva Schlinger, Noah A.Smith, & Chris Dyer: Translating into morphologically rich languages with synthetic phrases. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1044-1054. [PDF, 312KB]

(2013) Marta R.Costa-jussà & Chris Quirk: Morphological, syntactical and semantic knowledge in statistical machine translation. [NAACL-HLT 2013]  Tutorials, NAACL HLT 2013, 16-18 June 2013, Atlanta, Georgia; pp.10-13. [PDF, 67KB]

(2013) Elif Eyigöz, Daniel Gildea, & Kemal Oflazer: Simultaneous word-morpheme alignment for statistical machine translation. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.32-40. [PDF, 259KB]

(2013) Lluís Formiga, Marta R. Costa-jussà, José B. Mariño, José A. R. Fonollosa, Alberto Barrón-Cedeño & Lluis Marquez: The TALP-UPC phrase-based translation systems for WMT13: system combination with morphology generation, domain adaptation and corpus filtering. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.134-140. [PDF, 159KB]

(2013) Ankur Gandhe & Rashmi Gangadharaiah: Hypothesis refinement using agreement constraints in machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.429-437. [PDF, 170KB]

(2013) Ahmed Hamdi, Rahma Boujelbane, Nizar Habash, & Alexis Nasr: The effects of factorizing root and pattern mapping in translating between Tunisian Arabic and standard Arabic. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.125-134. [PDF, 672KB]

(2013) László J.Laki, Attila Novák, & Borbála Siklósi: An English-to-Hungarian morpheme-based statistical machine translation system with reordering rules. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.42-50. [PDF, 246KB]

(2013) Evgeny Matusov & Gregor Leusch: OmnifluentTM English-to-French and Russian-to-English systems for the 2013 Workshop on Statistical Ma­chine Translation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.158-163. [PDF, 136KB]

(2013) Maria Nadejde, Philip Williams, & Philipp Koehn: Edinburgh’s syntax-based machine translation systems. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.170-176. [PDF, 161KB]

(2013) Annette Rios & Anne Göhring: Machine learning disambiguation of Quechua verb morphology. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.13-18. [PDF, 226KB]

(2013) Mohammad Sadegh Rasooli, Ahmed El Kholy, & Nizar Habash: Orthographic and morphological processing for Persian-to-English statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1047-1051. [PDF, 106KB]

(2013) Mohammad Salameh, Colin Cherry, & Grzegorz Kondrak: Reversing morphological tokenization in English-to-Arabic SMT. [NAACL-HLT 2013] Proceedings of the NAACL HLT 2013 Student Research Workshop, 13 June 2013, Atlanta, Georgia; pp.47-53. [PDF, 161KB]

(2013) Evgeniy Shin, Sebastian Stüker, Kevin Kilgour, Christian Fügen, & Alex Waibel: Maximum entropy language modeling for Russian ASR. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 167KB]

(2013) Ines Turki Khemakhem, Salma Jamoussi, & Abdelmajid Ben Hamadou: Integrating morpho-syntactic features in English-Arabic statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.74-81. [PDF, 491KB]

(2013) Zhiyang Wang, Yajuan Lü, Meng Sun, & Qun Liu: Stem translation with affix-based rule selection for agglutinative languages. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.364-369. [PDF, 667KB]

(2013) Marion Weller, Max Kisselew, Svetlana Smekalova, Alexander Fraser, Helmut Schmid, Nadir Dur­rani, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions at WMT13: morphological and syntactic processing for SMT.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.232-239. [PDF, 160KB]

(2012) Khan Md.Anwarus Salam, Setsuo Yamada, & Tetsuro Nishino: Sublexical translations for low-resource language. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp.39-51. [PDF, 514KB]

(2012) Estelle Delpech, Béatrice Daille, Emmanuel Morin, & Claire Lemaire: Identification of fertile translations in medical comparable corpora: a morpho-compositional approach. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 154KB]

(2012) Thodam Doren Singh: Addressing some issues of data sparsity towards improving English-Manipuri SMT using morphological information. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.46-54. [PDF, 454KB]

(2012) Ahmed El Kholy & Nizar Habash: Rich morphology generation using statistical machine translation. INLG 2012 Proceedings of the 7th International Natural Language Generation Conference, Utica, May 2012; pp.90-94. [PDF, 100KB]

(2012) Ahmed El Kholy & Nizar Habash: Translate, predict or generate: modeling rich morphology in statistical machine translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.27-34. [PDF, 174KB]

(2012) Lluís Formiga, Adolfo Hernández, José B.Mariño, & Enric Monte: Improving English to Spanish out-of-domain translations by morphology generalization and generation. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.6-16. [PDF, 274KB]

(2012) Lluís Formiga, Carlos A.Henríquez Q., Adolfo Hernández, José B.Mariño, Enric Monte, & José A.R.Fonollosa: The TALP-UPC phrase-based translation systems for WMT12: morphology simplification and domain adaptation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.275-282. [PDF, 500KB]

(2012) Alexander Fraser, Marion Weller, Aoife Cahill, & Fabienne Cap: Modeling inflection and word formation in SMT. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp.664-674. [PDF, 146KB]

(2012) Spence Green & John DeNero: A class-based agreement model for generating accurately inflected translations. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.22-30. [PDF, 284KB]

(2012) Nizar Habash, Ramy Eskander, & Abdelati Hawwari: A morphological analyzer for Egyptian Arabic. SIGMORPHON-2012. Proceedings of the Twelfth Meeting of the Special Interest Group on Computational Morphology and Phonology, Montréal, Canada, June 7, 2012; pp.1-9. [PDF, 147KB]

(2012) Nizar Habash: MT and Arabic language issues. [Tutorial at] AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; presentation, 62 slides. [PDF of PPT, 1973KB]

(2012) Petr Homola: A machine translation toolchain for polysynthetic languages. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.65-68. [PDF, 211KB]

(2012) Philipp Koehn & Barry Haddow: Interpolated backoff for factored translation models. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 239KB]

(2012) Septina Dian Larasati: IDENTIC corpus: morphologically enriched Indonesian-English parallel corpus.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.902-906. [PDF, 719KB]

(2012) Emad Mohamed: Morphological segmentation and part of speech tagging for religious Arabic. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.65-71. [PDF, 708KB]

(2012) Tommi A.Pirinen & Francis M.Tyers: Compiling Apertium morphological dictionaries with HFST and using them in HFST applications.  [SALTMIL 2012] Workshop on Language Technology for Normalisation of Less-Resourced Languages, 22 May 2012, Istanbul, Turkey; pp.25-28. [PDF, 120KB]

(2012) Maja Popović: Morpheme and POS-based IBM1 scores and language model scores for translation quality estimation.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.133-137. [PDF, 110KB]

(2012) Gábor Prószéky: Language technology methods inspired by an agglutinative, free-phrase-order language [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.26-27. [PDF]

(2012) Loganathan Ramasamy, Ondřej Bojar, & Zdeněk Žabokrtský: Morphological processing for English-Tamil statistical machine translation. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp.113-122. [PDF, 542KB]

(2012) Loganathan Ramasamy, Zdeněk Žabokrtský, & Sowmya Vajjala: The study of effect of length in morphological segmentation of agglutinative languages. [ACL 2012] Proceedings of the First Workshop on Multilingual Modeling, Jeju, Republic of Korea, 8-14 July 2012; pp.18-24. [PDF, 151KB]

(2012) Jon Riding: Hunting the snark: the problem posed for MT by non-catenative morphologies. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 10pp. [PDF, 286KB], presentation: 26 slides [PDF, 1277KB]

(2012) Wael Salloum & Nizar Habash: Elissa: a dialectal to standard Arabic machine translation system. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 385-392. [PDF, 181KB]

(2012) V.M.Sánchez-Cartagena, M.Esplà-Gomis, F.Sánchez-Martínez, & J.A.Pérez-Ortiz: Choosing the correct paradigm for unknown words in rule-based machine translation systems.  In: Free/Open-Source Rule-Based Machine Translation, ed.Cristina España-Bonet and Aarne  Ranta. Proceedings of a Workshop held in Gothenburg, 14-15 June, 2012; pp.27-39. [PDF, 502KB]

(2012) Víctor M.Sánchez-Cartagena, Miquel Esplà-Gomis, & Juan Antonio Pérez-Ortiz: Source-language dictionaries help non-expert users to enlarge target-language dictionaries for machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3422-3429. [PDF, 416KB]

(2012) Nimesh Singh & Nizar Habash: Hebrew morphological preprocessing for statistical machine translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.43-50. [PDF, 146KB]

(2012) David Stallard, Jacob Devlin, Michael Kayser, Yoong Keok Lee, & Regina Barzilay: Unsupervised morphology rivals supervised morphology for Arabic MT. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.322-327. [PDF, 107KB]

(2012) Velislava Stoykova: Bulgarian inflectional morphology in Universal Networking Language. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 423-429. [PDF, 145KB]

(2012) Rui Wang, Petya Osenova, & Kiril Simov: Linguistically-enriched models for Bulgarian-to-English machine translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.10-19. [PDF, 134KB]

(2012) Jonathan North Washington, Mirlan Ipasov, & Francis M.Tyers: A finite-state morphological transducer for Kyrgyz.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.934-940. [PDF, 402KB]

(2012) Youzheng Wu, Hitoshi Yamamoto, Xugang Lu, Shigeki Matsuda, Chiori Hori, & Hideki Kashioka: Factored recurrent neural network language model in TED lecture transcription. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.222-228. [PDF, 645KB]

(2011) proceedings of Machine Translation and Morphologically-rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011

(2011) Ondřej Bojar: Rich morphology and what can we expect from hybrid approaches to MT [abstract]. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; p.12. [PDF, 139KB]

(2011) Alexandru Ceauşu & Dan Tufiş: Addressing SMT data sparseness when translating into morphologically-rich languages. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.57-68. [PDF, 1886KB]

(2011) Christian Chiarcos & Tomaž Erjavec: OWL/DL formalization of the MULTEXT-East morphosyntactic specifications. Proceedings of the Fifth LAW Workshop (LAW V), Portland, Oregon, USA, 23-24 June 2011; pp.11-20. [PDF, 118KB]

(2011) Ann Clifton & Anoop Sarkar: Combining morpheme-based machine translation with post-processing morpheme prediction. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.32-42. [PDF, 287KB]

(2011) Ahmed El Kholy & Nizar Habash: Automatic error analysis for morphologically rich languages. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.225-232. [PDF, 219KB]

(2011) Ankur Gandhe, Rashmi Gangadharaiah, Karthik Visweswariah, & Ananthakrishnan Ramanathan: Handling verb phrase morphology in highly inflected Indian languages for machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.111-119. [PDF, 256KB]

(2011) Maria Holmqvist, Sara Stymne, & Lars Ahrenberg: Experiments with word alignment, normalization and clause reordering for SMT between English and German. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.393-398. [PDF, 102KB]

(2011) Alon Lavie: Statistical MT with syntax and morphology: challenges and some solutions [abstract]. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; p.13. [PDF, 139KB]

(2011) Klaus Macherey, Andrew M.Dai, David Talbot, Ashok C.Popat, & Franz Och: Language-independent compound splitting with morphological operations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1395-1404. [PDF, 562KB]

(2011) David Mareček, Rudolf Rosa, Petra Galuščáková, & Ondřej Bojar: Two-step translation with grammatical post-processing. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.426-432. [PDF, 119KB]

(2011) Preslav Nakov & Hwee Tou Ng: Translating from morphologically complex languages: a paraphrase-based approach. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1298-1307. [PDF, 156KB]

(2011) Jason Naradowsky & Kristina Toutanova: Unsupervised bilingual morpheme segmentation and alignment with context-rich hidden semi-Markov models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.895-904. [PDF, 285KB]

(2011) MajaPopović: Morphemes and POS tags for n-gram based evaluation metrics. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.104-107. [PDF, 68KB]

(2011) Barbara Quaranta: Arabic and computer-aided translation: an integrated approach. Translating and the Computer 33, 17-18 November 2011, London; 14pp. [PDF, 63KB]

(2011) Adam Radziszewski & Tomasz Śniatowski: Maca – a configurable tool to integrate Polish morphological data. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.29-36. [PDF, 157KB]

(2011) Siva Reddy & Serge Sharoff: Cross-language POS taggers (and other tools) for Indian languages: an experiment with Kannada using Telugu resources. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.11-19. [PDF, 214KB]

(2011) Raivis Skadiņš, Maris Puriņš, Inguna Skadiņa, & Andrejs Vasiļjevs: Evaluation of SMT in localization to under-resourced inflected language. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.35-40. [PDF, 287KB]; presentation, 17 slides [PDF, 796KB]

(2011) Michael Subotin: An exponential translation model for target language morphology. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.230-238. [PDF, 188KB]

(2011) Zhiyang Wang, Yajuan Lü, & Qun Liu: Multi-granularity word alignment and decoding for agglutinative language translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.360-367. [PDF, 169KB]

(2011) Philip Williams & Philipp Koehn: Agreement constraints for statistical machine translation into German. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.217-226. [PDF, 146KB]

(2010) Hassan Al-Haj & Alon Lavie: The impact of Arabic morphological segmentation on broad-coverage English-to-Arabic statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 72KB]

(2010) Alexandre Allauzen, Josep M.Crego, Ílknur Durgar El-Kahlout, & François Yvon: LIMSI’s statistical translation systems for WMT’10.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 54-59. [PDF, 129KB]

(2010) Alexandre Allauzen, Josep M.Crego, İlknur Durgar El-Kahlout, Le Hai-Son, Guillaume Wisniewski, & François Yvon: LIMSI @ IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.105-112. [PDF, 386KB]

(2010) Mohamed Altantawy, Nizar Habash, Owen Rambow, & Ibrahim Saleh: Morphological analysis and generation of Arabic nouns: a morphemic functional approach. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, organised by ELRA, 17-23 May 2010, Valletta, Malta; pp.851-858. [PDF,               899KB]

(2010) Fadoua Ataa Allah & Siham Boulaknadel: Light morphology processing for Amazighe language. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 32-35. [PDF, 278KB]

(2010) Arianna Bisazza, Ioannis Klasinas, Mauro Cettolo, & Marcello Federico: FBK @ IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.53-58. [PDF, 346KB]

(2010) Lynne Cahill: A syllable-based approach to verbal morphology in Arabic. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 19-26. [PDF, 747KB]

(2010) Bruno Cartoni & Marie-Aude Lefer: The MuLeXFoR database: representing word-formation processes in a multilingual lexicographic environment. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.840-843. [PDF, 284KB]

(2010) Dana Dannélls & John J.Camilleri: Verb morphology of Hebrew and Maltese – towards an open source type theoretical resource grammar in GF. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 57-61. [PDF, 326KB]

(2010) Guy De Pauw, Naomi Maajabu, & Peter Waiganjo Wagacha: A knowledge-light approach to Luo machine translation and part-of-speech tagging. [LREC 2010]: Proceedings of the Second Workshop on African Language Technology, AFLAT 2010, Valetta, Malta; pp.15-20. [PDF, 141KB]

(2010) Thoudam Doren Singh & Sivaji Bandyopadhyay: Manipuri-English bidirectional statistical machine translation systems using morphology and dependency relations. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.83-91. [PDF, 510KB]

(2010) Thoudam Doren Singh & Sivaji Bandyopadhyay: Statistical machine translation of English-Manipuri using morpho-syntactic and semantic information. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 680KB]

(2010) Markus Dreyer: A non-parametric model for the discovery of inflectional paradigms from plain text using graphical models over strings. Dissertation defense, November 5, 2010, Johns Hopkins University; presentation: 200 slides [PDF of PPT, 11706KB]

(2010) Kais Dukes & Nizar Habash: Morphological annotation of Quranic Arabic. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, organised by ELRA, 17-23 May 2010, Valletta, Malta; pp.2530-2536. [PDF, 995KB]

(2010) Vladimir Eidelman, Chris Dyer, & Philip Resnik: The University of Maryland statistical machine translation system for the Fifth Workshop on Machine Translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 72-76. [PDF, 115KB]

(2010) İlknur Durgar El-Kahlout & François Yvon: The pay-offs of preprocessing for German-English statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.251-258. [PDF, 301KB]

(2010) Ahmed El Kholy & Nizar Habash: Orthographic and morphological processing for English-Arabic statistical machine translation. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 10pp. [PDF, 147KB]

(2010) Ahmed El Kholy & Nizar Habash: Techniques for Arabic morphological detokenization and orthographic denormalization. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 45-51. [PDF, 354KB]

(2010) Tomaž Erjavec: MULTEXT-East version 4: multilingual morphosyntactic specifications, lexicons and corpora. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2544-2547. [PDF, 324KB]

(2010) Mark Fishel & Harri Kirik: Linguistically motivated unsupervised segmentation for machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1741-1745. [PDF, 308KB]

(2010) Julien Gosme, Wigdan Mekki, Fathi Debili, Yves Lepage, & Nadine Lucas: The GREYC/LLACAN machine translation systems for the IWSLT 2010 campaign. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.59-65. [PDF, 307KB]

(2010) Yvette Graham & Josef van Genabith: Factor templates for factored machine translation models. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.275-282. [PDF, 366KB]

(2010) Christian Hardmeier, Arianna Bisazza, & Marcello Federico: FBK at WMT 2010: word lattices for morphological reduction and chunk-based reordering.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 88-92. [PDF, 132KB]

(2010) Carmen Heger, Joern Wuebker, Matthias Huck, Gregor Leusch, Saab Mansour, Daniel Stein, & Hermann Ney: The RWTH Aachen machine translation system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 93-97. [PDF, 98KB]

(2010) Minwoo Jeong, Kristina Toutanova, Hisami Suzuki, & Chris Quirk: A discriminative lexicon model for complex morphology. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 374KB]

(2010) Ines Turki Khemakhem, Salma Jamoussi, & Abdelmajid Ben Hamadou: Arabic morpho-syntactic feature disambiguation in a translation context. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.61-65. [PDF, 153KB]

(2010) Wen Li, Lei Chen, Wudaba, & Miao Li: Chained machine translation using morphemes as pivot language. [Coling 2010] Proceedings of the 8th Workshop on Asian Language Resources, Beijing, China, 21-22 August 2010; pp.169-177. [PDF, 170KB]

(2010) Minh-Thang Luong & Min-Yen Kan: Enhancing morphological alignment for translating highly inflected languages. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.743-751. [PDF, 610KB]

 (2010) Minh-Thang Luong, Preslav Nakov, & Min-Yen Kan: A hybrid morpheme-word representation for machine translation of morphologically rich languages.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.148-157. [PDF, 192KB]

(2010) Saab Mansour: MorphTagger: HMM-based Arabic segmentation for statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.321-327. [PDF, 369KB]

(2010) Yuval Marton, Nizar Habash, & Owen Rambow: Improving Arabic dependency parsing with lexical and inflectional morphological features. Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages, pages 13–21, Los Angeles, CA, USA, June 2010; pp.13-21. [PDF, 187KB]

(2010) Guillem Massó & Toni Badia: Dealing with sign language morphemes in statistical machine translation. [LREC 2010] 4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language  Technologies, Malta, May 2010; pp.154-157. [PDF, 543KB]

(2010) Wade Shen, Tim Anderson, Ray Slyh, & A.Ryan Aminzadeh: The MIT-LL/AFRL IWSLT-2010 MT system. Proceedings of the 7th International Workshop on Spoken L anguage Translation, 2-3 December 2010, Paris, France; pp.127-134. [PDF, 341KB]; presentation: 11 slides [PDF of PPT, 422KB]

 (2010) Raivis Skadiņš, Kārlis Goba, & Valters Šics: Improving SMT for Baltic languages with factored models. Human Language Technologies – the Baltic Perpective, 4th International Conference, Riga, Latvia, October 8, 2010; 17 slides [PDF of PPT, 742KB]

(2010) Ines Turki Khemakhem, Salma Jamoussi, & Abdelmajid Ben Hamadou: The MIRACL Arabic-English statistical machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.119-125. [PDF, 1083KB]

(2010) Reyyan Yeniterzi & Kemal Oflazer: Syntax-to-morphology mapping in factored phrase-based statistical machine translation from English to Turkish. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.454-464. [PDF, 1381KB]

(2010) Sina Zarrieß, Aoife Cahill, Jonas Kuhn, & Christian Rohrer: Cross-lingual induction of deep broad-coverage syntax: a case study on German participles. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1426-1434. [PDF, 106KB]

Post-editing and post-processing [see also Crowd sourcing]

(2015) Miguel Á.Candel-Mora: Evaluation of English to Spanish MT output of Tourism 2.0 consumer-generated reviews with post-editing purposes. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp.37-47. [PDF, 284KB]

(2015) Joke Daems, Sonia Vandepitte, Robert Hartsuker, & Lieve Macken: The impact of machine translation error types on post-editing effort indicators. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: Fourth Workshop on Post-editing Technology and Practice (WPTP 4); p.31-45. [PDF, 658KB]

(2015) Jinhua Du, Ankit Srivastava, Andy Way, Alfredo Maldonado-Guerra, & David Lewis: An empirical study of segment prioritization for incrementally retrained post-editing-based SMT. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.172-185. [PDF, 788KB] 

(2015) Maarit Koponen: How to teach machine translation post-editing? Experiences from a post-editing course. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: Fourth Workshop on Post-editing Technology and Practice (WPTP 4); p.2-15. [PDF, 505KB]

(2015) Alan K.Melby: Quality evaluation of four translations of a kidney document: focus on reliability. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.2: MT Users’ Track; pp.65-68. [PDF, 547KB]

(2015) Morgan O’Brien & Ana Duarte: Machine translation for enterprise technical communications – a journey of discovery. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.2: MT Users’ Track; pp.92-105. [PDF, 742KB]

(2015) Carla Ortiz Boix & Anna Matamala: Quality assessment of postedited versus translated wildlife documentary films: a three-level approach. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: Fourth Workshop on Post-editing Technology and Practice (WPTP 4); p.16-30. [PDF, 1089KB]

(2015) Carla Parra Escartín & Manuel Arcedillo: Machine translation evaluation made fuzzier: a study on post-editing productivity and evaluation metrics in commercial settings. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.131-144. [PDF, 491KB]

(2015) Xiaona Ren, Yongpeng Wei, & Rile Hu: Simplify sentence structure for improving human post-editing efficiency on Chinese-to-English patent machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: Sixth Workshop on Patent and Scientific Literature Translation (PSLT6); pp.33-43. [PDF, 909KB]

 (2015) John Richardson, Toshiaki Nakazawa, & Sadao Kurohashi: Enhancing function word translation with syntax-based statistical post-editing. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: Sixth Workshop on Patent and Scientific Literature Translation (PSLT6); pp.44-51. [PDF, 542KB]

(2015) Carla Schelfhout: Accurately predicting post-editing time & labor for cost management. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.2: MT Users’ Track; p.192-198. [PDF, 83KB]

(2015) Lane Schwartz, Isabel Lacruz, & Tatyana Bystrova: Effects of word alignment visualization on post-editing quality & speed. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.186-199. [PDF, 560KB]

(2015) Olga Torres, Ramon Piqué Huerta, Marisa Presas Corbella, Pilar Sánchez-Gijón, Adrià Martín Mor, Pilar Cid-Leal, Anna Aguilar-Amat, Celia Rico-Pérez, Amparo Alcina-Claudet & Miguel Ángel Candel-Mora: The use of machine translation and post-editing among language service providers in Spain. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp.90-95. [PDF, 132KB]

(2014) proceedings of the Workshop on Post-editing Technology and Practice (WPTP-3), AMTA 2014: Vancouver, BC, October 2014

(2014) Vicent Alabau, Jesús González-Rubio, Daniel Ortiz-Martínez, Germán Sanchis-Trilles, Francisco Casacuberta, Mercedes García-Martínez, Bartolomé Mesa-Lao, Dan Cheung Petersen, Barbara Dragsted, & Michael Carl: Integrating online and active learning in a computer-assisted translation workbench. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Interactive and Adaptive Machine Translation; pp.1-8. [PDF, 287KB]

(2014) Nora Aranberri, Gorka Labaka, Arantza Diaz de Ilarraza, & Kepa Sarasola Comparison of post-editing productivity between professional translators and lay users. AMTA 2014: proceedings of the eleventh conference of the  Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.20-33. [PDF, 247KB]

(2014) Eleftherios Avramidis, Aljoscha Burchardt, Sabine Hunsicker, Maja Popović, Cindy Tscherwinka, David Vilar, & Hans Uszkoreit: The TaraXŰ corpus of human-annotated  machine translations. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp. 2679-2682. [PDF, 98KB]

(2014) Petra Barančiková, Rudolf Rosa, & Aleš Tamchyna: Improving evaluation of English-Czech MT through paraphrasing.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2696-2700. [PDF, 127KB]

(2014) Michael Carl, Mercedes García Martínez, Bartolomé Mesa-Lao, & Nancy Underwood: CFT13: a resource for research into the post-editing process. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1757-1764. [PDF, 1011KB]

(2014) Laura Casanellas & Lena Marg: Assumptions, expectations and outliers in post-editing. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.93-97. [PDF, 592KB]

(2014) Sheila Castilho, Sharon O’Brien, Fabio Alves, & Morgan O’Brien: Does post-editing increase usability? A study with Brazilian Portuguese as target language. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.183-190. [PDF, 754KB]

(2014) Joke Daems, Lieve Macken, & Sonia Vandepitte: On the origin of errors: a fine-grained analysis of MT and PE errors and their relationship. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.62-68. [PDF, 120KB]

(2014) Michael Denkowski, Alon Lavie, Isabel Lacruz, & Chris Dyer: Real time adaptive machine translation: cdec and TransCenter. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); p.123. [PDF, 47KB]

(2014) Marcello Federico, Nicola Bertoldi, Marco Trombetti, & Alessandro Cattelan: MateCat: an open source CAT tool for MT post-editing. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Tutorials, 98 slides

(2014) Mercedes García-Martínez, Karan Singla, Aniruddha Tammewar, Bartolomé Mesa-Lao, Ankita Thakur, Anusuya M.A., Srinivas Bangalore, & Michael Carl: SEECAT: ASR & Eye-tracking enabled computer-assisted translation. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014; pp.81-88. [PDF, 624KB]

(2014) Federico Gaspari, Antonio Toral, Sudip Kumar Naskar, Declan Groves, & Andy Way: Perception vs. reality: measuring machine translation post-editing productivity. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.60-72. [PDF, 160KB]

(2014) Ulrich Germann: Dynamic phrase tables for machine translation in an interactive post-editing scenario. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Interactive and Adaptive Machine Translation; pp.20-31. [PDF, 246KB]

(2014) Attila Görög: TAUS post-editing course. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); p.119. [PDF, 107KB]

(2014) Attila Görög: TAUS post-editing productivity tool. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); p.120. [PDF, 107KB]

(2014) Christian Hardmeier, Sara Stymne, Jörg Tiedemann, Aaron Smith, & Joakim Nivre: Anaphora models and reordering for phrase-based SMT. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.122-129. [PDF, 184KB]

(2014) Yudai Kishimoto, Toshiaki Nakazawa, Daisuke Kawahara, & Sadao Kurohashi Post-editing user interface using visualization of a sentence structure. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); p.124. [PDF, 81KB]

(2014) Isabel Lacruz, Michael Denkowski, & Alon Lavie: Cognitive demand and cognitive effort in post-editing. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.73-84. [PDF, 126KB]

(2014) Varvara Logacheva & Lucia Specia: A quality-based active sample selection strategy for statistical machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2690-2695. [PDF, 230KB]

(2014) Sharon O’Brien, Joss Moorkens, & Joris Vreeke: Kanjingo – a mobile app for post-editing. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.137-141. [PDF, 736KB]

(2014) Sharon O’Brien, Joss Moorkens, & Joris Vreeke: Kanjingo: a mobile app for post-editing. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.125-127. [PDF, 231KB]

(2014) Elaine O’Curran: Machine translation and post-editing for user generated content: an LSP perspective. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Commercial MT Users, pp.50-54. [PDF, 1152KB]

(2014) Elaine O’Curran; Translation quality in post-edited versus human-translated  s: a case study. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.113-118. [PDF, 131KB]

(2014) Maja Popović, Arle Lommel, Aljoscha Burchardt, Eleftherios Avramidis, & Hans Uszkoreit: Relations between different types of post-editing operations, cognitive effort and temporal effort. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.191-198. [PDF, 402KB]

(2014) Victoria Porro, Johanna Gerlach, Pierrette Bouillon, & Violeta Seretan: Rule-based automatic post-processing of SMT output to reduce human post-editing effort. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.66-76. [PDF, 447KB]

(2014) Lise Rebout & Philippe Langlais: An iterative approach for mining parallel sentences in a comparable corpus. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.648-655. [PDF, 201KB]

(2014) Pilar Sánchez-Gijón, & Olga Torres-Hostench: MT post-editing into the mother tongue or into a foreign language?  Spanish-to-English MT translation output post-edited by translation trainees. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.5-19. [PDF, 226KB]

(2014) Lane Schwartz: Monolingual post-editing by a domain expert is highly effective for translation triage. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.34-4. [PDF, 75KB]

(2014) Lane Schwartz: An open source desktop post-editing tool. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); p. 122. [PDF, 103KB]

(2014) Lane O.B.Schwartz, Timothy Anderson, Jeremy Gwinnup, & Katherine M.Young: Machine translation and monolingual postediting: the AFRL WMT-14 system. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.186-194. [PDF, 539KB]

(2014) Karan Singla, David Orrego-Carmona, Ashleigh Rhea Gonzales, Michael Carl, & Srinivas Bangalore: Predicting post-editor profiles from the translation process. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Interactive and Adaptive Machine Translation; pp51-60. [PDF, 191KB]

(2014) Carlos S.C. Teixeira: Perceived vs. measured performance in the post-editing of suggestions from machine translation and translation memories. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.45-59. [PDF, 168KB]

(2014) Nancy Underwood, Bartolomé Mesa-Lao, Mercedes García Martínez, Michael Carl, Vicent Alabau,  Jesús González-Rubio, Luis A.Leiva, Germán Sanchis-Trilles, Daniel Ortíz-Martínez, & Francisco Casacuberta: Evaluating the effects of interactivity in a post-editing workbench. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.553-559. [PDF, 268KB]

(2014) ACCEPT: Automated Community Content Editing PorTal. Europrean Commission FP7-ICT-2011-7-Language Technologies. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.58. [PDF, 368KB]

(2014) ALST: linguistic and sensorial accessibility: technologies for voice-over and audio description. Ministerio de Economía y Competetitividad (Spain) Proyectos de investigación fundamental no orientada. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.128. [PDF, 227KB]

(2014) PEDAL: Post-editing with dynamic active learning. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014; p.56. [PDF, 271KB]

(2013) Proceedings of the Workshop on Post-editing Technology and Practice (WPTP-2), eds. Sharon O’Brien, Michel Simard and Lucia Specia, MT Summit XIV workshop, Nice, France, 2 September, 2013.

 (2013) Alexandre Allauzen, Nicolas Pécheux, Quoc Khanh Do, Marco Dinarelli, Thomas Lavergne, Au­rélien Max, Hai-Son Le & François Yvon: LIMSI @ WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.62-69. [PDF, 210KB]

(2013) Eleftherios Avramidis & Maja Popović: Selecting feature sets for comparative and time-oriented quality estimation of machine translation output. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.329-336. [PDF, 196KB]

(2013) Nicola Bertoldi, Mauro Cettolo, & Marcello Federico: Cache-based online adaptation for machine translation enhanced computer assisted translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.35-42. [PDF, 471KB]

(2013) Pierrette Bouillon: Automated Community Content Editing PorTal (ACCEPT). Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.407. [PDF, 182KB]

(2013) José Guilherme Camargo de Souza, Christian Buck, Marco Turchi & Matteo Negri: FBK-UEdin participation to the WMT13 quality estimation shared task.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.352-358. [PDF, 190KB]

(2013) Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Project adaptation for MT-enhanced computer assisted translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.27-34. [PDF, 399KB]

(2013) Nadir Durrani, Alexander Fraser, Helmut Schmid, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions of OSM systems at WMT13.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.122-127. [PDF, 166KB](2013) Petra Galuščáková, Martin Popel & Ondřej Bojar: PhraseFix: statistical post-editing of TectoMT.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.141-147. [PDF, 157KB]

(2013) Marcello Federico, Philipp Koehn, Holger Schwenk, & Marco Trombetti: Matecat: Machine Translation Enhanced Computer Assisted Translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.425. [PDF, 192KB]

(2013) Kevin Gimpel, Dhruv Batra, Chris Dyer, & Gregor Shakhnarovich: A systematic exploration of diversity in machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1100-1111. [PDF, 220KB]

(2013) Ana Guerberof Arenas: What do professional translators think about post-editing? Journal of Specialised Translation 19 (January 2013); pp.75-95. [PDF, 507KB]

(2013) Katsuhiko Hayashi, Katsuhito Sudoh, Hajime Tsukada, Jun Suzuki, & Masaaki Nagata: Shift-reduce word reordering for machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1382-1386. [PDF, 424KB]

(2013) Alon Lavie, Olga Beregovaya, Michael Denkowski, & David Clarke: Analyzing and predicting MT utility and post-editing productivity in enterprise-scale translation projects. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.305-307. [PDF, 258KB]

(2013) Lucía Morado Vázquez, Silvia Rodriguez Vázquez, & Pierrette Bouillon: Comparing forum data post-editing performance using translation memory and machine translation output: a pilot study. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.249-256. [PDF, 574KB]

(2013) Maja Popović, Eleftherios Avramidis, Aljoscha Burchardt, Sabine Hunsicker, Sven Schmeier, Cindy Tscherwinka, David Vilar, & Hans Uszkoreit: Learning from human judgments of machine translation output. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.231-238. [PDF, 328KB]

(2013) Celia Rico, Pedro L.Díez Orzas, & Felix Sasaki: Implementing ITS 2.0 for post-editing purposes. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.329-336. [PDF, 443KB]

(2013) Rudolf Rosa, David Mareček, & Aleš Tamchyna: Deepfix: statistical post-editing of statistical machine translation using deep syntactic analysis. ACL Student Research Workshop, Sofia, Bulgaria, August 4-9 2013; pp.172-179. [PDF, 178KB]

(2013) Raphael Rubino, Joachim Wagner, Jennifer Foster, Johann Roturier, Rasoul Samad Zadeh Kaljahi & Fred Hollowood: DCU-Symantec at the WMT 2013 quality estimation shared task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.392-397. [PDF, 162KB]

(2013) Hassan Sajjad, Svetlana Smekalova, Nadir Durrani, Alexander Fraser & Helmut Schmid: QCRI-MES submission at WMT13: using transliteration mining to improve statistical machine trans­lation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.219-224. [PDF, 205KB]

(2013) Michel Simard & George Foster: PEPr: post-edit propagation using phrase-based statistical machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.191-198. [PDF, 465KB]

(2013) Nathalie de Sutter & Joachim van den Bogaert: Quality or productivity? Let’s do both!  Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.381-390. [PDF of PPT, 824KB]

(2013) Lori Thicke: The industrial process for quality machine translation. Journal of Specialised Translation, issue 19, January 2013; pp.8-18. [PDF, 303KB]

(2013) Yulia Tsvetkov, Chris Dyer, Lori Levin & Archna Bhatia: Generating English determiners in phrase-based translation with synthetic translation options.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.271-280. [PDF, 291KB]

(2013) Guillaume Wisniewski, Anil Kumar Singh, Natalia Segal, & François Yvon: Design and analysis of a large corpus of post-edited translations: quality estimation, failure analysis and variability of post-edition. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.117-124. [PDF, 541KB]

(2012) Vicent Alabau, Luis A. Leiva, Daniel OrtizMartínez, & Francisco Casacuberta: User evaluation of interactive machine translation systems. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.20-23. [PDF, 411KB]

(2012) Eleftherios Avramidis, Aljoscha Burchardt, Christian Federmann, Maja Popović, Cindy Tscherwinka, & David  Vilar: Involving language professionals in the evaluation of machine translation. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1127-1130. [PDF, 320KB]]

(2012) Wilker Aziz, Sheila C.M.de Sousa, & Lucia Specia: PET: a tool for post-editing and assessing machine translation. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3982-3987. [PDF, 643KB]

(2012) Wilker Aziz & Lucia Specia: PET: a standalone tool for assessing machine translation through post-editing. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 5pp. [PDF, 446KB]; presentation by Lucia Specia: 54 slides [PDF, 694KB]

(2012) Wilker Aziz & Lucia Specia: PET: a tool for post-editing and assessing machine translation.  [Project paper at] EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; p.99. [PDF, 213KB]

(2012) Hanna Béchara, Raphaël Rubino, Yifan He, Yanjun Ma, & Josef van Genabith: An evaluation of statistical post-editing systems applied to RBMT and SMT systems. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.215-230. [PDF, 243KB]

(2012) Frédéric Blain, Holger Schwenk, & Jean Senellart: Incremental adaptation using translation information and post-editing analysis. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.229-236. [PDF, 873KB]

(2012) Michael Denkowski & Alon Lavie: Challenges in predicting machine translation utility for human post-editors. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 166KB]

(2012) Isao Goto, Masao Utiyama, & Eiichiro Sumita: Post-ordering by parsing for Japanese-English statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.311-316. [PDF, 346KB]

(2012) Andreas Kirkedal: Tree-based hybrid machine translation.  EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.77-86. [PDF, 260KB]

(2012) Maarit Koponen: Comparing human perceptions of post-editing effort with post-editing operations. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.181-190. [PDF, 176KB]

(2012) Maarit Koponen, Wilker Aziz, Luciana Ramos, & Lucia Specia: Post-editing time as a measure of cognitive effort. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 10pp. [PDF, 331KB]

(2012) Isabel Lacruz, Gregory M.Shreve, & Erik Angelone: Average pause ratio as an indicator of cognitive effort in post-editing: a case study. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 10pp. [PDF, 607KB]

(2012) Patrick Lambert, Jean Senellart, Laurent Romary, Holger Schwenk, Florian Zipser, Patrice Lopez, & Frédéric Blain: Collaborative machine translation service for scientific texts. [EACL 2012] Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 11-15. [PDF, 523KB]

(2012) Michel Lopez: Managing change when implementing MT systems. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 4pp. [PDF, 41KB]

(2012) Marianna J.Martindale: Can statistical post-editing with a small parallel corpus save a weak MT engine?  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2138-2142. [PDF, 353KB]

(2012) Alan K.Melby, Jason Housley, Paul J.Fields, & Emily Tuioti: Reliably assessing the quality of post-edited translation based on formalized structured translation specifications. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 10pp. [PDF, 181KB]

(2012) Bartolomé Mesa-Lao: The next generation translator’s workbench: post-editing in CASMACAT v.1.0. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 11pp. [PDF, 87KB], presentation: 16 slides [PDF, 311KB]

(2012) Mahsa Mohaghegh, Abdolhossein Sarrafzadeh, & Mehdi Mohammadi: GRAFIX: automated rule-based post editing system to improve English-Persian SMT output. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.873-882. [PDF, 405KB]

(2012) Alexander Molchanov: PROMT DeepHybrid system for WMT12 shared translation task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.345-348. [PDF, 88KB]

(2012) Jacob Mundt, Kristen Parton, & Kathleen McKeown: Learning to automatically post-edit dropped words in MT. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 8pp. [PDF, 371KB]

(2012) Sharon O’Brien, Michel Simard, & Lucia Specia (eds.): Workshop on post-editing technology and practice. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 103pp. [PDF, 5524KB]

(2012) Kristen Parton, Nizar Habash, Kathleen McKeown, Gonzalo Iglesias, & Adrià de Gispert: Can automatic post-editing make MT more meaningful? EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.111-118. [PDF, 155KB]

(2012) Sergio Penkale & Andy Way: SmartMATE: an online end-to-end MT post-editing framework. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 10pp. [PDF, 366KB]

(2012) Daniele Pighin, Meritxell González, & Lluís Màrquez: The UPC submission to the WMT 2012 shared task on quality estimation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.127-132. [PDF, 123KB]

(2012) Marion Potet, Emmanuelle Esperança-Rodier, Laurent Besacier, & Hervé Blanchon: Collection of a large database of French-English SMT output corrections.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.4043-4048. [PDF, 357KB]

(2012) Marion Potet, Laurent Besacier, Hervé Blanchon, & Marwen Azouzi: Towards a better understanding of statistical post-editing. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.284-291. [PDF, 624KB]

(2012) Alexandros Poulis & David Kolovratnik: To post-edit or not to post-edit? Estimating the benefits of MT post-editing for a European organization. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 9pp. [PDF, 390KB]

(2012) Rudolf Rosa, David Mareček, & Ondřej Dušek: DEPFIX: a system for automatic correction of Czech MT outputs. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.362-368. [PDF, 114KB]

(2012) Raphaël Rubino, Stéphane Huet, Fabrice Lefèvre, & Georges Linarès: Statistical post-editing of machine translation for domain adaptation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.221-228. [PDF, 225KB]

(2012) Midori Tatsumi, Takako Aikawa, Kentaro Yamamoto, & Hitoshi Isahara: How good is crowd post-editing? Its potential and limitations. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 10pp. [PDF, 92KB]

(2012) Justina Valotkaite & Munshi Asadullah: Error detection for post-editing rule-based machine translation. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 9pp. [PDF, 117KB]

(2012) Ventsislav Zhechev: Machine translation infrastructure and post-editing performance at Autodesk. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 10pp. [PDF, 1676KB]

(2012) ACCEPT: Automated Community Content Editing porTal. [Project paper at] EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; p.89. [PDF, 74KB]

(2011) Hanna Béchara, Yanjun Ma, & Josef van Genabith: Statistical post-editing for a statistical MT system. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.308-315. [PDF, 75KB]

(2011) Frédéric Blain, Jean Senellart, Holger Schwenk, Mirko Plitt, & Johann Roturier: Qualitative analysis of post-editing for high quality machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.164-171. [PDF, 218KB]

(2011) Christian Boitet, Valérie Bellynck, Achille Falaise, & Nguyen Hong-Thai: Operationalization of interactive multilingual gateways (iMAGs) in the Traouiero project. Translating and the Computer 33, 17-18 November 2011, London; 9pp. [PDF, 1737KB]

(2011) Ondřej Bojar: Analyzing error types in English-Czech machine translation. Prague Bulletin of Mathematical Linguistics 95, April 2011; pp.63-76. [PDF, 166KB]

(2011) Michael Carl, Barbara Dragsted, Jakob Elming, Daniel Hardt, & Arnt Lykke Jakobsen: The process of post-editing: a pilot study. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.131-142. [PDF, 1078KB]

(2011) Ann Clifton & Anoop Sarkar: Combining morpheme-based machine translation with post-processing morpheme prediction. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.32-42. [PDF, 287KB]

(2011) Miquel Esplà, Felipe Sánchez-Martínez, & Mikel L.Forcada: Using word alignments to assist computer-aided translation users by marking which target-side words to change or keep unedited. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.81-88. [PDF, 381KB]; presentation, 41 slides [PDF]

(2011) Miquel Esplà-Gomis, Felipe Sánchez-Martínez & Mikel L.Forcada: Using machine translation in computer-aided translation to suggest the target-side words to change. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.172-179. [PDF,160KB]

(2011) Federico Gaspari, Antonio Toral & Sudip Kumar Naskar: User-focused task-oriented MT evaluation for wikis: a case study. Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.13-22. [PDF, 961KB]

(2011) Vishal Goyal & Gurpreet Singh Lehal: Hindi to Punjabi machine translation system. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Portland, Oregon, June 19-24, 2011; pp.1-6. [PDF, 247KB]

(2011) Chang Hu, Philip Resnik, Yakov Kronrod, Vladimir Eidelman, Olivia Buzek, & Benjamin B.Bederson: The value of monolingual crowdsourcing in a real-world translation scenrio: simulation using Haitian Creole emergency SMS messages. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.399-404 . [PDF, 164KB]

(2011) Yanjun Ma, Yifan He, & Josef van Genabith: From the confidence estimation of machine translation to the integration of MT and translation memory. Tutorial at MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.13-15. [PDF, 75KB]

(2011) David Mareček, Rudolf Rosa, Petra Galuščáková, & Ondřej Bojar: Two-step translation with grammatical post-processing. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.426-432. [PDF, 119KB]

(2011) John Moran & David Lewis: Unobtrusive methods for low-cost manual evaluation of machine translation. Tralogy, Paris, 3-4 March 2011;  24pp. [PDF, 244KB]

(2011) Marion Potet, Raphaël Rubino, Benjamin Lecouteux, Stéphane Huet, Hervé Blanchon, Laurent Besacier, & Fabrice Lefèvre: The LIGA (LIG/LIA) machine translation system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.440-446. [PDF, 92KB]

(2011) Marion Potet, Emmanuelle Esperança-Rodier, Hervé Blanchon, & Laurent Besacier: Preliminary experiments on using users’ post-editions to enhance a SMT system. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.161-168. [PDF, 346KB]

(2011) Luciana M.Scocco: Machine translation aid to the test: a study on the benefits of post-editing for translation into the weaker language. Dissertation submitted as part requirement for Master of Arts Translation and Interpreting Studies, University of Western Sydney, 4th November 2011; 14pp. [PDF, 170KB]

(2011) Sheila C.M.de Sousa, Wilker Aziz & Lucia Specia: Assessing the post-editinng effort for automatic and semi-automatic translations of DVD subtitles. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.97-103. [PDF, 109KB]

(2011) Lucia Specia: Exploiting objective annotations for measuring translation post-editing effort. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.73-80. [PDF, 365KB]; presentation: 94 slides [PDF]

(2011) Sara Stymne: Pre- and postprocessing for statistical machine translation into Germanic languages. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.12-17. [PDF, 93KB]

(2011) Katsuhito Sudoh, Xianchao Wu, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: Post-ordering in statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.316-323. [PDF, 329KB]

(2011) Hirokazu Suzuki: Automatic post-editing based on SMT and its selective application by sentence-level automatic quality evaluation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.156-163. [PDF, 725KB]

(2011) Arda Tezcan & Vincent Vandeghinste: SMT-CAT integration in a technical domain: handling XML markup using pre & post-processing methods. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.55-62. [PDF, 343KB]

(2011) Arnaud Vié, Luis Villarejo Muñoz, Mireia Farrús Cabeceran, & Jimmy O’Regan: Apertium advanced web interface: a first step towards interactivity and language tools convergence. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.45-51. [PDF, 280KB]

(2011) Lucas Nunes Vieira & Lucia Specia: A review of translation tools from a post-editing perspective. Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.33-42. [PDF, 801KB]

(2011) Na Ye & Guiping Zhang: Study on the impact factors of the translators’ post-editing efficiency in a collaborative translation environment. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.579-586. [PDF, 277KB]

(2011) Omar F.Zaidan & Chris Callison-Burch: Crowdsourcing translation: professional quality from non-professionals. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1220-1229. [PDF, 241KB]

(2010) Giselle de Almeida & Sharon O’Brien: Analysing post-editing performance: correlations with years of translation experience. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 511KB]

(2010) Adriana Beaton & Gabriela Contreras: Sharing the Continental Airlines and SDL post-editing experience. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 3pp. [PDF, 33KB]

(2010) Christian Boitet, Huynh Cong Phap, Nguyen Hong Thai, & Valérie Bellynck: The iMAG concept: multilingual access gateway to an elected web sites with incremental quality increase through collaborative post-edition of MT pretranslations. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 8pp. [PDF, 1585KB]

(2010) Ilse Depraetere: What counts as useful advice in a university post-editing training context? Report on a case study. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 560KB]

(2010) Jinhua Du, Johann Roturier, & Andy Way: TMX markup: a challenge when adapting SMT to the localisation environment. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 656KB]

(2010) Ignacio Garcia: Does Google know better? Translators and machine translation. Translating and the Computer 32, 18-19 November 2010, London; presentation, 16 slides [PDF of PPT, 473KB]

(2010) Daniel Hardt & Jakob Elming: Incremental re-training for post-editing SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 191KB]

(2010) Yifan He, Yanjun Ma, Johann Roturier, Andy Way, & Josef van Genabith: Improving the post-editing experience using translation recommendation: a user study. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 136KB]

(2010) Yifan He, Yanjun Ma, Andy Way, & Josef van Genabith: Integrating n-best SMT outputs into a TM system. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.374-382. [PDF, 629KB]

(2010) Philipp Koehn: Enabling monolingual translators: post-editing vs.options. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.537-545. [PDF,

(2010) Jae-Hee Lee, Seung-Wook Lee, Gumwon Hong, Young-Sook Hwang, Sang-Bum Kim, & Hae-Chang Rim: A post-processing approach to statistical word alignment reflecting alignment tendency between part-of-speeches. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.623-629. [PDF, 438KB]

 (2010) Elliott Macklovitch & Laurie Gerber: MT postediting showcase. At AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 8 slides

 (2010) Sharon O’Brien: Introduction to post-editing: who, what, how and where to next? Tutorial at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 30 slides [PDF of PPT, 269KB]

 (2010) Mirko Plitt & François Masselot: A productivity test of statistical machine translation post-editing in a typical localization context. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.7-16. [PDF, 387KB]

 (2010) Luciana Cecilia Ramos: Post-editing free machine translation: from a language vendor’s perspective. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 37KB]

 (2010) Víctor M.Sánchez-Cartagena & Juan Antonio Pérez-Ortiz: Tradubi: open-source social translation for the Apertium machine translation platform. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.47-56. [PDF, 223KB]

 (2010) Barbara Scott & Adriana Beaton: Evaluating vendors for machine translation and post-editing. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 2pp. [PDF, 30KB]

 (2010) Lucia Specia & Atefeh Farzindar: Estimating machine translation post-editing effort with HTER. JEC 2010: Second joint EM+/CNGL WorkshopBringing MT to the user: research on integrating MT in the translation industry”, AMTA 2010, Denver, Colorado, November 4, 2010; pp.33-41. [PDF, 178KB]

 (2010) Midori Tatsumi & Johann Roturier: Source text characteristics and technical and temporal post-editing effort: what is their relationship? JEC 2010: Second joint EM+/CNGL WorkshopBringing MT to the user: research on integrating MT in the translation industry”, AMTA 2010, Denver, Colorado, November 4, 2010; pp.43-51. [PDF, 10,492KB]

 (2010) Irina Temnikova: Cognitive evaluation approach for controlled language post-editing experiment. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3485-3490. [PDF, 631KB]

(2010) Carol Van Ess-Dykema, Jocelyn Phillips, Florence Reeder, & Laurie Gerber: Paralinguist assessment decision factors for machine translation output: a case study. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 180KB]; abstract

(2010) Chris Wendt: Better translations with user collaboration – integrated MT at Microsoft. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 807KB]

Pre-editing and Pre-processing

(2014) Alexandru Ceausu & Sabine Hunsicker: Pre-ordering of phrase-based machine translation input in translation workflow. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3589-3592. [PDF, 407KB]

(2014) Quoc Kanh Do, Teresa Herrmann, Jan Niehues, Alexandre Allauzen, François Yvon & Alex Waibel: The KIT-LIMSI translation system for WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.84-89. [PDF, 378KB]

(2014) Piyush Dungarwal, Rajen Chatterjee, Abhijit Mishra, Anoop Kunchukuttan, Ritesh Shah, & Pushpak Bhattacharyya: The IIT Bombay HindiÛEnglish translation system at WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.90-96. [PDF, 222KB]

(2014) Marcin Junczys-Dowmunt & Bruno Pouliquen: SMT of German patents at WIPO: decompounding and verb structure pre-reordering.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.217-220. [PDF, 369KB]

(2014) Liangyou Li, Xiaofeng Wu, Santiago Cortés Vaíllo, Jun Xie, Andy Way, & Qun Liu: The DCU-ICTCAS MT system at WMT 2014 on German-English translation task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.136-141. [PDF, 306KB]

(2014) Prashant Mathur, Mauro Cettolo, Marcello Federico, & José G.C.de Souza: Online multi-user adaptive statistical machine translation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp. 152-165. [PDF, 657KB]

(2014) Santanu Pal, Sudip Kumar Naskar, & Sivaji Bandyopadhyay: Word alignment-based reordering of source chunks in PB-SMT.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3565-3571. [PDF, 185KB]

(2014) Nicolas Pécheux, AlexandreAllauzen, & Francçois Yvon: Rule-based reordering space in statistical machine translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1800-1806. [PDF, 607]

(2014) Stephan Peitz, Joern Wuebker, Markus Freitag & Hermann Ney: The RWTH Aachen German-English machine translation system for WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.157-162. [PDF, 230KB]

(2014) Lane O.B.Schwartz, Timothy Anderson, Jeremy Gwinnup, & Katherine M.Young: Machine translation and monolingual postediting: the AFRL WMT-14 system. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.186-194. [PDF, 539KB]

(2014) Violeta Seretan, Pierrette Bouillon, & Johanna Gerlach: A large-scale evaluation of pre-editing strategies for improving user-generated content translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1793-1799. [PDF, 476KB]

(2014) Marion Wittkowsky: Integrating machine translation (MT) in the higher education of translators and technical writers. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.100-108. [PDF, 158KB]

(2014) ACCEPT: Automated Community Content Editing PorTal. Europrean Commission FP7-ICT-2011-7-Language Technologies. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.58. [PDF, 368KB]

(2013) Alexandre Allauzen, Nicolas Pécheux, Quoc Khanh Do, Marco Dinarelli, Thomas Lavergne, Au­rélien Max, Hai-Son Le & François Yvon: LIMSI @ WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.62-69. [PDF, 210KB]

(2013) Pierrette Bouillon: Automated Community Content Editing PorTal (ACCEPT). Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.407. [PDF, 182KB]

(2013) Denis Dechandon & Viviane Schütz: Building on processes, workflows, skills and IT tools to speed up the  translation process and meet clients’ expectations. [Aslib 2013] Translating and the Computer 35, 28-29 November 2013, etc.venues, Paddington, London, UK; 11pp. [PDF, 688KB]

(2013) Nadir Durrani, Alexander Fraser, Helmut Schmid, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions of OSM systems at WMT13.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.122-127. [PDF, 166KB]

(2013) Johanna Gerlach, Victoria Porro, Pierrette Bouillon, & Sabine Lehmann: Combining pre-editing and post-editing to improve SMT of user-generated content.  Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 45-53. [PDF, 553KB]

(2013) Dan Han, Pascual Martínez-Gómez, Yusuke Miyao, Katsuhito Sudoh, & Masaaki Nagata: Using unlabeled dependency parsing for pre-ordering for Chinese-to-Japanese statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp. 25-33. [PDF, 334KB]

(2013) Teresa Herrmann, Jochen Weiner, Jan Niehues, & Alex Waibel: Analyzing the potential of source sentence reordering in statistical machine translation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 166KB]

(2013) Sho Hoshino, Yusuke Miyao, Katsuhito  Sudoh, & Masaaki Nagata: Two-stage pre-ordering for Japanese-to-English statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1062-1066. [PDF, 544KB]

(2013) Uri Lerner & Slav Petrov: Source-side classifier preordering for machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.513-523. [PDF, 132KB]

(2013) Shachar Mirkin, Sriram Venkatapathy, & Marc Dymetman: Confidence-driven rewriting for improved translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.257-264. [PDF, 616KB]

(2013) Shachar Mirkin, Sriram Venkatapathy, Marc Dymetman, & Ioan Calapodescu: SORT: an interactive source-rewriting tool for improved translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, System demonstrations, Sofia, Bulgaria, August 4-9 2013; pp.85-90. [PDF, 943KB]

(2013) Hwidong Na & Jong-Hyeok Lee: A discriminative reordering parser for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 105KB]

(2013) Santanu Pal, Sudio Kumar Naskar, & Sivaji Bandyopadhyay: MWE alignment in phrase based statistical machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.61-68. [PDF, 539KB]

(2013) Raj Nath Patel, Rohit Gupta, Prakash B.Pimpale & Sasikumar M: Reordering rules for English-Hindi MT. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.34-41. [PDF, 321KB]

(2013) Juan Pino, Aurelien Waite, Tong Xiao, Adrià de Gispert, Federico Flego & William Byrne: The University of Cambridge Russian-English system at WMT13.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.200-205. [PDF, 169KB]

(2013) Mohammad Sadegh Rasooli, Ahmed El Kholy, & Nizar Habash: Orthographic and morphological processing for Persian-to-English statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1047-1051. [PDF, 106KB]

(2013) Antonio Valerio Miceli Barone, & Giuseppe Attardi: Pre-reordering for machine translation using transition-based walks on dependency parse trees. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.164-169. [PDF, 207KB]

(2013) Fatiha Sadat: Towards a hybrid rule-based and statistical Arabic-French machine translation system. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.579-583. [PDF, 510KB]

(2013) Hassan Sajjad, Svetlana Smekalova, Nadir Durrani, Alexander Fraser & Helmut Schmid: QCRI-MES submission at WMT13: using transliteration mining to improve statistical machine trans­lation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.219-224. [PDF, 205KB]

(2013) Karthik Visweswariah, Mitesh M.Khapra, & Ananthakrishnan Ramanathan: Cut the noise: mutually reinforcing reordering and alignments for improved machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1275-1284. [PDF, 250KB]

(2012) Karunesh Kumar Arora & R.Mahesh K.Sinha: Improving statistical machine translation through co-joining parts of verbal constructs in English-Hindi translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.95-101. [PDF, 164KB]

(2012) Michael Denkowski, Greg Hanneman, & Alon Lavie: The CMU-Avenue French-English translation system. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.261-266. [PDF, 97KB]

(2012) Lluís Formiga & José A.R.Fonollosa: Dealing with input noise in statistical machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.319-328. [PDF, 129KB]

(2012) Anita Gojun & Alexander Fraser: Determining the placement of German verbs in English-to-German SMT. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 726-735. [PDF, 138KB]

(2012) Jie Jiang, Andy Way, & Rejwanul Haque: Translating user-generated content in the social networking space. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 9pp. [PDF, 130KB]

(2012) Jenny Lu & Patricia Paladini Adell: Beyond MT: source content quality and process automation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 4pp. [PDF, 192KB]

(2012) Graham Neubig, Taro Watanabe, & Shinsuke Mori: Inducing a discriminative parser to optimize machine translation reordering. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.843-853. [PDF, 644KB]

(2012) Thi Thanh Thao Phan, & Izabella Thomas: E-V MT of proper names: error analysis and some proposed solutions. TSD 2012: 15th International Conference on Text, Speech and Dialogue, Brno, Czech Republic, September 3-7, 2012; abstract #530, 1p. [HTML]

(2012) Manny Rayner, Pierrette Bouillon, & Barry Haddow: Using source-language transformations to address register mismatches in SMT. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 76KB]

(2012) Johann Roturier, Linda Mitchell, Robert Grabowski & Melanie Siegel: Using automatic machine translation metrics to analyze the impact of source reformulations. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 509KB]

(2012) Amit Sangodkar & Om P.Damani: Re-ordering source sentences for SMT.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2164-2171. [PDF, 649KB]

(2012) Amira Shoukry & Ahmed Rafea: Preprocessing Egyptian dialect tweets for sentiment mining. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.47-56. [PDF, 1193KB]

(2012) Nimesh Singh & Nizar Habash: Hebrew morphological preprocessing for statistical machine translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.43-50. [PDF, 146KB]

(2012) Sara Stymne: Clustered word classes for preordering in statistical machine translation. EACL Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP,  23-27 April 2012, Avignon, France; pp. 28-34. [PDF, 108KB]

(2012) Rui Wang, Petya Osenova, & Kiril Simov: Linguistically-augmented Bulgarian-to-English statistical machine translation model. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.119-128. [PDF, 163KB]

(2012) Joern Wuebker & Hermann Ney: Phrase model training for statistical machine translation with word lattices of preprocessing alternatives. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.450-459. [PDF, 160KB]

(2012) Daniel Zeman: Data issues of the multilingual translation matrix. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.395-400. [PDF, 225KB]

(2011) Vishal Goyal & Gurpreet Singh Lehal: Hindi to Punjabi machine translation system. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Portland, Oregon, June 19-24, 2011; pp.1-6. [PDF, 247KB]

(2011) Maxim Khalilov & Khalil Sima’an: ILLC-UvA translation system for EMNLP-WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.413-419. [PDF, 591KB]

(2011) Ma Jianjun, Huang Degen, Liu Haixia, & Sheng Wenfeng: POS tagging of English particles for machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.57-63. [PDF, 222KB]

(2011) Santanu Pal, Tanmoy Chakraborty, & Sivaji Bandyopadhyay: Handling multiword expressions in phrase-based statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.215-224. [PDF, 161KB]

(2011) Sara Stymne: Definite noun phrases in statistical machine translation into Scandinavian languages. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.289-296. [PDF, 336KB]; presentation, 50 slides [PDF]

 (2011) Sara Stymne: Pre- and postprocessing for statistical machine translation into Germanic languages. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.12-17. [PDF, 93KB]

 (2011) Arda Tezcan & Vincent Vandeghinste: SMT-CAT integration in a technical domain: handling XML markup using pre & post-processing methods. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.55-62. [PDF, 343KB]

(2011) Arnaud Vié, Luis Villarejo Muñoz, Mireia Farrús Cabeceran, & Jimmy O’Regan: Apertium advanced web interface: a first step towards interactivity and language tools convergence. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.45-51. [PDF, 280KB]

(2010) Alexandre Allauzen, Josep M.Crego, İlknur Durgar El-Kahlout, Le Hai-Son, Guillaume Wisniewski, & François Yvon: LIMSI @ IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.105-112. [PDF, 386KB]

(2010) Jennifer Beaupre & Kent Taylor: Content quality for better MT: a practical guide to quality at the source. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 3pp. [PDF, 69KB]

(2010) Arianna Bisazza & Marcello Federico: Chunk-based verb reordering in VSO sentences for Arabic-English statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 235-243. [PDF, 372KB]

(2010) İlknur Durgar El-Kahlout & François Yvon: The pay-offs of preprocessing for German-English statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.251-258. [PDF, 301KB]

(2010) Susan Howlett & Mark Dras: Dual-path phrase-based statistical machine translation. Australasian Language Technology Workshop 2010 (ALTA Workshop 2010), 9-10 December 2008, Melbourne, Australia; pp.32-40. [PDF, 633KB]

(2010) Hideki Isozaki, Katsuhito Sudoh, Hajime Tsukada, & Kevin Duh: Head finalization: a simple reordering rule for SOV languages.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 133-137. [PDF, 321KB]

(2010) Laura Elisabeth Jehl: Machine translation for Twitter. Master of Science, University of Edinburgh, 2010. iv, 53pp. [PDF, 398KB]

(2010) Weimin Jiang: Pre-editing for machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Maxim Khalilov & Khalil Sima’an: Source reordering using MaxEnt classifiers and supertags. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 605KB]

(2010) Hassan Sawaf: Arabic dialect handling in hybrid machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 146KB]

(2010) Sara Stymne, Maria Holmqvist, & Lars Ahrenberg: Vs and OOVs: two problems for translation between German and English. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 183-188. [PDF, 105KB]

(2010) Yanli Sun, Sharon O’Brien, Minako O’Hagan, & Fred Hollowood: A novel statistical pre-processing model for rule-based machine translation system.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 790KB]; presentation: 28 slides [PDF, 580KB]

Rapid development of MT systems

(2014) Ergun Biçici, Qun Liu, & Andy Way: Parallel FDA5 for fast development of accurate statistical machine translation systems. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.59-65. [PDF, 315KB]

(2014) Abu-MaTran: automatic building of machine translation. Project duration: January 2013 – December 2016. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.132. [PDF, 679KB]

(2013) George Tambouratzis, Sokratis Sofianopoulos, & Marina Vassiliou: Language-independent hybrid MT with PRESEMT. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.123-130. [PDF, 186KB]

(2012) Victor Chahuneau, Noah A.Smith, & Chris Dyer: pycdec: a Python interface to cdec. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.51-61. [PDF, 280KB]

(2011) William D.Lewis, Robert Munro, & Stephan Vogel: Crisis MT: developing a cookbook for MT in crisis situations. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.501-511. [PDF, 115KB]

(2011) Pim Otte & Francis M.Tyers: Rapid rule-based machine translation between Dutch and Afrikaans. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.153-160. [PDF, 317KB]

(2011) George Tambouratzis, Fotini Simistira, Sokratis Sofianopoulos, Nikos Tsimboukakis, & Marina Vassiliou: A resource-light phrase scheme for language-portable MT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.185-1 92. [PDF, 186KB]

(2010) Sankaranarayanan Ananthakrishnan, Rohit Prasad, David Stallard, & Prem Natarajan: A semi-supervised batch-mode active learning strategy for improved statistical machine translation. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.126-134. [PDF, 256KB]

(2010) William D.Lewis: Haitian Creole: how to build and ship an MT engine from scratch in 4 days, 17 hours, & 30 minutes. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 518KB]; presentation: 24 slides [PDF, 630KB]

(2010) William Lewis: Haitian Creole: developing MT for a low data language. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 24pp. [PDF, 3425KB]

(2010) Robert Munro: Crowdsourcing translation for emergency response in Haiti: the global collaboration of local knowledge. Contribution to workshop of ‘Collaborative translation’ at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 4pp. [PDF, 331KB]

(2010) Andrejs Vasiljevs: LetsMT! – towards cloud based service for MT generation. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 15pp. [PDF, 368KB]

(2010) Jernej Vičič & Petr Homola: Speeding up the implementation process of a shallow transfer machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 624KB]

Re-ordering (see also Generation, Syntactic transfer)

(2014) Abdullah Alrajeh & Mahesan Niranjan: Bayesian reordering model with feature selection. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.477-485. [PDF, 356KB]

(2014) Alexandru Ceausu & Sabine Hunsicker: Pre-ordering of phrase-based machine translation input in translation workflow. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3589-3592. [PDF, 407KB]

(2014) Christian Hardmeier, Sara Stymne, Jörg Tiedemann, Aaron Smith, & Joakim Nivre: Anaphora models and reordering for phrase-based SMT. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.122-129. [PDF, 184KB]

(2014) Teresa Herrmann, Jan Niehues, & Alex Waibel: Manual analysis of structurally informed reordering in German-English machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4379-4386. [PDF, 127KB]

(2014) Matthias Huck, Hieu Hoang, & Philip Koehn: Augmenting string-to-tree and tree-to-string translation with non-syntactic phrases. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.486-498. [PDF, 269KB]

(2014) Bushra Jawaid & Ondřej Bojar: Two-step machine translation with lattices. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.682-686. [PDF, 67KB]

(2014) Hideki Isozaki, Natsume Kouchi, & Tsutomu Hirao: Dependency-based automatic enumeration of semantically equivalent word orders for evaluating Japanese translations. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.287-292. [PDF, 264KB]

(2014) Marcin Junczys-Dowmunt & Bruno Pouliquen: SMT of German patents at WIPO: decompounding and verb structure pre-reordering.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.217-220. [PDF, 369KB]

(2014) Anoop Kunchukuttan, Abhijit Mishra, Rajen Chatterjee, Ritesh Shah, & Pushpak Bhattacharyya: Śata-Anuvādak: tackling multiway translation of Indian languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp. 1781-1787. [PDF, 233KB]

(2014) Santanu Pal, Sudip Kumar Naskar, & Sivaji Bandyopadhyay: Word alignment-based reordering of source chunks in PB-SMT.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3565-3571. [PDF, 185KB]

(2014) Nicolas Pécheux, AlexandreAllauzen, & Francçois Yvon: Rule-based reordering space in statistical machine translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1800-1806. [PDF, 607]

(2014) Maryam Siahbani & Anoop Sarkar: Expressive hierarchical rule extraction for left-to-right translation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.1-14. [PDF, 814KB]

(2014) Sara Stymne, Jörg Tiedemann, & Joakim Nivre: Estimating word alignment quality for SMT reordering tasks. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.275-286. [PDF, 344KB]

(2014) Katsuhito Sudoh, Masaaki Nagata, Shinsuke Mori, & Tatsuya Kawahara: Japanese-to-English patent translation system based on domain-adapted word segmentation and post-ordering. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.234-248. [PDF, 743KB]

(2013) Arianna Bisazza & Marcello Federico: Efficient solutions for word reordering in German-English phrase-based statistical machine transla­tion. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.440-451. [PDF, 876KB]

 (2013) Colin Cherry: Improved reordering for phrase-based translation using sparse features. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.22-31. [PDF, 187KB]

(2013) Eunah Cho, Thanh-Le Ha, Mohammed Mediani, Jan Niehues, Teresa Herrmann, Isabel Slawik & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2013. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.104-108. [PDF, 107KB]

 (2013) Nadir Durrani, Alexander Fraser, Helmut Schmid, Hieu Hoang, & Philipp Koehn: Can Markov models over minimal translation units help phrase-based  SMT?  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.399-405. [PDF, 220KB]

(2013) Minwei Feng, Jan-Thorsten Peter & Hermann Ney: Advancements in reordering models for statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.322-332. [PDF, 383KB]

(2013) Markus Freitag, Minwei Feng, Matthias Huck, Stephan Peitz & Hermann Ney: Reverse word order models. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.159-166. [PDF, 373KB]

(2013) Isao Goto, Masao Utiyama, Eiichiro Sumita, Akihiro Tamura, & Sadao Kurohashi: Distortion model considering rich context for statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.155- 165. [PDF, 808KB]

(2013) Dan Han, Pascual Martínez-Gómez, Yusuke Miyao, Katsuhito Sudoh, & Masaaki Nagata: Using unlabeled dependency parsing for pre-ordering for Chinese-to-Japanese statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp. 25-33. [PDF, 334KB]

(2013) Katsuhiko Hayashi, Katsuhito Sudoh, Hajime Tsukada, Jun Suzuki, & Masaaki Nagata: Shift-reduce word reordering for machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1382-1386. [PDF, 424KB]

(2013) Teresa Herrmann, Jochen Weiner, Jan Niehues, & Alex Waibel: Analyzing the potential of source sentence reordering in statistical machine translation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 166KB]

(2013) Teresa Herrmann, Jan Niehues, & Alex Waibel: Combining word reordering methods on different linguistic abstraction levels for statistical machine translation. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 39-47. [PDF, 131KB]

(2013) Sho Hoshino, Yusuke Miyao, Katsuhito  Sudoh, & Masaaki Nagata: Two-stage pre-ordering for Japanese-to-English statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1062-1066. [PDF, 544KB]

(2013) Fei Huang & Cezar Pendus: Generalized reordering rules for improved SMT.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.387-392. [PDF, 113KB]

(2013) Zhongqiang Huang, Jacob Devlin & Rabih Zbib: Factored soft source syntactic constraints for hierarchical machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.556-566. [PDF, 350KB]

(2013) Matthias Huck, Joern Wuebker, Felix Rietig & Hermann Ney: A phrase orientation model for hierarchical machine translation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.452-463. [PDF, 345KB]

(2013) Mitesh M.Khapra, Ananthakrishnan Ramanathan, & Karthik Visweswariah: Improving reordering performance using higher order and structural features. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.315-324. [PDF, 177KB]

(2013) László J.Laki, Attila Novák, & Borbála Siklósi: An English-to-Hungarian morpheme-based statistical machine translation system with reordering rules. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.42-50. [PDF, 246KB]

(2013) Uri Lerner & Slav Petrov: Source-side classifier preordering for machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.513-523. [PDF, 132KB]

(2013) Junhui Li, Philip Resnik, & Hal Daumé III: Modeling syntactic and semantic structures in hierarchical phrase-based translation. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.540-549. [PDF, 224KB]

(2013) Peng Li, Yang Liu & Maosong Sun: Recursive autoencoders for ITG-based translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.567-577. [PDF, 304KB]

(2013) Shuo Li, Derek F.Wong & Lidia S.Chao: Experiments with POS-based restructuring and alignment-based re-ordering for statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.82-87. [PDF, 273KB]

(2013) Antonio Valerio Miceli Barone, & Giuseppe Attardi: Pre-reordering for machine translation using transition-based walks on dependency parse trees. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.164-169. [PDF, 207KB]

(2013) Thuy Linh Nguyen & Stephan Vogel: Integrating phrase-based reordering features into a chart-based decoder for machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1587-1596. [PDF, 335KB]

(2013) Hwidong Na & Jong-Hyeok Lee: A discriminative reordering parser for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 105KB]

(2013) Raj Nath Patel, Rohit Gupta, Prakash B.Pimpale & Sasikumar M: Reordering rules for English-Hindi MT. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.34-41. [PDF, 321KB]

(2013) Stephan Peitz, Saab Mansour, Jan-Thorsten Peter, Christoph Schmidt, Joern Wuebker, Matthias Huck, Markus Freitag, & Hermann Ney: The RWTH Aachen machine translation system for WMT 2013. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.193-199. [PDF, 220KB]

(2013) Raphael Rubino, Antonio Toral, Santiago Cortés Vaíllo, Jun Xie, Xiaofeng Wu, Stephen Doherty, & Qun Liu: The CNGL-DCU-Prompsit translation systems for WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.213-218. [PDF, 152KB]

(2013) Hinrich Schütze: The operation sequence model: integrating translation and reordering operations in a single left-to-right model [abstract]. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.1. [PDF, 166KB]

(2013) Hendra Setiawan, Bowen Zhou & Bing Xiang: Anchor graph: global reordering contexts for statistical machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.501-512. [PDF, 242KB]

(2013) Maryam Siahbani, Baskaran Sankaran, & Anoop Sarkar: Efficient left-to-right hierarchical phrase-based translation with improved reordering. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1089-1099. [PDF,351KB]

(2013) Karthik Visweswariah, Mitesh M.Khapra, & Ananthakrishnan Ramanathan: Cut the noise: mutually reinforcing reordering and alignments for improved machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1275-1284. [PDF, 250KB]

(2013) Marion Weller, Max Kisselew, Svetlana Smekalova, Alexander Fraser, Helmut Schmid, Nadir Dur­rani, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions at WMT13: morphological and syntactic processing for SMT.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.232-239. [PDF, 160KB]

 (2013) Hui Zhang, Kristina Toutanova, Chris Quirk, & Jianfeng Gao: Beyond left-to-right: multiple decomposition structures for SMT. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.12-21. [PDF, 161KB]

(2012) Arianna Bisazza & Marcello Federico: Modified distortion matrices for phrase-based statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.478-487. [PDF, 742KB]

(2012) Fabienne Braune, Anita Gojun, & Alexander Fraser: Long-distance reordering during search for hierarchical phrase-based SMT. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.177-184. [PDF, 207KB]

(2012) Colin Cherry: Decoding. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 83 slides [PDF of PPT, 413KB]

(2012) Colin Cherry, Robert C.Moore, & Chris Quirk: On hierarchical re-ordering and permutation parsing for phrase-based decoding.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.200-209. [PDF, 284KB]

(2012) Han Dan, Katsuhito Sudoh, Xianchao Wu, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: Head finalization reordering for Chinese-to-Japanese machine translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.57-66. [PDF, 257KB]

(2012) Jacob Dlougach & Irina Galinskaya: Building a reordering system using tree-to-string hierachical model. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.27-36. [PDF, 1115KB]

(2012) Minwei Feng, Weiwei Sun, & Hermann Ney: Semantic cohesion model for phrase-based SMT. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.867-878. [PDF, 227KB]

(2012) Minwei Feng, Jan-Thorsten Peter, & Hermann Ney: Sequence labeling-based reordering model for phrase-based SMT. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.260-267. [PDF, 660KB]

(2012) Minwei Feng & Hermann Ney: A tagging-style reordering model for phrase-based SMT. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.17-26. [PDF, 140KB]

(2012) Ulrich Germann: Syntax-aware phrase-based statistical machine translation: system description. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.292-297. [PDF, 94KB]

(2012) Anita Gojun & Alexander Fraser: Determining the placement of German verbs in English-to-German SMT. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 726-735. [PDF, 138KB]

(2012) Isao Goto, Masao Utiyama, & Eiichiro Sumita: Post-ordering by parsing for Japanese-English statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.311-316. [PDF, 346KB]

(2012) Benjamin Gottesman: Statistical machine translation for depassivizing German part-of-speech sequences. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.55-60. [PDF, 441KB]

(2012) Rohit Gupta, Raj Nath Patel, & Ritesh Shah: Learning improved reordering models for Urdu, Farsi and Italian using SMT. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.37-45. [PDF, 520KB]

(2012) Hieu Hoang: Phrase-based translation. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 103 slides [PDF of PPT, 1227KB]

(2012) Maria Holmqvist, Sara Stymne, Lars Ahrenberg, & Magnus Merkel: Alignment-based reordering for SMT.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3436-3440. [PDF, 300KB]

(2012) Matthias Huck, Stephan Peitz, Markus Freitag, & Hermann Ney: Discriminative reordering extensions for hierarchical phrase-based machine translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.313-320. [PDF, 211KB]

(2012) Matthias Huck, Stephan Peitz, Markus Freitag, Malte Nuhn, & Hermann Ney: The RWTH Aachen machine translation system for WMT 2012. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.304-311. [PDF, 127KB]

(2012) Hideki Isozaki: Head finalization: translation from SVO to SOV [abstract]. Keynote at: IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; 1p. [PDF, 526KB]

(2012) Mitesh M.Khapra, Ananthakrishnan Ramanathan, & Karthik Visweswariah: Report of the shared task on learning reordering from word alignments at RSMT 2012. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.9-15. [PDF, 328KB]

(2012) Mitesh M.Khapra, Ananthakrishnan Ramanathan, & Karthik Visweswariah: Whitepaper for shared task on learning reordering from word alignments at RSMT 2012. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.1-8. [PDF, 326KB]

(2012) Anoop Kunchukuttan & Pushpak Bhattacharyya: Partially modelling word reordering as a sequence labelling problem. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.47-54. [PDF, 123KB]

(2012) Junhui Li, Zhaopeng Tu, Guodong Zhou, & Josef van Genabith: Using syntactic head information in hierarchical phrase-based translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.232-242. [PDF, 283KB]

(2012) Dennis N.Mehay & Chris Brew: CCG syntactic reordering models for phrase-based machine translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.210-221. [PDF, 184KB]

(2012) Jiří Navrátil, Karthik Visweswariah, & Ananthakrishnan Ramanathan: A comparison of syntactic reordering methods for English-German machine translation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2043-2058. [PDF, 213KB]

(2012) Graham Neubig, Taro Watanabe, & Shinsuke Mori: Inducing a discriminative parser to optimize machine translation reordering. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.843-853. [PDF, 644KB]

(2012) Jan Niehues, Yuqi Zhang, Mohammed Mediani, Teresa Herrmann, Eunah Cho, & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2012. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.349-355. [PDF, 149KB]

(2012) Ananthakrishnan Ramanathan & Karthik Visweswariah: A study of word-classing for MT reordering.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3971-3976. [PDF, 409KB]

(2012) Felipe Sánchez-Martínez, Rafael C.Carrasco, Miguel A.Martínez-Prieto, & Joaquín Adiego: Generalized bywords for bitext compression and translation spotting. Journal of Artificial Intelligence Research 43; pp.389-418. [PDF, 418KB]

(2012) Amit Sangodkar & Om P.Damani: Re-ordering source sentences for SMT.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2164-2171. [PDF, 649KB]

(2012) Sara Stymne: Clustered word classes for preordering in statistical machine translation. EACL Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP,  23-27 April 2012, Avignon, France; pp. 28-34. [PDF, 108KB]

(2012) Deyi Xiong, Min Zhang, & Haizhou Li: Modeling the translation of predicate-argument structure for SMT. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.902-911. [PDF, 172KB]

(2012) Ping Xu & Pascale Fung: Cross-lingual language modelling with syntactic reordering for low-resource speech recognition. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.766-776. [PDF, 275KB]

(2012) Nan Yang, Mu Li, Dongdong Zhang, & Nenghai Yu: A ranking-based approach to word reordering for statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.912-920. [PDF,

(2012) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Machine translation by modelling predicate-argument structure transformation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.3019-3036. [PDF, 1473KB]

(2012) Yue Zhang, Graeme Blackwood, & Stephen Clark: Syntax-based word ordering incorporating a large-scale language model. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 736-746. [PDF, 155KB]

(2011) JacobAndreas, Nizar Habash, & Owen Rambow: Fuzzy syntactic reordering for phrase-based statistical machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.227-236. [PDF, 221KB]

(2011) Alexandra Birch & Miles Osborne: Reordering metrics for MT. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1027-1035. [PDF, 976KB]

(2011) Nadir Durrani, Helmut Schmid, & Alexander Fraser: A joint sequence translation model with integrated reordering. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1045-1054. [PDF, 254KB]

(2011) Jakob Elming & Martin Haulrich: Reordering by parsing. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.35-42. [PDF, 361KB]

(2011) Marcello Federico, Arianna Bisazza & Christian Hardmeier: Morphological processing and word reordering for statistical MT of highly inflected languages [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; presentation: 37 slides [PDF of PPT, 835KB]

(2011) Andrew Finch, Chooi-Ling Goh, Graham Neubig, & Eiichiro Sumita: The NICT translation system for IWSLT 2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.49-56. [PDF, 285KB]

(2011) Teresa Herrmann, Mohammed Mediani, Jan Niehues, & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.379-385. [PDF, 106KB]

(2011) Chooi-Ling Goh, Takashi Onishi, & Eiichiro Sumita: Rule-based reordering constraints for phrase-based SMT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.113-120. [PDF, 446KB]

(2011) Jing He & Hongyu Liang: Word-reordering for statistical machine translation using trigram language model. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1288-1293. [PDF, 513KB]

(2011) Maria Holmqvist, Sara Stymne, & Lars Ahrenberg: Experiments with word alignment, normalization and clause reordering for SMT between English and German. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.393-398. [PDF, 102KB]

(2011) Susan Howlett & Mark Dras: Clause restructuring for SMT not absolutely helpful . ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.384-388. [PDF, 91KB]

(2011) Bushra Jawaid & Daniel Zeman: Word-order issues in English-to-Urdu statistical machine translation. Prague Bulletin of Mathematical Linguistics, no.95, April 2011; pp.87-106. [PDF, 1006KB]

(2011) Jason Katz-Brown, Slav Petrov, Ryan McDonald, Franz Och, David Talbot, Hiroshi Ichikawa, Masakazu Seno, & Hideto Kazawa: Training a parser for machine translation reordering. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.183-192. [PDF, 192KB]

(2011) Maxim Khalilov & Khalil Sima’an: Context-sensitive syntactic source-reordering by statistical translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.38-46. [PDF, 327KB]

(2011) Maxim Khalilov & Khalil Sima’an: ILLC-UvA translation system for EMNLP-WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.413-419. [PDF, 591KB]

(2011) Wang Ling, Tiago Luís, João Graça, Luísa Coheur, & Isabel Trancoso: Reordering modeling using weighted alignment matrices. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.450-454. [PDF, 121KB]

(2011) Wang Ling, João Graça, David Martins de Matos, Isabel Trancoso, & Alan Black: Discriminative phrase-based lexicalized reordering models using weighted reordering graphs. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.47-55. [PDF, 275KB]

(2011) Zhanyi Liu, Haifeng Wang, Hua Wu, Ting Liu, & Sheng Li: Reordering with source language collocations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1036-1044. [PDF, 462KB]

(2011) Takashi Onishi, Masao Utiyama, & Eiichiro Sumita: Reordering constraint based on document-level context. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.434-438. [PDF, 128KB]

(2011) Ananthakrishnan Ramanathan, Pushpak Bhattacharyya, Karthik Visweswariah, Kushal Ladha, & Ankur Gandhe: Clause-based reordering constraints to improve statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1351-1355. [PDF, 132KB]

(2011) Katsuhito Sudoh, Xianchao Wu, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: Post-ordering in statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.316-323. [PDF, 329KB]

(2011) David Talbot, Hideto Kazawa, Hiroshi Ichikawa, Jason Katz-Brown, Masakazu Seno, & Franz J.Och: A lightweight evaluation framework for machine translation reordering. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.12-21. [PDF, 371KB]

(2011) Karthik Visweswariah, Rajakrishnan Rajkumar, Ankur Gandhe, Ananthakrishnan Ramanathan, & Jiri Navratil: A word reordering model for improved machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.486-496. [PDF, 252KB]

(2011) Xianchao Wu, Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: Extracting pre-ordering rules from chunk-based dependency trees for Japanese-to-English translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.300-307. [PDF, 859KB]

(2011) Xianchao Wu, Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: Extracting pre-ordering rules from predicate-argument structures. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.29-37. [PDF, 1079KB]

(2011) Bing Xiang, Niyu Ge, & Abraham Ittycheriah: Improving reordering for statistical machine translation with smoothed priors and syntactic features. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.61-69. [PDF, 200KB]

(2011) Jun Xie, Haitao Mi & Qun Liu: A novel dependency-to-string model for statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.216-226. [PDF, 992KB]

(2011) Zhongguang Zheng, Yao Meng, & Hao Yu: Lexical-based reordering model for hierarchical phrase-based machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.389-396. [PDF, 385KB]

(2010) Avinesh PVS: A data mining approach to learn reorder rules for SMT. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Student Research Workshop… June 2, 2010, Los Angeles, California; pp.52-57. [PDF, 114KB]

(2010) Alexandra Birch & Miles Osborne: LRscore for evaluating lexical and reordering quality in MT.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 327-332. [PDF, 123KB]

(2010) Arianna Bisazza & Marcello Federico: Chunk-based verb reordering in VSO sentences for Arabic-English statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 235-243. [PDF, 372KB]

(2010) Hailong Cao & Eiichiro Sumita: Filtering syntactic constraints for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.17-21. [PDF, 65KB]

(2010) Marine Carpuat, Yuval Marton, & Nizar Habash: Improving Arabic-to-English statistical machine translation by reordering post-verbal subjects for alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.178-183. [PDF, 108KB]

(2010) Marine Carpuat, Yuval Marton, & Nizar Habash: Reordering matrix post-verbal subjects for Arabic-to-English SMT. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 10pp. [PDF, 287KB] 

(2010) Marta R.Costa-jussà & José R.Fonollosa: Using linear interpolation and weighted reordering hypotheses in the Moses system. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1712-1718. [PDF, 389KB]

(2010) Josep Maria Crego & François Yvon: Improving reordering with linguistically informed bilingual n-grams. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.197-205. [PDF, 270KB]

(2010) Jinhua Du & Andy Way: A discriminative latent variable-based “DE” classifier for Chinese-English SMT. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.286-294. [PDF, 698KB]

(2010) Jinhua Du & Andy Way: The impact of source-side syntactic reordering on hierarchical phrase-based SMT. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 746KB]; presentation: 34 slides [PDF, 540KB]

(2010) Chris Dyer & Philip Resnik: Context-free reordering, finite-state translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.858-866. [PDF, 272KB]

(2010) Minwei Feng, Arne Mauser, & Hermann Ney: A source-side decoding sequence model for statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 313KB]

(2010) Niyu Ge: A direct syntax-driven reordering model for phrase-based machine translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.849-857. [PDF, 161KB]

(2010) Dmitriy Genzel: Automatically learning source-side reordering rules for large scale machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.376-384. [PDF, 187KB]

(2010) Christian Hardmeier, Arianna Bisazza, & Marcello Federico: FBK at WMT 2010: word lattices for morphological reduction and chunk-based reordering.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 88-92. [PDF, 132KB]

(2010) Katsuhiko Hayashi, Hajime Tsukada, Katsuhito Sudoh, Kevin Duh, & Seiichi Yamamoto: Hierarchical phrase-based machine translation with word-based reordering model. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.439-446. [PDF, 1202KB]

(2010) Yanqing He, Yu Zhou, Chengqing Zong, & Huilin Wang: A novel reordering model based on multi-layer phrase for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.447-455. [PDF, 234KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Learning phrase boundaries for hierarchical phrase-based translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.383-390. [PDF, 378KB]

 (2010) Zhongjun He, Yao Meng, & Hao Yu: Maximum entropy based phrase reordering for hierarchical phrase-based translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.555-563. [PDF, 144KB]

 (2010) Susan Howlett & Mark Dras: Dual-path phrase-based statistical machine translation. Australasian Language Technology Workshop 2010 (ALTA Workshop 2010), 9-10 December 2008, Melbourne, Australia; pp.32-40. [PDF, 633KB]

(2010) Hideki Isozaki, Katsuhito Sudoh, Hajime Tsukada, & Kevin Duh: Head finalization: a simple reordering rule for SOV languages.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 133-137. [PDF, 321KB]

(2010) Jie Jiang, Jinhua Du, & Andy Way: Improved phrase-based SMT with syntactic reordering patterns learned from lattice scoring. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 257KB]

(2010) Jie Jiang, Jinhua Du, & Andy Way: Source-side syntactic reordering patterns with functional words for improved phrase-based SMT. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.19-27. [PDF, 208KB]

(2010) Maxim Khalilov & Khalil Sima’an: A discriminative syntactic model for source permutation via tree transduction. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.92-100. [PDF, 760KB]

(2010) Maxim Khalilov & Khalil Sima’an: The ILLC-UvA SMT system for IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.197-203. [PDF, 1209KB]

(2010) Maxim Khalilov & Khalil Sima’an: Source reordering using MaxEnt classifiers and supertags. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 605KB]

(2010) Young-Suk Lee, Bing Zhao, & Xiaoqiang Luo: Constituent reordering and syntax models for English-to-Japanese statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.626-634. [PDF, 214KB]

(2010) Gregor Leusch & Hermann Ney: The RWTH system combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 315-320. [PDF, 152KB]

(2010) Shui Liu, Sheng Li, Tiejun Zhao, Min Zhang, & Pengyuan Liu: Head-modifier relation based non-lexical reordering model for phrase-based translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.748-756. [PDF, 297KB]

(2010) Gideon Maillette de Buy Wenniger, Maxim Khalilov, & Khalil Sima’an: A toolkit for visualizing the coherence of tree-based reordering with word-alignments. Fifth Machine Translation Marathon, 13-18 September, University of Le Mans; Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.97-106. [PDF, 138KB]; presentation [PDF, 762KB]

(2010) Evgeny Matusov & Selçuk Köprü: Improving reordering in statistical machine translation from Farsi. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 148KB]

(2010) Jan Niehues, Teresa Herrmann, Mohammed Mediani, & Alex Waibel: The Karlsruhe Institute for Technology translation system for the ACL-WMT 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 138-142. [PDF, 87KB]

 (2010) Hendra Setiawan, Chris Dyer, & Philip Resnik: Discriminative word alignment with a function word reordering model.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.534-544. [PDF, 200KB]

(2010) Sara Stymne, Maria Holmqvist, & Lars Ahrenberg: Vs and OOVs: two problems for translation between German and English. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 183-188. [PDF, 105KB]

(2010) Jinsong Su, Yang Liu, Haitao Mi, Hongmei Zhao, Yajuan Lü, & Qun Liu: Dependency-based bracketing transduction grammar for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1185-1193. [PDF, 226KB]

(2010) Jinsong Su, Yang Liu, Yajuan Lü, Haitao Mi, & Qun Liu: Learning lexicalized reordering models from reordering graphs. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.12-16. [PDF, 1365KB]

(2010) Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, Tsutomu Hirao, & Masaaki Nagata: Divide and translate: improving long distance reordering in statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 418-427. [PDF, 569KB]

(2010) Sriram Venkatapathy, Rajeev Sangal, Aravind Joshi, & Karthik Gali: A discriminative approach to dependency based statistical machine translation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.66-74. [PDF, 187KB]

(2010) David Vilar, Daniel Stein, Matthias Huck, & Hermann Ney: Jane: open source hierarchical translation, extended with reordering and lexicon models. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 262-270. [PDF, 136KB]

(2010) Karthik Visweswariah, Jiri Navratil, Jeffrey Sorensen, Vijil Chenthamarakshan, & Nanda Kambhatla: Syntax based reordering with automatically derived rules for improved statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1119-1127. [PDF, 169KB]

(2010) Deyi Xiong, Min Zhang, Aiti Aw, & Haizhou Li: Linguistically annotated reordering: evaluation and analysis. Computational Linguistics 36 (3), pp. 535-568 [PDF, 745KB]

(2010) Sirvan Yahyaei & Christoph Monz: Dynamic distortion in a discriminative reordering model for statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.353-360. [PDF, 460KB]

Reusability

(2012) Tommi A.Pirinen & Francis M.Tyers: Compiling Apertium morphological dictionaries with HFST and using them in HFST applications.  [SALTMIL 2012] Workshop on Language Technology for Normalisation of Less-Resourced Languages, 22 May 2012, Istanbul, Turkey; pp.25-28. [PDF, 120KB]

(2011) Preslav Nakov: Reusing parallel corpora between related languages [abstract] AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; 1p. [PDF, 184KB]

(2011) Antonio Toral, Mireia Ginestí-Rosell, & Francis Tyers: An Italian to Catalan RBMT system reusing data from existing language pairs. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.77-81. [PDF, 161KB]

Segmentation (see also Morphological analysis, Tagging)

(2015) Jinhua Du, Ankit Srivastava, Andy Way, Alfredo Maldonado-Guerra, & David Lewis: An empirical study of segment prioritization for incrementally retrained post-editing-based SMT. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.172-185. [PDF, 788KB] 

(2015) Hassan Shavarani, Maryam Siahbani, Ramtin Mehdizadeh Seraj, & Anoop Sarkar:  Learning segmentations that balance latency versus quality in spoken language translation. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.217-224. [PDF, 3.5MB]

(2014) Kevin Flanagan: Filling in the gaps: what we need from TM subsegment recall. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.12-23. [PDF, 508KB]

(2014) Mikel L.Forcada: On the annotation of TMX translation memories for advanced leveraging in computer-aided translation.LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4374-4378. [PDF, 123KB]

(2014) Fabrizio Gotti, Philippe Langlais, & Atefeh Farzindar: Hashtag occurrences, layout and translation: a corpus-driven analysis of tweets published by the Canadian government.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2254-2261. [PDF, 486KB]

(2014) Carla Parra Escartin: Chasing the perfect splitter: a comparison of different compound splitting tools. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2990-2994. [PDF, 3340-3347B]

(2014) Katsuhito Sudoh, Masaaki Nagata, Shinsuke Mori, & Tatsuya Kawahara: Japanese-to-English patent translation system based on domain-adapted word segmentation and post-ordering. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.234-248. [PDF, 743KB]

(2013) Joris Driesen, Peter Bell, Mark Sinclair, & Steve Renals: Description of the UEDIN system for German ASR. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 216KB]

(2013) Nadir Durrani, Alexander Fraser, & Helmut Schmid: Model with minimal translation units, but decode with phrases. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.1-11. [PDF, 172KB]

(2013) Masato Hagiwara & Satoshi Sekine: Accurate word segmentation using transliteration and language model projection. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics,  Short papers, Sofia, Bulgaria, August 4-9 2013; pp.183-189. [PDF, 548KB]

(2013) Chien-Lin Huang, Paul R.Dixon, Shigeki Matsuda, Youzheng Wu, Xugang Lu, Masahiro Saiko, & Chiori Hori: The NICT ASR system for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 225KB]

(2013) Gideon Maillette de Buy Wenniger & Khalil Sima’an: Hierarchical alignment decomposition labels for Hiero grammar rules. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 19-28. [PDF, 128KB]

(2013) Juan Pino, Aurelien Waite, Tong Xiao, Adrià de Gispert, Federico Flego & William Byrne: The University of Cambridge Russian-English system at WMT13.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.200-205. [PDF, 169KB]

(2013) Mohammad Sadegh Rasooli, Ahmed El Kholy, & Nizar Habash: Orthographic and morphological processing for Persian-to-English statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1047-1051. [PDF, 106KB]

(2013) Hassan Sajjad, Francisco Guzmán, Preslav Nakov, Ahmed Abdelali, Kenton Murray, Fahad Al Obaidli, & Stephan Vogel: QCRI at IWSLT 2013: experiments in Arabic-English and English-Arabic spoken language translation.  [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 176KB]

(2013) Vivek Kumar Rangarajan Sridhar, John Chen, Srinivas Bangalore, Andre Ljolje, & Rathinavelu Chengalvarayan: Segmentation strategies for streaming speech translation. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.230-238. [PDF, 186KB]

(2013) Sara Stymne, Nicola Cancedda, & Lars Ahrenberg: Generation of compound words in statistical machine translation into compounding languages. Computational Linguistics 39 (4); pp.1067-1108. [PDF, 615KB]

(2013) Jennifer Williams, Rafael Banchs, & Haizhou Li: Meaning unit segmentation in English and Chinese: a new approach to discourse phenomena.  Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.1-9. [PDF, 885KB]

(2013) Dekai Wu, Karteek Addanki, Markus Saers, & Meriem Beloucif: Learning to Freestyle: hip hop challenge-response induction via transduction rule segmentation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.102-112. [PDF, 398KB]

(2013) Mahsa Yarmohammadi, Vivek K.Rangarajan Sridhar, Srinivas Bangalore, & Baskaran Sankaran: Incremental segmentation and decoding strategies for simultaneous translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1032-1036. [PDF, 192KB]

(2013) Hui Zhang, Kristina Toutanova, Chris Quirk, & Jianfeng Gao: Beyond left-to-right: multiple decomposition structures for SMT. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.12-21. [PDF, 161KB]

(2012) Khan Md.Anwarus Salam, Setsuo Yamada, & Tetsuro Nishino: Sublexical translations for low-resource language. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp.39-51. [PDF, 514KB]

(2012) James Breen, Timothy Baldwin, & Francis Bond: Segmentation and translation of Japanese multi-word loanwords. ALTA 2012: Proceedings of Australasian Language Technology Association Workshop, 4-6 December 2012, Otago University, Dunedin, New Zealand; pp. 61-69. [PDF, 249KB]

(2012) Odbayar Chimeddorj & Atsushi Fujii: Enhancing lemmatization for Monglian and its application to statistical machine translation. COLING 2012: Proceedings of the 10th Workshop on Asian Language Resources, Mumbai, December 2012; pp.115-123. [PDF, 675KB]

(2012) Eunah Cho, Jan Niehues, & Alex Waibel: Segmentation and punctuation prediction in speech language translation using a monolingual translation system. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.252-259. [PDF, 603KB]

(2012) Chenhui Chu, Toshiaki Nakazawa, Daisuke Kawahara, & Sadao Kurohashi: Exploiting shared Chinese characters in Chinese word segmentation: optimization for Chinese-Japanese machine translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.35-42. [PDF, 359KB]

(2012) Hiroshi Echizen’ya, Kenji Araki, & Eduard Hovy: Optimization for efficient determination of chunk in automatic evaluation for machine translation. COLING: Proceedings of the First International Workshop on Optimization Techniques for Human Language Technology, December 2012; pp.17-30. [PDF, 339KB]

(2012) Hyoung-Gyu Lee & Hae-Chang Rim: Decoder-based discriminative training of phrase segmentation for statistical machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.611-620. [PDF, 237KB]

(2012) Chang Liu & Hwee Tou Ng: Character-level machine translation evaluation for languages with ambiguous word boundaries. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.921-929. [PDF, 196KB]

(2012) Saab Mansour & Hermann Ney: Arabic-segmentation combination strategies for statistical machine translation. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3915-3920. [PDF, 351KB]

(2012) Jiří Maršík & Ondřej Bojar: TrTok: a fast and trainable tokenizer for natural languages. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.75-85. [PDF, 141KB]

(2012) Emad Mohamed: Morphological segmentation and part of speech tagging for religious Arabic. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.65-71. [PDF, 708KB]

(2012) Loganathan Ramasamy, Zdeněk Žabokrtský, & Sowmya Vajjala: The study of effect of length in morphological segmentation of agglutinative languages. [ACL 2012] Proceedings of the First Workshop on Multilingual Modeling, Jeju, Republic of Korea, 8-14 July 2012; pp.18-24. [PDF, 151KB]

(2012) Chunye Zhang, Tingting Li, & Tiejun Zhao: Syllable-based machine transliteration with extra phrase features.  [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.52-56. [PDF, 163KB]

(2011) Rafael E.Banchs: Comparative evaluation of Spanish segmentation strategies for Spanish-Chinese transliteration. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.41-48. [PDF, 198KB]

(2011) Andrew Finch, Paul Dixon, & Eiichiro Sumita: Integrating models derived from non-parametric Bayesian co-segmentation into a statistical machine transliteration system. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.23-27. [PDF, 266KB]

(2011) Andrew Finch, Chooi-Ling Goh, Graham Neubig, & Eiichiro Sumita: The NICT translation system for IWSLT 2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.49-56. [PDF, 285KB]

(2011) Takaaki Fukunishi, Seiichi Yamamoto, Andrew Finch, & Eiichiro Sumita: Using features from a bilingual alignment model in transliteration mining. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.49-57. [PDF, 660KB]

(2011) Hideto Ikeda, Nguyen Thanh Hung, & Ze Zhong Li: Functional approach for patent translation. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.64-69. [PDF, 144KB]

(2011) Alon Lavie & Hassan Al-Haj: The impact of Arabic morphological segmentation on broad-scale phrase-based SMT [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; presentation: 38 slides [PDF of PPT, 362KB]

(2011) Hyoung-Gyu Lee, Joo-Young Lee, Min-Jeong Kim, Hae-Chang Rim, Joong-Hwi Shin, & Young-Sook Hwang: Phrase segmentation model using collocation and translational entropy.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.198-205. [PDF, 203KB]

(2011) Klaus Macherey, Andrew M.Dai, David Talbot, Ashok C.Popat, & Franz Och: Language-independent compound splitting with morphological operations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1395-1404. [PDF, 562KB]

(2011) Coşkun Mermer & Murat Saraçlar: Unsupervised Turkish morphological segmentation for statistical machine translation. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 27 January, 2011; 2pp. [PDF, 136KB]; presentation: 41 slides [PDF of PPT, 278KB]

(2011) Jason Naradowsky & Kristina Toutanova: Unsupervised bilingual morpheme segmentation and alignment with context-rich hidden semi-Markov models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.895-904. [PDF, 285KB]

(2011) Sergio Ortiz Rojas & Santiago Cortés Vaíllo: Using Apertium linguistic data for tokenization to improve Moses SMT performance. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.29-34. [PDF, 282KB]

(2011) Ying Qin & Guohua Chen: Forward-backward machine transliteration between English and Chinese based on combined CRFs. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.82-85. [PDF, 168KB]

(2011) Germán Sanchis-Trilles, Daniel Ortiz-Martínez, Jesús González-Rubio, Jorge González, & Francisco Casacuberta: Bilingual segmentation for phrasetable pruning in statistical machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.257-264. [PDF, 374KB]; presentation, 15 slides [PDF]

(2011) Joern Wuebker, Matthias Huck, Saab Mansour, Markus Freitag, Minwei Feng, Stephan Peitz, Christoph Schmidt, & Hermann Ney: The RWTH Aachen machine translation system for IWSLT 2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.106-113. [PDF, 193KB]

(2011) Ning Xi, Guangchao Tang, Boyuan Li, & Yinggong Zhao: Word alignment combination over multiple word segmentation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.1-5. [PDF, 329KB]

(2010) Hassan Al-Haj & Alon Lavie: The impact of Arabic morphological segmentation on broad-coverage English-to-Arabic statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 72KB]

(2010) Hailong Cao, Andrew Finch, & Eiichiro Sumita: Syntactic constraints on phrase extraction for phrase-based machine translation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.28-33. [PDF, 84KB]

(2010)  Marta R.Costa-jussà, Vidas Daudaravicius, & Rafael E.Banchs: Integration of statistical collocation segmentations in a phrase-based statistical machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 975KB]

(2010) Vladimir Eidelman, Chris Dyer, & Philip Resnik: The University of Maryland statistical machine translation system for the Fifth Workshop on Machine Translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 72-76. [PDF, 115KB]

(2010) Ahmed El Kholy & Nizar Habash: Techniques for Arabic morphological detokenization and orthographic denormalization. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 45-51. [PDF, 354KB]

(2010) Andrew Finch & Eiichiro Sumita: A Bayesian model of bilingual segmentation for transliteration. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.259-266. [PDF, 511KB]

(2010) Mark Fishel & Harri Kirik: Linguistically motivated unsupervised segmentation for machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1741-1745. [PDF, 308KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Learning phrase boundaries for hierarchical phrase-based translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.383-390. [PDF, 378KB]

(2010) Carlos A.Henríquez Q., Marta R.Costa-jussà, Vidas Daudaravicius, Rafael E.Banchs, & José B.Mariño: UPC-BMIC-VDU system description for the IWSLT 2010: testing several collocation segmentations in a phrase-based SMT system.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.189-195. [PDF, 439KB]

(2010) Carlos A.Henríquez Q., Marta R.Costa-jussà, Vidas Daudaravicius, Rafael E.Banchs, & José B.Mariño: Using collocation segmentation to augment the phrase table. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 98-102. [PDF, 1398KB]

(2010) Chung-chi Huang, Ho-ching Yen, Shih-ting Huang, & Jason S.Chang: Using sublexical translations to handle the OOV problem in MT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 132KB]

(2010) Wen Li, Lei Chen, Wudaba, & Miao Li: Chained machine translation using morphemes as pivot language. [Coling 2010] Proceedings of the 8th Workshop on Asian Language Resources, Beijing, China, 21-22 August 2010; pp.169-177. [PDF, 170KB]

(2010) Saab Mansour: MorphTagger: HMM-based Arabic segmentation for statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.321-327. [PDF, 369KB]

(2010) Saab Mansour, Stephan Peitz, David Vilar, Joern Wuebker, & Hermann Ney: The RWTH Aachen machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.163-168. [PDF, 248KB]

(2010) Coşkun Mermer & Ahmet Afşın Akın: Unsupervised search for the optimal segmentation for statistical machine translation. Proceedings of the ACL 2010 Student Research Workshop, Uppsala, Sweden, 13 July 2010; pp.31-36. [PDF, 118KB]

(2010) Haitao Mi, Liang Huang, & Qun Liu: Machine translation with lattices and forests. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.837-845. [PDF, 268KB]

(2010) Hwidong Na & Jong-Hyeok Lee: The POSTECH’s statistical machine translation system for the IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.153-156. [PDF, 290KB]

(2010) ThuyLinh Nguyen, Stephan Vogel, & Noah A.Smith: Nonparametric word segmentation for machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.815-823. [PDF, 267KB]

(2010) Michael Paul, Andrew Finch, & Eiichiro Sumita: Integration of multiple bilingually-learned segmentation schemes into statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 400-408. [PDF, 256KB]

(2010) Glenn Slayden, Mei-Yuh Hwang, & Lee Schwartz: Thai sentence-breaking for large-scale SMT. [Coling 2010] Proceedings of the 1st Workshop on South and Southeast Asian Natural Language Processing, Beijing, China, August 2010; pp.8-16. [PDF, 334KB]

(2010) Yiou Wang, Kiyotaka Uchimoto, Jun’ichi Kazama, Canasai Kruengkrai, & Kentaro Torisawa: Adapting Chinese word segmentation for machine translation based on short units.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1758-1764. [PDF, 409KB]

(2010) Xinyan Xiao, Yang Liu, Young-Sook Hwang, Qun Liu, & Shouxun Lin: Joint tokenization and translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1200-1208. [PDF, 342KB]

(2010) Deyi Xiong, Min Zhang, & Haizhou Li: Learning translation boundaries for phrase-based decoding. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.136-144. [PDF, 450KB]

Semantic analysis and representation (see also Disambiguation, Entailment, Latent semantic analysis, Treebanks)

(2015) Meriem Beloucif, Markus Saers, & Dekai Wu: Improving semantic SMT via soft semantic role label constraints on ITG alignmens. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.333-345. [PDF, 647KB]  

(2014) Marianna Apidianaki, Emilia Verzeni, & Diana McCarthy: Semantic clustering of pivot paraphrases. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4270-4275. [PDF, 243KB]

(2014) Anabela Barreiro, Fernando Batista, Ricardo Ribeiro, Helena Moniz, & Isabel Trancoso: OpenLogos semantico-syntactic knowledge-rich bilingual dictionaries. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3774-3781. [PDF, 163KB]

(2014) Lars Borin, Jens Allwood, & Gerard de Melo: Bring vs.MTRoget: evaluating automatic thesaurus translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2115-2121. [PDF, 252KB]

(2014) Maud Ehrmann, Francesco Cecconi, Daniele Vannella, John McCrae, Philipp Cimiano, & Roberto Navigli: Representing multilingual data as linked data: the case of BabelNet 2.0.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.401-408. [PDF, 674KB]

(2014) Tatiana Erekhinskaya, Meghana Satpute, & Dan Moldovan: Multilingual eXtended Word Net Knowledge Base semantic parsing and translation of glosses.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2990-2994. [PDF, 82KB]

(2014) Michaela Regneri, Rui Wang, & Manfred Pinkal: Aligning predicate-argument structures for paraphrase fragment extraction. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4300-4307. [PDF, 216KB]

(2014) Gregor Thurmair: Conceptual transfer: using local classifiers for transfer selection. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4387-4393. [PDF, 409KB]

(2014) Nianwen Xue, Ondřej Bojar, Jan Hajič, Martha Palmer, Zdeňka Urešová, & Xiuhong Zhang: Not an interlingua, but close: comparison of English AMRs to Chinese and Czech. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1765-1772. [PDF, 417KB]

(2013) see Proceedings of Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation, June 13, 2013 Atlanta, Georgia

(2013) Jacob Andreas, Andreas Vlachos, & Stephen Clark: Semantic parsing as machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.47-52. [PDF, 133KB]

(2013) Marzieh Bazrafshan & Daniel Gildea: Semantic roles for string to tree machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.419-423. [PDF, 141KB]

(2013) Alexandra Birch, Barry Haddow, Ulrich Germann, Maria Nadejde, Christian Buck & Philipp Koehn: The feasibility of HMEANT as a human MT evaluation metric. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.52-61. [PDF, 222KB]

(2013) Marta R.Costa-jussà & Chris Quirk: Morphological, syntactical and semantic knowledge in statistical machine translation. [NAACL-HLT 2013]  Tutorials, NAACL HLT 2013, 16-18 June 2013, Atlanta, Georgia; pp.10-13. [PDF, 67KB]

(2013) Mahmoud Ghoneim & Mona Diab: Multiword expressions in the context of statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1181-1187. [PDF, 120KB]

(2013) Meritxell Gonzàlez, Maria Mateva, Ramona Enache, Cristina España, Lluís Màrquez, Borislav Popov, & Aarne Ranta: MT techniques in a retrieval system of semantically enriched patents. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.273-280. [PDF, 908KB]

(2013) Mikhail Kozhevnikov & Ivan Titov: Cross-lingual transfer of semantic role labeling models. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1190-1200. [PDF, 188KB]

(2013) Mike Lewis & Mark Steedman: Unsupervised induction of cross-lingual semantic relations. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.681-692. [PDF, 149KB]

(2013) Junhui Li, Philip Resnik, & Hal Daumé III: Modeling syntactic and semantic structures in hierarchical phrase-based translation. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.540-549. [PDF, 224KB]

(2013) Chi-kiu Lo & Dekai Wu: Can informal genres be better translated by tuning on automatic semantic metrics? Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.93-100. [PDF, 598KB]

(2013) Chi-kiu Lo & Dekai Wu: Human semantic MT evaluation with HMEANT for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 317KB]

(2013) Chi-kiu Lo, Meriem Beloucif, & Dekai Wu: Improving machine translation into Chinese by tuning against Chinese MEANT. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 284KB]

(2013) Chi-kiu Lo, Karteek Addanki, Markus Saers & Dekai Wu: Improving machine translation by training against an automatic semantic frame based evaluation metric. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.375-381. [PDF, 370KB]

(2013) Chi-kiu Lo & Dekai Wu: MEANT at WMT 2013: A tunable, accurate yet inexpensive semantic frame based MT evaluation metric. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.422-428. [PDF, 563KB]

(2013) John Philip McCrae, Philipp Cimiano, & Roman Klinger: Orthonormal explicit topic analysis for cross-lingual document matching. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1732-1740. [PDF, 192KB]

(2013) Tsuyoshi Okita, Qun Liu, & Josef van Genabith: Shallow semantically-informed PBSMT and HPBSMT. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.177-184. [PDF, 281KB]

(2013) Nathan Schneider, Behrang Mohit, Chris Dyer, Kemal Oflazer, & Noah A.Smith: Supersense tagging for Arabic: the MT-in-the-middle attack.  [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.661-667. [PDF, 143KB]

(2013) José G.C.de Souza, Miquel Esplà-Gomis, Marco Turchi, & Matteo Negri: Exploiting qualitative information from automatic word alignment for cross-lingual NLP tasks.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.771-776. [PDF, 277KB]

(2013) Mei Tu, Yu Zhou, & Chengqing Zong: A novel translation framework based on rhetorical structure theory.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.370-374. [PDF, 610KB]

(2013) Jennifer Williams, Rafael Banchs, & Haizhou Li: Meaning unit segmentation in English and Chinese: a new approach to discourse phenomena.  Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.1-9. [PDF, 885KB]

(2013) Will Y.Zou, Richard Socher, Daniel Cer, & Christopher D.Manning: Bilingual word embeddings for phrase-based machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1393-1398. [PDF, 337KB]

(2012) see Proceedings of SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012

(2012) Kartik Asooja, Jorge Gracia, Nitish Aggarwal, & Asunión Gómez Pérez: Using cross-lingual explicit semantic analysis for improving ontology translation. COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT), Mumbai, December 2012; pp.25-35. [PDF, 143KB]

(2012) Kathryn Baker, Michael Bloodgood, Bonnie J.Dorr, Chris Callison-Burch, Nathaniel W.Filardo, Christine Piatko, Lori Levin, & Scott Miller: Use of modality and negation in semantically-informed syntactic MT. Computational Linguistics 38 (2), pp. 411-438. [PDF, 3140KB]

(2012) Julio Castillo & Paula Estrella: Semantic textual similarity for MT evaluation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.52-58. [PDF, 278KB]

(2012) Erwin Fernandez-Ordoñez, Rada Mihalcea, & Samer Hassan: Unsupervised word sense disambiguation with multilingual representations.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.847-851. [PDF, 253KB]

(2012) Douwe Gelling & Trevor Cohn: Using senses in HMM word alignment. NAACL-HLT Workshop on the Induction of Linguistic Structure, Montréal, Canada, June 3-8, 2012; pp.39-46. [PDF, 156KB]

(2012) Bettina Harriehausen-Mühlbauer & Timm Heuss: Semantic web based machine translation. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.1-9. [PDF, 196KB]

(2012) Bevan Jones, Jacob Andreas, Daniel Bauer, Karl Moritz Hermann, & Kevin Knight: Semantics-based machine translation with hyperedge replacement grammars. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1359-1376. [PDF, 169KB]

(2012) Jungi Kim, Jinseok Nam, & Iryna Gurevych: Learning semantics with deep belief network for cross-language information retrieval. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.579-588. [PDF, 517KB]

(2012) Chi-kiu Lo, Anand Karthik Tumuluru, & Dekai Wu: Fully automatic semantic MT evaluation.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.243-252. [PDF, 379KB]

(2012) Yashar Mehdad, Matteo Negri, & Marcello Federico: Match without a referee: evaluating MT adequacy without reference translations. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.171-180. [PDF, 126KB]

(2012) Christian M.Meyer & Iryna Gurevych: To exhibit is not to loiter: a multilingual, sense-disambiguated Wiktionary for measuring verb similarity. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1763-1780. [PDF, 771KB]

(2012) Thomas Meyer & Andrei Popescu-Belis: Using sense-labeled discourse connectives for statistical machine translation. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.129-138. [PDF, 275KB]

(2012) Fernando Pereira: Low-pass semantics [abstract]. META-FORUM, Brussels, June 19-21, 2012; 1 p. [PDF, 15KB]

(2011) Yusuke Abe, Takafumi Suzuki, Bing Liang, Takehito Utsuro, Mikio Yamamoto, Suguru Matsuyoshi, & Yasuhide Kawada: Example-based translation of Japanese functional expressions utilizing semantic equivalence classes. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.91-103. [PDF, 776KB]

(2011) Fabio Alves, Adrianna Pagano, & Igor L. da Silva: Modeling (un)packing of meaning in translation: insights from effortful text production. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.153-163. [PDF, 1064KB]

(2011) Wilker Aziz, Miguel Rios, & Lucia Specia: Improving chunk-based semantic role labelling with lexical features.  [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.226-232. [PDF, 99KB]

(2011) Wilker Aziz, Miguel Rios, & Lucia Specia: Shallow semantic trees for SMT. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.316-322. [PDF, 106KB]

(2011) Rafael E.Banchs & Marta R.Costa-jussà: A semantic feature for statistical machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.126-134. [PDF, 576KB]

(2011) Caroline Barrière & Pierre Isabelle: Searching parallel corpora for contextually equivalent terms. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.105-112. [PDF, 149KB]

(2011) Stephen Beale: Using Linguist’s Assistant for language description and translation. Proceedings of the IJCNLP 2011 System Demonstrations, Chiang Mai, Thailand, November 9, 2011; pp.5-8. [PDF, 956KB]

(2011) Yufeng Chen & Chengqing Zong: A semantic-specific model for Chinese named entity translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.138-146. [PDF, 282KB]

(2011) Boxing Chen, Roland Kuhn, & George Foster: Semantic smoothing and fabrication of phrase pairs for SMT. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.144-150. [PDF, 335KB]

(2011) Ido Dagan & Shachar Mirkin: Textual entailment inference in machine translation [abstract]. Machine Translation and Morphologically-rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; presentation: 39 slides [PDF of PPT, 663KB]

(2011) Nachum Dershowitz: Using semantics in translation [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; presentation: 76 slides [PDF of PPT, 18,100KB]

(2011) Qin Gao & Stephan Vogel: Corpus expansion for statistical machine translation with semantic role label substitution rules. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.294-298. [PDF, 151KB]

(2011) Qin Gao & Stephan Vogel: Utilizing target-side semantic role labels to assist hierarchical phrase-based machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.107-115. [PDF, 352KB]

(2011) Yoshihiko Hayashi: A representation framework for cross-lingual/interlingual lexical semantic correspondences. Proceedings of the Ninth International Conference on Computational Semantics (IWCS 2011), ed. Johan Bos and Stephen Pulman, January 12-14, Oxford, UK; pp.155-164. [PDF, 257KB]

(2011) Leonardo Lesmo, Alessandro Mazzei, & Daniele Radicioni: Linguistic processing in the ATLAS project. International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 10-11 January 2011, Federal Ministry of Labour and Social Affairs, Berlin, Germany; 25 slides [PDF of PPT, 5367KB]

(2011) Chi-kiu Lo & Dekai Wu: MEANT: an inexpensive, high-accuracy, semi-automatic metric for evaluating translation utility via semantic frames. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.220-229. [PDF, 935KB]

(2011) Chi-kiu Lo & Dekai Wu: A radically simple, effective annotation and alignment methodology for semantic frame based SMT and MT evaluation. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.58-65. [PDF, 898KB]

(2011) Chi-kiu Lo & Dekai Wu: Structured vs. flat semantic role representations for machine translation evaluation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.10-20. [PDF, 758KB]

(2011) Jeff Ma, Spyros Matsoukas, & Richard Schwartz: Improving low-resource statistical machine translation with a novel semantic word clustering algorithm. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.352-359. [PDF, 160KB]

(2011) John McCrae, Maurizio Espinoza, Elena Monteil-Ponsoda, Guadalupe Aguado-de-Cea, & Philipp Cimiano: Combining statistical and semantic approaches to the translation of ontologies and taxonomies. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.116-125. [PDF, 576KB]

(2011) Daniel Marcu: Meaning-equivalent semantics for understanding, generation, translation, and evaluation. Keynote at: IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker

(2011) Johanna Monti, Anabela Barreiro, Annibale Elia, Federica Marano, & Antonella Napoli: Taking on new challenges in multi-word unit processing for machine translation. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.11-19. [PDF, 148KB]

(2011) Daniele Pighin & Lluís Màrquez: Automatic projection of semantic structures: an application of pairwise translation ranking. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.1-9. [PDF, 142KB]

(2011) Lonneke van der Plas, Paola Merlo, & James Henderson: Scaling up automatic cross-lingual semantic role annotation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.299-304. [PDF, 161KB]

(2011) Shumin Wu & Martha Palmer: Semantic mapping using automatic word alignment and semantic role labeling. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.21-30. [PDF, 200KB]

(2010) Marianna Apidianaki & Yifan He: An algorithm for cross-lingual sense-clustering tested in a MT evaluation setting. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.219-226. [PDF, 474KB]

(2010) Kathryn Baker, Michael Bloodgood, Chris Callison-Burch, Bonnie J.Dorr, Nathaniel W.Filardo, Lori Levin, Scott Miller, & Christine Piatko: Semantically-informed syntactic machine translation: a tree-grafting approach. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 360KB]

(2010) Boxing Chen, George Foster, & Roland Kuhn: Bilingual sense similarity for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.834-843. [PDF, 185KB]

(2010) Dmitry Davidov & Ari Rappoport: Automated translation of semantic relationships. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.241-249. [PDF, 196KB]

(2010) Thoudam Doren Singh & Sivaji Bandyopadhyay: Statistical machine translation of English-Manipuri using morpho-syntactic and semantic information. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 680KB]

(2010) Steffen Eger & Ineta Sejane: Computing semantic similarity from bilingual dictionaries. JADT 2010: 10th International Conference on Statistical Analysis of Textual Data, 9-11 juin 2010, Rome, Italie; pp.1217-1225 [PDF, 918KB]

(2010) Benoît Gaillard, Malek Boualem, & Olivier Collin: Query translation using Wikipedia-based resources for analysis and disambiguation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 579KB]

(2010) Jan Hajic: Building bridges using innovative approaches in machine translation. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 24 slides [PDF of PPT, 1109KB]

(2010) Chang Liu, Daniel Dahlmeier, & Hwee Tou Ng: TESLA: translation evaluation of sentences with linear-programming-based analysis. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 354-359. [PDF, 181KB]

(2010) Ding Liu & Daniel Gildea: Semantic role features for machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.716-724. [PDF, 388KB]

(2010) Chi-kiu Lo & Dekai Wu: Evaluating machine translation utility via semantic role labels. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2873-2877. [PDF, 417KB]

(2010) Chi-kiu Lo & Dekai Wu: Semantic vs. syntactic vs. n-gram structure for machine translation evaluation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.52-60. [PDF, 401KB]

(2010) Yuval Marton: Improved statistical machine translation with hybrid phrasal paraphrases derived from monolingual text and shallow lexical resource. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 293KB]

(2010) Yashar Mehdad, Matteo Negri, & Marcello Federico: Towards cross-lingual textual entailment.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.321-324. [PDF, 230KB]

(2010) Smruthi Mukund, Debanjan Ghosh, & Rohini K.Srihari: Using cross-lingual projections to generate semantic role labeled corpus for Urdu – a resource poor language. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.797-805. [PDF, 300KB]

(2010) Taiji Nagasaka, Ran Shimanouchi, Akiko Sakamoto, Takafumi Suzuki, Yohei Morishita, Takehito Utsuro, & Suguru Matsuyoshi: Utilizing semantic equivalence classes of Japanese functional expressions in translation rule acquisition from parallel patent sentences. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1777-1785. [PDF, 942KB]

(2010) Roberto Navigli & Simone Paolo Ponzetto: BabelNet: building a very large multilingual semantic network. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.216-225. [PDF, 229KB]

(2010) Christian Scheible: Sentiment translation through lexicon induction. Proceedings of the ACL 2010 Student Research Workshop, Uppsala, Sweden, 13 July 2010; pp.25-30. [PDF, 119KB]

(2010) Christian Scheible, Florian Laws, Lukas Michelbacher, & Hinrich Schütze: Sentiment translation through multi-edge graphs. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1104-1112. [PDF, 236KB]

(2010) Ineke Schuurman & Vincent Vandeghinste: Cultural aspects of spatiotemporal analysis in multilingual applications. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3308-3313. [PDF, 318KB]

(2010) Lonneke van der Plas, Tanja Samardžić, & Paola Merlo: Cross-lingual validity of PropBank in the manual annotation of French. [ACL-2010] Proceedings of the Fourth Linguistic Annotation Workshop, Uppsala, Sweden, 15-16 July 2010; pp.113-117. [PDF, 102KB]

(2010) Bin Wei & Christopher Pal: Cross lingual adaptation: an experiment in sentiment classifications. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.258-262. [PDF, 109KB]

(2010) Billy Tak-Ming Wong: Semantic evaluation of machine translation.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2884-2888. [PDF, 574KB]

(2010) Shumin Wu, Jinho D.Choi, & Martha Palmer: Detecting cross-lingual semantic similarity using parallel PropBanks. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 274KB]

Semantic transfer  see Lexical transfer

Speech recognition

(2015) Michael Heck, Quoc Truong Do, Sakriani Sakti, Graham Neubig, & Satoshi Nakamura: The NAIST English speech recognition system for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.105-111. [PDF, 2.8MB]

(2015) Kevin Kilgour & Alex Waibel: Multifeature modular deep neural network acoustic models. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.159-166. [PDF, 2.6MB]

(2015) Anna Matamala: The ALST project: technologies for audiovisual translation. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp.79-89. [PDF, 269KB]

(2015) Markus Müller, Thai-Son Nguyen, Matthias Sperber, Kevin Kilgour, Sebastian Stüker, & Alex Waibel: The 2015 KIT IWSLT speech-to-text systems for English and German. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.70-75. [PDF, 2.1MB]

(2015) Markus Müller & Alex Waibel: Using language adaptive deep neural networks for improved multilingual speech recognition. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.167-172. [PDF, 1.8MB]

(2015) Van Huy Nguyen, Quoc Bao Nguyen, Tat Thang Vu, & Chi Mai Luong: The IOIT English ASR system for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.84-87. [PDF, 1.2MB]

(2015) Tran Huy Dat, Jonathan William Dennis, & Ng Wen Zheng Terence: The I2R ASR system for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.88-92. [PDF, 1.8MB]

(2014) Alejandro Armando, Pierrette Bouillon, Manny Rayner, & Nikos Tsourakis: A tool for building multilingual voice questionnaires. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.177-181. [PDF, 279KB]

(2014) Mercedes García-Martínez, Karan Singla, Aniruddha Tammewar, Bartolomé Mesa-Lao, Ankita Thakur, Anusuya M.A., Srinivas Bangalore, & Michael Carl: SEECAT: ASR & Eye-tracking enabled computer-assisted translation. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014; pp.81-88. [PDF, 624KB]

(2014) Anthony Rousseau, Paul Deléglise, & Yannick Estève: Enhancing the TED-LIUM corpus with selected data for language modeling and more TED talks. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3935-3939. [PDF, 115KB]

(2014) Nicholas Ruiz, & Marcello Federico: Assessing the impact of speech recognition errors on machine translation quality. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation   in the Americas, Vancouver, BC, October 22-26; pp.261-274. [PDF, 560KB]

(2014) ALST: linguistic and sensorial accessibility: technologies for voice-over and audio description. Ministerio de Economía y Competetitividad (Spain) Proyectos de investigación fundamental no orientada. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.128. [PDF, 227KB]

(2013) Heike Adel, Ngoc Thang Vu, & Tanja Schultz: Combination of recurrent neural networks and factored language models for code-switching language modeling. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.206-211. [PDF, 210KB]

(2013) Sankaranarayanan Ananthakrishnan, Wei Chen, Rohit Kumar, & Dennis Mehay: Source aware phrase-based decoding for robust conversational spoken language translation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 222KB]

(2013) Peter Bell, Fergus McInnes, Siva Reddy Gangireddy, Mark Sinclair, Alexandra Birch, & Steve Renals: The UEDIN English ASR system for the IWSLT 2013 evaluation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 173KB]

(2013) Rahma Boujelbane, Mariem Ellouze Khemekhem, Siwar BenAyed, & Lamia Hadrich Belguith: Building bilingual lexicon to create dialect Tunisian corpora and adapt language model. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.88-93. [PDF, 454KB]

(2013) Joris Driesen, Peter Bell, Mark Sinclair, & Steve Renals: Description of the UEDIN system for German ASR. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 216KB]

(2013) Daniele Falavigna, Roberto Gretter, Fabio Brugnara, & Diego Giuliani: FBK @ IWSLT 2013 – ASR tracks. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 4pp. [PDF, 104KB]

(2013) Markus Freitag, Stephan Peitz, Joern Wuebker, Hermann Ney, Nadir Durrani, Matthias Huck, Philipp Koehn, Thanh-Le Ha, Jan Niehues, Mohammed Mediani, Teresa Herrmann, Alex Waibel, Nicola Bertoldi, Mauro Cettolo, & Marcello Federico: EU-BRIDGE MT: text translation of talks in the EU-BRIDGE project. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 128KB]

(2013) Michael Heck, Sebastian Stüker, Sakriani  Sakti, Alex Waibel, & Satoshi Nakamura: Incremental unsupervised training for university lecture recognition. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 162KB]

(2013) Shahab Jalalvand & Daniele Falavigna: Parameter optimization for iterative confusion network decoding in weather-domain speech recognition. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 631KB]

(2013) Kevin Kilgour, Christian Mohr, Michael Heck, Quoc Bao Nguyen, Van Huy Nguyen, Evgeniy Shin, Igor Tseyzer, Jonas Gehring, Markus Müller, Matthias Sperber, Sebastian Stüker & Alex Waibel: The 2013 KIT IWSLT speech-to-text systems for German and English. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 86KB]

(2013) Ngoc-Quan Pham, Hai-Son Le, Tat-Thang Vu, & Chi-Mai Luong: The speech recognition and machine translation system of IOIT for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 240KB]

(2013) Matt Post, Gaurav Kumar, Adam Lopez, Damianos Karakos, Chris Callison-Burch, & Sanjeev Khudanpur: Improved speech-to-text translation with the Fisher and Callhome Spanish-English speech translation corpus. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 192KB]

(2013) Sakriani Sakti, Keigo Kubo, Graham Neubig, Tomoki Toda, & Satoshi Nakamura: The NAIST English speech recognition system for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 109KB]

(2013) M.Ali Basha Shaik, Zoltan Tüske, Simon Wiesler, Markus Nußbaum-Thom, Stephan Peitz, Ralf Schlüter & Hermann Ney: The RWTH Aachen German and English LVCSR systems for IWSLT-2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 230KB]

(2013) Vivek Kumar Rangarajan Sridhar, John Chen, Srinivas Bangalore, Andre Ljolje, & Rathinavelu Chengalvarayan: Segmentation strategies for streaming speech translation. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.230-238. [PDF, 186KB]

(2012) Dia Abu Zeina, Moustafa Elshafei, Husni Al-Muhtaseb, & Wasfi Al-Khatib: Rescoring N-best hypotheses for Arabic speech recognition: a syntax-mining approach. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.57-64. [PDF, 846KB]

(2012) Eunah Cho, Jan Niehues, & Alex Waibel: Segmentation and punctuation prediction in speech language translation using a monolingual translation system. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.252-259. [PDF, 603KB]

(2012) Ştefan Daniel Dumitrescu, Radu Ion, Dan Ştefănescu, Tiberiu Boroş, & Dan Tufiş: Romanian to English automatic MT experiments at IWSLT12 – system description paper. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.136-143. [PDF, 812KB]

(2012) D.Falavigna, R.Gretter, F.Brugnara, & D.Giuliani: FBK@IWSLT 2012 – ASR track. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.81-86. [PDF, 732KB]

(2012) Daniele Falavigna & Roberto Gretter: Focusing language models for automatic speech recognition. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.171-178. [PDF, 722KB]

(2012) M.Federico, M.Cettolo, L.Bentivogli, M.Paul, & S.Stüker: Overview of the IWSLT 2012 evaluation campaign. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.12-33. [PDF, 772KB]; presentation: 44 slides [PDF of PPT, 3701KB]

(2012) Marcello Federico, Sebastian Stüker, Luisa Bentivogli, Michael Paul, Mauro Cettolo, Teresa Herrmann, Jan Niehues, & Giovanni Moretti: The IWSLT 2011 evaluation campaign on automatic talk translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3543-3540. [PDF, 411KB]

(2012) Eva Hasler, Peter Bell, Arnab Ghoshal, Barry Haddow, Philipp Koehn, Fergus McInnes, Steve Renals, & Pawel Swietojanski: The UEDIN systems for the IWSLT 2012 evaluation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.46-53. [PDF, 651KB]; presentation: 103 slides [PDF of PPT, 384KB]

(2012) Michael Heck, Keigo Kubo, Matthias Sperber, Sakriani Sakti, Sebastian Stüker, Christian Saam, Kevin Kilgour, Christian Mohr, Graham Neubig, Tomoki Toda, Satoshi Nakamura, & Alex Waibel: The KIT-NAIST (contrastive) English ASR system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.91-95. [PDF, 637KB]

(2012) Takatomo Kano, Sakriani Sakti, Shinnosuke Takamichi, Graham Neubig, Tomoki Toda, & Satoshi Nakamura: A method for translation of paralinguistic information. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.158-163. [PDF, 759KB]

(2012) Shahram Khadivi & Zeinab Vakil: Interactive-predictive speech-enabled computer-assisted translation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.237-243. [PDF, 774KB]

(2012) Heinrich Kolkhorst, Kevin Kilgour, Sebastian Stüker, & Alex Waibel: Evaluation of interactive user corrections for lecture transcription. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.217-221. [PDF, 635KB]

(2012) Stephan Peitz, Saab Mansour, Markus Freitag, Minwei Feng, Matthias Huck, Joern Wuebker, Malte Nuhn, Markus Nußbaum-Thom, & Hermann Ney: The RWTH Aachen speech recognition and machine translation system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.69-76. [PDF, 623KB]; presentation, 19 slides [PDF of PPT, 165KB]

(2012) N.Ruiz, A.Bisazza, R.Cattoni, & M.Federico: FBK’s machine translation systems for IWSLT 2012’s TED lectures. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.61-68. [PDF, 623KB]; presentation, 27 slides [PDF of PPT, 378KB]

(2012) Christian Saam, Christian Mohr, Kevin Kilgour, Michael Heck, Matthias Sperber, Keigo Kubo, Sebatian Stüker, Sakriani Sakri, Graham Neubig, Tomoki Toda, Satoshi Nakamura, & Alex Waibel: The 2012 KIT and KIT-NAIST English ASR systems for the IWSLT evaluation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.87-90. [PDF, 621KB]

(2012) Hassan Sawaf: Automatic speech recognition and hybrid machine translation for high-quality closed-captioning and subtitling for video broadcast. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 5pp. [PDF, 244KB]

(2012) Ping Xu & Pascale Fung: Cross-lingual language modelling with syntactic reordering for low-resource speech recognition. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.766-776. [PDF, 275KB]

(2012) Hitoshi Yamamoto, Youzheng Wu, Chien-Lin Huang, Xugang Lu, Paul R.Dixon, Shigeki Matsuda, Chiori Hori, & Hideki Kashioka: The NICT ASR system for IWSLT2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.34-37. [PDF, 700KB]; presentation, 16 slides [PDF of PPT, 1114KB]

(2012) Dong Yu: Who can understand your speech better – deep neural network of Gaussian mixture model? [Abstract of] Keynote speech at: IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; p.9. [PDF, 546KB]

(2012) transLectures: transcription and translation of video lectures.  [Project paper at] EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; p.204. [PDF, 142KB]

(2012) George Wright: Speech analysis and archive research. META-FORUM, Brussels, June 19-21, 2012; 17 slides. [PDF of PPT, 541KB]

(2011) Kazuhiko Abe, Youzheng Wu, Chien-lin Huang, Paul R.Dixon, Shigeki Matsuda, Chiori Hori, & Hideki Kashioka: The NICT ASR system for IWSLT2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.28-33. [PDF, 266KB]

(2011) A.Ryan Aminzadeh, Tim Anderson, Ray Slyh, Brian Ore, Eric Hansen, Wade Shen, Jennifer Drexler, & Terry Gleason: The MIT-LL/AFRL IWSLT-2011 MT system. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.34-40. [PDF, 254KB]

(2011) Dimitra Anastasiou: Survey on speech, machine translation and gestures in ambient assisted living. Tralogy, Paris, 3-4 March 2011; pp. [PDF, 185KB]

(2011) Arianna Bisazza, Nick Ruiz, & Marcello Federico: Fill-up versus interpolation methods for phrase-based SMT adaptation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.136-143. [PDF, 323KB]

(2011) Paul R.Dixon, Andrew Finch, Chiori Hori, & Hideki Kashioka: Investigation of the effects of ASR tuning on speech translation performance. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.167-174. [PDF, 238KB]

(2011) Marcello Federico, Luisa Bentivogli, Michael Paul, & Sebastian Stüker: Overview of the IWSLT 2011 evaluation campaign. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.11-27. [PDF, 486KB]

(2011) Sadaoki Furui: Data-intensive approaches for ASR. Keynote at: IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker

(2011) Jie Jiang, Zeeshan Ahmed, Julie Carson-Berndsen, Peter Cahill, & Andy Way: Phonetic representation-based speech translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.81-88. [PDF, 320KB]

(2011) Kevin Kilgour, Christian Saam, Christian Mohr, Sebastian Stüker, & Alex Waibel: The 2011 KIT QUAERO speech-to-text system for Spanish. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.199-205. [PDF, 236KB]

(2011) Lori Lamel, Sandrine Courcinous, Julien Despres, Jean-Luc Gauvain, Yvan Josse, Kevin Kilgour, Florian Kraft, Viet Bac Le, Hermann Ney, Markus Nußbaum-Thom, Ilya Oparin, Tim Schlippe, Ralf Schlüter, Tanja Schultz, Thiago Fraga da Silva, Sebastian Stüker, Martin Sundermeyer, Bianca Vieru, Ngoc Thang Vu, Alexander Waibel, & Cécile Woehrling: Speech recognition for machine translation in Quaero. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.121-128. [PDF, 310KB]; presentation, 33 slides. [PDF of PPT]

 (2011) Benjamin Lecouteux, Laurent Besacier, & Hervé Blanchon: LIG English-French spoken language translation system for IWSLT 2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.68-72 [PDF, 98KB]

(2011) Stephan Peitz, Markus Freitag, Arne Mauser, & Hermann Ney: Modeling punctuation prediction as machine translation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.238-245. [PDF, 214KB]

(2011) Anthony Rousseau, Fethi Bougares, Paul Deléglise, Holger Schwenk, & Yannick Estève: LIUM’s systems for the IWSLT 2011 speech translation tasks. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.79-85. [PDF, 250 KB]

(2011) N.Ruiz, A.Bisazza, F.Brugnara, D.Falavigna, D.Giuliani, S.Jaber, R.Gretter, & M.Federico: FBK@IWSLT 2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.86-93. [PDF, 203KB]

(2011) Sebastian Stüker, Kevin Kilgour, Christian Saam, & Alex Waibel: The 2011 KIT English ASR system for the IWSLT evaluation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.94-97. [PDF, 198KB]

(2011) Keiji Yasuda, Hideo Okuma, Masao Utiyama, & Eiichiro Sumita: Annotating data selection for improving machine translation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.269-274. [PDF, 352KB]

(2010) Mikko Kurimo, William Byrne, John Dines, Philip N.Garner, Matthew Gibson, Yong Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Mirjam Wester, Yi-Jian Wu, & Junichi Yamagishi: Personalising speech-to-speech translation in the EMIME project. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.48-53. [PDF, 679KB]

(2010) Chi-Ho Li, Nan Duan, Yinggong Zhao, Shujie Liu, Lei Cui, Mei-yuh Hwang, Amittai Axelrod, Jianfeng Gao, Yaodong Zhang, & Li Deng: The MSRA machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.135-138. [PDF, 254KB]; presentation: 17 slides [PDF of PPT, 477KB]

(2010) Mark Liberman: Obituary: Frederick Jelinek. Computational Linguistics 36 (4), pp. 595-599 [PDF, 48KB]

(2010) Daniel Chung Yong Lim, Ian Lane & Alex Waibel: Real-time spoken language identification and recognition for speech-to-speech translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.307-312. [PDF, 477KB]

(2010) Anthony Rousseau, Loïc Barrault, Paul Deléglise, & Yannick Estève: LIUM’s statistical machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.113-117. [PDF, 246KB]

Speech synthesis

(2015) Anna Matamala: The ALST project: technologies for audiovisual translation. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp.79-89. [PDF, 269KB]

(2014) ALST: linguistic and sensorial accessibility: technologies for voice-over and audio description. Ministerio de Economía y Competetitividad (Spain) Proyectos de investigación fundamental no orientada. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.128. [PDF, 227KB]

(2011) Dimitra Anastasiou: Survey on speech, machine translation and gestures in ambient assisted living. Tralogy, Paris, 3-4 March 2011; pp. [PDF, 185KB]

Syntactic analysis and representation

(2015) Makoto Morishita, Koichi Akabe, Yuto Hatakoshi, Graham Neubig, Koichiro Yoshino and Satoshi Nakamura: Parser self-training for syntax-based machine translation. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.232-239. [PDF, 2.6MB]

(2015) John Richardson, Toshiaki Nakazawa, & Sadao Kurohashi: Enhancing function word translation with syntax-based statistical post-editing. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: Sixth Workshop on Patent and Scientific Literature Translation (PSLT6); pp.44-51. [PDF, 542KB]

(2014) Tatiana Erekhinskaya, Meghana Satpute, & Dan Moldovan: Multilingual eXtended Word Net Knowledge Base semantic parsing and translation of glosses.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2990-2994. [PDF, 82KB]

(2014) Markus Freitag, Stephan Peitz, Joern Wuebker, Hermann Ney, Matthias Huck, Rico Sennrich, Nadir Durrani, Maria Nadejde, Philip Williams, Philipp Koehn, Teresa Herrmann, Eunach Cho, & Alex Waibel; EU-BRIDGE MT: combined machine translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.105-113. [PDF, 217KB]

(2014) Teresa Herrmann, Jan Niehues, & Alex Waibel: Manual analysis of structurally informed reordering in German-English machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4379-4386. [PDF, 127KB]

(2014) Matthias Huck, Hieu Hoang, & Philip Koehn: Augmenting string-to-tree and tree-to-string translation with non-syntactic phrases. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.486-498. [PDF, 269KB]

(2014) Tom Vanallemeersch & Vincent Vandeghinste: Improving fuzzy matching through syntactic knowledge. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.90-99. [PDF, 430KB]

(2014) Philip Williams, Rico Sennrich, Maria Nadejde, Matthias Huck, Eva Hasler, & Philipp Koehn: Edinburgh’s syntax-based systems at WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.207-214. [PDF, 222KB]

(2013) Proceedings of Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation, June 13, 2013 Atlanta, Georgia

(2013) Matthias Büchse, Alexander Koller, & Heiko Vogler: Generic binarization for parsing and translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.145-154. [PDF, 304KB]

(2013) Marta R.Costa-jussà & Chris Quirk: Morphological, syntactical and semantic knowledge in statistical machine translation. [NAACL-HLT 2013]  Tutorials, NAACL HLT 2013, 16-18 June 2013, Atlanta, Georgia; pp.10-13. [PDF, 67KB]

(2013) Ankur Gandhe & Rashmi Gangadharaiah: Hypothesis refinement using agreement constraints in machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.429-437. [PDF, 170KB]

(2013) Nathan David Green & Zdeněk Žabokrtský: Improvements to syntax-based machine translation using ensemble dependency parsers. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp. 19-24. [PDF, 202KB]

(2013) Greg Hanneman & Alon Lavie: Improving syntax-augmented machine translation by coarsening the label set. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.288-297. [PDF, 1205KB]

(2013) Zhongqiang Huang, Jacob Devlin & Rabih Zbib: Factored soft source syntactic constraints for hierarchical machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.556-566. [PDF, 350KB]

(2013) Uri Lerner & Slav Petrov: Source-side classifier preordering for machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.513-523. [PDF, 132KB]

(2013) Junhui Li, Philip Resnik, & Hal Daumé III: Modeling syntactic and semantic structures in hierarchical phrase-based translation. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.540-549. [PDF, 224KB]

(2013) Yang Liu: A shift-reduce parsing algorithm for phrase-based string-to-dependency translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1-10. [PDF, 290KB]

(2013) Maria Nadejde, Philip Williams, & Philipp Koehn: Edinburgh’s syntax-based machine translation systems. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.170-176. [PDF, 161KB]

(2013) Michal Novák, Anna Nedoluzhko, & Zdeněk Žabokrtský: Translation of “it” in a deep syntax framework. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.51-59. [PDF, 384KB]

(2013) Michal Novák, Zdeněk Žabokrtský & Anna Nedoluzhko: Two case studies on translating pronouns in a deep syntax framework. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1037-1041. [PDF, 348KB]

(2013) Rudolf Rosa, David Mareček, & Aleš Tamchyna: Deepfix: statistical post-editing of statistical machine translation using deep syntactic analysis. ACL Student Research Workshop, Sofia, Bulgaria, August 4-9 2013; pp.172-179. [PDF, 178KB]

(2013) Rasoul Samad Zadeh Kaljahi, Jennifer Foster, Raphael Rubino, Johann Roturier, & Fred Hollowood: Parser accuracy in quality estimation of machine translation: a tree kernel approach. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1092-1096. [PDF, 114KB]

(2013) Violeta Seretan: On translating syntactically-flexible expressions [abstract]. Invited talk at [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; p.11. [PDF, 284KB]

(2013) Ines Turki Khemakhem, Salma Jamoussi, & Abdelmajid Ben Hamadou: Integrating morpho-syntactic features in English-Arabic statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.74-81. [PDF, 491KB]

(2013) Marion Weller, Alexander Fraser, & Sabine Schulte im Walde: Using subcategorization knowledge to improve case prediction for translation into German.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.593-603. [PDF, 252KB]

(2013) Marion Weller, Max Kisselew, Svetlana Smekalova, Alexander Fraser, Helmut Schmid, Nadir Dur­rani, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions at WMT13: morphological and syntactic processing for SMT.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.232-239. [PDF, 160KB]

(2013) Wenduan Xu, Yue Zhang, Philip Williams, & Philipp Koehn: Learning to prune: context-sensitive pruning for syntactic MT.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.352-357. [PDF, 334KB]

(2013) Soichi Yokoyama: Analysis of parallel structures in patent sentences, focusing on the head words. [MT Summit XIV] Proceedings of the 5th Workshop on Patent Translation, Nice, September 2, 2013; pp.24-27. [PDF,  1040KB]

(2012) Proceedings of SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012

(2012) Dia Abu Zeina, Moustafa Elshafei, Husni Al-Muhtaseb, & Wasfi Al-Khatib: Rescoring N-best hypotheses for Arabic speech recognition: a syntax-mining approach. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.57-64. [PDF, 846KB]

(2012) Hala Almaghout, Jie Jiang, & Andy Way: Extending CCG-based syntactic constraints in hierarchical phrase-based SMT. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.193-200. [PDF, 228KB]

(2012) Kathryn Baker, Michael Bloodgood, Bonnie J.Dorr, Chris Callison-Burch, Nathaniel W.Filardo, Christine Piatko, Lori Levin, & Scott Miller: Use of modality and negation in semantically-informed syntactic MT. Computational Linguistics 38 (2), pp. 411-438. [PDF, 3140KB]

(2012) Fabienne Braune, Anita Gojun, & Alexander Fraser: Long-distance reordering during search for hierarchical phrase-based SMT. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.177-184. [PDF, 207KB]

(2012) Colin Cherry, Robert C.Moore, & Chris Quirk: On hierarchical re-ordering and permutation parsing for phrase-based decoding.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.200-209. [PDF, 284KB]

(2012) Michael Denkowski, Greg Hanneman, & Alon Lavie: The CMU-Avenue French-English translation system. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.261-266. [PDF, 97KB]

(2012) Ondřej Dušek, Zdeněk Žabokrtský, Martin Popel, Martin Majliš, Michal Novák, & David Mareček: Formemes in English-Czech deep syntactic MT. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.267-274. [PDF, 316KB]

(2012) Yang Feng, Dongdong Zhang, Mu Li, Ming Zhou, & Qun Liu: Hierarchical chunk-to-string translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.950-958. [PDF, 152KB]

(2012) Yang Feng, Yang Liu, Qun Liu, & Trevor Cohn: Left-to-right tree-to-string decoding with prediction. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1191-1200. [PDF, 511KB]

(2012) Ryan Georgi, Fei Xia, & William D.Lewis: Measuring the divergence of dependency structures cross-linguistically to improve syntactic projection algorithms.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.771-778. [PDF, 1013KB]

(2012) Ulrich Germann: Syntax-aware phrase-based statistical machine translation: system description. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.292-297. [PDF, 94KB]

(2012) Kenneth Heafield: Chart-based decoding. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 77 slides [PDF of PPT, 1257KB]

(2012) Hieu Hoang: Phrase-based translation. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 103 slides [PDF of PPT, 1227KB]

(2012) Rasoul Samad Zadeh Kaljahi, Raphael Rubino, Johann Roturier, & Jennifer Foster: A detailed analysis of phrase-based and syntax-based machine translation: the search for systematic differences. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 140KB]

(2012) Gideon Kotzé, Vincent Vandeghinste, Scott Martens, & Jörg Tiedemann: Large aligned treebanks for syntax-based machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.467-473. [PDF, 498KB]

(2012) Junhui Li, Zhaopeng Tu, Guodong Zhou, & Joseph van Genabith: Head-driven hierarchical phrase-based translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.33-37. [PDF, 268KB]

(2012) Junhui Li, Zhaopeng Tu, Guodong Zhou, & Josef van Genabith: Using syntactic head information in hierarchical phrase-based translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.232-242. [PDF, 283KB]

(2012) Shujie Liu, Chi-Ho Li, Mu Li, & Ming Zhou: Re-training monolingual parser bilingually for syntactic SMT. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.854-862. [PDF, 708KB]

(2012) Graham Neubig, Taro Watanabe, & Shinsuke Mori: Inducing a discriminative parser to optimize machine translation reordering. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.843-853. [PDF, 644KB]

(2012) Jan Niehues, Yuqi Zhang, Mohammed Mediani, Teresa Herrmann, Eunah Cho, & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2012. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.349-355. [PDF, 149KB]

(2012) Ananthakrishnan Ramanathan & Karthik Visweswariah: A study of word-classing for MT reordering.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3971-3976. [PDF, 409KB]

(2012) Ariya Rastrow, Mark Dredze, & Sanjeev Khudanpur: Fast syntactic analysis for statistical language modeling via substructure sharing and uptraining. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.175-183. [PDF, 419KB]

(2012) Raphael Rubino, Jennifer Foster, Joachim Wagner, Johann Roturier, Rasul Samad Zadeh Kaljahi, & Fred Hollowood: DCU-Symantec submission for the WMT 2012 quality estimation task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.138-144. [PDF, 125KB]

(2012) Baskaran Sankaran, Majid Razmara, & Anoop Sarkar: Kriya – an end-to-end hierarchical phrase-based MT system. Prague Bulletin of Mathematical Linguistics 97, April 2012; pp.83-98. [PDF,

(2012) Sokratis Sofianopoulos, Marina Vassiliou, & George Tambouratzis: Implementing a language-independent MT methodology. [ACL 2012] Proceedings of the First Workshop on Multilingual Modeling, Jeju, Republic of Korea, 8-14 July 2012; pp.1-10. [PDF, 285KB]

(2012) David Vilar: DFKI’s SMT system for WMT 2012. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.382-387. [PDF, 90KB]

(2012) Jonathan Weese, Chris Callison-Burch, & Adam Lopez: Using categorial grammar to label translation rules. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.222-231. [PDF, 165KB]

(2012) Philip Williams & Philipp Koehn: GHKM rule extraction and scope-3 parsing in Moses. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.388-394. [PDF, 122KB]

(2012) Xianchao Wu, Takuya Matsuzaki, & Jun’ichi Tsujii: Akamon: an open source toolkit for tree/forest-based statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 10 July 2012, System Demonstrations; pp.127-132. [PDF, 501KB]

(2012) Youzheng Wu, Hitoshi Yamamoto, Xugang Lu, Shigeki Matsuda, Chiori Hori, & Hideki Kashioka: Factored recurrent neural network language model in TED lecture transcription. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.222-228. [PDF, 645KB]

(2012) Tong Xiao, Jingbo Zhu, Hao Zhang, & Qiang Li: NiuTrans: an open source toolkit for phrase-based and syntax-based machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 10 July 2012, System Demonstrations; pp.19-24. [PDF, 258KB]

(2012) Nan Yang, Mu Li, Dongdong Zhang, & Nenghai Yu: A ranking-based approach to word reordering for statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.912-920. [PDF,

(2012) Hui Zhang & David Chiang: An exploration of forest-to-string translation: does translation help or hurt parsing? [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.317-321. [PDF, 49KB]

(2012) Yue Zhang & Joakim Nivre: Analyzing the effect of global learning and beam-search on transition-based dependency parsing. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1391-1400. [PDF, 282KB]

(2012) Yue Zhang, Graeme Blackwood, & Stephen Clark: Syntax-based word ordering incorporating a large-scale language model. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 736-746. [PDF, 155KB]

(2011) Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011

(2011) Hala Almaghout, Jie Jiang, & Andy Way: CCG contextual labels in hierarchical phrase-based SMT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.281-288. [PDF, 317KB]

(2011) Pratyush Banerjee, Hala Almaghout, Sudip Naskar, Johann Roturier, Jie Jiang, Andy Way, & Josef van Genabith: The DCU machine translation systems for IWSLT 2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.41-48. [PDF, 274KB]

(2011) Alexandru Ceausu: Rich morpho-syntactic descriptors for factored machine translation with highly inflected languages as target. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; 2pp. [PDF, 53KB]; presentation: 32 slides [PDF of PPT, 349KB]

(2011) Wenliang Chen, Jun’ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang, Kentaro Torisawa, & Haizhou Li: SMT helps bitext dependency parsing. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.73-83. [PDF, 583KB]

(2011) Christian Chiarcos & Tomaž Erjavec: OWL/DL formalization of the MULTEXT-East morphosyntactic specifications. Proceedings of the Fifth LAW Workshop (LAW V), Portland, Oregon, USA, 23-24 June 2011; pp.11-20. [PDF, 118KB]

(2011) Fabien Cromieres & Sadao Kurohashi: Efficient retrieval of tree translation examples for syntax-based machine translation.  [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.508-518. [PDF, 616KB]

(2011) Jakob Elming & Martin Haulrich: Reordering by parsing. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.35-42. [PDF, 361KB]

(2011) Marcello Federico: Aspects of tree-based statistical machine translation. Lecture at: Sixth Machine Translation Marathon, 5-10 September, Trento, Italy; 66 slides [PDF of PPT, 2112KB]

(2011) Christian Federmann, Sabine Hunsicker, Yu Chen, & Rui Wang: DFKI system combination using syntactic information at ML4HMT-2011. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.104-109. [PDF, 294KB]

(2011) Sumukh Ghodke, Steven Bird, & Rui Zhang: A breadth-first representation for tree matching in large scale forest-based translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.785-793. [PDF, 941KB]

(2011) Kevin Gimpel & Noah A.Smith: Quasi-synchronous phrase dependency grammars for machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.474-485. [PDF, 360KB]

(2011) Yoav Goldberg: Agreement matters: challenges of translating into a morphologically rich language, and the advantages of a syntax-based system. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23 January, 2011; 3pp. [PDF, 41KB];  presentation: 108 slides [PDF of PPT, 342KB]

(2011) Isao Goto, Masao Utiyama, Takashi Onishi, & Eiichiro Sumita: A comparison study of parsers for patent translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.448-455. [PDF, 156KB]

(2011) Nathan Green: Effects of noun phrase bracketing in dependency parsing and machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.69-74. [PDF, 288KB]

(2011) Greg Hanneman & Alon Lavie: CMU syntax-based machine translation at WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.365-371. [PDF, 118KB]

(2011) Christian Hardmeier: Improving machine translation quality prediction with syntactic tree kernels. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.233-240. [PDF, 633KB] ; presentation, 18 slides [PDF]

(2011) Kenneth Heafield, Hieu Hoang, Philipp Koehn, Tetsuo Kiso & Marcello Federico: Left language model state for syntactic machine translation. IWSLT 2011: Proc eedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.183-190. [PDF, 286KB]

(2011) Hieu Hoang: Mixed syntax translation. Invited talk at: Sixth Machine Translation Marathon, 5-10 September 2011, Trento, Italy; 75 slides [PDF of PPT, 1361KB]

(2011) Deirdre Hogan, Jennifer Foster, & Josef van Genabith: Decreasing lexical data sparsity in statistical syntactic parsing – experiments with named entities. Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World (MWE 2011), Portland, Oregon, USA, 23 June 2011; pp.14-19. [PDF, 97KB]

(2011) Mark Hopkins, Greg Langmead & Tai Vo: Extraction programs: a unified approach to translation rule extraction. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.523-532. [PDF, 552KB]

(2011) Hideto Ikeda, Nguyen Thanh Hung, & Ze Zhong Li: Functional approach for patent translation. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.64-69. [PDF, 144KB]

(2011) Miriam Kaeshammer & Dominikus Wetzel: Enriching phrase-based statistical machine translation with POS information.  [RANLP 2011] Proceedings of the Student Research Workshop associated with RANLP 2011, Hissar, Bulgaria, 13 September 2011; pp.33-40. [PDF, 127KB]

(2011) Jason Katz-Brown, Slav Petrov, Ryan McDonald, Franz Och, David Talbot, Hiroshi Ichikawa, Masakazu Seno, & Hideto Kazawa: Training a parser for machine translation reordering. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.183-192. [PDF, 192KB]

(2011) Maxim Khalilov & Khalil Sima’an: Context-sensitive syntactic source-reordering by statistical translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.38-46. [PDF, 327KB]

(2011) Kevin Knight: Integrated morphology for translation [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23 January, 2011; presentation: 104 slides [PDF of PPT, 2895KB]

(2011) Philipp Koehn: Tree-based models: statistical machine translation. Lecture at: Sixth Machine Translation Marathon, 5-10 September, Trento, Italy; 120 slides [PDF of PPT, 1806KB]

(2011) Alexander Koller & Marco Kuhlmann: A generalized view on parsing and translation [abstract]. IWPT 2011: 12th International Confernce on Parsing Technologies, October 5-7, 2011, Dublin City University; 1p. [PDF, 12KB]

(2011) Wenjing Lang, Qiaoli Zhou, Guiping Zhang, & Dongfeng Cai: Multi-stage Chinese dependency parsing based on dependency direction. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.64-71. [PDF, 128KB]

(2011) Alon Lavie: Statistical MT with syntax and morphology: challenges and some solutions [abstract]. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; p.13. [PDF, 139KB]

(2011) Lemao Liu, Tiejun Zhao, Chao Wang, & Hailong Cao: A unified and discriminative soft syntactic constraint model for hierarchical phrase-based translation.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.253-260. [PDF, 197KB]

(2011) Ryan McDonald, Slav Petrov, & Keith Hall: Multi-source transfer of delexicalized dependency parsers. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.62-72. [PDF, 284KB]

(2011) Matouš Macháček & Ondřej Bojar: Approximating a deep-syntactic metric for MT evaluation and tuning. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.92-98. [PDF, 130KB]

(2011) Markos Mylonakis & Khalil Sima’an: Learning hierarchical translation structure with linguistic annotations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.642-652. [PDF, 165KB]

(2011) Kemal Oflazer: Syntax-to-morphology mapping in factored SMT [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 27 January, 2011; presentation: 60 slides [PDF of PPT, 491KB]

(2011) Jason Riesa, Ann Irvine, & Daniel Marcu: Feature-rich language-independent syntax-based alignment for statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.497-507. [PDF, 1053KB]

(2011) Alexander M.Rush & Michael Collins: Exact decoding of syntactic translation models through Lagrangian relaxation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.72-82. [PDF, 203KB]

(2011) Lane Schwartz, Chris Callison-Burch, William Schuler, & Stephen Wu: Incremental syntactic language models for phrase-based translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.620-631. [PDF, 249KB]

(2011) Anders Søgaard: Data point selection for cross-language adaptation of dependency parsers. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.682-686. [PDF, 124KB]

(2011) Anders Søgaard & Martin Haulrich: Sentence-level instance-weighting for graph-based and transition-based dependency parsing. IWPT 2011: 12th International Confernce on Parsing Technologies, October 5-7, 2011, Dublin City University; pp.43-47. [PDF, 86KB]

(2011) George Tambouratzis, Fotini Simistira, Sokratis Sofianopoulos, Nikos Tsimboukakis, & Marina Vassiliou: A resource-light phrase scheme for language-portable MT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.185-1 92. [PDF, 186KB]

(2011) Reut Tsarfaty: Morphology, syntax, and what’s in between: reifying cross-linguistic variation through cross-linguistic parsing. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; 2pp. [PDF, 109KB]; presentation: 80 slides [PDF of PPT, 1321KB]

(2011) Jonathan Weese, Juri Ganitkevitch, Chris Callison-Burch, Matt Post, & Adam Lopez: Joshua 3.0: syntax-based machine translation with the Thrax grammar extractor. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.478-484. [PDF, 164KB]

(2011) Fai Wong, Francisco Oliveira, Sam Chao, & Chi-Wai Tang: Handheld machine translation system based on constraint synchronous grammar. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.439-446. [PDF, 158KB]

(2011) Dekai Wu: Syntactic SMT and semantic SMT. Tutorial at MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.12-13. [PDF, 75KB]

(2011) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Simple but effective approaches to improving tree-to-tree model. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.261-268. [PDF, 225KB]

(2011) Jiajun Zhang, Feifei Zhai, & Chenqing Zong: Augmenting string-to-tree translation models with fuzzy use of source-side syntax. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.204-215. [PDF, 317KB]

(2011) Bing Zhao, Young-Suk Lee, Xiaoqiang Luo, & Liu Li: Learning to transform and select elementary trees for improved syntax-based machine translations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.846-855. [PDF, 711KB]

(2011) Andreas Zollmann & Stephan Vogel: A word-class approach to labeling PSCFG rules for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1-11. [PDF, 230KB]

(2010) proceedings of SSST-4, Fourth Workshop on Syntax and Structure in Statistical Translation, Dekai Wu (ed.): COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010.

(2010) Hala Almaghout, Jie Jiang, & Andy Way: CCG augmented hierarchical phrase-based machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.211-218. [PDF, 653KB]

(2010) Hala Almaghout, Jie Jiang, & Andy Way: The DCU machine translation systems for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.37-44. [PDF, 621KB]

(2010) Kathryn Baker, Michael Bloodgood, Chris Callison-Burch, Bonnie J.Dorr, Nathaniel W.Filardo, Lori Levin, Scott Miller, & Christine Piatko: Semantically-informed syntactic machine translation: a tree-grafting approach. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 360KB]

(2010) Anton Bryl & Josef van Genabith: f-align: an open-source alignment tool for LFG f-structures. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 229KB]

(2010) David Burkett, Slav Petrov, John Blitzer, & Dan Klein: Learning better monolingual models with unannotated bilingual text. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.46-54. [PDF, 431KB]

(2010) Hailong Cao & Eiichiro Sumita: Filtering syntactic constraints for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.17-21. [PDF, 65KB]

(2010) Simon Carter & Christof Monz: Discriminative syntactic reranking for statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 143KB]

(2010) David Chiang: Learning to translate with source and target syntax. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1443-1452. [PDF, 357KB]

(2010) Tagyoung Chung & Daniel Gildea: Effects of empty categories on machine translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.636-645. [PDF, 198KB]

(2010) Josep Maria Crego & François Yvon: Improving reordering with linguistically informed bilingual n-grams. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.197-205. [PDF, 270KB]

(2010) Jinhua Du & Andy Way: The impact of source-side syntactic reordering on hierarchical phrase-based SMT. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 746KB]; presentation: 34 slides [PDF, 540KB]

(2010) Tomaž Erjavec: MULTEXT-East version 4: multilingual morphosyntactic specifications, lexicons and corpora. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2544-2547. [PDF, 324KB]

(2010) Qin Gao & Stephan Vogel: Training phrase-based machine translation models on the cloud. Open source machine translation toolkit Chaski. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.37-46. [PDF, 158KB]

(2010) Guillem Gascó Mora & Joan Andreu Sánchez Peiró: Syntax augmented inversion transduction grammars for machine translation [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 62KB]

(2010) Niyu Ge: A direct syntax-driven reordering model for phrase-based machine translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.849-857. [PDF, 161KB]

(2010) Jesús Giménez: Empirical machine translation and its evaluation. Presentation of Best Thesis Award for 2008 at EAMT 2010: 14th Annual conference of the European Association for Machine Translation, 28 May 2010, Saint-Raphaël, France. 39 slides [PDF of PPT presentation, 496KB]

(2010) Spence Green & Christopher D.Manning: Better Arabic parsing: baselines, evaluations, and analysis. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.394-402. [PDF, 148KB]

(2010) Greg Hanneman, Jonathan Clark & Alon Lavie: Improved features and grammar selection for syntax-based MT.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 82-87. [PDF, 97KB]

(2010) Rejwanul Haque, Sudip Kumar Naskar, Antal van den Bosch, & Andy Way: Supertags as source language context in hierarchical phrase-based SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 196KB]

(2010) Christian Hardmeier: Fast and extensible phrase scoring for statistical machine translation. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.87-96. [PDF, 123KB]

(2010) Carmen Heger, Joern Wuebker, Matthias Huck, Gregor Leusch, Saab Mansour, Daniel Stein, & Hermann Ney: The RWTH Aachen machine translation system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 93-97. [PDF, 98KB]

(2010) Hieu Hoang & Philipp Koehn: Improved translation with source syntax labels. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 409-417. [PDF, 1079KB]

(2010) Liang Huang & Haitao Mi: Efficient incremental decoding for tree-to-string translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.273-283. [PDF, 266KB]

(2010) Zhongqiang Huang, Martin Čmejrek & Bowen Zhou: Soft syntactic constraints for hierarchical phrase-based translation using latent syntactic distributions.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.138-147. [PDF, 506KB]

(2010) Hideki Isozaki, Katsuhito Sudoh, Hajime Tsukada, & Kevin Duh: Head finalization: a simple reordering rule for SOV languages.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 133-137. [PDF, 321KB]

(2010) Minwoo Jeong, Kristina Toutanova, Hisami Suzuki, & Chris Quirk: A discriminative lexicon model for complex morphology. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 374KB]

(2010) Jae Dong Kim, Ralf D.Brown, & Jaime G.Carbonell: Chunk-based EBMT. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 624KB]

(2010) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Ann Irvine, Sanjeev Khudanpur, Lane Schwartz, Wren N.G.Thornton, Ziyuan Wang, Jonathan Weese, & Omar F.Zaidan: Joshua 2.0: a toolkit for parsing-based machine translation with syntax, semirings, discriminative training and other goodies. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 133-137. [PDF, 101KB]

(2010) Shui Liu, Sheng Li, Tiejun Zhao, Min Zhang, & Pengyuan Liu: Head-modifier relation based non-lexical reordering model for phrase-based translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.748-756. [PDF, 297KB]

(2010) Yang Liu & Liang Huang: Tree-based and forest-based translation. ACL 2010:Tutorial abstracts, Uppsala, Sweden, 11 July, 2010; p.2. [PDF, 58KB]

(2010) Adam Lopez: Context-free translation models. Presentation at Fifth Machine Translation Marathon, 13-18 September 2010, University of Le Mans; 160 slides. [PDF, 3817KB]

(2010) Haitao Mi, Liang Huang, & Qun Liu: Machine translation with lattices and forests. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.837-845. [PDF, 268KB]

(2010) Hwidong Na, Jin-Ji Li, Yeha Lee, & Jong-Hyeok Lee: A synchronous context free grammar using depependency sequence for syntax-based statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 184KB]

(2010) Francisco Oliveira, Fai Wong, Iok-Sai Hong & Ming-Chui Dong: Parsing extended constraint synchronous grammar in Chinese-Portuguese machine translation. International Conference on Computational Processing of Portuguese Language (PROPOR 2010), Porto Alegre, Brazil, 2010; 6pp. [PDF, 91KB]

(2010) Francisco Oliveira, Fai Wong, & Iok-Sai Hong: Systematic processing of long sentences in rule based Portuguese-Chinese machine translation [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 58KB]

(2010) Adam Pauls, Dan Klein, David Chiang, & Kevin Knight: Unsupervised syntactic alignment with inversion traduction grammars. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.118-126. [PDF, 766KB]

(2010 Lane Schwartz: Reproducible results in parsing-based machine translation: the JHU shared task submission.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 177-182. [PDF, 101KB]

(2010) Lucia Silva: Fine-tuning in Brazilian Portuguese-English statistical transfer machine translation: verbal tenses.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Student Research Workshop… June 2, 2010, Los Angeles, California; pp.58-63. [PDF, 251KB]

(2010) Glenn Slayden, Mei-Yuh Hwang, & Lee Schwartz: Thai sentence-breaking for large-scale SMT. [Coling 2010] Proceedings of the 1st Workshop on South and Southeast Asian Natural Language Processing, Beijing, China, August 2010; pp.8-16. [PDF, 334KB]

(2010) Drahomira (Johanka) Spoustová & Miroslav Spousta: Dependency parsing as a sequence labeling task. Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.7-14. [PDF, 271KB]

(2010) Kathrin Spreyer, Lilja Øvrelid, & Jonas Kuhn: Training parsers on partial trees: a cross-language comparison. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1978-1985. [PDF, 285KB]

(2010) Daniel Stein, Stephan Peitz, David Vilar, & Hermann Ney: A cocktail of deep syntactic features for hierarchical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 221KB]

(2010) Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, Tsutomu Hirao, & Masaaki Nagata: Divide and translate: improving long distance reordering in statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 418-427. [PDF, 569KB]

(2010) Jun Sun, Min Zhang, & Chew Lim Tan: Exploring syntactic structural features for sub-tree alignment using bilingual tree kernels. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.306-315. [PDF, 286KB]

(2010) Yanli Sun, Sharon O’Brien, Minako O’Hagan, & Fred Hollowood: A novel statistical pre-processing model for rule-based machine translation system.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 790KB]; presentation: 28 slides [PDF, 580KB]

(2010) Jörg Tiedemann: Lingua-Align: an experimental toolkit for automatic tree-to-tree alignment.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.736-743. [PDF, 296KB]

(2010) David Vilar, Daniel Stein, Stephan Peitz, & Hermann Ney: If I only had a parser: poor man’s syntax for hierarchical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.345-352. [PDF, 400KB]

(2010) Karthik Visweswariah, Jiri Navratil, Jeffrey Sorensen, Vijil Chenthamarakshan, & Nanda Kambhatla: Syntax based reordering with automatically derived rules for improved statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1119-1127. [PDF, 169KB]

(2010) Wei Wang, Jonathan May, Kevin Knight, & Daniel Marcu: Re-structuring, re-labeling, and re-aligning for syntax-based machine translation. Computational Linguistics 36 (2), pp. 247-277 [PDF, 984KB]

(2010) Jonathan Weese & Chris Callison-Burch: Visualizing data structures in parsing-based machine translation. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.127-136. [PDF, 538KB]

(2010) Tong Xiao, Jingbo Zhu, Hao Zhang, & Muhua Zhu: An empirical study of translation rule extraction with multiple parsers. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1345-1353. [PDF, 388KB]

(2010) Deyi Xiong, Min Zhang, Aiti Aw, & Haizhou Li: Linguistically annotated reordering: evaluation and analysis. Computational Linguistics 36 (3), pp. 535-568 [PDF, 745KB]

(2010) Huichao Xue & Rebecca Hwa: Syntax-driven machine translation as a model of ESL revision. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1373-1381. [PDF, 258KB]

(2010) Reyyan Yeniterzi & Kemal Oflazer: Syntax-to-morphology mapping in factored phrase-based statistical machine translation from English to Turkish. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.454-464. [PDF, 1381KB]

(2010) Sina Zarrieß, Aoife Cahill, Jonas Kuhn, & Christian Rohrer: Cross-lingual induction of deep broad-coverage syntax: a case study on German participles. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1426-1434. [PDF, 106KB]

Syntactic transfer [see also Re-ordering]

(2015) Nina Seemann, Fabienne Braune, & Andreas Maletti: A systematic evaluation of MBOT in statistical machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.200-214. [PDF, 585KB]

(2014) Grégoire Détrez, Víctor M.Sánchez-Cartagena, & Aarne Ranta: Sharing resources between free/open-source rule-based machine translation systems: Grammatical Framework and Apertium. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4394-4400. [PDF, 146KB]

(2014) Matthias Huck, Hieu Hoang, & Philip Koehn: Augmenting string-to-tree and tree-to-string translation with non-syntactic phrases. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.486-498. [PDF, 269KB]

(2014) Austin Matthews, Waleed Ammar, Archna Bhatia, Weston Feely, Greg Hanneman, Eva Schlinger, Swabha Swayamdipta, Yulia Tsvetkov, Alon Lavie, & Chris Dyer: The CMU machine translation systems at WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.142-149. [PDF, 230KB]

(2014) Santanu Pal, Sudip Kumar Naskar, & Sivaji Bandyopadhyay: Word alignment-based reorderingof source chunks in PB-SMT.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3565-3571. [PDF, 185KB]

(2014) Daniel Quernheim & Fabienne Cap: Large-scale exact decoding: the IMS-TTT submission to WMT14. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.163-170. [PDF, 513KB]

(2013) Waleed Ammar, Victor Chahuneau, Michael Denkowski, Greg Hanneman, Wang Ling, Austin Matthews, Kenton Murray, Nicola Segall, Alon Lavie & Chris Dyer: The CMU machine translation systems at WMT 2013: syntax, synthetic translation options, and pseudo-references.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.70-77. [PDF, 201KB]

(2013) Marzieh Bazrafshan & Daniel Gildea: Semantic roles for string to tree machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.419-423. [PDF, 141KB]

(2013) Ondřej Bojar, Rudolf Rosa & Aleš Tamchyna: Chimera – three heads for English-to-Czech translation.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.92-98. [PDF, 175KB]

(2013) Fabienne Braune, Nina Seemann, Daniel Quernheim, & Andreas Maletti: Shallow local multi bottom-up tree transducers in statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.811-821. [PDF, 276KB]

(2013) Martin Čmejrek, Haitao Mi, & Bowen Zhou: Flexible and efficient hypergarph interactions for joint hierarchical and forest-to-string decoding. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.545-555. [PDF, 195KB]

(2013) Petra Galuščáková, Martin Popel & Ondřej Bojar: PhraseFix: statistical post-editing of TectoMT.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.141-147. [PDF, 157KB]

(2013) Shuo Li, Derek F.Wong & Lidia S.Chao: Experiments with POS-based restructuring and alignment-based re-ordering for statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.82-87. [PDF, 273KB]

(2013) Fandong Meng, Jun Xie, Linfeng Song, Yajuan Lü, & Qun Liu: Translation with source constituency and dependency trees. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1066-1076. [PDF, 341KB]

(2013) Graham Neubig: Travatar: a forest-to-string machine translation engine based on tree transducers. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, System demonstrations, Sofia, Bulgaria, August 4-9 2013; pp.91-96. [PDF, 344KB]

(2013) Michal Novák, Zdeněk Žabokrtský & Anna Nedoluzhko: Two case studies on translating pronouns in a deep syntax framework. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1037-1041. [PDF, 348KB]

(2013) Francisco Oliveira, Derek F.Wong, Lidia S.Chao, Liang Tian, & Liangye He: Influence of part-of-speech and phrasal category universal tag-set in tree-to-tree translation models. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.907-911. [PDF, 127KB]

(2013) Oscar Täckström, Ryan McDonald, & Joakim Nivre: Target language adaptation of discriminative transfer parsers. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.1061-1071. [PDF, 258KB]

(2013) Akihiro Tamura, Taro Watanabe, Eiichiro Sumita, Hiroya Takamura, & Manabu Okumura: Part-of-speech induction in dependency trees for statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.841-851. [PDF, 769KB]

(2013) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Handling ambiguities of bilingual predicate-argument structures for statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1127-1136. [PDF, 269KB]

(2012) Karteek Addanki, Chi-kui Lo, Markus Saers, & Dekai Wu: LTG vs. ITG coverage of cross-lingual verb frame alternations. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.295-302. [PDF, 283KB]

(2012) Phil Blunsom: Synchronous grammars and translation. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 45 slides [PDF of PPT, 456KB]

(2012) Chenhui Chu, Toshiaki Nakazawa, & Sadao Kurohashi: EBMT system of Kyoto University in OLYMPICS task at IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.96-101. [PDF, 771KB]

(2012) Jacob Dlougach & Irina Galinskaya: Building a reordering system using tree-to-string hierachical model. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.27-36. [PDF, 1115KB]

(2012) Greg Durrett, Adam Pauls, & Dan Klein: Syntactic transfer using a bilingual lexicon. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1-11. [PDF, 286KB]

(2012) Ondřej Dušek, Zdeněk Žabokrtský, Martin Popel, Martin Majliš, Michal Novák, & David Mareček: Formemes in English-Czech deep syntactic MT. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.267-274. [PDF, 316KB]

(2012) Yang Feng, Dongdong Zhang, Mu Li, Ming Zhou, & Qun Liu: Hierarchical chunk-to-string translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.950-958. [PDF, 152KB]

(2012) Michael Gasser: Toward a rule-based system for English-Amharic translation. [SALTMIL 2012] Workshop on Language Technology for Normalisation of Less-Resourced Languages, 22 May 2012, Istanbul, Turkey; pp.41-46. [PDF, 319KB]

(2012) Ryan Georgi, Fei Xia, & William D.Lewis: Measuring the divergence of dependency structures cross-linguistically to improve syntactic projection algorithms.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.771-778. [PDF, 1013KB]

(2012) Daniel Gildea: On the string translations produced by multi bottom-up tree transducers. Computational Linguistics 38 (3); pp.673-693. [PDF, 426KB]

(2012) Andreas Kirkedal: Tree-based hybrid machine translation.  EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.77-86. [PDF, 260KB]

(2012) Daniel Quernheim & Kevin Knight: Towards probabilistic acceptors and transducers for feature structures. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.76-85. [PDF, 180KB]

(2012) Xianchao Wu, Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: A comparative study of target dependency structures for statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.100-104. [PDF, 750KB]

(2012) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Machine translation by modelling predicate-argument structure transformation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.3019-3036. [PDF, 1473KB]

(2012) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Tree-based translation without using parse trees. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.3037-3054. [PDF, 1800KB]

(2012) Hui Zhang & David Chiang: An exploration of forest-to-string translation: does translation help or hurt parsing? [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.317-321. [PDF, 49KB]

(2012) Jingbo Zhu, Tong Xiao, & Chunliang Zhang: Learning better rule extraction with translation span alignment. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.280-284. [PDF, 235KB]

(2011) Daniel Emilio Beck: Syntax-based statistical machine translation using tree automata and tree transducers. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.36-40. [PDF, 124KB]

(2011) Aswarth Dara, Prashanth Mannem, Hemanth Sagar Bayyarapu & Avinesh PVS: Transferring syntactic relations from English to Hindi using alignments on local word groups. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.447-455. [PDF, 188KB]

(2011) Chenchen Ding, Takashi Inui, & Mikio Yamamoto: Long-distance hierarchical structure transformation rules utilizing function words. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.159-166. [PDF, 215KB]

(2011) Marcello Federico: Aspects of tree-based statistical machine translation. Lecture at: Sixth Machine Translation Marathon, 5-10 September, Trento, Italy; 66 slides [PDF of PPT, 2112KB]

(2011) Wenbin Jiang, Qun Liu, & Yajuan Lü: Relaxed cross-lingual projection of constituent syntax. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1192-1201. [PDF, 278KB]

(2011) Xianhua Li, Yajuan Lü, Yao Meng, Qun Liu, & Hao Yu: Feedback selecting of manually acquired rules using automatic evaluation. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.52-59. [PDF, 185KB]

(2011) Yang Liu, Qun Liu, & Yajuan Lü: Adjoining tree-to-string translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1278-1287. [PDF, 159KB]

(2011) Jan-Thorsten Peter, Matthias Huck, Hermann Ney, & Daniel Stein: Soft string-to-dependency hierarchical machine translation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.246-253.[PDF, 249KB]

(2011) Victor M.Sánchez-Cartagnena, Felipe Sánchez-Martínez, & Juan Antonio Pérez-Ortiz: Enriching a statistical machine translation system trained on small parallel corpora with rule-based bilingual phrases. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.90-96. [PDF, 125KB]

(2011) Víctor M.Sánchez-Cartagena, Felipe Sánchez-Martínez, & Juan Antonio Pérez-Ortíz: Integrating shallow-transfer rules into phrase-based statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.562-569. [PDF, 106KB]

(2011) V.Shukla & R.M.K.Sinha: Divergence patterns for Urdu to English and English to Urdu translation. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.21-28. [PDF, 773KB]

(2011) Ashish Vaswani, Haitao Mi, Liang Huang & David Chiang: Rule Markov models for fast tree-to-string translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.856-864. [PDF, 122KB]

(2011) Jun Xie, Haitao Mi & Qun Liu: A novel dependency-to-string model for statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.216-226. [PDF, 992KB]

(2011) Hao Zhang, Licheng Fang, Peng Xu, & Xiaoyun Wu: Binarized forest to string translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.835-845. [PDF, 212KB]

(2011) Jingbo Zhu & Tong Xiao: Improving decoding generalization for tree-to-string translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.418-423. [PDF, 311KB]

(2010) David Chiang: Learning to translate with source and target syntax. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1443-1452. [PDF, 357KB]

(2010) Martin Čmejrek & Bowen Zhou: Two methods for extending hierarchical rules from the bilingual chart parsing. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.180-188. [PDF, 200KB]

(2010) Sobha Lalitha Devi, Vijay Sundar Ram R, Bagyavathi T, & Praveen Pralayankar: Syntactic structure transfer in a Tamil-Hindi MT system – a hybrid approach [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

(2010) Rashmi Gangadharaiah, Ralf D.Brown, & Jaime Carbonell: Automatic determination of number of clusters for creating templates in example-based machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 582KB]; presentation: 37 slides [PDF, 496KB]

(2010) Yvette Graham: Sulis: an open source transfer decoder for deep syntactical statistical machine translation. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.17-26. [PDF, 705KB]

(2010) Yvette Graham & Josef van Genabith: Factor templates for factored machine translation models. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.275-282. [PDF, 366KB]

(2010) Hieu Hoang & Philipp Koehn: Improved translation with source syntax labels. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 409-417. [PDF, 1079KB]

(2010) Zhongqiang Huang, Martin Čmejrek & Bowen Zhou: Soft syntactic constraints for hierarchical phrase-based translation using latent syntactic distributions.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.138-147. [PDF, 506KB]

(2010) Marcin Junczys-Dowmunt: A maximum entropy approach to syntactic translation rule filtering [abstract].  CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 58KB]

(2010) Maxim Khalilov & Khalil Sima’an: A discriminative syntactic model for source permutation via tree transduction. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.92-100. [PDF, 760KB]

(2010) Maxim Khalilov & Khalil Sima’an: The ILLC-UvA SMT system for IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.197-203. [PDF, 1209KB]

(2010) Young-Suk Lee, Bing Zhao, & Xiaoqiang Luo: Constituent reordering and syntax models for English-to-Japanese statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.626-634. [PDF, 214KB]

(2010) Jin-Ji Li, Jungi Kim, & Jong-Hyeok Lee: Chinese syntactic reordering through contrastive analysis of predicate-predicate patterns in Chinese-to-Korean SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 817KB]

(2010) Jin-Ji Li, Jungi Kim & Jong-Hyeok Lee: Transferring syntactic relations of subject-verb-object pattern in Chinese-to-Korean SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 774KB]

(2010) Ding Liu & Daniel Gildea: Semantic role features for machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.716-724. [PDF, 388KB]

(2010) Yang Liu & Qun Liu: Joint parsing and translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.707-715. [PDF, 211KB]

(2010) Gideon Maillette de Buy Wenniger, Maxim Khalilov, & Khalil Sima’an: A toolkit for visualizing the coherence of tree-based reordering with word-alignments. Fifth Machine Translation Marathon, 13-18 September, University of Le Mans; Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.97-106. [PDF, 138KB]; presentation [PDF, 762KB]

 (2010) Andreas Maletti & Giorgio Satta: Parsing and translation algorithms based on weighted extended tree transducers. Proceedings of the 2010 Workshop on Applications of Tree Automata in Natural Language Processing, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.19-27. [PDF, 190KB]

(2010) Andreas Maletti: A tree transducer model for synchronous tree-adjoing grammars. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1067-1076. [PDF, 170KB]

(2010) Sylwia Ozdowska & Vincent Claveau: Inferring syntactic rules for word alignment through inductive logic programming. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.749-754. [PDF, 370KB]

(2010) Soma Paul, Prashant Mathur, & Sushant Kishore: Syntactic construct: an aid for translating English nominal compound into Hindi. Proceedings of the NAACL HLT 2010 Workshop on Extracting and Using Constructions in Computational Linguistics, Los Angeles, CA, June 2010; pp.32-38. [PDF, 107KB]

(2010) Kathrin Spreyer: Notes on the evaluation of dependency parsers obtained through cross-lingual projection. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1176-1184. [PDF, 114KB]

(2010) Jinsong Su, Yang Liu, Haitao Mi, Hongmei Zhao, Yajuan Lü, & Qun Liu: Dependency-based bracketing transduction grammar for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1185-1193. [PDF, 226KB]

(2010) Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, Tsutomu Hirao, & Masaaki Nagata: Divide and translate: improving long distance reordering in statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 418-427. [PDF, 569KB]

(2010) Jun Sun, Min Zhang, & Chew Lim Tan: Discriminative induction of sub-tree alignment using limited labeled data. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1047-1055. [PDF, 370KB]

(2010) Jun Sun, Min Zhang, & Chew Lim Tan: Exploring syntactic structural features for sub-tree alignment using bilingual tree kernels. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.306-315. [PDF, 286KB]

(2010) Vincent Vandeghinste & Scott Martens: Bottom-up transfer in example-based machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 943KB]; presentation: 26 slides [PDF, 771KB]

(2010) Xianchao Wu, Takuya Matsuzaki, & Jun’ichi Tsujii: Fine-grained tree-to-string translation rule extraction. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.325-334. [PDF, 766KB]

 (2010) Hui Zhang, Min Zhang, Haizhou Li, & Eng Siong Chang: Non-isomorphic forest pair translation. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.440-450. [PDF, 307KB]

(2010) Zhemin Zhu, Delphine Bernhard, & Iryna Gurevych: A monolingual tree-based translation model for sentence simplification. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1353-1361. [PDF, 260KB]

Tagging (see also Annotation and features, Segmentation)

(2014) Yves Scherrer & Benoît Sagot: A language-indendent and fully unsupervised approach t lexicon induction and part-of-speech tagging for closely related languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.502-508. [PDF, 184KB]

(2013) Eunah Cho, Thanh-Le Ha, Mohammed Mediani, Jan Niehues, Teresa Herrmann, Isabel Slawik & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2013. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.104-108. [PDF, 107KB]

(2013) Minwei Feng, Jan-Thorsten Peter & Hermann Ney: Advancements in reordering models for statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.322-332. [PDF, 383KB]

(2013) Dan Han, Pascual Martínez-Gómez, Yusuke Miyao, Katsuhito Sudoh, & Masaaki Nagata: Using unlabeled dependency parsing for pre-ordering for Chinese-to-Japanese statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp. 25-33. [PDF, 334KB]

(2013) Eric Joanis, Darlene Stewart, Samuel Larkin & Roland Kuhn: Transferring markup tags in statistical machine translation: a two-stream approach.  Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 73-81. [PDF, 599KB]

(2013) Francisco Oliveira, Derek F.Wong, Lidia S.Chao, Liang Tian, & Liangye He: Influence of part-of-speech and phrasal category universal tag-set in tree-to-tree translation models. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.907-911. [PDF, 127KB]

(2013) Celia Rico, Pedro L.Díez Orzas, & Felix Sasaki: Implementing ITS 2.0 for post-editing purposes. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.329-336. [PDF, 443KB]

(2013) Nathan Schneider, Behrang Mohit, Chris Dyer, Kemal Oflazer, & Noah A.Smith: Supersense tagging for Arabic: the MT-in-the-middle attack.  [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.661-667. [PDF, 143KB]

(2013) Oscar Täckström, Dipanjan Das, Slav Petrov, Ryan McDonald, & Joakim Nivre: Token and type constraints for cross-lingual part-of-speech tagging. Transactions of the Association for Computational Linguistics 1 (2013); pp.1-12 [PDF, 3217KB]

(2013) Akihiro Tamura, Taro Watanabe, Eiichiro Sumita, Hiroya Takamura, & Manabu Okumura: Part-of-speech induction in dependency trees for statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.841-851. [PDF, 769KB]

(2013) Mengqiu Wang, Wanxiang Che, & Christopher D.Manning: Joint word alignment and bilingual named entity recognition using dual decomposition. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1073-1082. [PDF, 424KB]

(2012) Renu Balyan, Sudip Kumar Naskar, Antonio Toral, & Niladri Chatterjee: A diagnostic evaluation approach targeting MT systems for Indian languages. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp. 61-71. [PDF, 359KB]

(2012) Odbayar Chimeddorj & Atsushi Fujii: Enhancing lemmatization for Monglian and its application to statistical machine translation. COLING 2012: Proceedings of the 10th Workshop on Asian Language Resources, Mumbai, December 2012; pp.115-123. [PDF, 675KB]

(2012) Minwei Feng & Hermann Ney: A tagging-style reordering model for phrase-based SMT. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.17-26. [PDF, 140KB]

(2012) Sungchul Kim, Kristina Toutanova, & Hwanjo Yu: Multilingual named entitity recognition using parallel data and metadata from Wikipedia. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.694-702. [PDF, 417KB]

(2012) Emad Mohamed: Morphological segmentation and part of speech tagging for religious Arabic. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.65-71. [PDF, 708KB]

(2012) Dávid Márk Nemeskey & Eszter Simon: Automatically generated NE tagged corpora for Englsih and Hungarian. [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.38-46. [PDF, 110KB]

(2012) Maja Popović: Morpheme and POS-based IBM1 scores and language model scores for translation quality estimation.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.133-137. [PDF, 110KB]

(2012) Raphael Rubino, Jennifer Foster, Joachim Wagner, Johann Roturier, Rasul Samad Zadeh Kaljahi, & Fred Hollowood: DCU-Symantec submission for the WMT 2012 quality estimation task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.138-144. [PDF, 125KB]

(2012) Sara Stymne: Clustered word classes for preordering in statistical machine translation. EACL Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP,  23-27 April 2012, Avignon, France; pp. 28-34. [PDF, 108KB]

(2011) Dipanjan Das & Slav Petrov: Unsupervised part-of-speech tagging with bilingual graph-based projections. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.600-609. [PDF, 225KB]

(2011) Ma Jianjun, Huang Degen, Liu Haixia, & Sheng Wenfeng: POS tagging of English particles for machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.57-63. [PDF, 222KB]

(2011) Matouš Macháček & Ondřej Bojar: Approximating a deep-syntactic metric for MT evaluation and tuning. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.92-98. [PDF, 130KB]

(2011) Christof Monz: Statistical machine translation with local language models. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.869-879. [PDF, 407KB]

(2011) Jan Niehues, Teresa Herrmann, Stephan Vogel, & Alex Waibel: Wider context by using bilingual langauge models in machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.198-206. [PDF, 570KB]

(2011) MajaPopović: Morphemes and POS tags for n-gram based evaluation metrics. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.104-107. [PDF, 68KB]

(2011) Siva Reddy & Serge Sharoff: Cross-language POS taggers (and other tools) for Indian languages: an experiment with Kannada using Telugu resources. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.11-19. [PDF, 214KB]

(2010) Kathryn Baker, Michael Bloodgood, Bonnie J.Dorr, Nathaniel W.Filardo, Lori Levin, & Christine Piatko: A modality lexicon and its use in automatic tagging. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1402-1407. [PDF, 375KB]

(2010) Guy De Pauw, Naomi Maajabu, & Peter Waiganjo Wagacha: A knowledge-light approach to Luo machine translation and part-of-speech tagging. [LREC 2010]: Proceedings of the Second Workshop on African Language Technology, AFLAT 2010, Valetta, Malta; pp.15-20. [PDF, 141KB]

(2010) Jae-Hee Lee, Seung-Wook Lee, Gumwon Hong, Young-Sook Hwang, Sang-Bum Kim, & Hae-Chang Rim: A post-processing approach to statistical word alignment reflecting alignment tendency between part-of-speeches. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.623-629. [PDF, 438KB]

(2010) Haitao Mi, Liang Huang, & Qun Liu: Machine translation with lattices and forests. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.837-845. [PDF, 268KB]

(2010) Mohamed Outahajala, Lahbib Zekouar, Paolo Rosso, & M.Antònia Martí: Tagging Amazigh with AnCoraPipe. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 52-56. [PDF, 326KB]

(2010) Narges Sharif Razavian & Stephan Vogel: Fixed length word suffix for factored statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.147-150. [PDF, 268KB]

Text alignment see Alignment

Text categorization see Domain identification

Training see Learning and training

Tree transduction see Syntactic transfer

Word alignment

(2015) Oliver Adams, Graham Neubig, Trevor Cohn & Steven Bird: Inducing bilingual lexicons from small quantities of sentence-aligned phonemic transcriptions. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.248-255. [PDF, 2.8MB]

(2015) Takahiro Nomura, Hajime Tsukada, & Tomoyoshi Akiba: Improvement of word alignment models for Vietnamese-to-English translation. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.112-116. [PDF, 1.3MB]

(2015) Lane Schwartz, Isabel Lacruz, & Tatyana Bystrova: Effects of word alignment visualization on post-editing quality & speed. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.186-199. [PDF, 560KB]

(2014) Timur Gilmanov, Olga Scrivner, & Sandra Kübler: SWIFT aligner, a multifunctional tool for parallel corpora: visualization, word alignment, and (morpho)-syntactic cross-language transfer. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2913-2919. [PDF, 284KB]

(2014) Miriam Kaeshammer & Anika Westburg: On complex word alignment configurations. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1773-1780. [PDF, 171KB]

(2014) Santanu Pal, Sudip Kumar Naskar, & Sivaji Bandyopadhyay: Word alignment-based reordering of source chunks in PB-SMT.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3565-3571. [PDF, 185KB]

(2014) Sara Stymne, Jörg Tiedemann, & Joakim Nivre: Estimating word alignment quality for SMT reordering tasks. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.275-286. [PDF, 344KB]

(2013) Marianna Apidianaki: LIMSI: cross-lingual word sense disambiguation using translation sense clustering. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.178-182. [PDF, 113KB]

(2013) Fabienne Cap, Marion Weller, & Ulrich Heid: Using a rich feature set for the identification of German MWEs. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.34-42. [PDF, 308KB]

(2013) Victor Chahuneau, Noah A.Smith, & Chris Dyer: Knowledge-rich morphological priors for Bayesian language models. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.1206-1215. [PDF, 208KB]

(2013) Elif Eyigöz, Daniel Gildea & Kemal Oflazer: Multi-rate HMMs for word alignment. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.494-502. [PDF, 570KB]

(2013) Elif Eyigöz, Daniel Gildea, & Kemal Oflazer: Simultaneous word-morpheme alignment for statistical machine translation. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.32-40. [PDF, 259KB]

(2013) Manaal Faruqui & Chris Dyer: An information theoretic approach to bilingual word clustering.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.777-783. [PDF, 263KB]

(2013) Yang Feng & Trevor Cohn: A Markov model of machine translation using non-parametric Bayesian inference. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.333-342. [PDF, 312KB]

(2013) Yarin Gal & Phil Blunsom: A systematic Bayesian treatment of the IBM alignment models. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.969-977. [PDF, 290KB]

(2013) Yvette Graham, Bahar Salehi, & Timothy Baldwin: Umelb: cross-lingual textual entailment with word alignment and string similarity features. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.133-137. [PDF, 91KB]

(2013) Rima Harastani, Béatrice Daille & Emmanuel Morin: Ranking translation candidates acquired from comparable corpora. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.401-409. [PDF, 276KB]

(2013) Shuhei Kondo, Kevin Duh & Yuji Matsumoto: Hidden Markov tree model for word alignment. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.503-511. [PDF, 531KB]

(2013) Hong-Seok Kwon, Hyeong-Won Seo, & Jae-Hoon Kim: Bilingual lexicon extraction via pivot language and word alignment tool.  Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.11-15. [PDF, 666KB]

(2013) Shuo Li, Derek F.Wong & Lidia S.Chao: Experiments with POS-based restructuring and alignment-based re-ordering for statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.82-87. [PDF, 273KB]

(2013) Qun Liu, Zhaopeng Tu, & Shouxun Lin: A novel graph-based compact representation of word alignment. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.358-363. [PDF, 211KB]

(2013) Xiaodong Liu, Kevin Duh, & Yuji Matsumoto: Topic models + word alignment = a flexible framework for extracting bilingual dictionary from comparable corpus. Proceedings of the Seventeenth Conference on Computational Natural Language Learning, Sofia, Bulgaria, 8-9 August 2013; pp.212-221. [PDF, 487KB]

(2013) Gideon Maillette de Buy Wenniger & Khalil Sima’an: A formal characterization of parsing word alignments by synchronous grammars with empirical evidence to the ITG hypothesis. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 58-67. [PDF, 150KB]

(2013) Gideon Maillette de Buy Wenniger & Khalil Sima’an: Hierarchical alignment decomposition labels for Hiero grammar rules. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 19-28. [PDF, 128KB]

(2013) Michael Matuschek, Christian M.Meyer, & Iryna Gurevych: Multilingual knowledge in aligned Wiktionary and OmegaWiki for translation applications. Translation: Computation, Corpora,  Cognition 3 (1), June 2013; pp.87-118. [PDF, 2898KB]

(2013) Santanu Pal, Sudip Kumar Naskar, & Sivaji Bandyopadhyay: A hybrid word alignment model for phrase-based statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.94-101. [PDF, 297KB]

(2013) Martha Alicia Rocha & Joan Andreu Sánchez: Towards the supervised machine translation: real word alignments and translations in a multi-task active learning process. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.239-247. [PDF, 632KB]

(2013) Thomas Schoenemann: Training nondeficient variants of IBM-3 and IBM-4 for word alignment. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.22-31. [PDF, 300KB]

(2013) Nasredine Semmar & Houda Saadane: Using transliteration of proper names from Arabic to Latin script to improve English-Arabic word alignment. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1022-1026. [PDF, 92KB]

(2013) José G.C.de Souza, Miquel Esplà-Gomis, Marco Turchi, & Matteo Negri: Exploiting qualitative information from automatic word alignment for cross-lingual NLP tasks.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.771-776. [PDF, 277KB]

(2013) Oscar Täckström, Dipanjan Das, Slav Petrov, Ryan McDonald, & Joakim Nivre: Token and type constraints for cross-lingual part-of-speech tagging. Transactions of the Association for Computational Linguistics 1 (2013); pp.1-12 [PDF, 3217KB]

(2013) Liling Tan & Francis Bond: XLING: matching query sentences to a parallel corpus using topic models for word sense disambiguation. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.167-170. [PDF, 176KB]

(2013) Marco Turchi & Matteo Negri: ALTN: word alignment features for cross-lingual textual entailment. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.128-132. [PDF, 386KB]

(2013) Karthik Visweswariah, Mitesh M.Khapra, & Ananthakrishnan Ramanathan: Cut the noise: mutually reinforcing reordering and alignments for improved machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1275-1284. [PDF, 250KB]

(2013) Mengqiu Wang, Wanxiang Che, & Christopher D.Manning: Joint word alignment and bilingual named entity recognition using dual decomposition. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1073-1082. [PDF, 424KB]

(2013) Bing Xiang, Xiaoqiang Luo, & Bowen Zhou: Enlisting the ghost: modeling empty categories for machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.822-831. [PDF, 294KB]

(2013) Nan Yang, Shujie Liu, Mu Li, Ming Zhou, & Nenghai Yu: Word alignment modeling with context dependent deep neural network. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.166-175. [PDF, 554KB]

(2013) Xuchen Yao, Benjamin Van Durme, Chris Callison-Burch, & Peter Clark: A lightweight and high performance monolingual word aligner.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.702-707. [PDF, 231KB]

(2013) Mo Yu, Tiejun Zhao, Yalong Bai, Hao Tian, & Dianhai Yu: Cross-lingual projections between languages from differenr families. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.312-317. [PDF, 187KB]

(2013) Will Y.Zou, Richard Socher, Daniel Cer, & Christopher D.Manning: Bilingual word embeddings for phrase-based machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1393-1398. [PDF, 337KB]

(2012) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Identifying bilingual multi-word expressions for statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.674-679. [PDF, 387KB]

(2012) M.Talha Çakmak, Süleyman Acar, & Gülşen Eryiğit: Word alignment for English-Turkish language pair.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2177-2180. [PDF, 577KB]

(2012) İlknur Durgar El-Kahlout: Recent improvements in Turkish-English machine translation [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.11-13. [PDF]

(2012) Douwe Gelling & Trevor Cohn: Using senses in HMM word alignment. NAACL-HLT Workshop on the Induction of Linguistic Structure, Montréal, Canada, June 3-8, 2012; pp.39-46. [PDF, 156KB]

(2012) Stephen Grimes, Katherine Peterson, & Xuansong Li: Automatic word alignment tools to scale production of manually aligned parallel texts.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2194-2198. [PDF, 259KB]

(2012) Quoc Hung-Ngo & Werner Winiwarter: A visualizing annotation tool for semi-automatically building a bilingual corpus.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.67-74. [PDF, 820KB]

(2012) Marcin Junczys-Dowmunt: A phrase table without phrases: rank encoding for better phrase table compression. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.245-252. [PDF, 240KB]

(2012) Marcin Junczys-Dowmunt: Phrasal rank-encoding: exploiting phrase redundancy and translational relations for phrase table compression. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.63-74. [PDF, 169KB]; presentation, 75 slides [PDF of PPT, 1126KB]

(2012) Mitesh M.Khapra, Ananthakrishnan Ramanathan, & Karthik Visweswariah: Report of the shared task on learning reordering from word alignments at RSMT 2012. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.9-15. [PDF, 328KB]

(2012) Mitesh M.Khapra, Ananthakrishnan Ramanathan, & Karthik Visweswariah: Whitepaper for shared task on learning reordering from word alignments at RSMT 2012. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.1-8. [PDF, 326KB]

(2012) Anoop Kunchukuttan & Pushpak Bhattacharyya: Partially modelling word reordering as a sequence labelling problem. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.47-54. [PDF, 123KB]

(2012) Patrik Lambert & Rafael E.Banchs: BIA: a discriminative phrase alignment toolkit. Prague Bulletin of Mathematical Linguistics 97, April 2012; pp.43-53. [PDF, 136KB]

(2012) Septina Dian Larasati: Improving word alignment by exploiting adapted word similarity. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.41-45. [PDF, 227KB]

(2012) Adrien Lardilleux, François Yvon, & Yves Lepage: Hierarchical sub-sentential alignment with Anymalign. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.279-286. [PDF, 173KB]

(2012) Dawn Lawrie, James Mayfield, Paul McNamee, & Douglas W.Oard: Creating and curating a cross-language person-entity linking collection.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3106-3110. [PDF, 312KB]

(2012) Peng Li, Yang Liu, & Maosung Sun: A beam search algorithm for ITG word alignment. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.673-682. [PDF, 471KB]

(2012) Toshiaki Nakazawa & Sadao Kurohashi: Alignment by bilingual generation and monolingual derivation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1963-1978. [PDF, 1172KB]

(2012) Preslav Nakov & Jörg Tiedemann: Combining word-level and character-level models for machine translation between closely-related languages. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.301-305. [PDF, 122KB]

(2012) Tsuyoshi Okita: Annotated corpora for word alignment between Japanese and English and its evaluation with MAP-based word aligner.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3241-3248. [PDF, 522KB]

(2012) Reinhard Rapp, Serge Sharoff, & Bogdan Babych: Identifying word translations from comparable documents without a seed lexicon.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.460-466. [PDF, 329KB]

(2012) Darcey Riley & Daniel Gildea: Improving the IBM alignment models using variational Bayes. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.306-310. [PDF, 103KB]

(2012) Houda Saadane, Ouafa Benterki, Nasredine Semmar, & Christian Fluhr: Using Arabic transliteration to improve word alignment from French-Arabic parallel corpora. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.38-46. [PDF,

(2012) Antonio Toral, Marc Poch, Pavel Pecina, & Gregor Thurmair: Efficiency-based evaluation of aligners for industrial applications. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.57-60. [PDF, 210KB]

(2012) Zhaopeng Tu, Yang Liu, Yifan He, Josef van Genabith, Qun Liu, & Shouxun Lin: Combining multiple alignments to improve machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1249-1260. [PDF, 178KB]

(2012) Ashish Vaswani, Liang Huang, & David Chiang: Smaller alignment models for better translations: unsupervised word alignment with the l0-norm. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.311-319. [PDF, 127KB]

(2012) Ivan Vulić & Marie-Francine Moens: Detecting highly confident word translations from comparable corpora without any prior knowledge. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 449-459. [PDF, 339KB]

(2012) Ning Xi, Guangchao Tang, Xinyu Dai, Shujian Huang, & Jiajun Chen: Enhancing statistical machine translation with character alignment. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.285-290. [PDF, 171KB]

(2012) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Tree-based translation without using parse trees. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.3037-3054. [PDF, 1800KB]

(2011) Elizabeth Baran & Nianwen Xue: Singular or plural? Exploiting parallel corpora for Chinese number prediction. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.207-214. [PDF, 187KB]

(2011) Aswarth Dara, Prashanth Mannem, Hemanth Sagar Bayyarapu & Avinesh PVS: Transferring syntactic relations from English to Hindi using alignments on local word groups. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.447-455. [PDF, 188KB]

(2011) John DeNero & Klaus Macherey: Model-based aligner combination using dual decomposition. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.420-429. [PDF, 315KB]

(2011) Nan Duan, Mu Li, Ming Zhou, & Lei Cui: Improving phrase extraction via MBR phrase scoring and pruning. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.189-197. [PDF, 302KB]

(2011) Kevin Duh, Katsuhito Sudoh, Tomoharu Iwata, & Hajime Tsukada: Alignment inference and Bayesian adaptation for machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.114-121. [PDF, 117KB]

(2011) Chris Dyer, Jonathan Clark, Alon Lavie, & Noah A.Smith: Unsupervised word alignment with arbitary features. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.201-210. [PDF, 303KB]

(2011) Miquel Esplà, Felipe Sánchez-Martínez, & Mikel L.Forcada: Using word alignments to assist computer-aided translation users by marking which target-side words to change or keep unedited. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.81-88. [PDF, 381KB]; presentation, 41 slides [PDF]

(2011) Qin Gao, Will Lewis, Chris Quirk, & Mei-Yuh Hwang: Incremental training and intentional over-fitting of word alignment. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.106-113. [PDF, 157KB]

(2011) Kevin Gimpel & Noah A.Smith: Generative models of monolingual and bilingual gappy patterns. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.512-522. [PDF, 379KB]

(2011) Teresa Herrmann, Mohammed Mediani, Jan Niehues, & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.379-385. [PDF, 106KB]

(2011) Maria Holmqvist, Sara Stymne, & Lars Ahrenberg: Experiments with word alignment, normalization and clause reordering for SMT between English and German. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.393-398. [PDF, 102KB]

(2011) Shujian Huang, Stephan Vogel, & Jiajun Chen: Dealing with spurious ambiguity in learning ITG-based word alignment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.379-383. [PDF, 182KB]

(2011) Patrik Lambert & Rafael Banchs: BIA: a discriminative phrase alignment toolkit. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; 23 slides [PDF of PPT, 288KB]

(2011) Li Zezhong, Hideto Ikeda, & Nguyen Thanh Hung: Exploiting multi-features for word alignment in patent MT. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.86-90. [PDF, 238KB]

(2011) Hui Lin & Jeff Bilmes: Word alignment via submodular maximization over matroids. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.170-175. [PDF, 159KB]

(2011) J.Scott McCarley, Abraham Ittycheriah, Salim Roukos, Bing Xiang, & Jian-ming Xu: A correction model for word alignments.  [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.889-898. [PDF, 236KB]

(2011) Coşkun Mermer & Murat Saraçlar: Bayesian word alignment for statistical machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.182-187. [PDF, 140KB]

(2011) Adam Meyers, Michiko Kosaka, Shasha Liao, & Nianwen Xue: Improving MT word alignment using aligned multi-stage parses. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.88-97. [PDF, 174KB]

(2011) Mirabela Navlea & Amalia Todiraşcu: Cognate identification for a French-Romanian lexical alignment system: empirical study. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.145-152. [PDF, 271KB]

(2011) Tsuyoshi Okita & Andy Way: MWE-sensitive word alignment in factored translation model. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 27 January, 2011; 2pp. [PDF, 42KB]; presentation: 51 slides [PDF of PPT, 650KB]

(2011) Jason Riesa, Ann Irvine, & Daniel Marcu: Feature-rich language-independent syntax-based alignment for statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.497-507. [PDF, 1053KB]

(2011) Hassan Sajjad, Alexander Fraser, & Helmut Schmid: An algorithm for unsupervised transliteration mining with an application to word alignment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.430-439. [PDF, 171KB]

(2011) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Bayesian extraction of minimal SCFG rules for hierarchical phrase-based translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.533-541. [PDF, 148KB]

(2011) Thomas Schoenemann: Probabilistic word alignment under the L0-norm. CoNLL-2011: Proceedings of the Fifteenth Conference on Computational Natural Language Learning, Portland, Oregon, USA, 23-24 June 2011; pp.172-180. [PDF, 165KB]

(2011) Thomas Schoenemann: Regularizing mono- and bi-word models for word alignment. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1313-1322. [PDF, 286KB]

(2011) Liang Tian, Fai Wong, & Sam Chao: Word alignment using GIZA++ on Windows. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.369-372. [PDF, 192KB]

(2011) Nadi Tomeh, Alexandre Allauzen, & François Yvon: Discriminative weighted alignment matrices for statistical machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.305-312. [PDF, 413KB]; presentation, 38 slides [PDF]

(2011) Zhaopeng Tu, Yang Liu, Qun Liu, & Shouxun Lin: Extracting hierarchical rules from a weighted alignment matrix. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1294-1303. [PDF, 238KB]

(2011) Ivan Vulić, Wim De Smet, & Marie-Francine Moens: Identifying word translations from comparable corpora using latent topic models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.479-484. [PDF, 153KB]

(2011) Zhiyang Wang, Yajuan Lü, & Qun Liu: Multi-granularity word alignment and decoding for agglutinative language translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.360-367. [PDF, 169KB]

(2011) Shumin Wu & Martha Palmer: Semantic mapping using automatic word alignment and semantic role labeling. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.21-30. [PDF, 200KB]

(2011) Xianchao Wu, Takuya Matsuzaki, Jun’ichi Tsujii: Effective use of function words for rule generalization in forest-based translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.22-31. [PDF, 857KB]

(2011) Ning Xi, Guangchao Tang, Boyuan Li, & Yinggong Zhao: Word alignment combination over multiple word segmentation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.1-5. [PDF, 329KB]

(2011) Jinxi Xu & Jinying Chen: How much can we gain from supervised word alignment?  ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.165-169. [PDF, 71KB]

(2010) Vamshi Ambati, Stephan Vogel & Jaime Carbonell: Active learning-based elicitation for semi-supervised word alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.365-370. [PDF, 182KB]

(2010) Daniel Andrade, Tetsuya Nasukawa, & Jun’ichi Tsujii: Robust measurement and comparison of context similarity for finding translation pairs. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.19-27. [PDF, 709KB]

(2010) Alexandra Birch & Philipp Koehn: Statistical machine translation: IBM models and word alignment. Presentation at Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 26 January, Dublin, Ireland; 40 slides. [PDF of PPT, 172KB]

 (2010) Samuel Brody: Unsupervised dependency parsing via word alignment. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.1214-1222. [PDF, 344KB]

(2010) Hailong Cao, Andrew Finch, & Eiichiro Sumita: Syntactic constraints on phrase extraction for phrase-based machine translation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.28-33. [PDF, 84KB]

(2010) Marine Carpuat, Yuval Marton, & Nizar Habash: Improving Arabic-to-English statistical machine translation by reordering post-verbal subjects for alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.178-183. [PDF, 108KB]

(2010) Hercules Dalianis, Hao-chun Xing, & Xin Zhang: Creating a reusable English-Chinese parallel corpus for bilingual dictionary construction. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1700-1705. [PDF, 409KB]

(2010) Qin Gao & Stephan Vogel: Consensus versus expertise: a case study of word alignment with Mechanical Turk. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.30-34. [PDF, 389KB]

(2010) Qin Gao, Francisco Guzman, & Stephan Vogel: EMDC: a semi-supervised approach for word alignment. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.349-357. [PDF, 231KB]

(2010) Qin Gao, Nguyen Bach, & Stephan Vogel: A semi-supervised word alignment algorithm with partial manual alignments. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 1-10. [PDF, 278KB]

 (2010) Adrià de Gispert, Juan Pino, & William Byrne: Hierarchical phrase-based translation grammars extracted from alignment posterior probabilities.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.545-554. [PDF, 162KB]

(2010) João V.Graça, Kuzman Ganchev, & Ben Taskar: Learning tractable word alignment models with complex constraints. Computational Linguistics 36 (3), pp. 481-504 [PDF, 675KB]

(2010) Stephen Grimes, Xuansong Li, Ann Bies, Seth Kulick, Xiaoyi Ma, & Stephanie Strassel: Creating Arabic-English parallel word-aligned treebank corpora at LDC. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp.102-107. [PDF, 747KB]

(2010) Maria Holmqvist: Heuristic word alignment with parallel phrases. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.744-748. [PDF, 307KB]

(2010) Shujian Huang, Kangxi Li, Xinyu Dai, & Jianjun Chen: Improving word alignment by semi-supervised ensemble. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.135-143. [PDF, 585KB]

(2010) Jie Jiang, Andy Way, & Julie Carson-Berndsen: Lattice score based data cleaning for phrase-based statistical machine translation.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 631KB]

(2010) Ines Turki Khemakhem, Salma Jamoussi, & Abdelmajid Ben Hamadou: Arabic morpho-syntactic feature disambiguation in a translation context. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.61-65. [PDF, 153KB]

(2010) Patrik Lambert: Statistical machine translation: IBM models and word alignment. Presentation at Fifth Machine Translation Marathon, 13-18 September 2010, University of Le Mans; 40 slides. [PDF, 230KB]

(2010) Adrien Lardilleux, Julien Gosme, & Yves Lepage: Bilingual lexicon induction: effortless evaluation of word alignment tools and production of resources for improbable language pairs. [LREC 2010]: Proceedings of the Second Workshop on African Language Technology, AFLAT 2010, Valetta, Malta; pp.252-256. [PDF, 330KB]

(2010) Jae-Hee Lee, Seung-Wook Lee, Gumwon Hong, Young-Sook Hwang, Sang-Bum Kim, & Hae-Chang Rim: A post-processing approach to statistical word alignment reflecting alignment tendency between part-of-speeches. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.623-629. [PDF, 438KB]

 (2010) Lianhau Lee, Aiti Aw, Min Zhang, & Haizhou Li: EM-based hybrid model for bilingual terminology extraction from comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.639-646. [PDF, 114KB]

(2010) Gregor Leusch & Hermann Ney: The RWTH system combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 315-320. [PDF, 152KB]

(2010) Xuansong Li, Niyu Ge, Stephen Grimes, Stephanie M.Strassel, & Kazuaki Maeda: Enriching word alignment with linguistic tags. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2189-2195. [PDF, 509KB]

(2010) Shujie Liu, Chi-Ho Li, & Ming Zhou: Improved discriminative ITG alignment using hierarchical phrase pairs and semi-supervised training. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.730-738. [PDF, 1050KB]

(2010) Yang Liu, Qun Liu, & Shouxun Lin: Discriminative word alignment by linear modeling. Computational Linguistics 36 (3), pp. 303-339 [PDF, 327KB]

(2010) Zhanyi Liu, Haifeng Wang, Hua Wu, & Sheng Li: Improving statistical machine translation with monolingual collocation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.825-833. [PDF, 405KB]

(2010) Yanjun Ma: Constrained word alignment models for statistical machine translation. Presentation of Best Thesis Award for 2009 at EAMT 2010: 14th Annual conference of the European Association for Machine Translation, 28 May 2010, Saint-Raphaël, France. 16 slides [PDF of PPT presentation, 325KB]

(2010) Lieve Macken: An annotation scheme and gold standard for Dutch-English word alignment. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3369-3374. [PDF, 379KB]

 (2010) Lieve Macken & Walter Daelemans: A chunk-driven bootstrapping approach to extracting translation patterns [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

(2010) Gideon Maillette de Buy Wenniger, Maxim Khalilov, & Khalil Sima’an: A toolkit for visualizing the coherence of tree-based reordering with word-alignments. Fifth Machine Translation Marathon, 13-18 September, University of Le Mans; Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.97-106. [PDF, 138KB]; presentation [PDF, 762KB]

(2010) Pascual Martínez Gómez, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda, & Germán Sanchis-Trilles: A deterministic annealing-based training algorithm for statistical machine translation models.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 712KB]

(2010) ThuyLinh Nguyen, Stephan Vogel, & Noah A.Smith: Nonparametric word segmentation for machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.815-823. [PDF, 267KB]

(2010) Jan Niehues, Teresa Herrmann, Mohammed Mediani, & Alex Waibel: The Karlsruhe Institute for Technology translation system for the ACL-WMT 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 138-142. [PDF, 87KB]

(2010) Tsuyoshi Okita, Alfredo Maldonado Guerra, Yvette Graham, & Andy Way: Multi-word expression-sensitive word alignment. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.26-34. [PDF, 239KB]

(2010) Sylwia Ozdowska & Vincent Claveau: Inferring syntactic rules for word alignment through inductive logic programming. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.749-754. [PDF, 370KB]

(2010) Sujith Ravi & Kevin Knight: Does GIZA++ make search errors?  Computational Linguistics 36 (3), pp. 295-302 [PDF, 616KB]

(2010) Gábor Recski, András Rung, Attila Zséder, & András Kornai: NP alignment in bilingual corpora. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3379-3382. [PDF, 305KB]

(2010) Jason Riesa & Daniel Marcu: Hierarchical search for word alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.157-166. [PDF, 413KB]

(2010) Markus Saers, Joakim Nivre, & Dekai Wu: A systematic comparison between inversion transduction grammar and linear transduction grammar for word alignment. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.10-18. [PDF, 279KB]

(2010) Markus Saers, Joakim Nivre, & Dekai Wu: Word alignment with stochastic bracketing linear inversion transduction grammar.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.341-344. [PDF, 97KB]

 (2010)  Nasredine Semmar, Christophe Servan, Gaël de Chalendar, Benoît Le Ny, & Jean-Jacques Bouzaglou: A hybrid word alignment approach to improve translation lexicons with compound words and idiomatic expressions. Translating and the Computer 32, 18-19 November 2010, London; 10pp. [PDF, 95KB]; presentation, 37 slides [PDF of PPT, 934KB]

(2010) Nasredine Semmar & Laib Meriama: Using a hybrid word alignment approach for automatic construction and updating of Arabic to French lexicons. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 114-119. [PDF, 309KB]

 (2010) Hendra Setiawan, Chris Dyer, & Philip Resnik: Discriminative word alignment with a function word reordering model.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.534-544. [PDF, 200KB]

 (2010) Kashif Shah, Loïc Barrault, & Holger Schwenk: Translation model adaptation by resampling. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 392-399. [PDF, 176KB]

(2010) Hiroyuki Shindo, Akinori Fujino, & Masaaki Nagata: Word alignment with synonym regularization. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.137-141. [PDF, 492KB]

(2010) Kathrin Spreyer: Notes on the evaluation of dependency parsers obtained through cross-lingual projection. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1176-1184. [PDF, 114KB]

(2010) Jinsong Su, Yang Liu, Yajuan Lü, Haitao Mi, & Qun Liu: Learning lexicalized reordering models from reordering graphs. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.12-16. [PDF, 1365KB]

(2010) Jun Sun, Min Zhang, & Chew Lim Tan: Discriminative induction of sub-tree alignment using limited labeled data. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1047-1055. [PDF, 370KB]

(2010) Nadi Tomeh, Alexandre Allauzen, Guillaume Wisniewski & François Yvon: Refining word alignment with discriminative training. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 203KB]

(2010) Shumin Wu, Jinho D.Choi, & Martha Palmer: Detecting cross-lingual semantic similarity using parallel PropBanks. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 274KB]

(2010) Bing Xiang, Yonggang Deng, & Bowen Zhou: Diversify and combine: improving word alignment for machine translation on low-resource languages. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.22-26. [PDF, 86KB]

(2010) Deyi Xiong, Min Zhang, & Haizhou Li: Learning translation boundaries for phrase-based decoding. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.136-144. [PDF, 450KB]

 (2010) Jinxi Xu & Antti-Veikko I.Rosti: Combining unsupervised and supervised alignments for MT: an empirical study.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.667-673. [PDF, 74KB]

 (2010) Shaojun Zhao & Daniel Gildea: A fast fertility Hidden Markov model for word alignment using MCMC.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.596-605. [PDF, 163KB]

Word graphs see Graphs

Word sense disambiguation

(2015) Yuanmei Lu, Toshiaki Nakazawa, & Sadao Kurohashi: Korean-Chinese word translation using Chinese character knowledge. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.256-269. [PDF, 589KB]

(2013) Sankaranarayanan Ananthakrishnan, Sanjika Hewavitharana, Rohit Kumar, Enoch Kan, Rohit Prasad, & Prem Natarjan: Semi-supervised word sense disambiguation for mixed-initiative conversational spoken language translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp. 43-50. [PDF, 536KB]

(2013) Marianna Apidianaki, Nikola Ljubešić, & Darja Fišer: Cross-lingual WSD for translation extraction from comparable corpora. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.1-10. [PDF, 388KB]

(2013) Marianna Apidianaki: LIMSI: cross-lingual word sense disambiguation using translation sense clustering. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.178-182. [PDF, 113KB]

(2013) Wilker Aziz & Lucia Specia: Multilingual WSD-like constraints for paraphrase extraction. Proceedings of the Seventeenth Conference on Computational Natural Language Learning, Sofia, Bulgaria, 8-9 August 2013; pp.202-211. [PDF, 248KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Building specialized bilingual lexicons using word sense disambiguation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.952-956. [PDF, 117KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Context vector disambiguation for bilingual lexicon extraction from comparable corpora.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.759-764. [PDF, 199KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Towards a generic approach for bilingual lexicon extraction from comparable corpora. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp. 143-150. [PDF, 531KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Using WordNet and semantic similarity for bilingual terminology mining from comparable corpora. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.16-23. [PDF, 628KB]

(2013) Lars Bungum, Björn Gambäck, André Lynum, & Erwin Marsi: Improving word translation disambiguation by capturing multiword expressions with dictionaries. [NAACL HLT 2013] Proceedings of the 9th Workshop on Multiword Expressions (MWE 2013), Atlanta, Georgia, June 13-14, 2013; pp.21-30. [PDF, 155KB]

(2013) Marine Carpuat: NRC: a machine translation approach to cross-lingual word sense disambiguation (SemEval-2013 Task 10). [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.188-192. [PDF, 100KB]

(2013) Marine Carpuat: A semantic evaluation of machine translation lexical choice. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp.1-10. [PDF, 145KB]

(2013) Yoan Gutiérrez, Yenier Castañeda, Andy González, Rainel Estrada, Dennys D.Ping, Jose I.Abreu, Roger Pérez, Antonio Fernández Orquín, Andrés Mintoyo, Rafael Muñoz, & Franc Camara: UMICC_DLSI: reinforcing a ranking algorithm with sense frequencies and multidimensional semantic resources to solve multililingual word sense disambiguation. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.241-249. [PDF, 489KB]

(2013) Hien Vu Huy, Phuong-Thai Nguyen, Tung-Lam Nguyen, & M.L.Nguyen: Bootstrapping phrase-based statistical machine translation via WSD integration. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1042-1046. [PDF, 231KB]

(2013) Els Lefever & Véronique Hoste: SemEval-2013 Task 10: cross-lingual word sense disambiguation. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.158-166. [PDF, 307KB]

(2013) Steve L.Manion & Raazesh Sainudiin: DAEBAK!: peripheral diversity for multilingual word sense disambiguation. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.250-254. [PDF, 142KB]

(2013) Roberto Navigli, David Jurgens & Daniele Vannella: SemEval-2013 Task 12: multilingual word sense disambiguation. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.222-231. [PDF, 345KB]

(2013) Liling Tan & Francis Bond: XLING: matching query sentences to a parallel corpus using topic models for word sense disambiguation. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.167-170. [PDF, 176KB]

(2013) Maarten van Gompel & Antal van den Bosch: WSD2: parameter optimisation for memory-based cross-lingual word-sense disambiguation. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.183-187. [PDF, 125KB]

(2012) Khan Md.Anwarus Salam, Setsuo Yamada, & Tetsuro Nishino: Sublexical translations for low-resource language. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp.39-51. [PDF, 514KB]

(2012) Marianna Apidianaki: Measuring the adequacy of cross-lingual paraphrases in a machine translation setting. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.63-72. [PDF, 137KB]

(2012) Marianna Apidianaki, Guillaume Wisniewski, Artem Sokolov, Aurélien Max, & François Yvon: WSD for n-best reranking and local language modeling in SMT. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.1-9. [PDF, 108KB]

(2012) Erwin Fernandez-Ordoñez, Rada Mihalcea, & Samer Hassan: Unsupervised word sense disambiguation with multilingual representations.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.847-851. [PDF, 253KB]

(2012) Els Lefever, Véronique Hoste, & Martine De Cock: Discovering missing Wikipedia inter-language links by means of cross-lingual word sense disambiguation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.841-846. [PDF, 345KB]

(2012) André Lynum, Erwin Marsi, Lars Bungum, & Björn Gambäck: Disambiguating word translations with target language models. TSD 2012: 15th International Conference on Text, Speech and Dialogue, Brno, Czech Republic, September 3-7, 2012; abstract #477, 1p. [HTML]

(2012) Roberto Navigli & Simone Paolo Ponzetto: Joining forces pays off: multilingual joint word sense disambiguation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1399-1410. [PDF, 413KB]

(2012) Špela Vintar, Darja Fišer, & Aljoša Vrščaj: Were the clocks striking or surprising? Using WSD to improve MT performance. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.87-92. [PDF, 283KB]

(2012) Mei Yang & Katrin Kirchhoff: Unsupervised  translation disambiguation for cross-domain statistical machine translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 424KB]

(2011) Pushpak Bhattacharyya: IndoWordNet and multilingual resource conscious word sense disambiguation. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.29-30. [PDF, 677KB]

(2011) Mitesh M.Khapra, Salil Joshi, & Pushpak Bhattacharyya: It takes two to tango: a bilingual unsupervised approach for estimating sense distributions using expectation maximization. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.695-704. [PDF, 280KB]

(2011) Mitesh M.Khapra, Salil Joshi, Arindam Chatterjee, & Pushpak Bhattacharyya: Together we can: bilingual bootstrapping for WSD. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.561-569. [PDF, 137KB]

(2011) Els Lefever & Véronique Hoste: An evaluation and possible improvement path for current SMT behavior on ambiguous nouns. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.52-60. [PDF, 164KB]

(2011) Els Lefever, Véronique Hoste, & Martine De Cock: ParaSense or how to use parallel corpora for word sense disambiguation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.317-322. [PDF, 119KB]

(2011) Erwin Marsi, André Lynum, Lars Bungum, & Björn Gambäck: Word translation disambiguation without parallel texts. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.66-74. [PDF, 898KB]

(2011) Seung-Hoon Na & Hwee Tou Ng: Enriching document representation via translation for improved monolingual information retrieval. SIGIR July 24-28, 2011, Beijing, China; 10pp. [PDF, 543KB]

(2011) Andrei Popescu-Belis, Bruno Cartoni, Andrea Gesmundo, James Henderson, Cristina Grisot, Paola Merlo, Thomas Meyer, Jacques Moeschler & Sandrine Zufferey: Improving MT coherence through text-level processing of input texts: the COMTIS projectTralogy, Paris, 3-4 March 2011;  17pp. [PDF, 169KB]

(2011) Alex Rudnick: Towards cross-language word sense disambiguation for Quechua. [RANLP 2011] Proceedings of the Student Research Workshop associated with RANLP 201, Hissar, Bulgaria, 13 September 2011; pp.133-138. [PDF, 125KB]

(2010) Hiroyuki Kaji, Takashi Tsunakawa, & Daisuke Okada: Using comparable corpora to adapt a translation model to domains. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2182-2188. [PDF, 432KB]

(2010) Mitesh M.Khapra, Saurabh Sohoney, Anup Kulkarni, & Pushpak Bhattacharyya: Value for money: balancing annotation effort, lexicon building and accuracy for multilingual WSD. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.555-563. [PDF, 237KB]

(2010) Els Lefever & Véronique Hoste: Construction of a benchmark data set for cross-lingual word sense disambiguation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1584-1590. [PDF, 465KB]

(2010) Fatiha Sadat: Using co-occurrence tendencies to improve cross-language information retrieval. JADT 2010: 10th International Conference on Statistical Analysis of Textual Data, 9-11 juin 2010, Rome, Italie; pp.1101-1111. [PDF, 1846KB]

(2010) Fangzhong Su & Katja Markert: Word sense subjectivity for cross-lingual lexical substitution. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.357-360. [PDF, 109KB]

 (2010) Takashi Tsunakawa & Hiroyuki Kaji: Augmenting a bilingual lexicon with information for word translation disambiguation. [Coling 2010] Proceedings of the 8th Workshop on Asian Language Resources, Beijing, China, 21-22 August 2010; pp.30-37. [PDF, 175KB]

(2010) Yang Mei & Katrin Kirchhoff: Contextual modeling for meeting translation using unsupervised word sense disambiguation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1227-1235. [PDF, 461KB]

 (2010) Yao Ziyuan: Breaking the language barrier: a game-changing approach. [Private publication], April 1, 2010; 25pp. [PDF, 296KB]