Machine Translation Archive

Index to linguistic categories, features, grammars and processes

Publications 2005-2009

For other periods go to: Publications since 2010; publications 2000-2004; publications 1990-1999; publications 1970-1989; publications before 1970

[To return to home page click here]

Adjectives

(2008) Bruno Cartoni: Lexical resources for automatic translation of constructed neologisms: the case study of relational adjectives. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 57KB]

Adverbs

(2008) Elliott Macklovitch, Guy Lapalme, & Fabrizio Gotti: TransSearch: what are translators looking for?  AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.412-419. [PDF, 757KB]

Anaphora resolution

 (2007) Keh-Yi Su: [review of] Ruslan Mitkov, Anaphora resolution. Machine Translation 21 (2), June 2007; pp.135-137. [see publication]

(2005) Shigeko Nariyama, Eric Nichols, Francis Bond, Takaaki Tanaka, & Hiromi Nakaiwa: Extracting representative arguments from dictionaries for resolving zero pronouns. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.3-10. [PDF, 665KB]

Annotation

(2009) Enrique Amigó, Jesús Giménez, Julio Gonzalo, & Felisa Verdejo: The contribution of linguistic features to automatic machine translation evaluation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.306-314. [PDF, 518KB]

(2009) Ondřej Bojar & Zdeněk Žabokrtský: CzEng 0.9: large parallel treebank with rich annotation. Prague Bulletin of Mathematical Linguistics, no.92, December 2009; pp.63-83 [PDF, 217KB]

(2009) E.Boldrini, S.Ferrández, R.Izquierdo, D.Tomás, & J.L.Vicedo: A parallel corpus labeled using open and restricted domain ontologies [abstract]. CICLING 2009: 10th International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, March 1-7, 2009; 1p. [PDF, 65KB]

(2009) Boxing Chen, George Foster, & Roland Kuhn: Phrase translation model enhanced with association based features. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.25-32. [PDF, 158KB]

(2009) David Chiang, Kevin Knight, & Wei Wang: 11,001 new features for statistical machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.218-226. [PDF, 114KB]

(2009) Jinho D.Choi, Martha Palmer, & Nianwen Xue: Using parallel Propbanks to enhance word-alignments. ACL-IJCNLP 2009: Third Linguistic Annotation Workshop (LAW III), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp.121-124. [PDF, 139KB]

(2009) Jakob Elming, Nizar Habash, & Josep M.Crego: Combination of statistical word alignments based on multiple preprocessing schemes.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.93-110.

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2009) Christian Federmann, Silke Theison, Andreas Eisele, Hans Uszkoreit, Yu Chen, Michael Jellinghaus, & Sabine Hunsicker: Translation combination using factored word substitution.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.70-74. [PDF, 100KB]

(2009) Alexander Fraser, Renjing Wang, & Hinrich Schütze: Rich bitext projection features for parse reranking. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.282-290. [PDF, 132KB]

(2009) Christian Hardmeier & Martin Volk: Using linguistic annotations in statistical machine translation of film subtitles. In: Kristina Jokinen and Eckhard Bick (eds.) Proceedings of the 17th Nordic Conference of Computational Linguistics (NODALIDA 2009), 15-16 May 2009, Odense, Denmark; pp.57-64. [PDF,  158KB]

(2009) Hieu Hoang & Philipp Koehn: Improving mid-range reordering using templates of factors. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.372-379. [PDF, 184KB]

(2009) Michal Hrusecky, Tomas Caithaml, & Chris Dyer: Feature function overhaul. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; 13pp. [PDF, 88KB]

(2009) Philipp Koehn: Discriminative training and factored translation models. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.69-81 [PDF, 75 slides, 530KB]

(2009) Germán Sanchis-Trilles, Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Online language model adaptation for spoken dialog translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 160-167. [PDF, 349KB]; presentation [PDF of PPT, 250KB]

(2009) Sanghoun Song & Francis Bond: Online search interface for the Sejong Korean-Japanese bilingual corpus and auto-interpretation of phrase alignment. ACL-IJCNLP 2009: Third Linguistic Annotation Workshop (LAW III), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp.146-149. [PDF, 153KB]

(2009) Sara Stymne: A comparison of merging strategies for translation of German compounds. Proceedings of the EACL 2009 Student Research Workshop, Athens, Greece, 2 April 2009; pp.61-69. [PDF, 111KB]

(2008) Carmen Banea, Rada Mihalcea, Janyce Wiebe, & Samer Hassan: Multilingual subjectivity analysis using machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.127-135. [PDF, 15403KB]

(2008) David Chiang, Yuval Marton, & Philip Resnik: Online large-margin training of syntactic and structural translation features.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.224-233. [PDF, 162KB]

(2008) Emil Ettelaie, Panayiotis G.Georgiou, & Shrikanth S.Narayanan: Towards unsupervised training of the classifier-based speech translator. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2739-2742; abstract [PDF, 58KB]

(2008) Meghan Lammie Glenn, Stephanie Strassel, Lauren Friedman, & Haejoong Lee: Management of large annotation projects involving multiple human judges: a case study of GALE machine translation post-editing. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 144KB]

(2008) João Graça, Joana Paulo Pardal, Luisa Coheur, & Diamantino Caseiro: Building a golden collection of parallel multi-language word alignments. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 301KB]

(2008) Reginald Hobbs, Jamal Laoudi, & Clare R.Voss: MTriage: web-enabled software for the creation, machine translation, and annotation of smart documents.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 433KB]

(2008) Jin-Ji Li, Dong-Il Kim, & Jong-Hyeok Lee: Annotation guidelines for Chinese-Korean word alignment.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 195KB]

(2008) Dekang Lin, Shaojun Zhao, Benjamin Van Durme, & Marius Paşca: Mining parenthetical translations from the web by word alignment. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 994-1002. [PDF, 419KB]

(2008) Qun Liu, Zhongjun He, Yang Liu, & Shouxun Lin: Maximum entropy based rule selection model for syntax-based statistical machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.89-97. [PDF, 916KB]

(2008) Beáta Megyesi, Bengt Dahlqvist, Eva Pettersson, & Joakim Nivre: Swedish-Turkish parallel treebank. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 80KB]

(2008) Petya Osenova, Kiril Simov, & Eelco Mossel: Language resources for semantic document annotation and crosslingual retrieval.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 465KB]

(2008) Doaa Samy & Ana González-Ledesma: Pragmatic annotation of discourse markers in a multilingual parallel corpus (Arabic-Spanish-English).  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 301KB]

(2008) Virach Sornlertlamvanich, Thatsanee Charoenporn, Chumpol Mokarat, Hammam Riza, Hitoshi Isahara, & Purev Jaimal: Synset assignment for bi-lingual dictionary with limited resource. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.673-678. [PDF, 549KB]

(2008) Virach Sornlertlamvanich, Thatsanee Charoenporn, Suphanut Thayaboon, Chumpol Mokarat, & Hitoshi Isahara: Enhanced tools for online collaborative language resource development. IJCNLP 2008: Sixth Workshop on Asian Language Resources, Proceedings of the workshop, 11-12 January 2008, Hyderabad, India; pp.105-106. [PDF, 310KB]

(2008) Deyi Xiong, Min Zhang, Aiti Aw, & Haizhou Li: Linguistically annotated BTG for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1009-1016. [PDF, 141KB]

(2008) Deyi Xiong, Min Zhang, Aiti Aw, & Haizhou Li: A linguistically annotated reordering model for BTG-based statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 149-152. [PDF, 319KB]

(2008) Yujie Zhang, Zhulong Wang, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Word alignment annotation in a Japanese-Chinese parallel corpus.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 195KB]

(2008) Jingbo Zhu, Huizhen Wang, & Eduard Hovy: Learning a stopping criterion for active learning for word sense disambiguation and text classification. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.366-372. [PDF, 421KB]

(2007) Matthias Buch-Kromann: Computing translation units and quantifying parallelism in parallel dependency treebanks. ACL 2007: proceedings of the Linguistic Annotation Workshop, Prague, Czech Republic, 28-29 June 2007; pp.69-76 [PDF, 361KB]

(2007) Philipp Koehn & Hieu Hoang: Factored translation models. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 868-876. [PDF, 225KB]

(2007) Lluís Màrquez, Luis Villarejo, M.A.Martí, & Mariona Taulé: SemEval-2007 task 09: multilevel semantic annotation of Catalan and Spanish. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.42-47 [PDF, 89KB]

(2007) Roser Morante & Bertjan Busser: ILK2: semantic role labelling for Catalan and Spanish using TiMBL. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.183-186 [PDF, 120KB]

(2007) Martin Volk, Joakim Lundborg, & Maël Mettler: A search tool for parallel treebanks. ACL 2007: proceedings of the Linguistic Annotation Workshop, Prague, Czech Republic, 28-29 June 2007; pp.85-92 [PDF, 270KB]

(2006) Juri Apresjan, Igor Boguslavsky, Boris Iomdin, Leonid Iomdin, Andrei Sannikov, & Victor Sizov: A syntactically and semantically tagged corpus of Russian: state of the art and prospects.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1378-1381 [PDF, 596KB]

(2006) Dan Flickinger: Identifying complex phenomena in a corpus via a treebank lens. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.125-129 [PDF, 89KB]

(2006) Silvia Hansen-Schirra, Stella Neumann, & Mihaela Vela : Multi-dimensional annotation and alignment in an English-German translation corpus. EACL-2006: Proceedings of the 5th Workshop on NLP and XML (NLPXML-2006): Multi-dimensional Markup in Natural Language Processing, April 4, 2006, Trento, Italy; pp.35-42. [PDF, 77KB]

(2006) Eduard Hovy, Mitchell Marcus, Martha Palmer, Lance Ramshaw & Ralph Weischedel: OntoNotes: the 90% solution.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 57-60 [PDF, 44KB]

(2006) Svetla Koeva, Svetlozara Lesseva, & Maria Todorova: Bulgarian sense tagged corpus.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.79-86. [PDF, 494KB]

(2006) Ivana Kruijff-Korbayová, Klára Chvátalová, & Oana Postolache: Annotation guidelines for Czech-English word alignment.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1256-1261 [PDF, 608KB]

(2006) Mohamed Maamouri, Ann Bies, & Seth Kulick: Diacritization: a challenge to Arabic treebank annotation and parsing.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.35-47 [PDF, 223KB]

(2006) Márton Miháltz & Gábor Pohl: Exploiting parallel corpora for supervised word sense disambiguation in English-Hungarian machine translation.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1294-1297 [PDF, 339KB]

(2006) Owen Rambow, Bonnie Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Keith J.Miller, Teruko Mitamura, Florence Reeder, & Advaith Siddharthan: Parallel syntactic annotation of multiple languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.559-564 [PDF, 286KB]

(2006) E. Saquete, P.Martínez-Barco, R.Muñoz, M.Negri, M.Speranza, & R.Sprugnoli: Multilingual extension of a temporal expression normalizer using annotated corpora. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.1-8 [PDF, 261 KB]

(2006) Serge Sharoff, Bogdan Babych, Paul Rayson, Olga Mudraya, & Scott Piao: ASSIST: automated semantic assistance for translators. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Posters and demonstrations, Trento, Italy, April 5-6, 2006; pp.139-142 [PDF, 69KB]

(2006) Mihaela Vela & Silvia Hansen-Schirra: The use of multi-level annotation and alignment for the translator. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 19pp. [PDF, 187KB]

(2006) Yuk Wah Wong & Raymond J. Mooney: Learning for semantic parsing with statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 439-446 [PDF, 205KB]

(2005) Jesús Cardeñosa, Carolina Gallardo & Luis Iraola: An XML-UNL model for knowledge-based annotation. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.300-308 [abstract, PDF, 99KB]

(2005) Youjin Chung & Jong-Hyeok Lee: Practical word-sense disambiguation using co-occurring concept codes [abstract]. Machine Translation 19 (1),2005; pp.59-82.

(2005) Jesús Giménez & Lluís Màrquez: Combining linguistic data views for phrase-based SMT. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 145-148. [PDF, 73KB]

(2005) Carol Nichols & Rebecca Hwa: Word alignment and cross-lingual resource acquisition. ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 69-72. [PDF, 331KB]

(2005) Yujie Zhang, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Building an annotated Japanese-Chinese parallel corpus – a part of NICT multilingual corpora. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.85-90. [PDF, 936KB]

(2005) Yujie Zhang, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Building an annotated Japanese-Chinese parallel corpus – a part of NICT multilingual corpora. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.71-78. [PDF, 1139KB]

Aspect

(2007) Yang Ye, Karl-Michael Schneider, & Steven Abney: Aspect marker generation in English-to-Chinese machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.521-527 [PDF, 129KB]

Belief representation

(2009) Thomas Deselaers, Saša Hasan, Oliver Bender, & Hermann Ney: A deep learning approach to machine transliteration.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.233-241. [PDF, 201KB]

Case grammar and case frames

(2009) Eneko Agirre, Aitziber Atutxa, Gorka Labaka, Mikel Lersundi, Aingeru Mayor, & Kepa Sarasola: Use of rich linguistic information to translate prepositions and grammatical cases to Basque. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.58-65. [PDF, 340KB]

(2009) R.Basili, D.De Cao, D.Croce, B.Coppola, & A.Moschitti: Cross-language frame semantic transfer in bilingual corpora [abstract]. CICLING 2009: 10th International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, March 1-7, 2009; 1p. [PDF, 76KB]

(2009) Karel Pala & Aleš Horák: Multilingual features of complex valency frames. (IIS-2009) Recent Advances in Intelligent Information Systems, ed. M.A.Klopotek, A. Przepiórkowski, S.T. Wierzchon, K. Trojanowski, Kraków, Poland, June 15-18, 2009; pp.41-49. [PDF, 145KB]

(2009) Ananthakrishnan Ramanathan, Hansraj Choudhary, Avishek Ghosh, & Pushpak Bhattacharyya: Case markers and morphology: addressing the crux of the fluency problem in English-Hindi SMT. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.800-808. [PDF, 221KB]

(2009) Dekai Wu & Pascale Fung: Can semantic role labeling improve SMT? EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.218-225. [PDF, 1141KB]

(2009) Shoichi Yokoyama & Masumi Okuyama: Translation disambiguation of patent sentences using case frames. MT Summit XII: Third Workshop on Patent Translation, August 30, 2009, Ottawa, Ontario, Canada; pp. 25-32. [PDF, 75KB]

(2007) Sanae Fujita & Francis Bond: A method of creating new valency entries [abstract]. Machine Translation 21 (1), March 2007; pp.1-28.

(2007) Pascale Fung, Zhaojun Wu, Yongsheng Yang, & Dekai Wu: Learning bilingual semantic frames: shallow semantic parsing vs. semantic role projection. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.75-84 [PDF, 477KB]

(2007) Kristina Toutanova & Hisami Suzuki: Generating case markers in machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.49-56 [PDF, 168KB]

(2005) Kuntal Dey & Pushpak Bhattacharyya: Universal Networking Language based analysis and generation for Bengali case structure constructs. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.215-229 [abstract, PDF, 88KB]

(2005) Brian Murphy & Carl Vogel: A comparative evaluation of UNL participant relations using a five-language parallel corpus. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.64-76 [abstract, PDF, 137KB]

(2005) Shigeko Nariyama, Eric Nichols, Francis Bond, Takaaki Tanaka, & Hiromi Nakaiwa: Extracting representative arguments from dictionaries for resolving zero pronouns. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.3-10. [PDF, 665KB]

Categorial grammar

(2009) Rejwanul Haque, Sudip Kumar Naskar, Yanjun Ma & Andy Way: Using supertags as source language context in SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.234-241. [PDF, 247KB]

(2009) Hany Hassan, Khalil Sima’an, & Andy Way: A syntactified direct translation model with linear-time decoding. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1182-1191. [PDF, 163KB]

(2008) Prachya Boonkwan & Thepchai Supnithi: Memory-inductive categorical grammar: an approach to gap resolution in analytic-language translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.80-87. [PDF, 339KB]

(2007) Alexandra Birch, Miles Osborne, & Philipp Koehn: CCG supertags in factored statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 9-16 [PDF, 175KB]

(2007) Hany Hassan, Khalil Sima’an, & Andy Way: Supertagged phrase-based statistical machine translation.  ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 288-295 [PDF, 175KB]

(2007) Michael White, Rajakrishnan Rajkumar, & Scott Martin: Towards broad coverage surface realization with CCG. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.22-30 [PDF, 2020KB]

Closely related languages

(2009) Vishal Goyal & Gurpreet Singh Lehal: Evaluation of Hindi to Punjabi machine translation system. International Journal of Computer Science Issues, vol.4, no.1 (2009), pp.36-39. [PDF, 101KB]

(2009) Petr Homola, Vladislav Kuboň, & Jernej Vičič: Shallow transfer between Slavic languages. (IIS-2009) Recent Advances in Intelligent Information Systems, ed. M.A.Klopotek, A. Przepiórkowski, S.T. Wierzchon, K. Trojanowski, Kraków, Poland, June 15-18, 2009; pp.219-232. [PDF, 390KB]

(2009) Preslav Nakov & Hwee Tou Ng: Improved statistical machine translation for resource-poor languages using related resource-rich languages. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1358-1367. [PDF, 209KB]

(2009) R.Mahesh K.Sinha: Developing English-Urdu machine translation via Hindi. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 7pp. [PDF, 643KB]

(2009) Jörg Tiedemann: Character-based PSMT for closely related languages. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.12-19 [PDF, 399KB]

(2008) Peter A. Chew & Ahmed Abdelali: The effects of language relatedness on multilingual information retrieval: a case study with Indo-European and Semitic languages. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.1-9. [PDF, 196KB]

(2008) Vishal Goyal & Gurpreet Singh Lehal: Comparative study of Hindi and Punjabi language scripts. Nepalese Linguistics, Journal of the Linguistics Society of Nepal, vol. 23 (2008), pp. 67-82. [PDF, 290KB]

(2008) Petr Homola: A distributed database for mobile NLP applications. ACL-08: HLT. Workshop on Mobile Language Processing. Proceedings of the workshop, June 20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 27-28. [PDF, 71KB]

(2008) Petr Homola & Vladislav Kuboň: Improving machine translation between closely related Romance languages . EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.72-77. [PDF, 527KB]

(2008) Gurpreet Singh Josan & Gurpreet Singh Lehal: A Punjabi to Hindi machine translation system. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.157-160. [PDF, 190KB]

(2008) M.G.Abbas Malik, Christian Boitet, & Pushpak Bhattacharyya: Hindi Urdu machine transliteration using finite-state transducers.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.537-544. [PDF,774KB]

(2008) Jan Nouza, Jan Silovsky, Jindrich Zdansky, Petr Cerva, Martin Kroul, & Josef Chaloupka: Czech-to-Slovak adapted broadcast news transcription system. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2683-2686; abstract [PDF, 57KB]

(2008) Daniel Zeman & Philip Resnik: Cross-language parser adaptation between related languages. IJCNLP 2008: Workshop on NLP for Less Privileged Languages. Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.35-42. [PDF, 150KB]

(2007) Bogdan Babych, Anthony Hartley, & Serge Sharoff: Translating from under-resourced languages: comparing direct transfer against pivot translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.29-35 [PDF, 197KB]

(2007) A.Cüneyd Tantuğ, Eşref Adali, & Kemal Oflazer: A MT system from Turkmen to Turkish employing finite state and statistical methods. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.459-465 [PDF, 399KB]

(2007) A.Cüneyd Tantuğ, Eşref Adali, & Kemal Oflazer: Machine translation between Turkic languages. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 189-192 [PDF, 135KB]

(2007) Jernej Vicic: Rapid development of RBMT systems for related languages. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 12pp. [PDF, 154KB]

(2007) David Vilar, Jan-T. Peter, & Hermann Ney: Can we translate letters?  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 33-39 [PDF, 124KB]

(2006) Carme Armentano i Oller & Mikel L. Forcada: Open-source machine translation between small languages: Catalan and Aranese Occitan.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.51-54. [PDF, 62KB]

(2006) Boštan Dvořák, Petr Homola, & Vladislav Kuboň: Exploiting similarity in the MT into a minority language.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.59-64. [PDF, 379KB]

(2006) Brock Pytlik & David Yarowsky: Machine translation for languages lacking bitext via multilingual gloss transduction. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.156-165 [PDF, 267KB]

(2006) Kevin P. Scannell: Machine translation for closely related language pairs.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.103-107. [PDF, 139KB]

(2005) Antonio M.Corbi-Bellot, Mikel L. Forcada, Sergio Ortíz-Rojas, Juan Antonio Pérez-Ortiz, Gema Ramírez-Sánchez, Felipe Sánchez-Martínez, Iñaki Alegria, Aingeru Mayor, & Kepa Sarasola: An open-source shallow-transfer machine translation engine for the Romance languages of Spain. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 79-86. [PDF, 130KB]

(2005) Karin Müller: Revealing phonological similarities between related languages from automatically generated parallel corpora.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 33-40. [PDF, 114KB]

Code switching see Dialects and registers

Cognates

(2009) Svetlin Nakov, Preslav Nakov, & Elena Paskaleva: Unsupervised extraction of false friends from parallel bi-texts using the web as a corpus.  [RANLP 2009] International conference: Recent Advances in Natural Language Processing. Proceedings ed. Galia Angelova, Kalina Bontcheva, Ruslan Mitkov, Nicolas Nicolov, Nikolai Nikolov, Borovets, Bulgaria, 14-16 September 2009; pp. 292-298. [PDF, 235KB]

(2007) Ruslan Mitkov, Viktor Pekar, Dimitar Blagoev, & Andrea Mulloni: Methods for extracting and classifying pairs of cognates and false friends [abstract]. Machine Translation 21 (1), March 2007; pp.29-53.

(2007) Preslav Nakov & Marti Hearst: UCB system description for the WMT 2007 shared task. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 212-215 [PDF, 129KB]

(2006) Jirka Hana, Anna Feldman, Chris Brew, & Luiz Amaral: Tagging Portuguese with a Spanish tagger using cognates. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.33-40 [PDF, 253KB]

(2006) Charles Schafer: Novel probabilistic finite-state transducers for cognate and transliteration modeling. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.203-212 [PDF, 250KB]

(2005) Grzegorz Kondrak: Cognates and word alignment in bitexts. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.305-312. [PDF, 179KB]

Cohesion see Reference

Collocations see Multiword expressions

Compositionality

(2008) Béatrice Daille & Emmanuel Morin: A effective compositional model for lexical alignment. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp 95-102. [PDF, 401KB]

(2008) Nikesh Garera & David Yarowsky: Translating compounds by learning component gloss translation models via multiple languages. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.403-410. [PDF, 497KB]

(2005) Masatsugu Tonoike, Mitsuhiro Kida, Toshihiro Takagi, Yauhiro Sasaki, Takehito Utsuro, & Satoshi Sato: Effect of domain-specific corpus in compositional translation estimation for technical terms. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.114-119. [PDF, 193KB]

Context-free grammars

(2009) Martin Čmejrek, Bowen Zhou, & Bing Xiang: Enriching SCFG rules directly from efficient bilingual chart parsing. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 136-143. [PDF, 315KB]; presentation [PDF of PPT, 982KB]

(2009) Liang Huang, Hao Zhang, David Gildea & Kevin Knight: Binarization of synchronous context-free grammars. Computational Linguistics 35 (4), pp. 559-595 [PDF, 837KB]

(2009) Hongfei Jiang, Muyun Yang, Tiejun Zhao, Sheng Li & Bo Wang: A statistical machine translation model based on a syntactic synchronous grammar. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.125-128. [PDF, 514KB]

(2009) Maxim Khalilov & José A.R.Fonollosa: N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.424-432. [PDF, 564KB]

(2009) Shankar Kumar, Wolfgang Macherey, Chris Dyer & Franz Och: Efficient minimum error rate training and minimum Bayes-risk decoding for translation hypergraphs and lattices. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.163-171. [PDF, 342KB]

(2009) Anders Søgaard & Jonas Kuhn: Empirical lower bounds on alignment error rates in syntax-based machine translation.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.19-27. [PDF, 138KB]

(2009) Ashish Venugopal, Andreas Zollmann, Noah A.Smith, & Stephan Vogel: Preference grammars: softening syntactic constraints to improve statistical machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.37-45. [PDF, 212KB]

(2007) Ashish Venugopal, Andreas Zollmann, & Stephan Vogel: An efficient two-pass approach to synchronous-CFG driven statistical MT. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.500-507 [PDF, 279KB]

(2007) Francisco Oliveira, Fai Wong, Ka-Seng Leong, Chio-Kin Tong, & Ming-Chui Dong: Query translation for cross-language information retrieval by parsing constraint synchronous grammar. International Conference on Machine Learning and Cybernetics, 19-22, August, 2007, Hong Kong; 6pp. [PDF, 218KB]

(2007) Hao Zhang & Daniel Gildea: Factorization of synchronous context-free grammars in linear time. SSST, NAACL-HLT-2007 AMTA Workshop on  Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.25-32 [PDF, 151KB]

Controlled languages

(2009) Xavier Blanco: Remarks about linguistic analysis, normalization and translation of Spanish “What to do in case of fire” texts. ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.43-48 [abstract]

(2009) Krzysztof Bogacki: Controlled languages and machine translation. ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.49-55 [abstract]

(2009) Tessa Cornally: Achieving a better machine translation from French to English via a controlled language. ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.69-73 [abstract]

(2009) Rafael Guzmán: Uncontrolled terminology and MT: the importance of making good comparisons. Translation Journal 13 (2), April 2009; 7pp. [PDF, 128KB]

(2009) Henri Madec: Cognitive models of yesterday and today in machine translation and their implication for controlled languages.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.172-178 [abstract]

(2009) Johann Roturier: Controlled language for MT in action. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 11pp. [PDF of PPT, 162KB]

(2009) Zuzanna Rudas: Polish controlled language and its machine translation into French.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.231-235 [abstract]

(2009) Irina Temnikova & Constantin Orasan: Post-editing experiments with MT for a controlled language.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.249-255 [abstract]

(2009) Xiaohong Wu: Building a linguistic database for Chinese interrogative sentences.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.260-268 [abstract]

(2008) Iñaki Alegria & Kepa Sarasola: [review of] Sylviane Cardey, Peter Greenfield, Séverine Vienney (eds.) Machine translation, controlled languages and specialised languages. Machine Translation 22 (4), December 2008; pp.259-261. [see publication]

(2008) Pierrette Bouillon, Glenn Flores, Maria Georgescul, Sonia Halimi, Beth Ann Hockey, Hitoshi Isahara, Kyoko Kanzaki, Yukie Nakao, Manny Rayner, Marianne Santaholma, Marianne Starlander, & Nikos Tsourakis: Many-to-many multilingual medical speech translation on a PDA. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.314-323. [PDF, 666KB]

(2008) Tobias Kuhn & Rolf Schwitter: Writing support for controlled natural languages. Australasian Language Technology Workshop 2008 (ALTW2008): Proceedings of the Workshop, 8-10 December 2008, CSIRO ICT Centre, Hobart; pp.46-54. [PDF, 326KB]

(2007) Takako Aikawa, Lee Schwartz, Ronit King, Mo Corston-Oliver, & Carmen Lozano: Impact of controlled language on translation quality and post-editing in a statistical machine translation environment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.1-7 [PDF, 314KB]

(2007) Sharon O’Brien & Johann Roturier: How portable are controlled language rules? A comparison of two empirical MT studies. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.345-352 [PDF, 77KB]

(2006) proceedings of CLAW 2006: 5th International Workshop on Controlled Language Applications, Cambridge, MA, USA, August 12, 2006.

(2006) Michael Blench [presenter]: Automated translation system Job Bank. AMTA 2006:  7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; presentation on User Track [PDF of PPT presentation, 383KB]

(2006) Nikos Chatzichrisafis, Pierrette Bouillon, Manny Rayner, Marianne Santaholma, Marianne Starlander & Beth Ann Hockey: Evaluating task performance for a unidirectional controlled language medical speech translation system.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.9-16 [PDF, 109KB]

(2006) Graliński Filip, Jassem Krzysztof, Wagner Agnieszka, & Wypych Mikołaj: Text normalization as a special case of machine translation. Proceedings of the International Multiconference on Computer Science and Information Technology, Wisla, Poland, 6-10 November 2006; pp.51-56. [PDF, 318KB]

(2006) Uwe Muegge: Fully automatic high quality machine translation of restricted text: a case study. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 6pp. [PDF, 43KB]

(2006) David M.Rojas & Takako Aikawa: Predicting MT quality as a function of the source language.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2534-2537 [PDF, 613KB]

(2006) Marcus Sammer, Kobi Reiter, Stephen Soderland, Katrin Kirchhoff & Oren Etzioni: Ambiguity reduction for machine translation: human-computer collaboration.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.193-202 [PDF, 247KB]

(2005) Stephen Beale, Sergei Nirenburg, Marjorie McShane, & Tod Allman: Document authoring the Bible for minority language translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.63-70. [PDF, 415KB]

(2005) Andre Castilla, Alice Bacic, & Sergio Furuie: Machine translation on the medical domain: the role of BLEU/NIST and METEOR in a controlled vocabulary setting. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.47-54. [PDF, 156KB]

(2005) Sharon O’Brien: Methodologies for measuring the correlations between post-editing effort and machine translatability [abstract]. Machine Translation 19 (1), 2005; pp.37-58.

(2005) Nathalie de Preux: How much does using controlled language improve machine translation results? Translating and the Computer 27: proceedings of the Twenty-seventh International Conference on Translating and the Computer, 24-25 November 2005, London. (London: Aslib, 2005); 14pp. [PDF, 109KB]

(2005) Laura Ramirez Polo & Johann Haller: Controlled language and the implementation of machine translation for technical documentation. Translating and the Computer 27: proceedings of the Twenty-seventh International Conference on Translating and the Computer, 24-25 November 2005, London. (London: Aslib, 2005); 9pp. [PDF, 109KB]; presentation [PDF, 117KB]

(2005) Marianne Starlander, Pierrette Bouillon, Nikos Chatzichrisafis, Marianne Santaholma, Manny Rayner, Beth Ann Hockey, Hitoshi Isahara, Kyoko Kanzaki, & Yukie Nakao: Practicing controlled language through a help system integrated into the medical speech translation system (MedSLT). MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.188-194. [PDF, 169KB]

(2005) Andy Way & Nano Gough: Controlled translation in an example-based environment: what do automatic evaluation metrics tell us? [abstract]. Machine Translation 19 (1), 2005; pp.1-36.

Coordination

(2009) Karin Harbusch & Gerard Kempen: Generating clausal coordinate ellipsis multilingually: a uniform approach based on postediting. Proceedings of the 12th European Workshop on Natural Language Generation, March 30-31, 2009, Athens, Greece; pp.138-145. [PDF, 389KB]

(2009) Hideharu Okuma, Kazuo Hara, Masashi Shimbo, & Yuji Matsumoto: Bypassed alignment graph for learning coordination in Japanese sentences. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.5-8. [PDF, 474KB]

Cyrillic coding and transliteration

(2008) Karine Megerdoomian & Dan Parvaz: Low-density language bootstrapping: the case of Tajiki Persian.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 331KB]

Dates

(2009) Julien Gosme, Yves Lepage & Adrien Lardilleux: Translation of sublanguages by subgrammars. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.77-84. [PDF, 116KB]

Dependency relations

(2009) Nguyen Bach, Qin Gao, & Stephan Vogel: Source-side dependency tree reordering models and subtree movements and constraints. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 17-24. [PDF, 211KB]

(2009) Wanxiang Che, Zhenghua Li, Yongqiang Li, Yuhang Guo, Bing Qin, & Ting Liu: Multingual dependency-based syntactic and semantic parsing. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.49-54. [PDF, 633KB]

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2009) Michel Galley & Christopher D.Manning: Quadratic-time dependency parsing for machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.773-781. [PDF, 235KB]

(2009) Nikesh Garera, Chris Callison-Burch & David Yarowsky: Improving translation lexicon induction from monolingual corpora via dependency contexts and part-of-speech equivalences. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning, June 4-5, 2009, Boulder, Colorado; pp.129-137. [PDF, 450KB]

(2009) Kevin Gimpel & Noah A.Smith: Feature-rich translation by quasi-synchronous lattice parsing. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.219-228. [PDF, 329KB]

 (2009) Yvette Graham, Josef van Genabith, & Anton Bryl: F-structure transfer-based statistical machine translation. Proceedings of the LFG09 conference, Miriam Butt and Tracy Holloway King (eds.) CSLI Publications, 2009; pp.318-337. [PDF, 1227KB]

(2009) Jan Hajič, Massimiliano Ciaramita, Richard Johansson, Daisuke Kawahara, Maria Antònia Marti, Lluís Màrquez, Adam Meyers, Joakim Nivre, Sebastian Padó, Jan Štěpánek, Pavel Straňák, Mihai Surdeanu, Nainwen Xue, & Yi Zhang: The CoNLL-2009 Shared task: syntactic and semantic dependencies in multiple languages. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.1-18. [PDF, 220KB]

(2009) Jan Hajič: Tree-based machine translation using syntax and semantics. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 23pp. [PDF of PPT, 543KB]

(2009) Saša Hasan & Hermann Ney: Comparison of extended lexicon models in search and rescoring for SMT. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.17-20. [PDF, 167KB]

(2009) Hany Hassan, Khalil Sima’an, & Andy Way: A syntactified direct translation model with linear-time decoding. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1182-1191. [PDF, 163KB]

(2009) Yifan He & Andy Way: Learning labeled dependencies in machine translation evaluation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.44-51. [PDF, 426KB]

(2009) Jeremy G.Kahn, Matthew Snover, & Mari Ostendorf: Expected dependency pair match: predicting translation quality with expected syntactic structure [abstract]. Machine Translation 23 (2/3), September 2009; pp.169-179.

(2009) Sadao Kurohashi: Fully syntactic example-based machine translation [abstract]. Invited talk [at] Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; p.1. [PDF, 22KB]; presentation [PDF of PPT, 2353KB]

(2009) Roser Morante, Vincent Van Asch, & Antal van den Bosch: Joint memory-based learning of syntactic and semantic dependencies in multiple languages. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.25-30. [PDF, 108KB]

(2009) Hwidong Na, Jin-Ji Li, Jungi Kim, & Jong-Hyeok Lee: Improving fluency by reordering target constituents using MST parser in English-to-Japanese phrase-based SMT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.276-283. [PDF, 381KB]

(2009) Toshiaki Nakazawa & Sadao Kurohashi: Statistical phrase alignment model using dependency relation probability.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.10-18. [PDF, 670KB]

(2009) Martin Popel & Zdeněk Žabokrtský: Improving English-Czech tectogrammatical MT. Prague Bulletin of Mathematical Linguistics, no.92, December 2009; pp.115-134 [PDF, 265KB]

(2009) Han Ren, Donghong Ji, Jing Wan, & Mingyao Zhang: Parsing syntactic and semantic dependencies for multiple languages with a pipeline approach. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.97-102. [PDF, 228KB]

(2009) James Smith & Stephen Clark: EBMT for SMT: a new EBMT-SMT hybrid. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.3-10. [PDF, 144KB]; presentation [PDF of PPT, 564KB]

(2009) Ankit K.Srivastava & Andy Way: Using percolated dependencies for phrase extraction in SMT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.316-323. [PDF, 91KB]

(2009) Oscar Täckström: Multilingual semantic parsing with a pipeline of linear classifiers. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.103-108. [PDF, 103KB]

(2009) Yotaro Watanabe, Masayuki Asahara, & Yuji Matsumoto: Multilingual syntactic-semantic dependency parsing with three-stage approximate max-margin linear models. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.103-108. [PDF, 727KB]

(2009) Deyi Xiong, Min Zhang, Aiti Aw & Haizhou Li: A source dependency model for statistical machine translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 371-378. [PDF, 87KB]

(2009) Peng Xu, Jaeho Kang, Michael Ringgard, & Franz Och: Using a dependency parser to improve SMT for subject-object-verb languages. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.245-253. [PDF, 273KB]

(2009) Kun Yu & Junichi Tsujii: Extracting bilingual dictionary from comparable corpora with dependency heterogeneity. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.121-124. [PDF, 132KB]

(2009) Zdeněk Žabokrtský & Martin Popel: Hidden Markov tree model in dependency-based machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.145-148. [PDF, 154KB]

(2009) Zdeněk Žabokrtský: TectoMT: software framework for developing MT systems (and other NLP applications). Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.51-56 [PDF, 570KB]

(2009) Sina Zarrieß & Jonas Kuhn: Exploiting translational correspondences for pattern-independent MWE identification.  [ACL-IJCNLP-2009] Proceedings of the 2009 Workshop on Multiword Expressions, ACL-IJCNLP 2009, Suntec, Singapore, 6 August 2009; pp.23-30. [PDF, 132KB]

(2009) Hai Zhao, Wenliang Chen, Jun’ichi Kazama, Kiyotaka Uchimoto, & Kentaro Torisawa: Multilingual dependency learning: exploiting rich features for tagging syntactic and semantic dependencies. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.61-66. [PDF, 364KB]

(2009) Hai Zhao, Yan Song, Chunyu Kit, & Guodong Zhou: Cross language dependency parsing using a bilingual lexicon. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.55-63. [PDF, 547KB]

(2009) Jing Zheng, Necip Fazil Ayan, Wen Wang, & David Burkett: Using syntax in large-scale audio document translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2008) Kalika Bali, Sankaran Baskaran, & A.Kumaran: A dependency treelet-based phrasal SMT: evaluation and issues in English-Hindi language pair.  ICON-2008: 6th International Conference on Natural Language Processing, CDAC Pune, India, 20-22 December 2008; 10pp. [PDF, 292KB]

 (2008) Ondřej Bojar: Exploiting linguistic data in machine translation. Doctoral thesis, Charles University, Faculty of Mathematics and Physics, Prague 2008; 135pp. [PDF, 1045KB]

(2008) Ondřej Bojar: Problems of deep syntactic English-to-Czech MT. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 33 slides. [PDF, 248KB]

(2008) Ondřej Bojar & Adam Lopez: Tree-based translation. [tutorial handout at] Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 7pp. [PDF, 248KB]

(2008) Mauro Cettolo, Marcello Federico, Daniele Pighin, & Nicola Bertoldi: Shallow-syntax phrase-based translation: joint versus factored string-to-chunk models. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp. 56-64. [PDF, 176KB]

(2008) Colin Cherry: Cohesive phrase-based decoding for statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 72-80. [PDF, 179KB]

(2008) Yvette Graham & Josef van Genabith: Packed rules for automatic transfer-rule induction. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.57-65. [PDF, 1052KB]

(2008) Yuqing Guo, Josef van Genabith, & Haifeng Wang: Dependency-based n-gram models for general purpose sentence realisation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.297-304. [PDF, 138KB]

(2008) Young-Sook Hwang, YoungKil Kim, & SangKyu Park: Paraphrasing depending on bilingual context toward generalization of translation knowledge. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.327-334. [PDF, 952KB]

(2008) Jason Katz-Brown & Michael Collins: Syntactic reordering in preprocessing for Japanese®English translation: MIT system description for NTCIR-7 patent translation task. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 409-414. [PDF, 424KB]

(2008) Yanjun Ma, Sylwia Ozdowska, Yanli Sun, & Andy Way: Improving word alignment using syntactic dependencies. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.69-77. [PDF, 181KB]

(2008) David Mareček, Zdeněk Žabokrtský, & Václav Novák: Automatic alignment of Czech and English deep syntactic dependency trees. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.104-113. [PDF, 586KB]

(2008) Aurélien Max, Rafik Makhloufi, & Philippe Langlais: Explorations in using grammatical dependencies for contextual phrase translation disambiguation. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.114-119. [PDF, 568KB]

(2008) Toshiaki Nakazawa & Sadao Kurohashi: Linguistically-motivated tree-based probabilistic phrase alignment. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.163-171. [PDF, 603KB]

(2008) Vassilina Nikoulina & Marc Dymetman: Experiments in discriminating phrase-based translations on the basis of syntactic coupling features. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.55-60. [PDF, 98KB]

(2008) Joakim Nivre, Igor M.Boguslavsky, & Leonid L.Iomdin: Parsing the SynTagRus treebank of Russian.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.641-648. [PDF, 266KB]

(2008) Matt Post & Daniel Gildea: Parsers as language models for statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.172-181. [PDF, 538KB]

(2008) Libin Shen, Jinxi Xu, & Ralph Weischedel: A new string-to-dependency machine translation algorithm with a target dependency language model. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 577-585. [PDF, 147KB]

(2008) Zdeněk Žabokrtský, Jan Ptáček, & Petr Pajas: TectoMT: highly modular MT system with tectogrammatics used as transfer layer. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.167-170. [PDF, 136KB]

(2008) Daniel Zeman & Philip Resnik: Cross-language parser adaptation between related languages. IJCNLP 2008: Workshop on NLP for Less Privileged Languages. Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.35-42. [PDF, 150KB]

(2007) Eckard Bick: Dan2eng: wide-coverage Danish-English machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.37-43 [PDF, 621KB]

(2007) Bernd Bohnet: The induction and evaluation of word order rules using corpora based on the two concepts of topological models. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.38-45 [PDF, 1835KB]

(2007) Matthias Buch-Kromann: Breaking the barrier of context-freeness: towards a linguistically adequate probabilistic dependency model of parallel texts. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.31-40 [PDF, 432KB]; presentation [PDF, 1285KB]

(2007) Matthias Buch-Kromann: Computing translation units and quantifying parallelism in parallel dependency treebanks. ACL 2007: proceedings of the Linguistic Annotation Workshop, Prague, Czech Republic, 28-29 June 2007; pp.69-76 [PDF, 361KB]

(2007) Pi-Chuan Chang & Kristina Toutanova: A discriminative syntactic word order model for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 9-16 [PDF, 235KB]

(2007) Nizar Habash: Syntactic preprocessing for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.215-222 [PDF, 118KB]

(2007) Irene Langkilde-Geary: Declarative syntactic processing of natural language using concurrent constraint programming and probabilistic dependency modeling. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.55-63 [PDF, 1969KB]

(2007) Dennis N. Mehay & Chris Brew: BLEUÂTRE: flattening syntactic dependencies for MT evaluation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.122-131 [PDF, 370KB]; presentation [PDF, 802KB]

(2007) Arul Menezes & Chris Quirk: Using dependency order templates to improve generality in translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 1-8 [PDF, 153KB]

(2007) Toshiaki Nakazawa, Yu Kun, & Sadao Kurohashi: Structural phrase alignment based on consistency criteria. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.337-344 [PDF, 202KB]

(2007) Joakim Nivre: Data-driven dependency parsing across languages and domains: perspectives from the CoNLL 2007 Shared Task. IWPT 2007: Proceedings of the 10th International Conference on Parsing Technologies, June 23-24, 2007, Prague, Czech Republic; pp.168-170. [PDF, 69KB]

(2007) Karolina Owczarzak & Josef van Genabith: Evaluating machine translation with LFG dependencies [abstract]. Machine Translation 21 (2), June 2007; pp.95-119.

(2007) Karolina Owczarzak, Josef van Genabith, & Andy Way: Labelled dependencies in machine translation evaluation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 104-111 [PDF, 239KB]

(2007) Karolina Owczarzak, Josef van Genabith, & Andy Way: Dependency-based automatic evaluation for machine translation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.80-87 [PDF, 245KB]

(2007) Deyi Xiong, Qun Liu, & Shouxun Lin: A dependency treelet string correspondence model for statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 40-47 [PDF, 411KB]

(2006) Yafa Al-Raheb, A.Akrout, J. van Genabith, & J. Dichy: DCU 250 Arabic dependency bank: an LFG gold standrad resource for the Arabic Penn treebank.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.105-116. [PDF, 355KB]

(2006) Juri Apresjan, Igor Boguslavsky, Boris Iomdin, Leonid Iomdin, Andrei Sannikov, & Victor Sizov: A syntactically and semantically tagged corpus of Russian: state of the art and prospects.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1378-1381 [PDF, 596KB]

(2006) Giuseppe Attardi: Experiments with a multilanguage non-projective dependency parser. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.166-170. [PDF, 104KB]

(2006) Sabine Buchholz & Erwin Marsi: CoNLL-X shared task on multilingual dependency parsing. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.149-164. [PDF, 158KB]

(2006) Yuchang Cheng, Masayuki Asahara & Yuji Matsumoto: Multi-lingual dependency parsing at NAIST.  CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.191-195. [PDF, 192KB]

(2006) Colin Cherry & Dekang Lin: Soft syntactic constraints for word alignment through discriminative training. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.105-112. [PDF, 153KB]

(2006) Colin Cherry & Dekang Lin: A comparison of syntactically motivated word alignment spaces. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, April 3-7, 2006; pp.145-152 [PDF, 150KB]

(2006) Simon Corston-Oliver & Anthony Aue: Dependency parsing with reference to Slovene, Spanish and Swedish. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.196-200. [PDF, 286KB]

(2006) Simon Corston-Oliver, Anthony Aue, Kevin Duh & Eric Ringger: Multilingual dependency parsing using Bayes point machines.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 160-167 [PDF, 178KB]

(2006) Yuan Ding & Martha Palmer: Better learning and decoding for syntax based SMT using PSDIG. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.37-45 [PDF, 391KB]

(2006) Eva Hajičová: Old linguists never die, they only get obligatorily deleted. Computational Linguistics 32 (4), pp. 457-469. [PDF, 98KB]

(2006) Saša Hasan, Oliver Bender, & Hermann Ney: Reranking translation hypotheses using structural properties.  EACL-2006: Proceedings of the Workshop on Learning Structured Informationa in Natural Language Applications, April 3, 2006, Trento, Italy; pp.41-48. [PDF, 100KB]

(2006) Radu Ion, Alexandru Ceauşu, & Dan Tufiş: Dependency-based phrase alignment.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1290-1293 [PDF, 430KB]

(2006) Richard Johansson & Pierre Nugues: Investigating multilingual dependency parsing. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.206-210. [PDF, 75KB]

(2006) Philippe Langlais & Fabrizio Gotti: EBMT by tree-phrasing [abstract]. Machine Translation 20 (1),2006; pp.1-23.

(2006) Ryan McDonald, Kevin Lerman, & Fernando Pereira: Multilingual dependency analysis with a two-stage discriminative parser. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.216-220. [PDF, 79KB]

(2006) Arul Menezes, Kristina Toutanova, & Chris Quirk: Microsoft Research treelet translation system: NAACL 2006 Europarl evaluation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 158-161 [PDF, 676KB]

(2006) Sylwia Ozdowska: Projecting POS tags and syntactic dependencies from English and French to Polish in aligned corpora. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.53-60 [PDF, 366 KB]

(2006) Christopher Quirk & Arul Menezes: Dependency treelet translation: the convergence of statistical and example-based machine translation? [abstract] Machine Translation 20 (1),2006; pp.43-65.

(2006) Chris Quirk & Simon Corston-Oliver: The impact of parse quality on syntactically-informed statistical machine translation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 62-69. [PDF, 203KB]

(2006) Sebastian Riedel, Ruket Ҫakıcı, & Ivan Meza-Ruiz: Multi-lingual dependency parsing with incremental integer linear programming. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.226-230. [PDF, 72KB]

(2006) Koichiro Ryu, Shigeki Matsubara, & Yasuyoshi Inagaki: Simultaneous English-Japanese spoken language translation based on incremental dependency parsing and transfer. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.683-690. [PDF, 420KB]

(2006) David A. Smith & Jason Eisner: Quasi-synchronous grammars: alignment by soft projection of syntactic dependencies.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 23-30 [PDF, 207KB]

(2006) Otakar Smrž: Tips and tricks of the Prague Arabic dependency treebank.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.25-34. [PDF, ]

(2006) Toon Witkam: History and heritage of the DLT (Distributed Language Technology) project. [Utrecht: private publication, 2006]. 11pp. [PDF, 170KB]

(2006) Hao Zhang & Daniel Gildea: Inducing word alignments with bilexical synchronous trees. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.953-960. [PDF, 156KB]

(2005) Etienne Blanc: About and around the French enconverter and the French deconverter. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.157-166 [abstract, PDF, 21KB]

(2005) Ondřej Bojar & Jan Hajič: Extracting translations verb frames. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; p.2-6 [PDF, 207KB]

(2005) Yuan Ding & Martha Palmer: Machine translation using probabilistic synchronous dependency insertion grammars.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 541-548. [PDF, 369KB]

(2005) Heidi J. Fox: Dependency-based statistical machine translation.  ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 91-96. [PDF, 101KB]

(2005) Philippe Langlais, Fabrizio Gotti, Didier Bourigault, & Claude Coulombe: EBMT by tree-phrasing: a pilot study MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.71-80. [PDF, 387KB]

(2005) Arul Menezes & Chris Quirk: Dependency treelet translation: the convergence of statistical and example-based machine-translation? MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.99-108. [PDF, 314KB]

(2005) Arul Menezes & Chris Quirk: Microsoft Research treelet translation system: IWSLT evaluation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 4pp. [PDF, 240KB]

(2005) Sylwia Ozdowska: Using bilingual dependencies to align words in English/French parallel corpora.  ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 127-132. [PDF, 89KB]

(2005) Chris Quirk, Arul Menezes, & Colin Cherry: Dependency treelet translation: syntactically informed phrasal SMT.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 271-279. [PDF, 118KB]

(2005) Toon Witkam: A new road to automatic translation. [Utrecht: private publication, 26 November 2005]. 13pp. [PDF, 177KB]

Diacritics

(2009) Hamdy S.Mubarak, Mohamed Al Sharqawy & Esraa Al Masry: Diacritization and transliteration of proper nouns from Arabic to English.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.256-259. [PDF, 367KB]

(2008) Tim Schlippe, ThuyLinh Nguyen, & Stephan Vogel: Diacritization as a machine translation problem and as a sequence labeling problem. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.270-278 [PDF, 701KB]

(2007) Mona Diab, Mahmoud Ghoneim, & Nizar Habash: Arabic diacritization in the context of statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.143-149 [PDF, 105KB]

(2007) Nizar Habash & Owen Rambow: Arabic diacritization through full morphological tagging. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.53-56 [PDF, 99KB]

(2006) Mohamed Maamouri, Ann Bies, & Seth Kulick: Diacritization: a challenge to Arabic treebank annotation and parsing.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.35-47 [PDF, 223KB]

Dialects and registers

 (2009) Ahmed Abdelali, Steve Helmreich, & Ron Zacharski: Investigations on standard Arabic geographical classification.  CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 193KB]

(2009) Rodolfo Delmonte, Antonella Bristot, Sara Tonelli, & Emanuele Pianta: English/Veneto resource poor machine translation with STILVEN.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.82-89 [abstract]

(2008) Thamar Solorio & Yang Liu: Part-of-speech tagging for English-Spanish code-switched text. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.1051-1060. [PDF, 129KB]

(2007) Jing-Shin Chang & Chun-Kai Kung: A Chinese-to-Chinese statistical machine translation model for mining synonymous simplified-traditional Chinese terms. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.81-87 [PDF, 314KB]

(2007) Mona Diab & Nizar Habash: Arabic dialect processing tutorial. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.5-6 [PDF, 122KB]

(2007) Yves Scherrer: Adaptive string distance measures for bilingual dialect lexicon induction. ACL 2007: proceedings of the Student Research Worksho, Prague, Czech Republic, June 2007; pp. 55-60 [PDF, 122KB]

(2006) Mona Diab & Nizar Habash: Arabic dialect processing.  Tutorial at AMTA 2006 conference, August 8, 2006, Cambridge, Massachusetts, USA; 61pp. [PDF of PPT presentation, 1514KB]

(2006) Nizar Habash & Owen Rambow: MAGEAD: a morphological analyzer and generator for the Arabic dialects. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.681-688. [PDF, 112KB]

(2006) Rebecca Hwa, Carol Nichols & Khalil Sima’an: Corpus variations for translation lexicon induction. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.74-81 [PDF, 178KB]

(2006) Keith J.Miller & Michelle Vanni: Formal vs. informal: register-differentiated Arabic MT evaluation in the PLATO paradigm.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.161-166 [PDF, 296KB]

(2005) Nizar Habash, Owen Rambow & George Kiraz: Morphological analysis and generation for Arabic dialects.  ACL-2005: Workshop on Computational Approaches to Semitic Languages, University of Michigan, Ann Arbor, 29 June 2005; pp. 17-24. [PDF, 121KB]

Dialogue

(2009) Beth Ann Hockey & Manny Rayner: Using paraphrases of deep semantic representations to support regression testing in spoken dialogue systems. NAACL-HLT-2009: Software Engineering, Testing and Quality Assurance for Natural Language Processing (SETQA-NLP 2009), Proceedings of the Workshop, June 5, 2009, Boulder, Colorado; pp.14-21. [PDF, 139KB]

(2009) Germán Sanchis-Trilles, Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Online language model adaptation for spoken dialog translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 160-167. [PDF, 349KB]; presentation [PDF of PPT, 250KB]

(2008) Sherri Condon, Jon Phillips, Christy Doran, John Aberdeen, Dan Parvaz, Beatrice Oshika, Greg Sanders, & Craig Schlendoff: Applying automated metrics to speech translation dialogs.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 178KB]

(2008) Farzad Ehsani, Jim Kimzey, Elaine Zuber, Demitrios Master, & Karen Sudre: Speech to speech translation for nurse patient interaction. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.54-59. [PDF, 92KB]

(2008) Andrew Finch & Eiichiro Sumita: Dynamic model interpolation for statistical machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.208-215. [PDF, 428KB]

(2008) Long Jiang & Ming Zhou: Generating Chinese couplets using a statistical MT approach. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.377-384. [PDF, 284KB]

(2008) Azza Abdel Monem, Khaled Shaalan, Ahmed Rafea, & Hoda Baraka: Generating Arabic text in multilingual speech-to-speech machine translation framework [abstract]. Machine Translation 22 (4), December 2008; pp.205-258.

(2008) Vivek Kumar Rangarajan Sridhar, Shrikanth Narayanan, & Srinivas Bangalore: Enriching spoken language translation with dialog acts. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 225-228. [PDF, 383KB]

(2007) Kristin Precoda, Jing Zheng, Dimitra Vergyri, Horacio Franco, Colleen Richey, Andreas Kathol, & Sachin Kajarekar: IraqComm: a next generation translation system. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2841-2844; abstract [PDF, 23KB]

(2007) Sharath Rao, Ian Lane, & Tanja Schultz: Improving spoken language translation by automatic disfluency removal: evidence from conversational speech transcripts. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.385-389 [PDF, 140KB]

(2007) Sharath Rao, Ian Lane, & Tanja Schultz: Optimizing sentence segmentation for spoken language translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2845-2848; abstract [PDF, 23KB]

(2007) Harold Somers: Theoretical and methodological issues regarding the use of language technologies for patients with limited English proficiency. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.206-213 [PDF, 637KB]; presentation [PDF, 464KB]

(2007) Ying Zhang & Stephan Vogel: PanDoRA: a large-scale two-way statistical machine translation system for hand-held devices. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.543-550 [PDF, 472KB]

(2006) Farzad Ehsani, Jim Kimzey, Demetrios Master, Karen Lesea & Hunil Park: Speech to speech translation for medical triage in Korean.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.17-23 [PDF, 96KB]

(2006) Emil Ettelaie, Panayiotis G.Georgiou, & Shrikanth Narayanan: Cross-lingual dialog model for speech to speech translation. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1858; abstract [PDF, 74KB]

(2006) Diana Inkpen, Muath Alzghool, Gareth J.F.Jones & Douglas W.Oard: Investigating cross-language speech retrieval for a spontaneous conversational speech collection.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 61-64 [PDF, 43KB]

(2006) Djamal Mostefa, Olivier Hamon, & Khalid Choukri: Evaluation of automatic speech recognition and speech language translation within TC-STAR: results from the first evaluation campaign.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.149-154 [PDF, 388KB]

(2006) Manny Rayner, Pierrette Bouillon, Nikos Chatzichrisafis, Marianne Santaholma, Marianne Starlander, Beth Ann Hockey, Yukie Nakao, Hitoshi Isahara & Kyoko Kanzaki: MedSLT: a limited-domain unidirectional grammar-based medical speech translator.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.44-47 [PDF, 75KB]

(2006) Stephanie Seneff, Chao Wang & John Lee: Combining linguistic and statistical methods for bi-directional English Chinese translation in the flight domain. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.213-222 [PDF, 152KB]

(2006) Toshiyuki Takezawa & Tohru Shimizu: Performance improvement of dialog speech translation by rejecting unreliable utterances. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1100; abstract [PDF, 78KB]

(2005) Naoki Asanoma, Setsuo Yamada, Osamu Furuse, & Masahiro Oku: Building a conversation corpus by text derivation from "germ dialogs". 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 27-32. [PDF, 82KB]

(2005) Emil Ettelaie, Sudeep Gandhe, Panayiotis Georgiou, Kevin Knight, Daniel Marcu, Shrikanth Narayanan, David Traum, & Robert Belvin: Transonics: a practical speech-to-speech translator for English-Farsi medical dialogues.  ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 89-92  [PDF, 352KB]

(2005) Manny Rayner, Nikos Chatzichrisafis, Pierrette Bouillon, Yukie Nakao, Hitoshi Isahara, Kyoko Kanzaki, Beth Ann Hockey, Marianne Santaholma, & Marianne Starlander: Japanese speech understanding using grammar specialization. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 26-27. [PDF, 54KB]

(2005) Mutsuko Tomokiyo, Gérard Chollet, & Solange Hollard: Studies of emotional expression sin oral dialogues: towards an extension of Universal Networking Language. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.286-299 [abstract, PDF, 13KB]

Discontiguous relations

(2009) Josep M. Crego & François Yvon: Gappy translation units under left-to-right SMT decoding. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.66-73. [PDF, 551KB]

(2009) Hongfei Jiang, Sheng Li, Muyun Yang, & Tiejun Zhao: A study of translation rule classification for syntax-based statistical machine translation.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.45-50. [PDF, 466KB]

(2009) Jan Niehues & Muntsin Kolss: A POS-based model for long-range reorderings in SMT. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.206-214. [PDF, 200KB]

(2009) Jun Sun, Min Zhang, & Chew Lim Tan: A non-contiguous tree sequence alignment-based model for statistical machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.914-922. [PDF, 459KB]

(2008) Dimitra Anastasiou: Identification of idioms by machine translation: a hybrid research system vs. three commercial systems.  EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.12-20. [PDF, 498KB]

(2008) Yanqing He & Chengqing Zong: A generalized reordering model for phrase-based statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.117-124. [PDF, 661KB]

(2008) Adam David Lopez: Machine translation by pattern matching. PhD thesis, University of Edinburgh, 2008. 138pp. [PDF, 1171KB]

(2007) Rens Bod: Unsupervised syntax-based machine translation: the contribution of discontiguous phrases. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.51-56 [PDF, 87KB]

(2006) Michael Carl & Ecaterina Rascu: A dictionary lookup strategy for translating of discontinuous phrases. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.49-57 [PDF, 197KB]

Discourse representation

(2009) Lorraine Goeuriot, Emmanuel Morin & Béatrice Daille: Compilation of specialized comparable corpora in French and Japanese. [ACL-IJCNLP-2009] Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.55-63. [PDF, 186KB]

(2009) Sebastian Padó, Michel Galley, Dan Jurafsky, & Christopher D.Manning: Textual entailment features for machine translation evaluation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.37-41. [PDF, 378KB]

(2008) Doaa Samy & Ana González-Ledesma: Pragmatic annotation of discourse markers in a multilingual parallel corpus (Arabic-Spanish-English).  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 301KB]

(2008) Vivek Kumar Sridhar, Srinivas Bangalore, & Shrikanth S.Narayanan: Factored translation models for enriching spoken language translation with prosody. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2723-2726; abstract [PDF, 60KB]

(2006) Arendse Bernth: EasyEnglish Analyzer: taking controlled language from sentence to discourse level. CLAW 2006: 5th International Workshop on Controlled Language Applications, Cambridge, MA, USA, August 12, 2006; 7pp. [PDF, 116KB]

(2006) Eva Hajičová: Old linguists never die, they only get obligatorily deleted. Computational Linguistics 32 (4), pp. 457-469. [PDF, 98KB]

(2006) Bing Zhao & Eric P. Xing: BiTAM: bilingual topic admixture models for word alignment. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.969-976. [PDF, 574KB]

(2005) Ralf Brown: Context-sensitive retrieval for example-based translation MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.9-15. [PDF, 273KB]

Ellipsis

(2009) Karin Harbusch & Gerard Kempen: Generating clausal coordinate ellipsis multilingually: a uniform approach based on postediting. Proceedings of the 12th European Workshop on Natural Language Generation, March 30-31, 2009, Athens, Greece; pp.138-145. [PDF, 389KB]

(2009) Gumwon Hong, Seung-Wook Lee & Hae-Chang Rim: Bridging morpho-syntactic gap between source and target sentences for English-Korean statistical machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.233-236. [PDF, 178KB]

(2008) Prachya Boonkwan & Thepchai Supnithi: Memory-inductive categorical grammar: an approach to gap resolution in analytic-language translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.80-87. [PDF, 339KB]

(2006) Daniel Hardt: [review of] Marjorie J.McShane, A theory of ellipsis. Machine Translation 20 (3),2006; pp.221-222. [see publication]

Emphasis and stress

(2008) Chen-li Kuo & Allan Ramsay: Translating emphatic/contrastive focus from English to Mandarin Chinese [abstract]. Machine Translation 22 (4), December 2008; pp.181-203.

(2008) Chen-li Kuo & Allan Ramsay: Translating emphatic/contrastive focus from English to Mandarin Chinese. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.78-87. [PDF, 660KB]

Entailment (see also Logical aspects)

(2009) Shachar Mirkin, Lucia Specia, Nicola Cancedda, Ido Dagan, Marc Dymetman, & Idan Szpektor: Source-language entailment modeling for translating unknown terms. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.791-799. [PDF, 152KB]

(2009) Sebastian Padó, Daniel Cer, Michel Galley, Dan Jurafsky, & Christopher D.Manning: Measuring machine translation quality as semantic equivalence: a metric based on entailment features [abstract]. Machine Translation 23 (2/3), 2009; pp.181-193.

(2009) Sebastian Padó, Michel Galley, Dan Jurafsky & Chris Manning: Robust machine translation evaluation with entailment features. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.297-305. [PDF, 437KB]

(2005) Dekai Wu: Recognizing paraphrases and textual entailment using inversion transduction grammars.  ACL-2005: Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, University of Michigan, Ann Arbor, 29 June 2005; pp. 25-30. [PDF, 113KB]

Focus

(2008) Chen-li Kuo & Allan Ramsay: Translating emphatic/contrastive focus from English to Mandarin Chinese [abstract]. Machine Translation 22 (4), December 2008; pp.181-203.

(2008) Chen-li Kuo & Allan Ramsay: Translating emphatic/contrastive focus from English to Mandarin Chinese. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.78-87. [PDF, 660KB]

Function words

(2009) Hendra Setiawan, Min-Yen Kan, Haizhou Li, & Philip Resnik: Topological ordering of function words in hierarchical phrase-based translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.324-332. [PDF, 504KB]

(2007) Hendra Setiawan, Min-Yen Kan, & Haizhou Li: Ordering phrases with function words. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 712-719 [PDF, 213KB]

(2006) Masatoshi Tsuchiya, Takao Shime, Toshihiro Takagi, Takehito Utsuro, Kiyotaka Uchimoto, Suguru Matsuyoshi, Satoshi Sato, & Seiichi Nakagawa: Chunking Japanese compound functional expressions by machine learning. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp. 25-32 [PDF, 793KB]

Functional Generative Grammar

 (2008) Ondřej Bojar: Exploiting linguistic data in machine translation. Doctoral thesis, Charles University, Faculty of Mathematics and Physics, Prague 2008; 135pp. [PDF, 1045KB]

(2007) Keith Hall & Petr Němec: Generation in machine translation from deep syntactic trees. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.57-64 [PDF, 237KB]

Grammar formalisms see also: Case grammar, Categorial grammar, Context-free grammars, Definite Clause Grammar, Fulcrum approach, Functional Generative Grammar, Generalised Phrase Structure Grammar, Head-driven Phrase Struture Grammar, Inversion transduction grammar, Left Associative Grammar, Lexical Functional Grammar, Meaning-text model, Montague grammar, Principles and parameters, Role and reference grammar, Transduction grammars, Transformational-generative grammar, Tree Adjoining Grammar, Unification grammar

(2009) Anders Søgaard: On the complexity of alignment problems in two synchronous grammar formalisms.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.60-68. [PDF, 161KB]

(2009) Eric Wehrli, Luka Nerima & Yves Scherrer: Deep linguistic multilingual translation and bilingual dictionaries.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.90-94. [PDF, 71KB]

(2008) Anders Søgaard: Range concatenation grammars for translation. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.103-106. [PDF, 130KB]

(2007) Kevin Knight: Capturing practical natural language transformations [abstract]. Machine Translation 21 (2), June 2007; pp.121-133.

(2007) Fai Wong : The parsing algorithm of translation corresponding tree (TCT) grammar . Third Asian-Pacific Congress on Computational Mechanics (APCOM'07) in conjunction with Eleventh International Conference on the Enhancement and Promotion of Computational Methods in Engineering and Science (EPMESC XI), December 3-6, 2007, Kyoto, Japan; 10pp. [PDF, 294KB]

(2006) Liang Huang: Efficient algorithms for richer formalisms: parsing and machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 223-226 [PDF, 87KB]

(2006) Slim Mesfar: Standard Arabic formalization and linguistic platform for its analysis. The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.84-94. [PDF, 226KB]

Grammars and grammar acquisition

(2009) Vamshi Ambati, Alon Lavie, & Jaime Carbonell: Extraction of syntactic translation models from parallel data using syntax from source and target languages. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.190-197. [PDF, 251KB]

(2009) Martin Čmejrek, Bowen Zhou, & Bing Xiang: Enriching SCFG rules directly from efficient bilingual chart parsing. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 136-143. [PDF, 315KB]; presentation [PDF of PPT, 982KB]

(2009) Steve DeNeefe & Kevin Knight: Synchronous tree adjoining machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.727-736. [PDF, 398KB]

(2009) Jorge González & Francisco Casacuberta: GREAT: a finite-state machine translation toolkit implementing a grammatical inference approach for transducer inference (GIATI). Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference, Athens, Greece, 30 March 2009; pp.24-32. [PDF, 125KB]

(2009) Maxim Khalilov, José A.R.Fonollosa, & Mark Dras: Compiling hierarchical word reordering and decoding in phrase-based statistical machine translation.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.78-86. [PDF, 1161KB]

(2009) Markus Saers, Joakim Nivre, & Dekai Wu: Learning stochastic bracketing inversion transduction grammars with a cubic time biparsing algorithm. IWPT-09: Proceedings of the 11th International Conference on Parsing Technologies, 7-9 October 2009, Paris, France; pp. 29-32. [PDF, 126KB]

(2009) Felipe Sánchez-Martínez & Mikel L.Forcada: Inferring shallow-transfer machine translation rules from small parallel corpora. Journal of Artificial Intelligence Research 34, pp. 605-635. [PDF, 706KB]

(2009) Benjamin Snyder, Tahira Naseem, & Regina Barzilay: Unsupervised multilingual grammar induction. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.73-81. [PDF, 273KB]

(2009) István Varga & Soichi Yokoyama: Transfer rule generation for a Japanese-Hungarian machine translation system. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.356-362. [PDF, 152KB]

(2008) Pierrette Bouillon, Sonia Halimi, Yukie Nakao, Kyoko Kanzaki, Hitoshi Isahara, Nikos Tsourakis, Marianne Starlander, Beth Ann Hockey, & Manny Rayner: Developing non-European translation pairs in a medium-vocabulary medical speech translation system. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 122KB]

(2008) Helena M. Caseli, Maria das Graças V.Nunes, & Mikel L. Forcada: From free shallow monolingual resources to machine translation systems: easing the task. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 41-48. [PDF, 454KB]

(2008) Jonathan H. Clark, Robert Frederking, & Lori Levin: Inductive detection of language features via clustering minimal pairs: towards feature-rich grammars in machine translation.  Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.78-86. [PDF, 246KB]

(2008) Yvette Graham & Josef van Genabith: Packed rules for automatic transfer-rule induction. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.57-65. [PDF, 1052KB]

(2008) Greg Hanneman, Edmund Huber, Abhaya Agarwal, Vamshi Ambati, Alok Parlikar, Erik Peterson, & Alon Lavie: Statistical transfer systems for French-English and German-English machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.163-166. [PDF, 72KB]

(2008) Michael Jellinghaus: A data-driven approach to deep machine translation. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 28 slides. [PDF, 405KB]

(2008) Alon Lavie, Alok Parlikar, & Vamshi Ambati: Syntax-driven learning of sub-sentential translation equivalents and translation rules from parsed parallel corpora. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.87-95. [PDF, 1610KB]

(2008) Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori Levin, Alon Lavie, Alison Alvarez, Roberto Aranovich, Jaime Carbonell, Robert Frederking, Erik Peterson, & Katharina Probst: Linguistic structure and bilingual informants help induce machine translation of lesser-resourced languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

(2008) Marianne Santaholma: Multilingual grammar resources in multilingual application development. Coling 2008: Proceedings of the Workshop on Grammar Engineerings across Frameworks, 24 August 2008, Manchester, UK; pp.25-32. [PDF, 108KB]

(2008) Hao Zhang, Daniel Gildea, & David Chiang: Extracting synchronous grammar rules from word-level alignments in linear time. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1081-1088. [PDF, 181KB]

(2007) Krzysztof Jassem & Tomasz Kowalski: Machine translation using scarce bilingual corpora. TASK Quarterly 11, no.1-2, 21-33. [PDF, 216KB]

(2007) Felipe Sánchez-Martínez & Mikel L.Forcada: Automatic induction of shallow-transfer rules for open-source machine translation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.181-190 [PDF, 336KB]; poster [PDF, 29KB]

(2006) Yuan Ding & Martha Palmer: Better learning and decoding for syntax based SMT using PSDIG. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.37-45 [PDF, 391KB]

(2006) Krzysztof Jassem & Kowalski Tomasz: An algorithm for extracting translation rules from scarce bilingual corpora. Proceedings of the International Multiconference on Computer Science and Information Technology, vol.1: XXII Autumn Meeting of Polish Information Processing Society, November 6-10, 2006, Wisla, Poland; pp.67-73. [PDF, 368KB]

(2006) Janna Khegai: Grammatical framework (GF) for MT in sublanguage domains. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.95-104 [PDF, 252KB]

(2006) Tracy Holloway King: [review of] Hristo Georgiev, English algorithmic grammar. Machine Translation 20 (3),2006; pp.217-220. [see publication]

(2006) Rebecca Nesson, Stuart Shieber & Alexander Rush: Induction of probabilistic synchronous tree-insertion grammars for machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.128-137 [PDF, 376KB]

(2005) Polat Kadir, Koichi Yamada & Hiroshi Kinukawa: Comparative study on Japanese and Uyghur grammars for an English-Uyghur machine translation system . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.432-437. [PDF, 304KB]

(2005) Jonas Kuhn: Parsing word-aligned parallel corpora in a grammar induction context. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 17-25. [PDF, 150KB]

Head-driven Phrase Structure Grammar

(2009) Xianchao Wu, Takuya Matsuzaki, Naoaki Okazaki, Yusuke Miyao, & Jun’ichi Tsujii: The UOT system: improve string-to-tree translation using head-driven phrase structure grammar and predicate-argument structures. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 99-106. [PDF, 496KB]; poster [PDF, 304KB]

(2009) Yi Zhang, Rui Wang, & Stephan Oepen: Hybrid multilingual parsing with HPSG for SRL. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.31-36. [PDF, 115KB]

(2008) Francis Bond, Eric Nichols, Darren Scott Appling, & Michael Paul: Improving statistical machine translation by paraphrasing the training data. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.150-157 [PDF, 132KB]; presentation [PDF,189KB]

(2008) Janne Bondi Johannessen, Torbjørn Nordgård, & Lars Nygaard: Evaluation of linguistics-based translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 249KB]

(2006) Sara Stymne & Lars Ahrenberg: A bilingual grammar for translation of English-Swedish verb frame divergences. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.9-17 [PDF, 139KB]

(2005) Francis Bond, Stephan Oepen, Melanie Siegel, Ann Copestake, & Dan Flickinger: Open Source machine translation with DELPH-IN  MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Open Source Machine Translation; pp.15-22. [PDF, 2700KB]

Historical linguistics

(2009) Walther von Hahn: Challenges in machine (aided) translation for historical languages. EAMT-2009 Workshop Machine Translation for Historical Languages, 13 May 2009, Barcelona; slides [PDF of PPT, 734KB]

(2009) Roland Meyer: Tagging Old Russian texts via modern translations – applications and resources for historical MT. EAMT-2009 Workshop Machine Translation for Historical Languages, 13 May 2009, Barcelona; slides [PDF of PPT, 142KB]

 (2009) Cristina Vertan: Machine (aided) translation for historical texts – an overview of current solutions. EAMT-2009 Workshop Machine Translation for Historical Languages, 13 May 2009, Barcelona; slides [PDF of PPT, 453KB]

Homonyms, homographs and homophones (see also Word sense disambiguation)

(2009) Oi Yee Kwong: Homophones and tonal patterns in English-Chinese transliteration. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.21-24. [PDF, 204KB]

(2009) Oi Yee Kwong: Phonological context approximation and homophone treatment for NEWS 2009 English-Chinese transliteration shared task. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.76-79. [PDF, 178KB]

Idioms

(2008) Dimitra Anastasiou: Identification of idioms by machine translation: a hybrid research system vs. three commercial systems.  EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.12-20. [PDF, 498KB]

(2008) Piotr Bański & Radsław Moszczyński: Enhancing an English-Polish electronic dictionary for multiword expression research. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 56KB]

(2007) Koichi Takeuchi, Takashi Kanehila, Kazuki Hilao, Takeshi Abekawa, & Kyo Kageura: Flexible automatic look-up of English idiom entries in dictionaries. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.451-458 [PDF, 489KB]

(2006) Begoña Villada Moirón & Jörg Tiedemann: Identifying idiomatic expressions using word-alignment. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp.33-40 [PDF, 291KB]

Imperatives and commands

(2009) Marcel Puig Portella & Eleni Papadopoulou: Treatment of the imperative forms in the machine translation between Catalan, Spanish and Greek.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.198-202 [abstract]

Interrogatives and questions

(2009) Rodolfo Delmonte, Antonella Bristot, Sara Tonelli, & Emanuele Pianta: English/Veneto resource poor machine translation with STILVEN.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.82-89 [abstract]

(2009) Xiaohong Wu: Building a linguistic database for Chinese interrogative sentences.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.260-268 [abstract]

(2008) Chen-li Kuo & Allan Ramsay: Translating emphatic/contrastive focus from English to Mandarin Chinese [abstract]. Machine Translation 22 (4), December 2008; pp.181-203.

(2008) Chen-li Kuo & Allan Ramsay: Translating emphatic/contrastive focus from English to Mandarin Chinese. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.78-87. [PDF, 660KB]

Language identification

(2009) Hakan Ceylan & Yookyung Kim: Language identification of search engine queries. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.1066-1074. [PDF, 239KB]

(2009) David Kurokawa, Cyril Goutte & Pierre Isabelle: Automatic detection of translated text and its impact on machine translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 81-88. [PDF, 118KB]

(2009) Daniel Chung Yong Lim & Ian Lane: Language identification for speech-to-speech translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2008) Yo Ehara & Kumiko Tanaka-Ishii: Multilingual text entry using automatic language detection. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.441-448. [PDF, 428KB]

(2008) Udhyakumar Nallasamy, Alan W. Black, Tanja Schultz, & Robert Frederking: NineOneOne: recognizing and classifying speech for handling minority language emergency calls.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 95KB]

(2008) Hans van Halteren: Source language markers in EUROPARL translations.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.937-944. [PDF, 144KB]

(2005) Beatrice Alex: An unsupervised system for identifying English inclusions in German text. ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp.133-138. [PDF, 83KB]

(2005) Pavol Zavarsky, Yoshiki Mikami & Shota Wada: Language and encoding scheme identification of extremely large sets of multilingual text . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.354-355. [PDF, 84KB]

Language universals

 (2008) Gloria Corpas Pastor, Ruslan Mitkov, Naveed Afzal, & Viktor Pekar: Translation universals: do they exist? A corpus-based NLP study of convergence and simplification. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.75-81. [PDF, 566KB]

(2005) Luis Iraola: Using WordNet for linking UWs to the UNL UW system. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.370-379 [abstract, PDF, 59KB]

Latent semantic analysis

(2009) Honglei Guo, Huijia Zhu, Zhili Guo, Xiaoxun Zhang, Xian Wu & Zhong Su: Domain adaptation with latent semantic association for named entity recognition. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.281-289. [PDF, 203KB]

(2008) Brett W.Bader & Peter A.Chew: Enhancing multilingual latent semantic analysis with term alignment information.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.49-56. [PDF, 295KB]

(2008) Peter A.Chew, Brett W.Bader, & Ahmed Abdelali: Latent morpho-semantic analysis: multilingual information retrieval with character n-grams and mutual information. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.129-136. [PDF, 117KB]

(2007) Peter A.Chew & Ahmed Abdelali: Benefits of the ‘massively parallel Rosetta stone’: cross-language information retrieval with over 30 languages. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 872-879 [PDF, 145KB]

(2007) Yonggang Deng & Yuqing Gao: Guiding statistical word alignment models with prior knowledge. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 1-8 [PDF, 245KB]

(2007) Yik-Cheung Tam, Ian Lane, & Tanja Schultz: Bilingual LSA-based adaptation for statistical machine translation [abstract].  Machine Translation 21 (4) December 2007; pp.187-207.

(2007) Yik-Cheung Tam, Ian Lane, & Tanja Schultz: Bilingual-LSA based LM adaptation for spoken language translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 520-527 [PDF, 162KB]

(2007) Yik-Cheung Tam & Tanja Schultz: Bilingual LSA-based translation lexicon adaptation for spoken language translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2461-2464; abstract [PDF, 23KB]

(2006) Irina Matveeva & Gina-Anne Levow: Computing term translation probabilities with generalized latent semantic analysis. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Posters and demonstrations, Trento, Italy, April 5-6, 2006; pp.151-154 [PDF, 90KB]

(2006) Florence Reeder: Measuring MT adequacy using Latent Semantic Analysis.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.176-184 [PDF, 180KB]

Lexical Functional Grammar

 (2009) Yvette Graham, Josef van Genabith, & Anton Bryl: F-structure transfer-based statistical machine translation. Proceedings of the LFG09 conference, Miriam Butt and Tracy Holloway King (eds.) CSLI Publications, 2009; pp.318-337. [PDF, 1227KB]

(2008) Tirthankar Dasgupta, Sandipan Dandapat, & Anupam Basu: Prototype machine translation system from text-to-Indian sign language. IJCNLP 2008: Workshop on NLP for Less Privileged Languages. Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.19-26. [PDF, 149KB]

(2008) Yvette Graham & Josef van Genabith: Packed rules for automatic transfer-rule induction. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.57-65. [PDF, 1052KB]

(2008) Yuqing Guo, Josef van Genabith, & Haifeng Wang: Dependency-based n-gram models for general purpose sentence realisation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.297-304. [PDF, 138KB]

(2008) Janne Bondi Johannessen, Torbjørn Nordgård, & Lars Nygaard: Evaluation of linguistics-based translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 249KB]

(2007) Mark Hopkins & Jonas Kuhn: Deep grammars in a tree labeling approach to syntax-based statistical machine translation. ACL 2007: proceedings of the Workshop on Deep Linguistic Processing, Prague, Czech Republic, 28 June 2007; pp.33-40 [PDF, 156KB]

(2007) Karolina Owczarzak & Josef van Genabith: Evaluating machine translation with LFG dependencies [abstract]. Machine Translation 21 (2), June 2007; pp.95-119.

(2007) Karolina Owczarzak, Josef van Genabith, & Andy Way: Labelled dependencies in machine translation evaluation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 104-111 [PDF, 239KB]

(2007) Karolina Owczarzak, Josef van Genabith, & Andy Way: Dependency-based automatic evaluation for machine translation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.80-87 [PDF, 245KB]

(2006) Yafa Al-Raheb, A.Akrout, J. van Genabith, & J. Dichy: DCU 250 Arabic dependency bank: an LFG gold standrad resource for the Arabic Penn treebank.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.105-116. [PDF, 355KB]

Linguistics and MT

(2008) Alexander Täuschel: Linguistic aspects in machine translation. [Offenbach a.M.: private], 2008. 22pp. [PDF, 325KB]

(2007) Doug Arnold, Louis Des Tombe, & Steven Krauwer: MT in the 1970s and 1980s. [priv.publ., April 2007]. 16pp. [PDF, 76KB]

(2007) Horacio Rodríguez: [review of] Kornai, András, Mathematical linguistics. Machine Translation 21 (4),December 2007; pp.253-256. [see publication]

(2007) Karen Spärck Jones: Computational linguistics: what about the linguistics?  Computational Linguistics 33 (3), pp. 437-441. [PDF, 48KB]

(2006) Marcello Federico: [review of] Mark Johnson, Sanjeev P.Khudanpur, Mari Ostendorf and Roni Rosenfeld (eds.), Mathematical foundations of speech and language processing. Machine Translation 20 (3),2006; pp.223-225. [see publication]

(2006) Eva Hajičová: Old linguists never die, they only get obligatorily deleted. Computational Linguistics 32 (4), pp. 457-469. [PDF, 98KB]

(2006) Daniel Marcu & Alan K.Melby: Data-driven machine translation: a conversation with linguistics and translation studies. AMTA 2006:  7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; invited talk [PDF of PPT presentation, 998KB]; expanded version with annotations and revisions, January 2008; PDF, 768KB]

Loan words

(2006) Badam-Osor Khaltar, Atsushi Fujii, & Testuya Ishikawa: Extracting loan words from Mongolian corpora and producing a Japanese-Mongolian bilingual dictionary. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.657-664. [PDF, 297KB]

Logical aspects (see also Entailment)

(2006) Dennis Spohr & Ulrich Heid: Modeling monolingual and bilingual collocation dictionaries in description logics. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp.65-72 [PDF, 357KB]

(2005) Brian Murphy & Carl Vogel: A comparative evaluation of UNL participant relations using a five-language parallel corpus. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.64-76 [abstract, PDF, 137KB]

Meaning-text model

(2008) A.S.Andreyeva: Lexical-functional correspondences and their use in the system of machine translation ETAP-3. Coling 2008: Proceedings of the Workshop on Cognitive Aspects of the Lexicon, 24 August 2008, Manchester, UK; pp.64-72. [PDF,  164KB]

(2008) Igor Mel’čuk & Leo Wanner: Morphological mismatches in machine translation [abstract]. Machine Translation 22 (3), September 2008; pp.101-152.

(2008) Simon Mille & Leo Wanner: Multilingual summarization in practice: the case of patent claims. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.120-129. [PDF, 602KB]

(2006) Igor Mel’čuk & Leo Wanner: Syntactic mismatches in machine translation [abstract]. Machine Translation 20 (2),2006; pp.81-138.

(2005) Igor M.Boguslavsky, Leonid L.Iomdin & Victor G.Sizov: Interactive enconversion by means of the Etap-3 system. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.230-240 [abstract, PDF, 72KB]

Minimal Recursion Semantics

(2008) Janne Bondi Johannessen, Torbjørn Nordgård, & Lars Nygaard: Evaluation of linguistics-based translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 249KB]

(2006) Stephan Oepen & Jan Tore Lønning: Discriminant-based MRS banking.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1250-1255 [PDF, 430KB]

(2005) Dan Flickinger, Jan Tore Lønning, Helge Dyvik, Stephan Oepen, & Francis Bond: SEM-I rational MT: enriching deep grammars with a semantic interface for scalable machine translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.165-172. [PDF, 249KB]

Minority languages

(2009) Vamshi Ambati & Jaime Carbonell: Proactive learning for building machine translation systems for minority languages. NAACL-HLT-2009: Active Learning for Natural Language Processing (ALNLP-09), Proceedings of the workshop, June 5, 2009, Boulder, Colorado; pp.58-61. [PDF, 101KB]

(2009) Hendrik J.Groenewald & Wildrich Fourie: Introducing the Autshumato integrated translation environment. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.190-196. [PDF , 410KB]

(2009) Lori Levin: Adaptable, community-controlled, language technologies for language maintenance. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.8-11 [PDF, 223KB]

(2009) Nicholas Ostler: “The jungle is neutral” – newcomer languages face new media. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.1-7 [PDF, 265KB]

(2009) Francis M.Tyers, Linda Wiechetek, & Trond Trosterud: Developing prototypes for machine translation between two Sámi languages. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.120-127. [PDF, 340KB]

(2008) Scott Hourin: Language technology in humanitarian aid and disaster response. [Abstract of keynote speech at] IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, Hawaii, USA.

(2008) Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori Levin, Alon Lavie, Alison Alvarez, Roberto Aranovich, Jaime Carbonell, Robert Frederking, Erik Peterson, & Katharina Probst: Linguistic structure and bilingual informants help induce machine translation of lesser-resourced languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

(2008) Udhyakumar Nallasamy, Alan W. Black, Tanja Schultz, & Robert Frederking: NineOneOne: recognizing and classifying speech for handling minority language emergency calls.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 95KB]

(2008) Udhyakumar Nallasamy, Alan W.Black, Tanja Schultz, Robert Frederking, & Jerry Weltman: Speech translation for triage of emergency phonecalls in minority languages. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.48-53. [PDF, 229KB]

(2008) Kathrin Spreyer, Jonas Kuhn, & Bettina Schrader: Identification of comparable argument-head relations in parallel corpora.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 138KB]

(2008) Vincent Vandeghinste, Peter Dirix, Ineke Schuurman, Stella Markantonatou, Sokratis Sofianopoulos, Marina Vassiliou, Olga Yannoutsou, Toni Badia, Maite Melero, Gemma Boleda, Michael Carl, & Paul Schmidt: Evaluation of a machine translation system for low resource languages: METIS-II.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 78KB]

(2007) Michael Cronin: [review of] Albert Branchadell and Lovell Margaret West (eds.) Less translated languages. Machine Translation 21 (1), March 2007; pp.69-71. [see publication]

(2007) Harold Somers: Theoretical and methodological issues regarding the use of language technologies for patients with limited English proficiency. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.206-213 [PDF, 637KB]; presentation [PDF, 464KB]

(2006) proceedings of 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, LREC-2006: Fifth International Confe rence on Language Resources and Evaluation, Genoa, Italy, 23 May 2006. [PDF, 5391KB]

(2006) Oliver Streiter, Kevin P.Scanell, & Martin Stuflesser: Implementing NLP projects for noncentral languages: instructions for funding bodies, strategies for developers [abstract]. Machine Translation 20 (4),2006; pp.267-289.

(2005) Alison Alvarez, Lori Levin, Robert Frederking, Erik Peterson & Jeff Good: Semi-automated elicitation corpus generation . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.388-395. [PDF, 131KB]

(2005) Stephen Beale, Sergei Nirenburg, Marjorie McShane, & Tod Allman: Document authoring the Bible for minority language translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.63-70. [PDF, 415KB]

(2005) Petr Homola & Vladislav Kuboň: A machine translation system into a minority language. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; p.31-35 [PDF, 178KB]

Modality

(2005) Seong-Bae Park, Jeong-Woo Son, & Yoon-Shik Tae: Augmentation of modality translation rules in Korean-to-English machine translation by rule learning. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.93-100. [PDF, 2045KB]

Monologues

(2006) Hitomi Tohyama & Shigeki Matsubara: Collection of simultaneous interpreting patterns by using bilingual spoken monologue corpus. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2564-2569 [PDF, 552KB]

Multiword expressions [see also Nouns and Noun phrases, Prepositional phrases, Verbs and verb phrases]

(2009) Ming-Hong Bai, Jia-Ming You, Keh-Jiann Chen, & Jason S.Chang: Acquiring translation equivalences of multiword expressions by normalized correlation frequencies. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.478-486. [PDF, 371KB]

(2009) Philippe Langlais, François Yvon & Pierre Zweigenbaum: Improvements in analogical learning: application to translating multi-terms of the medical domain.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.487-495. [PDF, 172KB]

(2009) Zhixiang Ren, Yajuan Lü, Jie Cao, Qun Liu, & Yun Huang: Improving statistical machine translation using domain bilingual multiword expressions. [ACL-IJCNLP-2009] Proceedings of the 2009 Workshop on Multiword Expressions, ACL-IJCNLP 2009, Suntec, Singapore, 6 August 2009; pp.47-54. [PDF, 170KB]

(2009) Massoud Sharifi-Atashgah & Mahmood Bijankhan: Corpus-based analysis for multi-token units in Persian. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 817KB]

(2009) Svetlana Sheremetyeva: On extracting multiword NP terminology for MT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.205-212. [PDF, 215KB]

(2009) R.Mahesh K.Sinha: Mining complex predicates in Hindi using a parallel Hindi-English corpus. [ACL-IJCNLP-2009] Proceedings of the 2009 Workshop on Multiword Expressions, ACL-IJCNLP 2009, Suntec, Singapore, 6 August 2009; pp.40-46. [PDF, 112KB]

(2009) Eric Wehrli, Violeta Seretan, Luka Nerima, & Lorenza Russo: Collocations in a rule-based MT system: a case study evaluation of their translation adequacy. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.128-135. [PDF, 375KB]

(2009) Sina Zarrieß & Jonas Kuhn: Exploiting translational correspondences for pattern-independent MWE identification.  [ACL-IJCNLP-2009] Proceedings of the 2009 Workshop on Multiword Expressions, ACL-IJCNLP 2009, Suntec, Singapore, 6 August 2009; pp.23-30. [PDF, 132KB]

(2009) Ali Zogheib: Genetic algorithm-based multi-word automatic language translation. (IIS-2009) Recent Advances in Intelligent Information Systems, ed. M.A.Klopotek, A. Przepiórkowski, S.T. Wierzchon, K. Trojanowski, Kraków, Poland, June 15-18, 2009; pp.751-760. [PDF, 141KB]

(2008) A.S.Andreyeva: Lexical-functional correspondences and their use in the system of machine translation ETAP-3. Coling 2008: Proceedings of the Workshop on Cognitive Aspects of the Lexicon, 24 August 2008, Manchester, UK; pp.64-72. [PDF,  164KB]

(2008) Piotr Bański & Radsław Moszczyński: Enhancing an English-Polish electronic dictionary for multiword expression research. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 56KB]

(2008) Béatrice Daille & Emmanuel Morin: A effective compositional model for lexical alignment. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp 95-102. [PDF, 401KB]

(2008) Daiga Deksne, Raivis Skadiņš, & Inguna Skadiņa: Dictionary of multiword expressions for translation into highly inflected languages.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 262KB]

(2008) María Fernández Parra & Pius ten Hacken: Beyond terms; multi-word units in Multiterm Extract. Translating and the Computer 30, 27-28 November 2008, London; 18pp. [PDF, 157KB]

 (2007) David Hardcastle: Generalizing syntactic collocates for creative language generation. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.13-21 [PDF, 2114KB]

(2006) Asif Ekbal, Sudip Kumar Naskar, & Sivaji Bandyopadhyay: A modified joint source-channel model for transliteration. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.191-198. [PDF, 159KB]

(2006) Patrik Lambert & Rafael Banchs: Grouping multi-word expressions according to part-of-speech in statistical machine translation. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp.9-16 [PDF, 318KB]

(2006) Paul McNamee & James Mayfield: Translation of multiword expressions using parallel suffix arrays.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.100-109 [PDF, 733KB]

(2006) Scott S.L.Piao, Guangfan Sun, Paul Rayson, & Qi Yuan: Automatic extraction of Chinese multiword expressions with a statistical tool. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp.17-24 [PDF, 396KB]

(2006) Reinhard Rapp & Carlos Martin Vide: Example-based machine translation using a dictionary of word pairs.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1268-1273 [PDF, 352KB]

(2006) Violeta Seretan & Eric Wehrli: Accurate collocation extraction using a multilingual parser. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.953-960. [PDF, 168KB]

(2006) Serge Sharoff, Bogdan Babych, & Anthony Hartley: Using comparable corpora to solve problems difficult for human translators. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.739-746. [PDF, 250KB]

(2006) Serge Sharoff, Bogdan Babych, & Anthony Hartley: Using collocations from comparable corpora to find translation equivalents.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.465-470 [PDF, 1104KB]

(2006) Dennis Spohr & Ulrich Heid: Modeling monolingual and bilingual collocation dictionaries in description logics. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp.65-72 [PDF, 357KB]

(2006) Masatoshi Tsuchiya, Takao Shime, Toshihiro Takagi, Takehito Utsuro, Kiyotaka Uchimoto, Suguru Matsuyoshi, Satoshi Sato, & Seiichi Nakagawa: Chunking Japanese compound functional expressions by machine learning. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp. 25-32 [PDF, 793KB]

(2006) Tamás Váradi: Multiword units in an MT lexicon. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp. 73-78 [PDF, 635KB]

(2006) Sriram Venkatapathy & Aravind K.Joshi: Using information about multi-word expressions for the word-alignment task. Coling-ACL 2006: Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties, Sydney, July 2006; pp.20-27 [PDF, 120KB]

 (2006) Joachim Wagner: [review of] Nadja Nesselhauf, Collocations in a learner corpus. Machine Translation 20 (4),2006; pp.301-303. [see publication]

(2005) Marcello Federico & Nicola Bertoldi: A word-to-phrase statistical translation model. ACM Transactions on Speech and Language Processing 2 (2), pp.1-24. [abstract, PDF, 13KB]

(2005) Young-Sook Hwang & Yutaka Sasaki: Context-dependent SMT model using bilingual verb-noun collocation. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 549-556. [PDF, 1005KB]

(2005) Patrik Lambert & Rafael Banchs: Data inferred multi-word expressions for statistical machine translation . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.396-403. [PDF, 182KB]

Names and named entities

 (2009) see [ACL-IJCNLP 2009] NEWS 2009. Named Entities Workshop: shared task on transliteration. Proceedings of the Workshop, Suntec, Singapore, 7 August 2009

 (2009) Kedar Bellare, Koby Crammer, & Dayne Freitag: Loss-sensitive discriminative training of machine transliteration models. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.61-65. [PDF, 117KB]

(2009) Ming-Wei Chang, Dan Goldwasser, Dan Roth, & Yuancheng Tu: Unsupervised constraint driven learning for transliteration discovery. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.299-307. [PDF, 580KB]

(2009) Loic Dugast, Jean Senellart, & Philipp Koehn: Statistical post editing and dictionary extraction: Systran/Edinburgh submissions for ACL-WMT2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.110-114. [PDF, 245KB]

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2009) Izaskun Fernandez, Iñaki Alegria, & Nerea Ezeiza: Using Wikipedia for named-entity translation. [SEPLN 2009] SALTMIL 2009, Donostia-San Sebastián, Spain. “Information retrieval and information extraction for less resourced languages”, Donostia-San Sebastián, September 7; pp.27-35. [PDF, 550KB]

(2009) Tatiana Gornostay & Inguna Skadiņa: English-Latvian toponym processing: translation strategies and linguistic patterns. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.81-87. [PDF, 266KB]

(2009) Filip Graliński, Krzysztof Jassem, & Michał Marcińczuk: An environment for named entity recognition and translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.88-95. [PDF, 430KB]

(2009) Filip Graliński, Krzysztof Jassem, & Michal Marcińczuk: Named entity recognition in machine anonymization. (IIS-2009) Recent Advances in Intelligent Information Systems, ed. M.A.Klopotek, A. Przepiórkowski, S.T. Wierzchon, K. Trojanowski, Kraków, Poland, June 15-18, 2009; pp.77-89. [PDF, 144KB]

(2009) Honglei Guo, Huijia Zhu, Zhili Guo, Xiaoxun Zhang, Xian Wu & Zhong Su: Domain adaptation with latent semantic association for named entity recognition. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.281-289. [PDF, 203KB]

(2009) Martin Jansche & Richard Sproat: Named entity transcription with pair n-gram models. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.32-35. [PDF, 113KB]

(2009) Heng Ji: Mining name translations from comparable corpora by creating bilingual information networks. [ACL-IJCNLP-2009] Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.34-47. [PDF, 120KB]

(2009) Alexandre Klementiev & Dan Roth: Names entity transliteration and discovery in multilingual corpora.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.79-92.

(2009) Maoxi Li, Jiajun Zhang, Yu Zhou, & Chengqing Zong: The CASIA statistical machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 83-90. [PDF, 317KB]; presentation [PDF of PPT, 463KB]

(2009) Djamel Mostefa, Mariama Laïb, Stéphane Chaudiron, Khalid Choukri, & Gaël de Chalendar: A multilingual named entity corpus for Arabic, English and French.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.213-216. [PDF, 571KB]

(2009) Hamdy S.Mubarak, Mohamed Al Sharqawy & Esraa Al Masry: Diacritization and transliteration of proper nouns from Arabic to English.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.256-259. [PDF, 367KB]

(2009) Peter Nabende: Evaluation of dynamic Bayesian network models for entitiy name transliteration. Benelearn 2009: proceedings of the 18th Annual Belgian-Dutch Conference on Machine Learning, Tilburg, May 2009; pp.99-100. [PDF, 151KB]

(2009) Sara Noeman: Transliteration using phrase based SMT approach on substrings.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.199-204. [PDF, 348KB]

(2009) Bruno Pouliquen & Ralf Steinberger: Automatic construction of multilingual name dictionaries.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.59-78.

(2009) Sujith Ravi & Kevin Knight: Learning phoneme mappings for transliteration without parallel data. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.37-45. [PDF, 198KB]

(2009) Aarthi Reddy, Richard Rose, Hani Safadi, Samuel Larkin & Gilles Boulianne: Incorporating knowledge of source language text in a system for dictation of document translations. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 112-119. [PDF, 167KB]

(2009) R.Mahesh K.Sinha: Automated mining of names using parallel Hindi-English corpus. ACL-IJCNLP-2009: 7th Workshop on Asian Language Resources (ALR-7), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp. 11-18. [PDF, 312KB]

(2009) Raghavendra Udupa, K.Saravanan, A.Kumaran, & Jagadeesh Jagarlamudi: MINT: a method for effective and scalable mining of named entity transliterations from large comparable corpora.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.799-807. [PDF, 1120KB]

(2009) Fan Yang, Jun Zhao, & Kang Liu: A Chinese-English organization name translation system using heuristic web mining and asymmetric alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.387-395. [PDF, 178KB]

(2008) Bogdan Babych & Anthony Hartley: Sensitivity of automated MT evaluation metrics on higher quality MT output: BLEU vs task-based evaluation methods. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 57KB]

(2008) Sivaji Bandyopadhyay: Multilingual named entity recognition. IJCNLP 2008: Workshop on NER for South and South East Asian Languages, Proceedings of the workshop, 12 January 2008, Hyderabad, India; pp.3-4. [PDF, 37KB]

(2008) Stephan Busemann & Yajing Zhang: Identifying foreign person names in Chinese text. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 255KB]

(2008) Asif Ekbal, Rejwanul Haque, Amitava Das, Venkateswarlu Poka, & Sivaji Bandyopadhyay: Language independent named entity recognition in Indian languages. IJCNLP 2008: Workshop on NER for South and South East Asian Languages, Proceedings of the workshop, 12 January 2008, Hyderabad, India; pp.33-40. [PDF, 160KB]

(2008) Dan Goldwasser & Dan Roth: Active sample selection for named entity transliteration. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 53-56. [PDF, 538KB]

(2008) Dan Goldwasser & Dan Roth: Transliteration as constrained optimization.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.353-362. [PDF, 645KB]

(2008) Ulf Hermjakob, Kevin Knight, & Hal Daumé III: Name translation in statistical machine translation: learning when to transliterate. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 389-397. [PDF, 107KB]

(2008) Fei Huang, Ahmad Emami, & Imed Zitouni: When Harry met Harri, [Arabic name] and [Chinese name]: cross-lingual name spelling normalization.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.391-399. [PDF, 165KB]

(2008) Shasha Liao: Combining source and target language information for name tagging of machine translation output. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the student research workshop, June 16, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 19-24. [PDF, 98KB]

(2008) Inderjeet Mani, Alex Yeh, & Sherri Condon: Learning to match names across languages. Coling 2008: Proceedings of the 2nd Workshop on Multi-source, Multilingual Information Extraction and Summarization, 23 August 2008, Manchester, UK; pp.2-9. [PDF, 182KB]

(2008) Denis Maurel: Prolexbase: a multilingual relational lexical database of proper names. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 145KB]

(2008) Alexander E. Richman & Patrick Schone: Mining Wiki resources for multilingual named entity recognition. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 1-9. [PDF, 134KB]

(2008) K.Saravanan & A.Kumaran: Some experiments in mining named entitiy transliteration pairs from comparable corpora. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.26-33. [PDF, 805KB]

(2008) Zhiyi Song & Stephanie Strassel: Entity translation and alignment in the ACE-07 ET task. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 135KB]

(2008) Kristen Summers & Diane Chandler: Language processing for analysis and investigation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.454-456. [PDF, 647KB]

(2008) Jean Tavernier, Rosa Cowan, & Michelle Vanni: Holy Moses! Leveraging existing tools and resources for entity translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 74KB]

(2008) Yu-Chun Wang, Richard Tzong-Han Tsai, & Wen-Lian Hsu: Learning patterns from the web to translate named entities for cross language information retrieval.  IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.281-288. [PDF, 464KB]

(2008) Wolodja Wentland, Johannes Knopp, Carina Silberer, & Matthias Hartung: Building a multilingual lexical resource for named entity disambiguation, translation and transliteration.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 451KB]

(2008) Imed Zitouni & Radu Florian: Mention detection crossing the language barrier. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.600-609. [PDF, 149KB]

(2007) Bogdan Babych & Anthony Hartley: Sensitivity of automated models for MT evaluation: proximity-based vs. performance-based methods. MT Summit XI Workshop: Automatic procedures in MT evaluation, 11 September 2007, Copenhagen, Denmark, [Proceedings]; 22pp. [PDF of PPT presentation, 150KB]

(2007) Guihong Cao, Jianfeng Gao, & Jian-Yun Nie: A system to mine large-scale bilingual dictionaries from monolingual web pages. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.57-64 [PDF, 539KB]

(2007) Dan Goldwasser, Ming-Wei Chang, Yuancheng Tu & Dan Roth: Constraint-driven transliteration discovery. In: Recent Advances in Natural Language Processing: selected papers from RANLP 2007, ed. Nicolas Nikolov, Galia Angelova, Ruslan Mitkov, John Benjamins Publ.Co.: Amsterdam/Philadelphia, 2000; pp.21-40. [PDF, 525KB]

(2007) Chu-Ren Huang, Petr Šimon, & Shu-Kai Hsieh: Automatic discovery of named entity variants – grammar-driven approaches to non-alphabetical transliterations. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 153-156 [PDF, 246KB]

 (2007) Heng Ji & Ralph Grishman: Collaborative entity extraction and translation. In: Recent Advances in Natural Language Processing: selected papers from RANLP ’97, ed. Nicolas Nikolov, John Benjamins Publ.Co.: Amsterdam/Philadelphia, 2000; pp.73-84. [PDF, 303KB]

(2007) Long Jiang, Ming Zhou, Lee-Feng Chien & Cheng Niu: Named entity translation with web mining and transliteration. IJCAI-07: Twentieth International Joint conference on Artificial Intelligence, Hyderabad, India, 6-12 January 2007; pp.1629-1634. [PDF, 491KB]

(2007) Sarvnaz Karimi, Andrew Turpin, & Falk Scholer: Corpus effects on the evaluation of automated transliteration systems. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 640-647 [PDF, 1215KB]

(2007) Sarvnaz Karimi, Falk Scholer, & Andrew Turpin: Collapsed consonant and vowel models: new approaches for English-Persian transliteration and back-transliteration.  ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 648-655 [PDF, 195KB]

(2007) Haizhou Li, Khe Chai Sim, Jin-Shea Kuo, & Minghui Dong: Semantic transliteration of personal names. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 120-127 [PDF, 152KB]

(2007)  S.Saleem, K.Subramanian, R.Prasad, David Stallard, Chia-Lin Kao, P.Natarajan, & R.Suleiman: Improvements in machine translation for English/Iraqi speech translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2445-2448; abstract [PDF, 23KB]

(2007) Khaled Shaalan & Hafsa Raza: Person name entity recognition for Arabic. ACL 2007: proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, Prague, Czech Republic, 28 June 2007; pp. 17-24 [PDF, 591KB]

(2007) Tarek Sherif & Grzegorz Kondrak: Substring-based transliteration. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 944-951 [PDF, 152KB]

(2007)  Jian-Cheng Wu & Jason S.Chang: Learning to find English to Chinese transliterations on the web. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 858-857. [PDF, 187KB]

(2007) Chien-Cheng Wu & Jason S.Chang: Learning to find transliteration on the Web. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Demonstration Program, pp.21-22 [PDF, 1621KB]

(2007) Su-Youn Yoon, Kyoung-Young Kim, & Richard Sproat: Multilingual transliteration using feature based phonetic method. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 112-119 [PDF, 269KB]

(2007) Bing Zhao, Nguyen Bach, Ian Lane & Stephan Vogel: A log-linear block transliteration model based on bi-stream HMMs. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.364-371 [PDF, 453KB]

(2006) Iñaki Alegria, Nerea Ezeiza, & Izaskun Fernandez: Named entities translation based on comparable corpora. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp.1-8 [PDF, 455KB]

(2006) Marine Carpuat, Yihai Shen, Xiaofeng Yu, & Dekai Wu: Toward integrating word sense and entity disambiguation into statistical machine translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 37-44  [PDF, 330KB]

(2006) Conrad Chen & Hsin-Hsi Chen: A high-accurate Chinese-English NE backward translation system combining both lexical information and web statistics.  Coling-ACL 2006: Proceedings of the Coling/ACL 2 006 Main Conference Poster Sessions, Sydney, July 2006; pp.81-88. [PDF, 313KB]

(2006) Sherri Condon & Keith J.Miller: Name transliteration: current methods, applications, and evaluation in name transliteration and name translation. Tutorial at AMTA 2006 conference, August 8, 2006, Cambridge, Massachusetts, USA; 46pp. [PDF of PPT presentation, 1103KB]

(2006) Sherri Condon & Keith J. Miller: What’s in a name: current methods, applications, and evaluation in multilingual name search and matching. [Abstract for tutorial, in] HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 299-300 [PDF, 20KB]

(2006) Asif Ekbal, Sudip Kumar Naskar, & Sivaji Bandyopadhyay: A modified joint source-channel model for transliteration. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.191-198. [PDF, 159KB]

(2006) Andrew T. Freeman, Sherri L. Condon & Christopher M. Ackerman: Cross linguistic name matching in English and Arabic: a “one to many mapping” extension of the Levenshtein edit distance algorithm. HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 471-478 [PDF, 371KB]

(2006) Jianfeng Gao, Mu Li, Andi Wu, & Chang-Ning Huang: Chinese word segmentation and named entity recognition: a pragmatic approach. Computational Linguistics 31 (4), pp. 531-574. [PDF, 1638KB]

(2006) Mehdi M. Kashani, Fred Popowich, & Fatiha Sadat: Automatic transliteration of proper nouns from Arabic to English. The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.76-83. [PDF, 179KB]

(2006) Alexandre Klementiev & Dan Roth: Weakly supervised named entity transliteration and discovery from multilingual comparable corpora. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.817-824. [PDF, 173KB]

(2006) Ted Pedersen, Anagha Kulkarni, Roxana Angheluta, Zornitsa Kozareva, & Thamar Solorio: Improving name discrimination: a language salad approach. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.25-32 [PDF, 268 KB]

(2006) Charles Schafer: Novel probabilistic finite-state transducers for cognate and transliteration modeling. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.203-212 [PDF, 250KB]

(2006) Richard Sproat, Tao Tao, & ChenXiang Zhai: Named entity transliteration with comparable corpora. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.73-80. [PDF, 232KB]

(2006) Dmitry Zelenko & Chinatsu Aone: Discriminative methods for transliteration.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 612-617. [PDF, 342KB]

(2005) Ondřej Bojar, Petr Homola, & Vladislav Kuboň: Problems of reusing an existing MT system. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.179-184. [PDF, 73KB]

(2005) Hany Hassan & Jeffrey Sorensen: An integrated approach for Arabic-English named entity translation. ACL-2005: Workshop on Computational Approaches to Semitic Languages, University of Michigan, Ann Arbor, 29 June 2005; pp. 87-93. [PDF, 108KB]

(2005) Fei Huang: Cluster-specific named entity transliteration. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 435-444. [PDF, 169KB]

(2005) Fei Huang, Ying Zhang, & Stephan Vogel: Mining key phrase translations from web corpora. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 483-490. [PDF, 310KB]

(2005) Thamar Solorio: Exploiting named entity taggers in a second language. ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp.25-30.[PDF, 70KB]

(2005) Gregor Thurmair: Improving machine translation quality. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit: invited paper [PDF, 209KB]

Nouns and noun phrases

(2009) Lars Bungum & Stephan Oepen: Automatic translation of Norwegian noun compounds. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.136-143. [PDF, 465KB]

(2009) Pi-Chuan Chang, Dan Jurafsky, & Christopher D.Manning: Disambiguating “DE” for Chinese-English machine translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.215-223. [PDF, 276KB]

(2009) Chris Dyer: Using maximum entropy model to build segmentation lattices for MT.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.406-414. [PDF, 173KB]

(2009) Spence Green, Conal Sathi & Christopher D.Manning: NP subject detection in verb-initial Arabic clauses. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp.  [PDF, 729KB]

(2009) Le Thi Sinh: French-Vietnamese noun phrase translation.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.158-163 [abstract]

(2009) Svetlana Sheremetyeva: On extracting multiword NP terminology for MT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.205-212. [PDF, 215KB]

(2009) Sara Stymne: A comparison of merging strategies for translation of German compounds. Proceedings of the EACL 2009 Student Research Workshop, Athens, Greece, 2 April 2009; pp.61-69. [PDF, 111KB]

(2008) Nikesh Garera & David Yarowsky: Translating compounds by learning component gloss translation models via multiple languages. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.403-410. [PDF, 497KB]

(2008) Sara Stymne & Maria Holmqvist: Processing of Swedish compounds for phrase-based statistical machine translation. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.182-191. [PDF, 532KB]

(2007) Roxana Girju: Improving the interpretation of noun phrases with cross-linguistic information. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 568-575 [PDF, 156KB]

(2007) Sanjika Hewavitharana, Alon Lavie, & Stephan Vogel: Experiments with a noun-phrase driven statistical machine translation system. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.247-253 [PDF, 241KB]

(2007) Maria Holmqvist, Sara Stymne, & Lars Ahrenberg: Getting to know Moses: initial experiments on German-English factored translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 181-184 [PDF, 123KB]

(2007) Masaki Itagaki, Takako Aikawa, & Xiaodong He: Automatic validation of terminology translation consistency with statistical method.  MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.269-274 [PDF, 416KB]

 (2007) Heng Ji & Ralph Grishman: Collaborative entity extraction and translation. In: Recent Advances in Natural Language Processing: selected papers from RANLP ’97, ed. Nicolas Nikolov, John Benjamins Publ.Co.: Amsterdam/Philadelphia, 2000; pp.73-84. [PDF, 303KB]

(2007) Rebecca Nesson & Stuart M.Shieber: Extraction phenomena in synchronous TAG syntax and semantics. SSST, NAACL-HLT-2007 AMTA Workshop on  Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.9-16 [PDF, 696KB]

(2006) Chris Brockett, William B.Dolan, & Michael Gamon: Correcting ESL errors using phrasal SMT techniques.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.249-256. [PDF, 109KB]

(2006) Michael Carl & Ecaterina Rascu: A dictionary lookup strategy for translating of discontinuous phrases. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.49-57 [PDF, 197KB]

(2006) Gábor Pohl: English-Hungarian NP alignment in MetaMorpho TM. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.69-74 [PDF, 81KB]

(2006) Bettina Schrader: Non-probabilistic alignment of rare German and English nominal expressions.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1274-1277 [PDF, 251KB]

(2005) Patrik Lambert & Rafael Banchs: Data inferred multi-word expressions for statistical machine translation . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.396-403. [PDF, 182KB]

Number (singular, plural)

(2008) Jonathan H. Clark, Robert Frederking, & Lori Levin: Toward active learning in data selection: automatic discovery of language features during elicitation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

Numbers and numerals

(2008) Dongdong Zhang, Mu Li, Nan Duan, Chi-Ho Li, & Ming Zhou: Measure word generation for English-Chinese SMT systems. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 89-96. [PDF, 237KB]

Paraphrases

(2009) Anabela Barreiro & Luís Miguel Cabral: ReEscreve: a translator-friendly multi-pupose paraphrasing software tool. MT Summit XII - Workshop: Beyond Translation Memories: New Tools for Translators MT, August 29, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 594KB]; presentation, 37pp. [PDF of PPT, 1344KB]

(2009) Chris Dyer, Hendra Setiawan, Yuval Marton, & Philip Resnik: The University of Maryland statistical machine translation system for the Fourth Workshop on Machine Translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.145-149. [PDF, 120KB]

(2009) Beth Ann Hockey & Manny Rayner: Using paraphrases of deep semantic representations to support regression testing in spoken dialogue systems. NAACL-HLT-2009: Software Engineering, Testing and Quality Assurance for Natural Language Processing (SETQA-NLP 2009), Proceedings of the Workshop, June 5, 2009, Boulder, Colorado; pp.14-21. [PDF, 139KB]

(2009) Yuval Marton, Chris Callison-Burch, & Philip Resnik: Improved statistical machine translation using monolingually-derived paraphrases. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.381-390. [PDF, 160KB]

(2009) Shachar Mirkin, Lucia Specia, Nicola Cancedda, Ido Dagan, Marc Dymetman, & Idan Szpektor: Source-language entailment modeling for translating unknown terms. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.791-799. [PDF, 152KB]

(2009) Matthew G.Snover, Nitin Madnani, Bonnie Dorr, & Richard Schwartz: TER-Plus: paraphrase, semantic, and alignment enhancements to Translation Error Rate [abstract]. Machine Translation 23 (2/3), September 2009; pp.117-127.

(2009) Bo Wang, Tiejun Zhao, Muyun Yang, & Sheng Li: References extension for the automatic evaluation of MT by syntactic hybridization.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.37-44. [PDF, 378KB]

(2008) Francis Bond, Eric Nichols, Darren Scott Appling, & Michael Paul: Improving statistical machine translation by paraphrasing the training data. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.150-157 [PDF, 132KB]; presentation [PDF,189KB]

(2008) Chris Callison-Burch, Trevor Cohn, & Mirella Lapata: ParaMetric: an automatic evaluation metric for paraphrasing. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.97-104. [PDF, 232KB]

(2008) Chris Callison-Burch: Syntactic constraints on paraphrases extracted from parallel corpora. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.196-205. [PDF, 187KB]

(2008) Atsushi Fujita & Satoshi Sato: A probabilistic model for measuring grammaticality and similarity of automatically generated paraphrases of predicate phrases. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.225-232. [PDF, 498KB]

(2008) Young-Sook Hwang, YoungKil Kim, & SangKyu Park: Paraphrasing depending on bilingual context toward generalization of translation knowledge. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.327-334. [PDF, 952KB]

(2008) Nitin Madnani, Philip Resnik, Bonnie J.Dorr, & Richard Schwartz: Are multiple reference translations necessary? Investigating the value of paraphrased reference translations in parameter optimization. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.143-152. [PDF, 655KB]

(2008) Aurélien Max & Michael Zock: Looking up phrase rephrasings via a pivot language. Coling 2008: Proceedings of the Workshop on Cognitive Aspects of the Lexicon, 24 August 2008, Manchester, UK; pp.77-84. [PDF, 257KB]

(2008) Preslav Nakov: Improving English-Spanish statistical machine translation: experiments in domain adaptation, sentence paraphrasing, tokenization, and recasing. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.147-150. [PDF, 110KB]

(2008) Preslav Nakov: Improved statistical machine translation using monolingual paraphrases. ECAI 2008: 18th European Conference on Artificial Intelligence, July 21-25, Patras, Greece; pp.338-342. [PDF, 199KB]

(2008) Shiqi Zhao, Haifeng Wang, Ting Liu, & Sheng Li: Pivot approach for extracting paraphrase patterns from bilingual corpora. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 780-788. [PDF, 188KB]

(2008) Shiqi Zhao, Cheng Niu, Ming Zhou, Ting Liu, & Sheng Li: Combining multiple resources to improve SMT-based paraphrasing model. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 1021-1029. [PDF, 188KB]

(2007) Pernilla Danielsson: In other words: using paraphrases in translation. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 11pp. [PDF, 51KB]

(2007) Nitin Madnani, Necip Fazil Ayan, Philip Resnick, & Bonnie J.Dorr: Using paraphrases for parameter tuning in statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 120-127 [PDF, 148KB]

(2006) Chris Callison-Burch, Philipp Koehn & Miles Osborne: Improved statistical machine translation using paraphrases.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 17-24 [PDF, 178KB]

(2006) David Kauchak & Regina Barzilay: Paraphrasing for automatic evaluation. HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 455-462 [PDF, 196KB]

(2006) Igor Mel’čuk & Leo Wanner: Syntactic mismatches in machine translation [abstract]. Machine Translation 20 (2),2006; pp.81-138.

(2006) Karolina Owczarzak, Declan Groves, Josef Van Genabith, & Andy Way: Contextual bitext-derived paraphrases in automatic MT evaluation. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 86-93 [PDF, 222KB]

(2006) Kiyotaka Uchimoto, Naoko Hayashida, Toru Ishida, & Hitoshi Isahara: Automatic detection and semi-automatic revision of non-machine-translatable parts of a sentence.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.703-708 [PDF, 330KB]

(2006) Liang Zhou, Chin-Yew Lin, & Eduard Hovy: Re-evaluating machine translation results with paraphrase support.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 77-84. [PDF, 324KB]

(2005) Colin Bannard & Chris Callison-Burch: Paraphrasing with bilingual parallel corpora. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 597-604. [PDF, 196KB]

(2005) Chris Brockett & William B.Dolan: Support vector machines for paraphrase identification and corpus construction. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 1-8. [PDF, 115KB]

(2005) Michael Carl, Ecaterina Rascu, & Paul Schmidt: Using template-grammars for shake & bake paraphrasing. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 66-72. [PDF, 107KB]

(2005) William B.Dolan & Chris Brockett: Automatically constructing a corpus of sentential paraphrases. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 9-16. [PDF, 115KB]

(2005) Andrew Finch, Young-Sook Hwang, & Eiichiro Sumita: Using machine translation evaluation techniques to determine sentence-level semantic equivalence. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 17-24. [PDF, 111KB]

(2005) Atsushi Fujita & Kentaro Inui: A class-oriented approach to building a paraphrase corpus. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 25-32. [PDF, 300KB]

(2005) Richard Power & Donia Scott: Automatic generation of large-scale paraphrases. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 73-79. [PDF, 49KB]

(2005) Dekai Wu: Recognizing paraphrases and textual entailment using inversion transduction grammars.  ACL-2005: Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, University of Michigan, Ann Arbor, 29 June 2005; pp. 25-30. [PDF, 113KB]

Parentheses

(2008) Dekang Lin, Shaojun Zhao, Benjamin Van Durme, & Marius Paşca: Mining parenthetical translations from the web by word alignment. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 994-1002. [PDF, 419KB]

Parsing see Syntactic analysis, Tagging

Particles

(2009) Pi-Chuan Chang, Dan Jurafsky, & Christopher D.Manning: Disambiguating “DE” for Chinese-English machine translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.215-223. [PDF, 276KB]

(2009) Craig Kopris: Endoclitics in Pashto: can they really do that?  CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 6pp. [PDF, 485KB]

(2008) Arul Menezes & Chris Quirk: Syntactic models for structural word insertion and deletion.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.735-744. [PDF, 178KB]

(2005) R. Mahesh K. Sinha & Anil Thakur: Handling ki in Hindi for Hindi-English MT. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.356-363. [PDF, 289KB]

Phonology and phonetics

(2009) L.Baghai-Ravary: Moderating strong accents. Translating and the Computer 31, 19-20 November 2009, London; 13pp. [PDF, 186KB]

(2009) Colin Cherry & Hisami Suzuki: NEWS 2009 machine transliteration shared task system description: transliteration with letter-to-phoneme technology. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.69-71. [PDF, 289KB]

(2009) Priyanka Gupta & Vishal Goyal: Implementation of rule based algorithm for sandhi-vicheda of compound Hindi words. International Journal of Computer Science Issues, vol.3 (2009), pp. 45-49. [PDF, 122KB]

(2009) Oi Yee Kwong: Graphemic approximation of phonological context for English-Chinese transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.186-193. [PDF, 258KB]

(2009) Oi Yee Kwong: Phonological context approximation and homophone treatment for NEWS 2009 English-Chinese transliteration shared task. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.76-79. [PDF, 178KB]

(2009) Antoine Laurent, Paul Deléglise, & Sylvain Meignier: Grapheme to phonese conversion using an SMT system. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2009) Mohammad Mehdi Arab & Ali Azimizadeh: Construction of a Persian letter-to-sound conversion system based on classification and regression tree. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 676KB]

(2009) Jong-Hoon Oh, Kiyotaka Uchimoto, & Kentaro Torisawa: Can Chinese phonemes improve machine transliteration?: a comparative study of English-to-Chinese transliteration models. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.658-667. [PDF, 181KB]

(2009) Taraka Rama, Anil Kumar Singh, & Sudbeer Kolachina: Modeling letter-to-phoneme conversion as a phrase based statistical machine translation problem with minimum error rate training.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.90-95. [PDF, 404KB]

(2009) Sujith Ravi & Kevin Knight: Learning phoneme mappings for transliteration without parallel data. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.37-45. [PDF, 198KB]

(2009) Sravana Reddy & Sonjia Waxmonsky: Substring-based transliteration with conditional random fields. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.92-95. [PDF, 90KB]

(2008) Jin-Shea Kuo, Haizhou Li, & Chih-Lung Lin: Mining transliterations from web query results: an incremental approach. IJCNLP 2008: Sixth SIGHAN Workshop on Chinese Language Processing, Proceedings of the workshop, 11-12 January 2008, Hyderabad, India; pp.16-23. [PDF, 400KB]

(2007) Su-Youn Yoon, Kyoung-Young Kim, & Richard Sproat: Multilingual transliteration using feature based phonetic method. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 112-119 [PDF, 269KB]

(2006) Jin-Shea Kuo, Haizhou Li, & Ying-Kuei Yang: Learning transliteration lexicons from the Web. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.1129-1136. [PDF, 428KB]

(2006) Tao Tao, Su-Youn Yoon, Andrew Fister, Richard Sproat, & ChengXiang Zhai: Unsupervised name entity transliteration using temporal and phonetic correlation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 250-257. [PDF, 139KB]

(2005) Karin Müller: Revealing phonological similarities between related languages from automatically generated parallel corpora.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 33-40. [PDF, 114KB]

Postpositions

(2008) Sibansu Mukhopadhyay, Biswanath Barik, & Sudeshna Sarkar: Problems of transfer grammar: the Hindi postpositional correlatives in Bangla. ICON-2008: 6th International Conference on Natural Language Processing, CDAC Pune, India, 20-22 December 2008; 10pp. [PDF, 180KB]

(2006) Masatoshi Tsuchiya, Takao Shime, Toshihiro Takagi, Takehito Utsuro, Kiyotaka Uchimoto, Suguru Matsuyoshi, Satoshi Sato, & Seiichi Nakagawa: Chunking Japanese compound functional expressions by machine learning. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp. 25-32 [PDF, 793KB]

Pragmatics in MT

(2008) Carmen Banea, Rada Mihalcea, Janyce Wiebe, & Samer Hassan: Multilingual subjectivity analysis using machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.127-135. [PDF, 15403KB]

(2006) David Farwell & Stephen Helmreich: Pragmatics-based MT and the translation of puns. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.187- 193 [PDF, 157KB]

Prepositional phrases

(2008) Victoria Fossum & Kevin Knight: Using bilingual Chinese-English word alignments to resolve PP-attachment ambiguity in English. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.245-253 [PDF, 595KB]

(2008) Elliott Macklovitch, Guy Lapalme, & Fabrizio Gotti: TransSearch: what are translators looking for?  AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.412-419. [PDF, 757KB]

(2006) Martin Volk: How bad is the problem of PP-attachment? A comparison of English, German and Swedish. Proceedings of the Third ACL-SIGSEM Workshop on Prepositions, Trento, Italy, April 2006; pp.81-88. [PDF, 133KB]

(2005) Rajat Kumar Mohanty, Ashish Francis Almeida & Pushpak Bhattacharyya: Prepositional phrase attachment and interlingua. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.241-253 [abstract, PDF, 15KB]

Prepositions

(2009) Eneko Agirre, Aitziber Atutxa, Gorka Labaka, Mikel Lersundi, Aingeru Mayor, & Kepa Sarasola: Use of rich linguistic information to translate prepositions and grammatical cases to Basque. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.58-65. [PDF, 340KB]

(2009) Hendra Setiawan, Min-Yen Kan, Haizhou Li, & Philip Resnik: Topological ordering of function words in hierarchical phrase-based translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.324-332. [PDF, 504KB]

(2008) Arul Menezes & Chris Quirk: Syntactic models for structural word insertion and deletion.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.735-744. [PDF, 178KB]

(2007) Samar Husain, Dipti Misra Sharma, & Manohar Reddy: Simple preposition correspondence: a problem in English to Indian language machine translation. ACL 2007: proceedings of the 4th ACL-SIGSEM Workshop on Prepositions, Prague, Czech Republic, 28 June 2007; pp.51-58 [PDF, 104KB]

(2007) Hendra Setiawan, Min-Yen Kan, & Haizhou Li: Ordering phrases with function words. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 712-719 [PDF, 213KB]

(2006) Sudip Kumar Naskar & Sivaji Bandyopadhyay: Handling of prepositions in English to Bengali machine translation. Proceedings of the Third ACL-SIGSEM Workshop on Prepositions, Trento, Italy, April 2006; pp.89-94. [PDF, 208KB]

(2005) Ebba Gustavii: Target language preposition selection - an experiment with transformation based learning and aligned bilingual data. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 112-118. [PDF, 45KB]

Pronouns

(2008) Prachya Boonkwan & Thepchai Supnithi: Memory-inductive categorical grammar: an approach to gap resolution in analytic-language translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.80-87. [PDF, 339KB]

Punctuation

(2008) Coşkun Mermer, Hamza Kaya, Ömer Farukhan Güneş, & Mehmet Uğur Doğan: The TÜBÍTAK-UEKAE statistical machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.138-142 [PDF, 239KB; presentation [PDF, 179KB]

(2007) Roldano Cattoni, Nicola Bertoldi, & Marcello Federico: Punctuating confusion networks for speech translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2453-2456; abstract [PDF, 23KB]

(2006) Evgeny Matusov, Arne Mauser, & Hermann Ney: Automatic sentence segmentation and punctuation prediction for spoken language translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 158-165 [PDF, 117KB]

(2007) Evgeny Matusov, Dustin Hillard, Mathew Magimai-Doss, Dilek Hakkani-Tür, Mari Ostendorf, & Hermann Ney: Improving speech translation w ith automatic boundary prediction. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2449-2452; abstract [PDF, 61KB]

Reduplicatives

(2005) R. M. K. Sinha, & Anil Thakur: Dealing with replicative words in Hindi for machine translation to English. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.157-164. [PDF, 298KB]

Reference (see also Anaphora resolution)

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2008) Colin Cherry: Cohesive phrase-based decoding for statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 72-80. [PDF, 179KB]

(2008) Imed Zitouni & Radu Florian: Mention detection crossing the language barrier. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.600-609. [PDF, 149KB]

(2007) Anja Belz, Albert Gatt, Ehud Reiter & Jette Viethen: The attribute selection for generation of referring expressions challenge. [Introduction to Shared Task Evaluation Challenge.] MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.73-74 [PDF, 438KB]

(2007) Anja Belz & Albert Gatt: The attribute selection for GRE challenge: overview and evaluation results. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.75-83 [PDF, 438KB]

(2007) Bernd Bohnet: IS-FBN, IS-FBS, IS-IAC: the adaptation of two classic algorithms for the generation of referring expresssions in order to produce expressions like humans do. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.84-86 [PDF, 676KB]

(2007) Raquel Hervás & Pablo Gervás: NIL: attribute selection for matching the task corpus using relative attribute groupings obtained from the test data. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.87-89 [PDF, 694KB]

 (2007) Heng Ji & Ralph Grishman: Collaborative entity extraction and translation. In: Recent Advances in Natural Language Processing: selected papers from RANLP ’97, ed. Nicolas Nikolov, John Benjamins Publ.Co.: Amsterdam/Philadelphia, 2000; pp.73-84. [PDF, 303KB]

(2007) J.D.Kelleher: DIT: frequency based incremental attribute selection for GRE. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.90-91 [PDF, 502KB]

(2007) Advaith Siddharthan & Ann Copestake: Evaluating an open-domain GRE algorithm on closed domains system IDs: CAM-B, CAM-T, CAM-BU and CAM-TU. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.92-94 [PDF, 727KB]

(2007) Philipp Spanger, Kurosawa Takahiro, & Tokunaga Takenobu: TITCH: attribute selection based on discrimination power and frequency. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.98-100 [PDF, 688KB]

(2007) Mariët Theune, Pascal Touset, Jette Viethen, & Emiel Krahmer: Cost-based attribute selection for GRE (GRAPH-SC/GRAPH-FP). MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.95-97 [PDF, 721KB]

(2007) Kees van Deemter & Albert Gatt: Content determination in GRE: evaluating the evaluator. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.101-103 [PDF, 754KB]

Role and reference grammar

(2009) Yasser Salem & Brian Nolan: Designing an XML lexicon architecture for Arabic machine translation based on role and reference grammar.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.221-229. [PDF, 691KB]

Sentiment

(2009) Julian Brooke, Milan Tofiloski, & Maite Taboada: Cross-linguistic sentiment analysis: from English to Spanish. [RANLP 2009] International conference: Recent Advances in Natural Language Processing. Proceedings ed. Galia Angelova, Kalina Bontcheva, Ruslan Mitkov, Nicolas Nicolov, Nikolai Nikolov, Borovets, Bulgaria, 14-16 September 2009; pp.50-57. [PDF, 65KB]

(2009) Xiaojun Wan: Co-training for cross-lingual sentiment classification. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.235-243. [PDF, 273KB]

(2005) Mutsuko Tomokiyo, Gérard Chollet, & Solange Hollard: Studies of emotional expressions in oral dialogues: towards an extension of Universal Networking Language. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.286-299 [abstract, PDF, 13KB]

Shake & Bake method

(2005) Michel Carl, Paul Schmidt, & Jörg Schütz: Reversible template-based Shake & Bake generation MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.17-25. [PDF, 367KB]

(2005) Michael Carl, Ecaterina Rascu, & Paul Schmidt: Using template-grammars for shake & bake paraphrasing. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 66-72. [PDF, 107KB]

Sign languages

(2008) Jan Bungeroth, Daniel Stein, Philippe Dreuw, Hermann Ney, Sara Morrissey, Andy Way, & Lynette van Zijl: The ATIS sign language corpus. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 643KB]

(2008) Tirthankar Dasgupta, Sandipan Dandapat, & Anupam Basu: Prototype machine translation system from text-to-Indian sign language. IJCNLP 2008: Workshop on NLP for Less Privileged Languages. Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.19-26. [PDF, 149KB]

(2007) Matt Huenerfauth, Liming Zhao, Erdan Gu, & Jan Allbeck: Design and evaluation of an American Sign Language generator. ACL 2007: Proceedings of the Workshop on Embodied Language Processing, June 29, 2007, Prague, Czech Republic; pp.51-58. [PDF, 204KB]

(2007) Sara Morrissey, Andy Way, Daniel Stein, Jan Bungeroth, & Hermann Ney: Combining data-driven MT systems for improved sign language translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.329-336 [PDF, 189KB]

(2007) Daniel Stein, Philippe Dreuw, Hermann Ney, Sara Morrissey, & Andy Way: Hand in hand: automatic sign language to English translation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.214-220 [PDF, 280KB]; poster [PDF, 97KB]

(2006) Sara Morrissey & Andy Way: Lost in translation: the problems of using mainstream MT evaluation metrics for sign language translation.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.91-98. [PDF, 465KB]

(2005) Matt Huenerfauth: American Sign Language generation: multimodal NLG with multiple linguistic channels. ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 37-42. [PDF, 91KB]

(2005) Sara Morrissey & Andy Way: An example-based approach to translating sign language MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.109-116. [PDF, 369KB]

Speech acts

(2006) David Farwell & Stephen Helmreich: Pragmatics-based MT and the translation of puns. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.187- 193 [PDF, 157KB]

Spelling checkers

(2009) Mehrnoush Shamsfard, Soheila Kiani & Yaser Shahedi: STeP-1: standard text preparation for Persian language. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 659KB]

Sublanguage [see also Domain restriction and specification]

(2009) Julien Gosme, Yves Lepage & Adrien Lardilleux: Translation of sublanguages by subgrammars. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.77-84. [PDF, 116KB]

(2009) Tsai Yi-Jung: Research into the practicality of machine translation of administrative documents.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.256-259 [abstract]

(2006) Janna Khegai: Grammatical framework (GF) for MT in sublanguage domains. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.95-104 [PDF, 252KB]

Syllables

(2009) Xue Jiang, Le Sun & Dakun Zhang: A syllable-based name transliteration system. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.96-99. [PDF, 168KB]

Synonyms

(2008) Takashi Tsunakawa & Jun’ichi Tsujii: Bilingual synonym identification with spelling variations. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.457-464. [PDF, 691KB]

(2006) Lonneke van der Plas & Jörg Tiedemann: Finding synonyms using automatic word alignment and measures of distributional similarity.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.866-873. [PDF, 223KB]

(2005) Hideki Kashioka: Training data modification for SMT: considering groups of synonymous sentences. ACL-2005: Workshop on Empirical Modeling of Semantic Equivalence and Entailment, University of Michigan, Ann Arbor, 30 June 2005; pp. 19-24. [PDF, 113KB]

Temporal expressions

 (2006) E. Saquete, P.Martínez-Barco, R.Muñoz, M.Negri, M.Speranza, & R.Sprugnoli: Multilingual extension of a temporal expression normalizer using annotated corpora. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.1-8 [PDF, 261 KB]

Tense

(2008) Jonathan H. Clark, Robert Frederking, & Lori Levin: Toward active learning in data selection: automatic discovery of language features during elicitation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

(2007) Vincent Vandeghinste, Peter Dirix, & Ineke Schuurman: The effect of a few rules on a data-driven MT system. METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 8pp. [PDF, 154KB]

(2006) Yang Ye, Victoria Li Foss um, & Steven Abney: Latent featrures in automatic tense translation between Chinese and English. Coling-ACL 2006: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, Sydney, 22-23 July 2006; pp.48-55. [PDF, 79KB]

Terminology

(2009) Caroline Barrière: The web as a source of informative background knowledge. MT Summit XII - Workshop: Beyond Translation Memories: New Tools for Translators MT, August 29, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 787KB]

(2009) Sanaa Benmessaoud: Effective terminology management using computers. Translation Journal 13 (1), January 2009; 7pp. [PDF, 140KB]

(2009) David Calvert: Tiptoeing towards TBX: strategies for terminology management at a language services provider. Translating and the Computer 31, 19-20 November 2009, London; 14pp. [PDF, 205KB]

(2009) Alain Désilets, Louis-Philippe Huberdeau, Marc Laporte, & Jean Quirion: Building a collaborative multilingual terminology system. Translating and the Computer 31, 19-20 November 2009, London; 11pp. [PDF, 349KB]

(2009) Rafael Guzmán: Uncontrolled terminology and MT: the importance of making good comparisons. Translation Journal 13 (2), April 2009; 7pp. [PDF, 128KB]

(2009) Miguel A.Jiménez-Crespo: Conventions in localization: a corpus study of original vs. translated web texts. Journal of Specialised Translation 12 (July 2009); pp.79-102. [PDF, 285KB]

(2009) Philippe Langlais, François Yvon & Pierre Zweigenbaum: Improvements in analogical learning: application to translating multi-terms of the medical domain.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.487-495. [PDF, 172KB]

(2009) Els Lefever, Lieve Macken & Veronique Hoste: Language-independent bilingual terminology extraction from a multilingual parallel corpus.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.496-504. [PDF, 137KB]

(2009) Raphaël Rubino: Exploring context variation and lexicon coverage in project-based approach for term translation. [RANLP 2009] Student Research Workshop [held at], International conference: Recent Advances in Natural Language Processing. Proceedings ed. Irina Temnikova, Ivelina Nikolova, Natalia Konstantinova, Borovets, Bulgaria, 14-15 September 2009; pp.66-70. [PDF, 470KB]

(2009) Svetlana Sheremetyeva: On extracting multiword NP terminology for MT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.205-212. [PDF, 215KB]

(2008) Vincent Claveau: Automatic translation of biomedical terms by supervised machine learning.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 202KB]

(2008) Mohammad Daoud, Asanobu Kitamoto, Christian Boitet, & Mathieu Mangeot: CLIR-based collaborative construction of a multilingual terminological dictionary for cultural resources. Translating and the Computer 30, 27-28 November 2008, London; 15pp. [PDF, 516KB]

(2008) María Fernández Parra & Pius ten Hacken: Beyond terms; multi-word units in Multiterm Extract. Translating and the Computer 30, 27-28 November 2008, London; 18pp. [PDF, 157KB]

(2008) Le An Ha, Gabriela Fernandez, Ruslan Mitkov, & Gloria Corpas: Mutual bilingual terminology extraction. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 87KB]

(2008) Masaki Itagaki & Takako Aikawa: Post-MT term swapper: supplementing a statistical machine translation system with a user dictionary.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 352KB]

(2008) Lieve Macken, Els Lefever, & Veronique Hoste:  Linguistically-based sub-sentential alignment for terminology extraction from a bilingual automotive corpus. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.529-536. [PDF, 145KB]

(2008) Yohei Morishita, Takehito Utsuro, & Mikio Yamamoto: Integrating a phrase-based SMT model and a bilingual lexicon for human in semi-automatic acquisition of technical term translation lexicon. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.153-162. [PDF, 988KB]

(2008) Rogelio Nazar, Leo Wanner, & Jorge Vivaldi: Two-step flow in bilingual lexicon extraction from unrelated corpora. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.140-149. [PDF, 537KB]

(2008) Tadaaki Oshio, Tomoharu Mitsuhashi, & Tsuyoshi Kakita: Use of the technical field-oriented user dictionaries. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 462-465. [PDF, 382KB]

(2008) Tom Vanallemeersch & Cornelia Wermuth: Linguistics-based word alignment for medical translators. Journal of Specialised Translation 9 (January 2008); pp.20-38. [PDF, 224KB]

(2008) Jian-Cheng Wu, Peter Wei-Huai Hsu, Chiung-Hui Tseng, & Jason S. Chang: Mining the web for domain-specific translations. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.212-221 PDF, 750B]

(2008) Xianchao Wu, Naoaki Okazaki, Takashi Tsunakawa, & Jun’ichi Tsujii: Improving English-to-Chinese translation for technical terms using morphological information. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.202-211 [PDF, 799KB]

(2007) Jing-Shin Chang & Chun-Kai Kung: A Chinese-to-Chinese statistical machine translation model for mining synonymous simplified-traditional Chinese terms. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.81-87 [PDF, 314KB]

(2007) Rachel Chrem: WIPO’s activities in patent translation and terminology. Invited talk at MT Summit XI Workshop on patent translation, 11 September, 2007, Copenhagen, Denmark; 42 slides [PDF of PPT presentation, 2099KB]

(2007) Masaki Itagaki, Takako Aikawa, & Xiaodong He: Automatic validation of terminology translation consistency with statistical method.  MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.269-274 [PDF, 416KB]

(2007) Michael Leifeld & Urs Huechting: The long way to a corporate platform for multilingual contents and translation processes: corporate translation management in practice. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 5pp. [PDF, 110KB]

(2007) Chengye Lu, Yue Xu, & Shlomo Geva: Improving translation accuracy in web-based translation extraction. Proceedings of NTCIR-6 Workshop Meeting, May 15-18, 2007, Tokyo, Japan; pp.31-35. [PDF, 94KB]

(2007) Doris Marty-Albisser: Client centric multilingual leverage – scale the skills. Invited talk at MT Summit XI, 10-14 September 2007, Copenhagen, Denmark; 25pp. [PDF of PPT presentation, 686KB]

(2007) E.Morin, B.Daille, K.Takeuchi, & K.Kageura: Bilingual terminology mining – using brain, not brawn comparable corpora.  ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 664-671 [PDF, 143KB]

(2007) Antoni Oliver & Mercè Vàzquez: A free terminology extraction suite. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 28pp. [PDF, 182KB]

(2007) Jost Zetzsche: Translating on good terms. Translation Journal 11 (2), April 2007; 4pp. [PDF, 74KB]

(2006) Gaolin Fang, Hao Yu, & Fumihito Nishino: Chinese-English term translation mining based on semantic prediction.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.199-206. [PDF, 240KB]

(2006) André Le Meur & Marie-Jeanne Derouin: Integrated bilingual specialist dictionaries – the LexTerm initiative. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 8pp. [PDF, 77KB]; presentation by Marie-Jeanne Derouin: 32 slides [PDF, 579KB]

(2006) Irina Matveeva & Gina-Anne Levow: Computing term translation probabilities with generalized latent semantic analysis. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Posters and demonstrations, Trento, Italy, April 5-6, 2006; pp.151-154 [PDF, 90KB]

(2006) Anja Ruetten: Termprofile.com – supporting a conference interpreter’s workflow. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 6pp. [PDF, 91KB]

(2006) Gilles Sérasset, Francis Brunet-Manquat, & Elena Chiocchetti: Multilingual legal terminology on the Jibiki platform: the LexALP project. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.937-944. [PDF, 433KB]

(2006) Eric Wehrli: TwicPen: hand-held scanner and translation software for non-native readers. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Interactive Presentation Sessions, Sydney, July 2006; pp.61-64. [PDF, 229KB]

(2005) Ulrike Bernardi, Andras Bocsak, & Jörg Porsiel: Are we making ourselves clear? Terminology management and machine translation at Volkswagen. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 41-49. [PDF, 273KB]

(2005) Cyril Chantrier: EXTER: a breakthrough solution for efficient terminology extraction. Translating and the Computer 27: proceedings of the Twenty-seventh International Conference on Translating and the Computer, 24-25 November 2005, London. (London: Aslib, 2005); 19pp. [PDF, 161KB]

(2005) Young-Gil Kim, Seong-Il Yang, Munpyo Hong, Chang-Hyun Kim, Young-Ae Seo, Cheol Ryu, Sang-Kyu Park, & Se-Young Park: Terminology construction workflow for Korean-English patent MT MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Patent Translation; pp.55-59. [PDF, 147KB]

(2005) Jong-Hoon Oh & Key-Sun Choi: Automatic extraction of English-Korean translations for constituents of technical terms. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.67-72. [PDF, 226KB]

(2005) Masatsugu Tonoike, Mitsuhiro Kida, Toshihiro Takagi, Yauhiro Sasaki, Takehito Utsuro, & Satoshi Sato: Effect of domain-specific corpus in compositional translation estimation for technical terms. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.114-119. [PDF, 193KB]

(2005) José Vega, Benoît Lamey, & Jacques Vergne: MyCatex: a language-independent term extractor. Translating and the Computer 27: proceedings of the Twenty-seventh International Conference on Translating and the Computer, 24-25 November 2005, London. (London: Aslib, 2005); 10pp. [PDF, 179KB]

(2005) Jian-Cheng Wu, Tracy Lin, & Jason S.Chang: Learning source-target surface patterns for web-based terminology translation. ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 37-40. [PDF, 153KB]

(2005) Daniel Zielinski & Yamile Ramirez Safar: Research meets practice: t-survey 2005. An online survey on terminology extraction and terminology management. Translating and the Computer 27: proceedings of the Twenty-seventh International Conference on Translating and the Computer, 24-25 November 2005, London. (London: Aslib, 2005); 27pp. [PDF, 152KB]

Text prediction

(2009) Sergio Barrachina, Oliver Bender, Francisco Casacuberta, Jorge Civera, Elsa Cubel, Shahram Khadivi, Antonio Lagarda, Hermann Ney, Jesús Tomás, Enrique Vidal, & Juan-Miguel Vilar: Statistical approaches to computer-assisted translation. Computational Linguistics 35 (1), pp.3-28. [PDF, 252KB]

(2009) Sander Canisius & Antal van den Bosch: A constraint satisfaction approach to machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.182-189. [PDF, 533KB]

(2009) Philipp Koehn: Human translation and machine translation – invited talk. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; 80pp. [PDF of PPT, 599KB]

(2009) Philipp Koehn: A process study of computer-aided translation [abstract]. Machine Translation 23 (4), November 2009; pp.241-263.

(2009) Philipp Koehn & Barry Haddow: Interactive assistance to human translators using statistical machine translation methods. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 73-80. [PDF, 217KB]

(2009) Philipp Koehn: A web-based interactive computer aided translation tool. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Software Demonstrations, Suntec, Singapore, 3 August 2009; pp.17-20. [PDF, 127KB]

(2008) Germán Sanchis-Trilles, Daniel Ortiz-Martínez, Jorge Civera, Francisco Casacuberta, Enrique Vidal, & Hieu Hoang: Improving interactive machine translation via mouse actions. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.485-494. [PDF, 148KB]

(2006) Jorge Civera, Antonio L. Lagarda, Elsa Cubel, Francisco Casacuberta, Enrique Vidal, Juan M. Vilar, & Sergio Barrachina: A computer-assisted translation tool based on finite-state technology. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.33-40 [PDF, 139KB]

(2008) Alain Désilets, Marta Stojanovic, Jean-François Lapointe, Rick Rose, & Aarthi Reddy: Evaluating productivity gains of hybrid ASR-MT systems for translation dictation. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.158-165 [PDF, 421KB]; presentation [PDF, 548KB]

(2006) Saša Hasan, Shahram Khadivi, Richard Zens, & Hermann Ney: A flexible architecture for CAT applications. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.81-88 [PDF, 160KB]

(2006) Shahram Khadivi, Richard Zens, & Hermann Ney: Integration of speech to computer-assisted translation using finite-state automata. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.467-474. [PDF, 348KB]

(2006) Elliott Macklovitch: TransType2: the last word.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.167-172 [PDF, 30KB]

(2006) Richard Zens & Hermann Ney: N-gram posterior probabilities for statistical machine translation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 72-77 [PDF, 133KB]

(2005) Elliott Macklovitch, Ngoc Tran Nguyen & Guy Lapalme: Tracing translations in the making. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.323-330. [PDF, 793KB]

(2005) Oliver Bender, Saša Hasan, David Vilar, Richard Zens, & Hermann Ney: Comparison of generation strategies for interactive machine translation. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 33-40. [PDF, 83KB]

(2005) Nicola Ueffing & Hermann Ney: Application of word-level confidence measures in interactive statistical machine translation. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 262-270. [PDF, 173KB]

Thesaurus method (see also Treebanks)

(2008) Violaine Prince & Jacques Chauche: Building a bilingual representation of the Roget thesaurus for French to English machine translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 142KB]

(2008) Jessica Ramírez, Masayuki Asahara, & Yuji Matsumoto: Japanese-Spanish thesaurus construction using English as a pivot. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.473-480. [PDF, 787KB]

(2006) G. Craig Murray, Bonnie J. Dorr, Jimmy Lin, Jan Hajič, & Pavel Pecina: Leveraging reusability: cost-effective lexical acquisition for large-scale ontology translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.945-952. [PDF, 159KB]

(2006) Dan Ştefănescu & Dan Tufiş: Aligning multilingual thesauri. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.475-478 [PDF, 461KB]

Tones and tone languages

(2009) Oi Yee Kwong: Homophones and tonal patterns in English-Chinese transliteration. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.21-24. [PDF, 204KB]

Transduction grammars

(2009) Han-Bin Chen, Jian-Cheng Wu & Jason S.Chang: Learning bilingual linguistic reordering model for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.254-262. [PDF, 333KB]

(2009) Guillem Gascó & Joan Andreu Sánchez: UPV translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 118-123. [PDF, 331KB]; presentation [PDF of PPT, 603KB]

(2009) Aria Haghighi, John Blitzer, John DeNero & Dan Klein: Better word alignments with supervised ITG models. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.306-314. [PDF, 301KB]

(2009) Markus Saers, Joakim Nivre, & Dekai Wu: Learning stochastic bracketing inversion transduction grammars with a cubic time biparsing algorithm. IWPT-09: Proceedings of the 11th International Conference on Parsing Technologies, 7-9 October 2009, Paris, France; pp. 29-32. [PDF, 126KB]

(2009) Anders Søgaard & Dekai Wu: Emprical lower bounds on translation unit error rate for the full class of inversion transduction grammars. IWPT-09: Proceedings of the 11th International Conference on Parsing Technologies, 7-9 October 2009, Paris, France; pp. 33-36. [PDF, 87KB]

 (2009) Anders Søgaard: On the complexity of alignment problems in two synchronous grammar formalisms.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.60-68. [PDF, 161KB]

(2009) Markus Saers & Dekai Wu: Improving phrase-based translation via word alignments from stochastic inversion transduction grammars.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.28-36. [PDF, 554KB]

(2009) Min Zhang & Haizhou Li: Tree kernel-based SVM with structured syntactic knowledge for BTG-based phrase reordering. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.698-707. [PDF, 294KB]

(2008) Boxing Chen, Deyi Xiong, Min Zhang, Aiti Aw, & Haizhou Li: I2R multi-pass machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 46-51. [PDF, 177KB]; presentation [PDF, 910KB]

(2008) Yanqing He & Chengqing Zong: A generalized reordering model for phrase-based statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.117-124. [PDF, 661KB]

(2008) Damianos Karakos, Jason Eisner, Sanjeev Khudanpur, & Markus Dreyer: Machine translation system combination using ITG-based alignments. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 81-84. [PDF, 65KB]

(2008) Slav Petrov, Aria Haghighi, & Dan Klein: Coarse-fine syntactic machine translation using language projections.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.108-116. [PDF, 306KB]

(2008) Matt Post & Daniel Gildea: Parsers as language models for statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.172-181. [PDF, 538KB]

(2008) Germán Sanchis-Trilles & Joan Andreu Sánchez: Using parsed corpora for estimating stochastic inversion transduction grammars.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 3pp. [PDF, 48KB]

(2008) Anders Søgaard: Range concatenation grammars for translation. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.103-106. [PDF, 130KB]

(2008) Deyi Xiong, Min Zhang, Aiti Aw, & Haizhou Li: Linguistically annotated BTG for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1009-1016. [PDF, 141KB]

(2008) Deyi Xiong, Min Zhang, Aiti Aw, & Haizhou Li: A linguistically annotated reordering model for BTG-based statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 149-152. [PDF, 319KB]

(2008) Deyi Xiong, Min Zhang, Aiti Aw, Haitao Mi, Qun Liu, & Shouxun Liu: Refinements in BTG-based statistical machine translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.505-512. [PDF, 336KB]

(2008) Hirofumi Yamamoto, Hideo Okuma, & Eiichiro Sumita: Imposing constraints from the source tree on ITG constraints for SMT. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.1-9. [PDF, 171KB]

(2008) Dakun Zhang, Le Sun, & Wenbo Li: A structured prediction approach for statistical machine translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.649-654. [PDF, 521KB]

(2007) Wen-Han Chao & Zhou-Jun Li: Incorporating constituent structure constraint into discriminative word alignment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.97-103 [PDF, 112KB]

(2007) Colin Cherry & Dekang Lin: Inversion transduction grammar for joint phrasal translation modeling. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.17-24 [PDF, 178KB]

(2007) Markus Dreyer, Keith Hall, & Sanjeev Khudanpur: Comparing reordering constraints for SMT using efficient BLEU oracle computation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.103-110 [PDF, 206KB]

(2007) Hendra Setiawan, Min-Yen Kan, & Haizhou Li: Ordering phrases with function words. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 712-719 [PDF, 213KB]

(2006) Colin Cherry & Dekang Lin: Soft syntactic constraints for word alignment through discriminative training. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.105-112. [PDF, 153KB]

(2006) Colin Cherry & Dekang Lin: A comparison of syntactically motivated word alignment spaces. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, April 3-7, 2006; pp.145-152 [PDF, 150KB]

(2006) J. A. Sánchez & J. M. Benedí: Obtaining word phrases with stochastic inversion translation grammars for phrase-based statistical machine translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.179-186 [PDF, 129KB]

(2006) J.A.Sánchez & J.M.Benedí: Stochastic inversion transduction grammars for obtaining word phrases for phrase-based statistical machine translation. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 130-133 [PDF, 91KB]

(2006) Benjamin Wellington, Sonja Waxmonsky, & I.Dan Melamed: Empirical lower bounds on the complexity of translational equivalence. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.977-984. [PDF, 121KB]

(2006) Hao Zhang & Daniel Gildea: Efficient search for inversion transduction grammar.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 224-231. [PDF, 172KB]

(2006) Hao Zhang & Daniel Gildea: Inducing word alignments with bilexical synchronous trees. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.953-960. [PDF, 156KB]

(2005) Liang Huang, Hao Zhang, & Daniel Gildea: Machine translation as lexicalized parsing with hooks. IWPT 2005: Proceedings of the Ninth International Workshop on Parsing Technologies, Vancouver, October 2005; pp.65-73. [PDF, 130KB]

(2005) Dekai Wu: Recognizing paraphrases and textual entailment using inversion transduction grammars.  ACL-2005: Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, University of Michigan, Ann Arbor, 29 June 2005; pp. 25-30. [PDF, 113KB]

(2005) Dekai Wu: Statistical machine translation part II: tree-based SMT. [Tutorial at] IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; p.276. [PDF, 33KB]

(2005) Hao Zhang & Daniel Gildea: Stochastic lexicalized inversion transduction grammar for alignment. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 4787-482. [PDF, 102KB]

Transliteration

(2009) see [ACL-IJCNLP 2009] NEWS 2009. Named Entities Workshop: shared task on transliteration. Proceedings of the Workshop, Suntec, Singapore, 7 August 2009

(2009) Kedar Bellare, Koby Crammer, & Dayne Freitag: Loss-sensitive discriminative training of machine transliteration models. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.61-65. [PDF, 117KB]

(2009) Ming-Wei Chang, Dan Goldwasser, Dan Roth, & Yuancheng Tu: Unsupervised constraint driven learning for transliteration discovery. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.299-307. [PDF, 580KB]

(2009) Colin Cherry & Hisami Suzuki: Discriminative substring decoding for transliteration. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1066-1075. [PDF, 583KB]

(2009) Thomas Deselaers, Saša Hasan, Oliver Bender, & Hermann Ney: A deep learning approach to machine transliteration.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.233-241. [PDF, 201KB]

(2009) Tatiana Gornostay & Inguna Skadiņa: English-Latvian toponym processing: translation strategies and linguistic patterns. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.81-87. [PDF, 266KB]

(2009) Vishal Goyal & Gurpreet Singh Lehal: Hindi-Punjabi machine transliteration system (for machine translation system). Atti Della Fondazione Giorgio Ronchi, vol. 64, no. 1 (2009), pp. 27-35. [PDF, 161KB]

(2009) Martin Jansche & Richard Sproat: Named entity transcription with pair n-gram models. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.32-35. [PDF, 113KB]

(2009) Amit Kirschenbaum & Shuly Wintner: Lightly supervised transliteration for machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.433-441. [PDF, 147KB]

(2009) Alexandre Klementiev & Dan Roth: Names entity transliteration and discovery in multilingual corpora.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.79-92.

(2009) Kevin Knight & Richard Sproat: Writing systems, transliteration and decipherment.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Tutorials, Boulder, Colorado, May 31 - June 5, 2009; pp.15-16. [PDF, 68KB]

(2009) Oi Yee Kwong: Homophones and tonal patterns in English-Chinese transliteration. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.21-24. [PDF, 204KB]

(2009) Hamdy S.Mubarak, Mohamed Al Sharqawy & Esraa Al Masry: Diacritization and transliteration of proper nouns from Arabic to English.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.256-259. [PDF, 367KB]

(2009) Peter Nabende: Evaluation of dynamic Bayesian network models for entitiy name transliteration. Benelearn 2009: proceedings of the 18th Annual Belgian-Dutch Conference on Machine Learning, Tilburg, May 2009; pp.99-100. [PDF, 151KB]

(2009) Sara Noeman: Transliteration using phrase based SMT approach on substrings.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.199-204. [PDF, 348KB]

(2009) Jong-Hoon Oh, Kiyotaka Uchimoto, & Kentaro Torisawa: Can Chinese phonemes improve machine transliteration?: a comparative study of English-to-Chinese transliteration models. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.658-667. [PDF, 181KB]

(2009) Michael Paul, Andrew Finch & Eiichiro Sumita: NICT@WMT09: model adaptation and transliteration for Spanish-English SMT.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.105-109. [PDF, 95KB]

(2009) Vladimir Pervouchine, Haizhou Li, & Bo Lin: Transliteration alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.136-144. [PDF, 572KB]

(2009) Anand Arokia Raj & Harikrishna Maganti: Transliteration based search engine for multilingual information access. NAACL-HLT-2009: The Third International Workshop on Cross Lingual Information Access: Addressing the Information Need of Multilingual Societies (CLIAWS3), Proceedings of the Workshop, June 4, 2009, Boulder, Colorado; pp.12-20. [PDF, 208KB]

(2009) Taraka Rama, Anil Kumar Singh, & Sudbeer Kolachina: Modeling letter-to-phoneme conversion as a phrase based statistical machine translation problem with minimum error rate training.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.90-95. [PDF, 404KB]

(2009) Sujith Ravi & Kevin Knight: Learning phoneme mappings for transliteration without parallel data. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.37-45. [PDF, 198KB]

(2009) Raghavendra Udupa, K.Saravanan, A.Kumaran, & Jagadeesh Jagarlamudi: MINT: a method for effective and scalable mining of named entity transliterations from large comparable corpora.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.799-807. [PDF, 1120KB]

(2008) Eiji Aramaki, Takeshi Imai, Kengo Miyo, & Kazuhiko Ohe: Orthographic disambiguation incorporating transliterated probability. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.48-55. [PDF, 525KB]

(2008) Andrew Finch & Eiichiro Sumita: Phrase-based machine transliteration. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; 6pp. [PDF, 203KB]

(2008) Surya Ganesh, Sree Harsha, Prasad Pingali, & Vasudeva Varma: Statistical transliteration for cross language information retrieval using HMM alignment and CRF. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.42-47. [PDF, 127KB]

(2008) Dan Goldwasser & Dan Roth: Active sample selection for named entity transliteration. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 53-56. [PDF, 538KB]

(2008) Dan Goldwasser & Dan Roth: Transliteration as constrained optimization.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.353-362. [PDF, 645KB]

(2008) Ulf Hermjakob, Kevin Knight, & Hal Daumé III: Name translation in statistical machine translation: learning when to transliterate. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 389-397. [PDF, 107KB]

(2008) Fei Huang, Ahmad Emami, & Imed Zitouni: When Harry met Harri, [Arabic name] and [Chinese name]: cross-lingual name spelling normalization.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.391-399. [PDF, 165KB]

(2008) HaiXiang Huang & Atsushi Fujii: Effects of related term extraction in transliteration into Chinese. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.643-648. [PDF, 374KB]

(2008) Chengguo Jin, Dong-Il Kim, Seung-Hoon Na, & Jong-Hyeok Lee: Automatic extraction of English-Chinese transliteration pairs using dynamic window and tokenizer. IJCNLP 2008: Sixth SIGHAN Workshop on Chinese Language Processing, Proceedings of the workshop, 11-12 January 2008, Hyderabad, India; pp.9-15. [PDF, 385KB]

(2008) Jin-Shea Kuo & Haizhou Li: Multi-view co-training of transliteration model. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.373-380. [PDF, 465KB]

(2008) Jin-Shea Kuo, Haizhou Li, & Chih-Lung Lin: Mining transliterations from web query results: an incremental approach. IJCNLP 2008: Sixth SIGHAN Workshop on Chinese Language Processing, Proceedings of the workshop, 11-12 January 2008, Hyderabad, India; pp.16-23. [PDF, 400KB]

(2008) M.G.Abbas Malik, Christian Boitet, & Pushpak Bhattacharyya: Hindi Urdu machine transliteration using finite-state transducers.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.537-544. [PDF,774KB]

(2008) Karine Megerdoomian & Dan Parvaz: Low-density language bootstrapping: the case of Tajiki Persian.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 331KB]

(2008) Jong-Hoon Oh & Hitoshi Isahara: Hypothesis selection in machine transliteration: a web mining approach. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.233-240. [PDF, 461KB]

(2008) Tejinder Singh Saini, Gurpreet Singh Lehal, & Virinder S.Kalra: Shahmukhi to Gurmukhi transliteration system. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.177-180. [PDF, 319KB]

(2008) K.Saravanan & A.Kumaran: Some experiments in mining named entitiy transliteration pairs from comparable corpora. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.26-33. [PDF, 805KB]

(2008) Kristen Summers & Diane Chandler: Language processing for analysis and investigation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.454-456. [PDF, 647KB]

(2008) Harshit Surana & Anil Kumar Singh: A more diserning and adaptable multilingual transliteration mechanism for Indian languages. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.64-71. [PDF, 393KB]

(2008) Jean Tavernier, Rosa Cowan, & Michelle Vanni: Holy Moses! Leveraging existing tools and resources for entity translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 74KB]

(2008) M.S.Vijaya, G.Shivapratap, V.Dhanalakshmi, V.P. Ajith, & K.P.Soman: Sequence labeling approach for English to Tamil transliteration using memory-based learning.  ICON-2008: 6th International Conference on Natural Language Processing, CDAC Pune, India, 20-22 December 2008; 7pp. [PDF, 195KB]

(2008) Wolodja Wentland, Johannes Knopp, Carina Silberer, & Matthias Hartung: Building a multilingual lexical resource for named entity disambiguation, translation and transliteration.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 451KB]

(2008) Fan Yang, Jun Zhao, Bo Zou, Kang Liu, & Feifan Liu: Chinese-English backward transliteration assisted with mining monolingual web pages. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 541-549. [PDF, 593KB]

 (2007) Dan Goldwasser, Ming-Wei Chang, Yuancheng Tu & Dan Roth: Constraint-driven transliteration discovery. In: Recent Advances in Natural Language Processing: selected papers from RANLP 2007, ed. Nicolas Nikolov, Galia Angelova, Ruslan Mitkov, John Benjamins Publ.Co.: Amsterdam/Philadelphia, 2000; pp.21-40. [PDF, 525KB]

(2007) Chu-Ren Huang, Petr Šimon, & Shu-Kai Hsieh: Automatic discovery of named entity variants – grammar-driven approaches to non-alphabetical transliterations. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 153-156 [PDF, 246KB]

(2007) Long Jiang, Ming Zhou, Lee-Feng Chien & Cheng Niu: Named entity translation with web mining and transliteration. IJCAI-07: Twentieth International Joint conference on Artificial Intelligence, Hyderabad, India, 6-12 January 2007; pp.1629-1634. [PDF, 491KB]

(2007) Mehdi M.Kashani, Eric Joanis, Roland Kuhn, George Foster, & Fred Popowich: Integration of an Arabic transliteration module into a statistical machine translation system.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp.17-24 [PDF, 145KB]

(2007) Haizhou Li, Khe Chai Sim, Jin-Shea Kuo, & Minghui Dong: Semantic transliteration of personal names. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 120-127 [PDF, 152KB]

(2007) Jong-Hoon Oh & Hitoshi Isahara: Machine transliteration using multiple transliteration engines and hypothesis re-ranking. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.353-360 [PDF, 253KB]

(2007) Tarek Sherif & Grzegorz Kondrak: Bootstrapping a stochastic transducer for Arabic-English transliteration extraction. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 864-871 [PDF, 189KB]

(2007) Tarek Sherif & Grzegorz Kondrak: Substring-based transliteration. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 944-951 [PDF, 152KB]

(2007) Yu-Chun Wang, Cheng-Wei Lee, Richard Tzong-Han Tsai, & Wen-Lian Hsu: IASL system for NTCIR-6 Korean-Chinese cross-language information retrieval. Proceedings of NTCIR-6 Workshop Meeting, May 15-18, 2007, Tokyo, Japan; pp.26-30. [PDF, 384KB]

(2007)  Jian-Cheng Wu & Jason S.Chang: Learning to find English to Chinese transliterations on the web. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 858-857. [PDF, 187KB]

(2007) Chien-Cheng Wu & Jason S.Chang: Learning to find transliteration on the Web. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Demonstration Program, pp.21-22 [PDF, 1621KB]

(2007) Su-Youn Yoon, Kyoung-Young Kim, & Richard Sproat: Multilingual transliteration using feature based phonetic method. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 112-119 [PDF, 269KB]

(2007) Bing Zhao, Nguyen Bach, Ian Lane & Stephan Vogel: A log-linear block transliteration model based on bi-stream HMMs. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.364-371 [PDF, 453KB]

(2006) Sherri Condon & Keith J.Miller: Name transliteration: current methods, applications, and evaluation in name transliteration and name translation. Tutorial at AMTA 2006 conference, August 8, 2006, Cambridge, Massachusetts, USA; 46pp. [PDF of PPT presentation, 1103KB]

(2006) Asif Ekbal, Sudip Kumar Naskar, & Sivaji Bandyopadhyay: A modified joint source-channel model for transliteration. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.191-198. [PDF, 159KB]

(2006) Mehdi M. Kashani, Fred Popowich, & Fatiha Sadat: Automatic transliteration of proper nouns from Arabic to English. The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.76-83. [PDF, 179KB]

(2006) Alexandre Klementiev & Dan Roth: Weakly supervised named entity transliteration and discovery from multilingual comparable corpora. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.817-824. [PDF, 173KB]

(2006) Jin-Shea Kuo, Haizhou Li, & Ying-Kuei Yang: Learning transliteration lexicons from the Web. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.1129-1136. [PDF, 428KB]

(2006) M.G.Abbas Malik: Punjabi transliteration. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.1137-1144. [PDF, 1234KB]

(2006) Charles Schafer: Novel probabilistic finite-state transducers for cognate and transliteration modeling. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.203-212 [PDF, 250KB]

(2006) Richard Sproat, Tao Tao, & ChenXiang Zhai: Named entity transliteration with comparable corpora. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.73-80. [PDF, 232KB]

(2006) Tao Tao, Su-Youn Yoon, Andrew Fister, Richard Sproat, & ChengXiang Zhai: Unsupervised name entity transliteration using temporal and phonetic correlation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 250-257. [PDF, 139KB]

(2006) LiLi Xu, Atsushi Fujii, & Tetsuya Ishikawa: Modeling impression in probabilistic transliteration into Chinese.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 242-249. [PDF, 365KB]

(2006) Dmitry Zelenko & Chinatsu Aone: Discriminative methods for transliteration.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 612-617. [PDF, 342KB]

(2005) Fei Huang: Cluster-specific named entity transliteration. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 435-444. [PDF, 169KB]

Tree Adjoining Grammar (TAG)

(2009) Xavier Carreras & Michael Collins: Non-projective parsing for statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.200-209. [PDF, 174KB]

(2009) Steve DeNeefe & Kevin Knight: Synchronous tree adjoining machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.727-736. [PDF, 398KB]

(2009) Rejwanul Haque, Sudip Kumar Naskar, Yanjun Ma & Andy Way: Using supertags as source language context in SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.234-241. [PDF, 247KB]

(2007) Hany Hassan, Khalil Sima’an, & Andy Way: Supertagged phrase-based statistical machine translation.  ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 288-295 [PDF, 175KB]

(2007) Rebecca Nesson & Stuart M.Shieber: Extraction phenomena in synchronous TAG syntax and semantics. SSST, NAACL-HLT-2007 AMTA Workshop on  Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.9-16 [PDF, 696KB]

(2007) Stuart M.Shieber: Probabilistic synchronous tree-adjoining grammars for machine translation: the argument from bilingual dictionaries. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.88-95 [PDF, 152KB]

(2006) Liang Huang, Kevin Knight & Aravind Joshi: Statistical syntax-directed translation with extended domain of locality. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.66-73 [PDF, 162KB]

(2006) Liang Huang, Kevin Knight, & Aravind Joshi: A syntax-directed translator with extended domain of locality. HLT-NAACL-2006: Computationally Hard Problems and Joint Inferenec in Speech and Language Processing. Proceedings of the workshop, 9 June 2006, New York City, NY, USA; pp.1-8. [PDF, 120KB]

(2006) Rebecca Nesson, Stuart Shieber & Alexander Rush: Induction of probabilistic synchronous tree-insertion grammars for machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.128-137 [PDF, 376KB]

(2005) Yuan Ding & Martha Palmer: Machine translation using probabilistic synchronous dependency insertion grammars.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 541-548. [PDF, 369KB]

Unification grammar

(2009) Selçuk Köprü & Jude Miller: A unification based approach to the morphological analysis and generation of Arabic. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 615KB]

(2006) Martin Kay: A life of language. Computational Linguistics 31 (4), pp. 425-438. [PDF, 97KB]

(2005) Francis Bond, Stephan Oepen, Melanie Siegel, Ann Copestake, & Dan Flickinger: Open Source machine translation with DELPH-IN  MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Open Source Machine Translation; pp.15-22. [PDF, 2700KB]

(2005) Martin Kay: A life of language. [Paper given in acknowledgement of the Life-time Achievement Award at the 43rd Annual Meeting of the Association for Computational Linguistics, Ann Arbor, Michigan, 27th June, 2005]; 13pp. [PDF, 1232KB]

Unknown words (see also Names and named entities)

(2009) Younès Bahou, Lamia Hadrich Belguith, & Abdelmajid Ben Hamadou: Disfluency and out-of-vocabulary word processing in Arabic speech understanding. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 825KB]

(2009) Bruno Cartoni: Lexical morphology in machine translation: a feasibility study.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.130-138. [PDF, 129KB]

(2009) Colin Cherry & Hisami Suzuki: Discriminative substring decoding for transliteration. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1066-1075. [PDF, 583KB]

(2009) Beate Dorow, Florian Laws, Lukas Michelbacher, Christian Scheible, & Jason Utt: A graph-theoretic algorithm for automatic extension of translation lexicons. Proceedings of the EACL 2009 Workshop on GEMS: Geometrical Models of Natural Language Semantics, Athens, Greece, 31 March 2009; pp.91-95. [PDF, 81KB]

(2009) Nizar Habash: REMOOV: a tool for online handling of out-of-vocabulary words in machine translation. MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.217-220. [PDF, 342KB]

(2009) Krister Lindén: Guessers for finite-state transducer lexicons [abstract]. CICLING 2009: 10th International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, March 1-7, 2009; 1p. [PDF, 103KB]

(2009) Yuval Marton, Chris Callison-Burch, & Philip Resnik: Improved statistical machine translation using monolingually-derived paraphrases. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.381-390. [PDF, 160KB]

(2009) Shachar Mirkin, Lucia Specia, Nicola Cancedda, Ido Dagan, Marc Dymetman, & Idan Szpektor: Source-language entailment modeling for translating unknown terms. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.791-799. [PDF, 152KB]

(2009) Harold Somers, Sandipan Dandapat, & Sundip Kumar Naskar: A review of EBMT using proportional analogies. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.53-60. [PDF, 337KB]

(2009) Jörg Tiedemann: Character-based PSMT for closely related languages. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.12-19 [PDF, 399KB]

(2009) Yuejie Zhang, Yang Wang, & Xiangyang Xue: English-Chinese bi-directional OOV translation based on web mining and supervised learning. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.129-132. [PDF, 325KB]

(2008) Karunesh Arora, Michael Paul & Eiichiro Sumita: Translation of unknown words in phrase-based statistical machine translation for languages of rich morphology. First International Workshop on Spoken Languages Technologies for Under-resourced languages (SLTU-2008), Hanoi University of Technology, Hanoi, Vietnam, May 5-7, 2008; pp. 70-75. [PDF, 187KB]

(2008) Bruno Cartoni: Lexical resources for automatic translation of constructed neologisms: the case study of relational adjectives. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 57KB]

(2008) Steve DeNeefe, Ulf Hermjakob & Kevin Knight: Overcoming vocabulary sparsity in MT using lattices. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.89-96. [PDF, 492KB]

(2008) Matthias Eck, Stephan Vogel, & Alex Waibel: Communicating unknown words in machine translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 303KB]

(2008)  Nizar Habash & Hayden Metsky: Automatic learning of morphological variations for handling out-of-vocabulary terms in Urdu-English machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.107-116. [PDF, 551KB]

(2008) Nizar Habash: Four techniques for online handling of out-of-vocabulary words in Arabic-English statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 57-60. [PDF, 81KB]

(2008) Jin-Shea Kuo & Haizhou Li: Multi-view co-training of transliteration model. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.373-380. [PDF, 465KB]

(2008) Coşkun Mermer, Hamza Kaya, Ömer Farukhan Güneş, & Mehmet Uğur Doğan: The TÜBÍTAK-UEKAE statistical machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.138-142 [PDF, 239KB; presentation [PDF, 179KB]

(2008) Ruiqiang Zhang & Eiichiro Sumita: Chinese unknown word translation by subword re-segmentation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.225-232. [PDF, 360KB]

(2007) Mathias Creutz, Teemu Hirsimäki, Mikko Kurimo, Antti Puurula, Janne Pylkkönen, Vesa Siivola, Matti Varjokallio, Ebru Arısoy, Murat Saraçlar, & Andreas Stolcke: Analysis of morph-based speech recognition and the modeling of out-of-vocabulary words across languages. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.380-387 [PDF, 136KB]

(2007) Etienne Denoual: Analogical translation of unknown words in a statistical machine translation framework. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.135-141 [PDF, 94KB]

(2007) Philippe Langlais & Alexandre Patry: Translating unknown words using analogical learning. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 877-886. [PDF, 155KB]

(2007) Chengye Lu, Yue Xu, & Shlomo Geva: Improving translation accuracy in web-based translation extraction. Proceedings of NTCIR-6 Workshop Meeting, May 15-18, 2007, Tokyo, Japan; pp.31-35. [PDF, 94KB]

(2007) Chen-Yu Su, Tien-Chien Lin & Shih-Hung Wu: Using Wikipedia to translate OOV terms on MLIR. Proceedings of NTCIR-6 Workshop Meeting, May 15-18, 2007, Tokyo, Japan; pp.109-115. [PDF, 462KB]

(2007) Dong Zhou, Mark Truran, Tim Brailsford & Helen Ashman: NTCIR-6 experiments using pattern matched translation extraction. Proceedings of NTCIR-6 Workshop Meeting, May 15-18, 2007, Tokyo, Japan; pp.145-151. [PDF, 130KB]

(2006) Viktor Pekar, Ruslan Mitkov, Dimitar Blagoev, & Andrea Mulloni: Finding translations for low-frequency words in comparable corpora [abstract].  Machine Translation 20 (4),2006; pp.247-266.

(2006) Ayu Purwarianti, Masatoshi Tsuchiya, & Seiichi Nakagawa: Indonesian-Japanese CLIR using only limited resource. Coling-ACL 2006: Proceedings of the workshop on How can computational linguistics improve information retrieval? Sydney, July 2006; pp.1-8. [PDF, 80KB]

(2006) Gregor Thurmair: Using corpus information to improve MT quality. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Third International Workshop on Language Resources for Translation Work, Research & Training (LR4Trans-III), Genoa, Italy, 28 May 2006; pp.45-48. [PDF, 371KB]

(2005) Jiun-Hung Lin, Min-Shiang Shia, Kao-Hung Lin, Shu-Jung Lin, Scott Yu, & Wen-Hsiang Lu: Search-result-based method for unknown term translation in cross-language information retrieval. Proceedings of NTCIR-5 Workshop Meeting, December 6-9, 2005, Tokyo, Japan; 4pp. [PDF, 559KB]

(2005) Jinming Min, Le Sun, & Junlin Zhang: ISCAS in English-Chinese CLIR at NTCIR-5.  Proceedings of NTCIR-5 Workshop Meeting, December 6-9, 2005, Tokyo, Japan; 7pp. [PDF, 277KB]

Verbs and verb phrases

(2009) Bogdan Babych, Anthony Hartley, & Serge Sharoff: Evaluation-guided pre-editing of source text: improving MT-tractability of light verb constructions. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.36-43 [PDF, 336KB]

(2009) Michael Gasser: Semitic morphological analysis and generation using finite state transducers with feature structures. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.309-317. [PDF, 217KB]

(2009) Gan Jin: Some problems in a French-Chinese machine translation system.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.138-142 [abstract]

(2009) Jin-Ji Li, Jungi Kim, Dong-Il Kim, & Jong-Hyeok Lee: Chinese syntactic reordering for adequate generation of Korean verbal phrases in Chinese-to-Korean SMT.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.190-196. [PDF, 323KB]

(2009) Wei-Yun Ma & Kathleen McKeown: Where’s the verb? Correcting machine translation during question answering. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.333-336. [PDF, 170KB]

(2009) Karel Pala & Aleš Horák: Multilingual features of complex valency frames. (IIS-2009) Recent Advances in Intelligent Information Systems, ed. M.A.Klopotek, A. Przepiórkowski, S.T. Wierzchon, K. Trojanowski, Kraków, Poland, June 15-18, 2009; pp.41-49. [PDF, 145KB]

(2009) R.Mahesh K.Sinha: Mining complex predicates in Hindi using a parallel Hindi-English corpus. [ACL-IJCNLP-2009] Proceedings of the 2009 Workshop on Multiword Expressions, ACL-IJCNLP 2009, Suntec, Singapore, 6 August 2009; pp.40-46. [PDF, 112KB]

(2009) Shoichi Yokoyama & Masumi Okuyama: Translation disambiguation of patent sentences using case frames. MT Summit XII: Third Workshop on Patent Translation, August 30, 2009, Ottawa, Ontario, Canada; pp. 25-32. [PDF, 75KB]

(2009) Sina Zarrieß & Jonas Kuhn: Exploiting translational correspondences for pattern-independent MWE identification.  [ACL-IJCNLP-2009] Proceedings of the 2009 Workshop on Multiword Expressions, ACL-IJCNLP 2009, Suntec, Singapore, 6 August 2009; pp.23-30. [PDF, 132KB]

(2008) Takeshi Abekawa & Kyo Kageura: What prompts translators to modify draft translations? An anlysis of basic modification patterns for use in the automatic notification of awkwardly translated text. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.241-248. [PDF, 393KB]

(2008) Debasri Chakrabarti, Hemang Mandalia, Ritwik Priya, Vaijayanthi Sarma, & Pushpak Bhattacharyya: Hindi compound verbs and their automatic extraction. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.27-30. [PDF, 62KB]

 (2008) Young-Sook Hwang, YoungKil Kim, & SangKyu Park: Paraphrasing depending on bilingual context toward generalization of translation knowledge. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.327-334. [PDF, 952KB]

(2008) Jin-Ji Li, Dong-Il Kim, & Jong-Hyeok Lee: Annotation guidelines for Chinese-Korean word alignment.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 195KB]

(2007) Roser Morante & Bertjan Busser: ILK2: semantic role labelling for Catalan and Spanish using TiMBL. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.183-186 [PDF, 120KB]

(2007) R.Mahesh K. Sinha: Using rich morphology in resolving certain Hindi-English machine translation divergence. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.429-433 [PDF, 48KB]

(2007) Yang Ye, Karl-Michael Schneider, & Steven Abney: Aspect marker generation in English-to-Chinese machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.521-527 [PDF, 129KB]

(2006) Kentaro Inui, Toru Hirano, Ryu Iida, Atsushi Fujita, & Yuji Matsumoto: Augmenting a semantic verb lexicon with large scale collection of example sentences.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.365-368 [PDF, 283KB]

(2006) Lucia Specia, Maria das Graças Volpe Nunes, & Mark Stevenson: Translation context sensitive WSD. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.227-232 [PDF, 206KB]

(2006) Lucia Specia: A hybrid relational approach for WSD – first results. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Student Research Workshop, Sydney, July 2006; pp.55-60. [PDF, 75KB]

(2006) Sara Stymne & Lars Ahrenberg: A bilingual grammar for translation of English-Swedish verb frame divergences. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.9-17 [PDF, 139KB]

(2006) Masatoshi Tsuchiya, Takao Shime, Toshihiro Takagi, Takehito Utsuro, Kiyotaka Uchimoto, Suguru Matsuyoshi, Satoshi Sato, & Seiichi Nakagawa: Chunking Japanese compound functional expressions by machine learning. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp. 25-32 [PDF, 793KB]

(2005) Ondřej Bojar & Jan Hajič: Extracting translations verb frames. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; p.2-6 [PDF, 207KB]

(2005) Adrià de Gispert: Phrase linguistic classification and generalization for improving statistical machine translation. ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 67-72. [PDF, 90KB]

(2005) Young-Sook Hwang & Yutaka Sasaki: Context-dependent SMT model using bilingual verb-noun collocation. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 549-556. [PDF, 1005KB]

Word association

(2009) Boxing Chen, George Foster, & Roland Kuhn: Phrase translation model enhanced with association based features. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.25-32. [PDF, 158KB]

(2008) Patrik Lambert & Rafael E.Banchs: Word association models and search strategies for discriminative word alignment. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.97-103. [PDF, 633KB]

Written forms

(2009) Yuxiang Jia, Danqing Zhu, & Shiwen Yu: A noisy channel model for grapheme-based machine transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.88-91. [PDF, 189KB]

(2009) Kevin Knight & Richard Sproat: Writing systems, transliteration and decipherment.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Tutorials, Boulder, Colorado, May 31 - June 5, 2009; pp.15-16. [PDF, 68KB]

(2009) Philipp Koehn & Barry Haddow: Edinburgh’s submission to all tracks of the WMT2009 shared task with reordering and speed improvements to Moses.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.160-164. [PDF, 105KB]

(2009) Oi Yee Kwong: Graphemic approximation of phonological context for English-Chinese transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.186-193. [PDF, 258KB]

(2009) Mohammad Mehdi Arab & Ali Azimizadeh: Construction of a Persian letter-to-sound conversion system based on classification and regression tree. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 676KB]

(2009) Taraka Rama, Anil Kumar Singh, & Sudbeer Kolachina: Modeling letter-to-phoneme conversion as a phrase based statistical machine translation problem with minimum error rate training.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.90-95. [PDF, 404KB]

(2009) R.Mahesh K.Sinha: Developing English-Urdu machine translation via Hindi. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 7pp. [PDF, 643KB]

(2009) Jörg Tiedemann: Character-based PSMT for closely related languages. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.12-19 [PDF, 399KB]

(2008) Eiji Aramaki, Takeshi Imai, Kengo Miyo, & Kazuhiko Ohe: Orthographic disambiguation incorporating transliterated probability. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.48-55. [PDF, 525KB]

(2008) Anurag Bhardwaj, Damien Jose, & Venu Govindaraju: Script independent word spotting in multilingual documents. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.48-54. [PDF, 122KB]

(2008) Steve DeNeefe, Ulf Hermjakob & Kevin Knight: Overcoming vocabulary sparsity in MT using lattices. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.89-96. [PDF, 492KB]

(2008) Vishal Goyal & Gurpreet Singh Lehal: Comparative study of Hindi and Punjabi language scripts. Nepalese Linguistics, Journal of the Linguistics Society of Nepal, vol. 23 (2008), pp. 67-82. [PDF, 290KB]

(2007) Eiji Aramaki, Takeshi Imai, Kengo Miyo, & Kazuhiko Ohe: Support vector machine based orthographic disambiguation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.21-30 [PDF, 979KB]; poster [PDF, 403KB]

(2007) Mona Diab, Mahmoud Ghoneim, & Nizar Habash: Arabic diacritization in the context of statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.143-149 [PDF, 105KB]

(2007) David Vilar, Jan-T. Peter, & Hermann Ney: Can we translate letters?  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 33-39 [PDF, 124KB]

(2006) Wei Wang, Kevin Knight & David Marcu: Capitalizing machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 1-8 [PDF, 171KB]

(2005) Etienne Denoual & Yves Lepage: BLEU in characters: towards automatic MT evaluation in languages without word delimiters. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.79-84. [PDF, 495KB]