Machine Translation Archive

Index of Languages

Publications 2005-2009

Languages M to Z

[For languages A to D click here; for languages E click here, for languages F to L click here]

[For publications since 2010 click here; for publications before 2005 click here]

[to return to home page click here]

Maltese

(2005) Michael Rosner: Towards HLT evaluation at the University of Malta. ELRA-HLT Evaluation Workshop, Malta, 1-2 December 2005; 19pp. [PDF from PPT, 128KB]

Mapundugun

(2006) Christian Monson, Ariadna Font Llitjos, Roberto Aranovich, Lori Levin, Ralf Brown, Eric Peterson, Jaime Carbonell, & Alon Lavie: Building NLP systems for two resource-scarce indigenous languages: Mapudungun and Quechua. LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.15-24. [PDF, 124KB]

Marathi

(2009) Mitesh M.Khapra, Sapan Shah, Piyush Kedia, & Pushpak Bhattacharyya: Projecting parameters for multilingual word sense disambiguation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.459-467. [PDF, 670KB]

(2005) Nitin Verma & Pushpak Bhattacharyya: Automatic generation of multilingual lexicon by using WordNet. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.380-391 [abstract, PDF, 16KB]

Marathi to English

(2008) Manoj Kumar Chinnakotla, Sagar Ranadive, Om P.Damani & Pushpak Bhattacharyya: Hindi and Marathi to English cross language information. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; abstract, p.64. [PDF, 12KB]

Mongolian to English

(2008) Virach Sornlertlamvanich, Thatsanee Charoenporn, Chumpol Mokarat, Hammam Riza, Hitoshi Isahara, & Purev Jaimal: Synset assignment for bi-lingual dictionary with limited resource. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.673-678. [PDF, 549KB]

Mongolian to Japanese

(2006) Badam-Osor Khaltar, Atsushi Fujii, & Testuya Ishikawa: Extracting loan words from Mongolian corpora and producing a Japanese-Mongolian bilingual dictionary. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.657-664. [PDF, 297KB]

Norwegian

(2009) Kevin Unhammer & Trond Trosterud: Reuse of free resources in machine translation between Nynorsk and Bokmål. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 35-42. [PDF, 219KB]

 (2008) Torbjørg Breivik: Establishing the Norwegian HLT resource collection. SLTC 2008: Second Swedish Language Technology Conference, November 20-21, 2008, Stockholm; pp.29-30. [PDF, 319KB]

Norwegian to English

(2009) Lars Bungum & Stephan Oepen: Automatic translation of Norwegian noun compounds. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.136-143. [PDF, 465KB]

(2008) Janne Bondi Johannessen, Torbjørn Nordgård, & Lars Nygaard: Evaluation of linguistics-based translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 249KB]

(2008) Stephan Oepen: Grammar-based processing and re-usable hybrid MT – an experience report in the wilderness. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 32 slides. [PDF, 254KB]

(2007) Stephan Oepen, Erik Velldal, Jan Tore Lønning, Paul Meurer, Victoria Rosén, & Dan Flickinger: Towards hybrid quality-oriented machine translation – on linguistics and probabilities in MT. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.144-153 [PDF, 328KB]

(2006) Dan Flickinger: Identifying complex phenomena in a corpus via a treebank lens. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.125-129 [PDF, 89KB]

(2006) Jan Tore Lønning & Stephan Oepen: Re-usable tools for precision machine translation.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.53-56. [PDF, 213KB]

(2006) Lars Nygaard, Jan Tore Lønning, Torbjørn Nordgård, and Stephan Oepen: Using a bi-lingual dictionary in lexical transfer. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.233-238 [PDF, 133KB]

(2005) Erik Velldal & Stephan Oepen: Maximum entropy models for realization ranking. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.109-116. [PDF, 217KB]

(2005) Stephan Oepen, Helge Dyvik, Dan Flickinger, Jan Tore Lønning, Paul Meurer, & Victoria Rosén: Holistic regression testing for high-quality MT: some methodological and technological reflections. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 198-204. [PDF, 173KB]

Norwegian to Swedish

(2009) Jörg Tiedemann: Character-based PSMT for closely related languages. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.12-19 [PDF, 399KB]

Occitan to Catalan

(2006) Carme Armentano i Oller & Mikel L. Forcada: Open-source machine translation between small languages: Catalan and Aranese Occitan.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.51-54. [PDF, 62KB]

Panjabi see Punjabi

Pashto

(2009) Craig Kopris: Endoclitics in Pashto: can they really do that?  CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 6pp. [PDF, 485KB]

Pashto to English

(2008) Reginald L.Hobbs & Clare R.Voss: Designing and executing machine translation workflows through the Kepler framework. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.380-389.  [PDF of PPT presentation, 1204KB]

(2005) Andreas Kathol, Kristin Precoda, Dimitra Vergyri, Wen Wang, & Susanne Riehemann: Speech translation for low-resource languages: the case of Pashto. Interspeech 2005 - Eurospeech: 9th European  Conference on  Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005; pp.2273-2276; abstract [PDF, 44KB]

Persian

(2009) Mohammad Mehdi Arab & Ali Azimizadeh: Construction of a Persian letter-to-sound conversion system based on classification and regression tree. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 676KB]

(2009) Mehrnoush Shamsfard, Soheila Kiani & Yaser Shahedi: STeP-1: standard text preparation for Persian language. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 659KB]

(2009) Massoud Sharifi-Atashgah & Mahmood Bijankhan: Corpus-based analysis for multi-token units in Persian. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 817KB]

(2008) Mohamad Ali Honarpisheh, Gholamreza Ghassem-Sani, & Ghassem Mirroshandel: A multi-document multi-lingual automatic summarization system. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.733-738. [PDF, 320KB]

Persian to English

(2009) Yonggang Deng & Bowen Zhou: Optimizing word alignment combination for phrase table training. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.229-232. [PDF, 132KB]

(2009) Chakaveh Saedi, Mehrnoush Shamsfard, & Yasaman Motazedi: Automatic translation between English and Persian texts. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 1001KB]

(2008) C.-L. Kao, S.Saleem, R.Prasad, F.Choi, P.Natarajan, D.Stallard, K.Krstovski, & M.Kamali: Rapid development of an English/Farsi speech-to-speech translation system. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.166-173 [PDF, 143KB]; presentation [PDF, 95KB]

(2008) Andreas Kathol & Jing Zheng: Strategies for building a Farsi-English SMT system from limited resources. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2731-2734; abstract [PDF, 62KB]

(2008) Karine Megerdoomian & Dan Parvaz: Low-density language bootstrapping: the case of Tajiki Persian.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 331KB]

(2008) Vivek Kumar Rangarajan Sridhar, Shrikanth Narayanan, & Srinivas Bangalore: Enriching spoken language translation with dialog acts. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 225-228. [PDF, 383KB]

(2008) Vivek Kumar Sridhar, Srinivas Bangalore, & Shrikanth S.Narayanan: Factored translation models for enriching spoken language translation with prosody. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2723-2726; abstract [PDF, 60KB]

(2008) NIST: MetricsMATR challenge. [NIST, 2008]. 6pp. [PDF, 35KB]

(2007) Nguyen Bach, Matthais Eck, Paisarn Charoenpornsawat, Thilo Köhler, Sebastian Stüker, ThuyLinh Nguyen, Roger Hsiao, Alex Waibel, Stephan Vogel, Tanja Schultz, & Alan W.Black: The CMU TransTac 2007 eyes-free two-way speech-to-speech translation system. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 489KB]; presentation [PDF, 1043KB]

(2006) Emil Ettelaie, Panayiotis G.Georgiou, & Shrikanth Narayanan: Cross-lingual dialog model for speech to speech translation. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1858; abstract [PDF, 74KB]

(2005) Emil Ettelaie, Sudeep Gandhe, Panayiotis Georgiou, Kevin Knight, Daniel Marcu, Shrikanth Narayanan, David Traum, & Robert Belvin: Transonics: a practical speech-to-speech translator for English-Farsi medical dialogues.  ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 89-92  [PDF, 352KB]

Philippine languages see Cebuano, English to Tagalog, Tagalog to English

Polish to English

(2009) Krzysztof Bogacki: Controlled languages and machine translation. ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.49-55 [abstract]

(2009) Filip Graliński, Krzysztof Jassem, & Michał Marcińczuk: An environment for named entity recognition and translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.88-95. [PDF, 430KB]

(2007) Jacek Gintrowicz & Krzysztof Jassem: Using regular expressions in translation memories. Proceedings of the International Multiconference on Computer Science and Information Technology, vol.2, October 15-17, 2007, Wisla, Poland; pp.87-92. [PDF, 200KB]

Polish to French

(2009) Krzysztof Bogacki: Controlled languages and machine translation. ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.49-55 [abstract]

(2009) Marcin Junczys-Dowmunt: It’s all about the trees – towards a hybrid syntax-based MT system. Proceedings of the International Multiconference on Computer Science and Information Technology, Mrągowo, Poland, 12-14 October 2009; pp.219-226. [PDF, 125KB]

(2009) Zuzanna Rudas: Polish controlled language and its machine translation into French.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.231-235 [abstract]

Polish to German

(2007) Monika Rosińska: Collecting Polish-German parallel corpora in the Internet. Proceedings of the International Multiconference on Computer Science and Information Technology, Wisla, Poland, 15-17 October 2007; pp.285-292. [PDF, 550KB]

Portuguese

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2006) Jirka Hana, Anna Feldman, Chris Brew, & Luiz Amaral: Tagging Portuguese with a Spanish tagger using cognates. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.33-40 [PDF, 253KB]

(2005) Jorge Marques Pelizzoni & Maria das Graças Volpe Nunes: Flexibility, configurability and optimality in UNL deconversion via multiparadigm programming . In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute);  pp.175-194 [abstract, PDF, 13KB]

(2005) Palmira Marrafa: The representation of complex telic predicates in wordnets: the case of lexical-conceptual structure deficitary verbs. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.109-116 [abstract, PDF, 14KB]

(2005) Thamar Solorio: Exploiting named entity taggers in a second language. ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp.25-30.[PDF, 70KB]

Portuguese to Chinese

(2009) Fai Wong & Sam Chao: PCT: Portuguese-Chinese machine translation systems. International Conference on The Teaching of Computer-aided Translation, Chinese University, Hong Kong, 2009; 8pp. [PDF, 132KB]

(2008) Fai Wong & Kwok Kit Leung: The design of web based machine translation server based on grid infrastructure . The Fourth International Conference on Networked Computing and Advanced Information Management (NCM2008), Gyeongju, Korea, 2008; pp.713-718. [PDF, 226KB]

(2007) Fai Wong : The parsing algorithm of translation corresponding tree (TCT) grammar . Third Asian-Pacific Congress on Computational Mechanics (APCOM'07) in conjunction with Eleventh International Conference on the Enhancement and Promotion of Computational Methods in Engineering and Science (EPMESC XI), December 3-6, 2007, Kyoto, Japan; 10pp. [PDF, 294KB]

(2007) Fai Wong, Mingchui Dong, Chiwai Tang, & Francisco Oliveira: Translation technologies of Portuguese to Chinese machine translation system: PCT Assistente. Proceedings of the 4th Chinese Digitization Forum (CDF), Jan. 24-26, 2007, Macau SAR, China; 11pp. [PDF, 186KB]

Portuguese to English (see also Brazilian Portuguese to English)

(2009) Anabela Barreiro & Luís Miguel Cabral: ReEscreve: a translator-friendly multi-pupose paraphrasing software tool. MT Summit XII - Workshop: Beyond Translation Memories: New Tools for Translators MT, August 29, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 594KB]; presentation, 37pp. [PDF of PPT, 1344KB]

(2008) Diana Santos & Alberto Simões: Portuguese-English word alignment: some experiments. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 68KB]

(2005) Andre Castilla, Alice Bacic, & Sergio Furuie: Machine translation on the medical domain: the role of BLEU/NIST and METEOR in a controlled vocabulary setting. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.47-54. [PDF, 156KB]

Portuguese to Spanish

(2008) Jorge González & Francisco Casacuberta: Linguistic categorisation in machine translation using stochastic finite state transducers. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 55-62. [PDF, 321KB]

(2008) Petr Homola & Vladislav Kuboň: Improving machine translation between closely related Romance languages . EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.72-77. [PDF, 527KB]

Punjabi

(2008) Vishal Goyal & Gurpreet Singh Lehal: Comparative study of Hindi and Punjabi language scripts. Nepalese Linguistics, Journal of the Linguistics Society of Nepal, vol. 23 (2008), pp. 67-82. [PDF, 290KB]

(2008) Tejinder Singh Saini, Gurpreet Singh Lehal, & Virinder S.Kalra: Shahmukhi to Gurmukhi transliteration system. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.177-180. [PDF, 319KB]

(2006) M.G.Abbas Malik: Punjabi transliteration. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.1137-1144. [PDF, 1234KB]

Punjabi to Hindi

(2008) Gurpreet Singh Josan & Gurpreet Singh Lehal: A Punjabi to Hindi machine translation system. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.157-160. [PDF, 190KB]

Pushto see Pashto

Quechua

(2006) Christian Monson, Ariadna Font Llitjos, Roberto Aranovich, Lori Levin, Ralf Brown, Eric Peterson, Jaime Carbonell, & Alon Lavie: Building NLP systems for two resource-scarce indigenous languages: Mapudungun and Quechua. LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.15-24. [PDF, 124KB]

Quechua to Spanish

(2008) Michael Mohler & Rada Mihalcea: BABYLON parallel text builder: gathering parallel texts for low-density languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 63KB]

Romance languages

(2007) Roxana Girju: Improving the interpretation of noun phrases with cross-linguistic information. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 568-575 [PDF, 156KB]

(2006) Thomas Koller: [review of] Emanuela Cresti and Massimo Moneglia (eds.), C-ORAL-ROM, Integrated reference corpora for spoken Romance languages. Machine Translation 20 (4),2006; pp.297-300. [see publication]

Romanian

 (2006) Ted Pedersen, Anagha Kulkarni, Roxana Angheluta, Zornitsa Kozareva, & Thamar Solorio: Improving name discrimination: a language salad approach. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.25-32 [PDF, 268 KB]

Romanian to English

(2008) Carmen Banea, Rada Mihalcea, Janyce Wiebe, & Samer Hassan: Multilingual subjectivity analysis using machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.127-135. [PDF, 15403KB]

(2008) Constantin Orasan & Oana Andreea Chiorean: Evaluation of a cross-lingual Romanian-English multi-document summariser.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 74KB]

(2006) Phil Blunsom & Trevor Cohn: Discriminative word alignments with conditional random fields. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.65-72. [PDF, 248KB]

(2005) Helena M. Caseli, Maria G.V. Nunes, & Mikel L. Forcada: LIHLA: shared task system description. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp.111-114. [PDF, 68KB]

(2005) Natalia Eliţa & Antonina Birladeanu: An application using example-based machine translation. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; p.9-11 [PDF, 159KB]

(2005) Natalia Elita & Antonina Birladeanu: A first step in integrating an EBMT into the Semantic Web MT Summit X, Phuket, Thailand, September 12, 2005, Proceedings of Workshop on Semantic Web Technologies for Machine Translation; pp.13-15. [PDF, 177KB]

(2005) Alexander Fraser & Daniel Marcu: ISI’s participation in the Romanian-English alignment task. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 91-94. [PDF, 88KB]

(2005) Adam Lopez & Philip Resnik: Improved HMM alignment models for languages with scarce resources.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 83-86. [PDF, 103KB]

(2005) Joel Martin, Rada Mihalcea, & Ted Pedersen: Word alignment for languages with scarce resources. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 65-74. [PDF, 170KB]

(2005) Juan Miguel Vilar: Experiments using MAR for aligning corpora. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 95-98. [PDF, 93KB]

Russian

(2009) Roland Meyer: Tagging Old Russian texts via modern translations – applications and resources for historical MT. EAMT-2009 Workshop Machine Translation for Historical Languages, 13 May 2009, Barcelona; slides [PDF of PPT, 142KB]

(2009) Raghavendra Udupa, K.Saravanan, A.Kumaran, & Jagadeesh Jagarlamudi: MINT: a method for effective and scalable mining of named entity transliterations from large comparable corpora.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.799-807. [PDF, 1120KB]

(2008) Dan Goldwasser & Dan Roth: Active sample selection for named entity transliteration. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 53-56. [PDF, 538KB]

(2008) Joakim Nivre, Igor M.Boguslavsky, & Leonid L.Iomdin: Parsing the SynTagRus treebank of Russian.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.641-648. [PDF, 266KB]

(2007) Einat Minkov, Kristina Toutanova, & Hisami Suzuki: Generating complex morphology for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 128-135 [PDF, 219KB]

(2007) Alla Rozovskaya & Richard Sproat: Multilingual word sense disambiguation: a comparative cross-linguistic study. ACL 2007: Proceedings of the Workshop on Balto-Slavonic Natural Language Processing, June 29, 2007, Prague, Czech Republic; pp.82-87. [PDF, 122KB]

(2006) Juri Apresjan, Igor Boguslavsky, Boris Iomdin, Leonid Iomdin, Andrei Sannikov, & Victor Sizov: A syntactically and semantically tagged corpus of Russian: state of the art and prospects.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1378-1381 [PDF, 596KB]

Russian to Bulgarian

(2009) Svetlin Nakov, Preslav Nakov, & Elena Paskaleva: Unsupervised extraction of false friends from parallel bi-texts using the web as a corpus.  [RANLP 2009] International conference: Recent Advances in Natural Language Processing. Proceedings ed. Galia Angelova, Kalina Bontcheva, Ruslan Mitkov, Nicolas Nicolov, Nikolai Nikolov, Borovets, Bulgaria, 14-16 September 2009; pp. 292-298. [PDF, 235KB]

Russian to English

(2009) Bogdan Babych, Anthony Hartley, & Serge Sharoff: Evaluation-guided pre-editing of source text: improving MT-tractability of light verb constructions. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.36-43 [PDF, 336KB]

(2009) Raymond Flournoy & Christine Duran: Machine translation and document localization at Adobe: from pilot to production. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 425-428. [PDF, 123KB]

(2009) Philippe Langlais, François Yvon & Pierre Zweigenbaum: Improvements in analogical learning: application to translating multi-terms of the medical domain.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.487-495. [PDF, 172KB]

(2008) Bogdan Babych, Serge Sharoff, & Anthony Hartley: Generalising lexical translation strategies for MT using comparable corpora.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 412KB]

(2007) Bogdan Babych, Anthony Hartley, Serge Sharoff, & Olga Mudraya: Assisting translators in indirect lexical transfer.  ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 136-143 [PDF, 285KB]

(2007) Bogdan Babych, Anthony Hartley, & Serge Sharoff: A dynamic dictionary for discovering indirect translation equivalents.  Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 10pp. [PDF, 150KB]

(2006) Serge Sharoff, Bogdan Babych, & Anthony Hartley: Using comparable corpora to solve problems difficult for human translators. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.739-746. [PDF, 250KB]

(2006) Serge Sharoff: Translation as problem-solving: uses of comparable corpora.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Third International Workshop on Language Resources for Translation Work, Research & Training (LR4Trans-III), Genoa, Italy, 28 May 2006; pp.23-28. [PDF, 914KB]

(2006) Dmitry Zelenko & Chinatsu Aone: Discriminative methods for transliteration.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 612-617. [PDF, 342KB]

Russian to German

(2008) Igor Mel’čuk & Leo Wanner: Morphological mismatches in machine translation [abstract]. Machine Translation 22 (3), September 2008; pp.101-152.

Sami

(2009) Francis M.Tyers, Linda Wiechetek, & Trond Trosterud: Developing prototypes for machine translation between two Sámi languages. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.120-127. [PDF, 340KB]

Sanskrit

(2008) Anurag Bhardwaj, Damien Jose, & Venu Govindaraju: Script independent word spotting in multilingual documents. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.48-54. [PDF, 122KB]

Semitic languages

(2009) Michael Gasser: Semitic morphological analysis and generation using finite state transducers with feature structures. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.309-317. [PDF, 217KB]

(2008) Peter A. Chew & Ahmed Abdelali: The effects of language relatedness on multilingual information retrieval: a case study with Indo-European and Semitic languages. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.1-9. [PDF, 196KB]

Serbian to English

(2006) Maja Popović & Hermann Ney: Statistical machine translation with a small amount of bilingual training data. LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.25-29. [PDF, 115KB]

(2005) Maja Popović, David Vilar, Hermann Ney, Slobodan Jovičić, & Zoran Šarić: Augmenting a small parallel text with morpho-syntactic language resources for Serbian-English statistical machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 41-48. [PDF, 211KB]

Slavic languages

(2009) Petr Homola, Vladislav Kuboň, & Jernej Vičič: Shallow transfer between Slavic languages. (IIS-2009) Recent Advances in Intelligent Information Systems, ed. M.A.Klopotek, A. Przepiórkowski, S.T. Wierzchon, K. Trojanowski, Kraków, Poland, June 15-18, 2009; pp.219-232. [PDF, 390KB]

Slovenian

(2008) Špela Vintar: Corpora in translation: a Slovene perspective. Journal of Specialised Translation, issue 10, July 2008; pp.40-55. [PDF, 187KB]

(2006) Simon Corston-Oliver & Anthony Aue: Dependency parsing with reference to Slovene, Spanish and Swedish. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.196-200. [PDF, 286KB]

(2006) Gilles Sérasset, Francis Brunet-Manquat, & Elena Chiocchetti: Multilingual legal terminology on the Jibiki platform: the LexALP project. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.937-944. [PDF, 433KB]

Slovenian to English

(2009) Jan Rupnik: Multi-view CCA and regression CCA. SMART Workshop at EACL 2009, Barcelona, Spain, 13 May 2009. 26 slides. [PDF of PPT,2578KB]

(2006) Mirjam Sepesy Maučec & Zdravko Kačič: Statistical machine translation using the IJS-ELAN corpus.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.87-90. [PDF, 117KB]

(2005) J. Žganec Gros, A. Mihelič, M. Žganec, F. Mihelič, S. Dobrišek, J. Žibert, Š. Vintar, T. Korošec, T. Erjavec, & M. Romih: Initial considerations in building a speech-to-speech translation system for the Slovenian-English language pair. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 288-293. [PDF, 531KB]

Slovenian to Serbian

(2007) Jernej Vicic: Rapid development of RBMT systems for related languages. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 12pp. [PDF, 154KB]

Spanish

(2009) Xavier Blanco: Remarks about linguistic analysis, normalization and translation of Spanish “What to do in case of fire” texts. ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.43-48 [abstract]

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2009) Eleni Papadopoulou & Marcel Puig Portella: Abduction alerts in Greek and Spanish.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.185-189 [abstract]

(2008) Gloria Corpas Pastor, Ruslan Mitkov, Naveed Afzal, & Viktor Pekar: Translation universals: do they exist? A corpus-based NLP study of convergence and simplification. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.75-81. [PDF, 566KB]

(2008) Doaa Samy & Ana González-Ledesma: Pragmatic annotation of discourse markers in a multilingual parallel corpus (Arabic-Spanish-English).  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 301KB]

(2007) Lluís Màrquez, Luis Villarejo, M.A.Martí, & Mariona Taulé: SemEval-2007 task 09: multilevel semantic annotation of Catalan and Spanish. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.42-47 [PDF, 89KB]

(2007) Roser Morante & Bertjan Busser: ILK2: semantic role labelling for Catalan and Spanish using TiMBL. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.183-186 [PDF, 120KB]

(2006) A.Bonafonte, H.Höge, I.Kiss, A.Moreno, U.Ziegenhain, H.van den Heuvel, H.-U.Hain, X.S.Wang, M.N.Garcia: TC-STAR: specifications of language resources and evaluation for speech synthesis. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.311-314 [PDF, 272KB]

(2006) Simon Corston-Oliver & Anthony Aue: Dependency parsing with reference to Slovene, Spanish and Swedish. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.196-200. [PDF, 286KB]

(2006) Jesús Giménez & Lluís Màrquez: Low-cost enrichment of Spanish WordNet with automatically translated glosses: combing general and specialized models. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.287-294. [PDF, 108KB]

(2006) Jirka Hana, Anna Feldman, Chris Brew, & Luiz Amaral: Tagging Portuguese with a Spanish tagger using cognates. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.33-40 [PDF, 253KB]

(2006) Djamal Mostefa, Olivier Hamon, & Khalid Choukri: Evaluation of automatic speech recognition and speech language translation within TC-STAR: results from the first evaluation campaign.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.149-154 [PDF, 388KB]

(2006) Ted Pedersen, Anagha Kulkarni, Roxana Angheluta, Zornitsa Kozareva, & Thamar Solorio: Improving name discrimination: a language salad approach. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.25-32 [PDF, 268 KB]

(2006) Doaa Samy, Antonio Moreno Sandoval, Jose M. Guirao, & Enrique Alfonseca: Building a parallel multilingual corpus (Arabic-Spanish-English).  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2176-2181 [PDF, 473KB]

(2006) E. Saquete, P.Martínez-Barco, R.Muñoz, M.Negri, M.Speranza, & R.Sprugnoli: Multilingual extension of a temporal expression normalizer using annotated corpora. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.1-8 [PDF, 261 KB]

(2006) Violeta Seretan & Eric Wehrli: Accurate collocation extraction using a multilingual parser. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.953-960. [PDF, 168KB]

(2005) Carme Colominas: BancTrad: a web interface for integrated access to annotated corpora. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; p.7-8 [PDF, 114KB]

(2005) Thamar Solorio: Exploiting named entity taggers in a second language. ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp.25-30.[PDF, 70KB]

Spanish to Basque

(2009) Eneko Agirre, Aitziber Atutxa, Gorka Labaka, Mikel Lersundi, Aingeru Mayor, & Kepa Sarasola: Use of rich linguistic information to translate prepositions and grammatical cases to Basque. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.58-65. [PDF, 340KB]

(2009) Arantza Díaz de Ilarraza, Gorka Labaka, & Kepa Sarasola: Relevance of different segmentation options on Spanish-Basque SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.74-80. [PDF, 390KB]

(2009) Arantza Díaz de Ilarraza, Gorka Labaka, & Kepa Sarasola: Reordering on Spanish-Basque SMT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 207-213. [PDF, 327KB]

(2009) Aingeru Mayor & Francis M.Tyers: Matxin: moving towards language independence. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 11-17. [PDF, 205KB]

(2009) Kepa Sarasola: Matxin: developing sustainable machine translation for a less-resourced language [abstract]. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 1.

(2008) I.Alegria, A.Casillas, A.Diaz de Ilarraza, J.Igartua, G.Labaka, M.Lersundi, A.Mayor, K.Sarasola, X.Saralegi, & B.Laskurain: Mixing approaches to MT for Basque: selecting the best output from RBMT, EBMT and SMT. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 27-34. [PDF, 357KB]

(2008) Iñaki Alegria, Arantza Casillas, Arantza Diaz de Ilarraza, Jon Igartua, Gorka Labaka, Mikel Lersundi, Aingeru Mayor, & Kepa Sarasola: Spanish-to-Basque multiengine machine translation for a restricted domain. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.37-45. [PDF, 575KB]

(2008) A. Diaz de Ilarraza, G. Labaka, & K. Sarasola: Statistical post-editing: a valuable method in domain adaptation of RBMT systems for less-resourced languages. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 35-40. [PDF, 364KB]

(2007) Gorka Labaka, Nicolas Stroppa, Andy Way, & Kepa Sarasola: Comparing rule-based and data-driven approaches to Spanish-to-Basque machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.297-304 [PDF, 558KB]

(2007) Alicia Pérez, M.Inés Torres, M.Teresa González, & Francisco Casacuberta: Speech-input multi-target machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 56-63 [PDF, 208KB]

(2007) Alicia Pérez, M.Inés Torres, M.Teresa González, & Francisco Casacuberta: An integrated architecture for speech-input multi-target machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.133-136 [PDF, 130KB]

(2007) Germán Sanchis & Francisco Casacuberta: Reordering via n-best lists for Spanish-Basque translation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.191-198 [PDF, 309KB]; poster [PDF, 115KB]

(2006) A.Casillas, A. Díaz de Illarraza, J.Igartua, R. Martínez, & K. Sarasola: Compilation and structuring of a Spanish-Basque parallel corpus.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.55-58. [PDF, 172KB]

(2006) Alicia Pérez, Inés Torres, Francisco Casacuberta, & Víctor Guijarrubia: A Spanish-Basque weather forecast corpus for probabilistic speech translation.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.99-102. [PDF, 227KB]

(2005) Iñaki Alegria, Arantza Diaz de Ilarraza, Gorka Labaka, Mikel Lersundi, Aingeru Mayor, Kepa Sarasola, Mikel L. Forcada, Sergio Ortiz-Rojas, & Lluís Padró: An open architecture for transfer-based machine translation between Spanish and Basque  MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Open Source Machine Translation; pp.7-14. [PDF, 155KB]

Spanish to Catalan

(2009) Marcel Puig Portella & Eleni Papadopoulou: Treatment of the imperative forms in the machine translation between Catalan, Spanish and Greek.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.198-202 [abstract]

(2009) Felipe Sánchez-Martínez & Mikel L.Forcada: Inferring shallow-transfer machine translation rules from small parallel corpora. Journal of Artificial Intelligence Research 34, pp. 605-635. [PDF, 706KB]

(2009) Luis Villarejo Muñoz, Sergio Ortiz Rojas, & Mireia Ginestí Rosell: Joint efforts to further develop and incorporate Apertium into the document management flow at Universitat Oberta de Catalunya. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortíz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 75-82. [PDF, 270KB]

(2007) Felipe Sánchez-Martínez & Mikel L.Forcada: Automatic induction of shallow-transfer rules for open-source machine translation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.181-190 [PDF, 336KB]; poster [PDF, 29KB]

(2006) Rafael Banchs, Antonio Bonafonte, & Javier Pérez: Acceptance testing of a spoken language translation system. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2074-2079 [PDF, 262KB]

(2006) J.González, A.L.Lagarda, J.R.Navarro, L.Eliodoro, A.Giménez, F.Casacuberta, J.M.de Val, & F.Fabregat: SisHiTra: a Spanish-to-Catalan hybrid machine translation system.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.69-73. [PDF, 158KB]

(2006) Maite Melero, Antoni Oliver, & Toni Badia: Automatic multilingual subtitling in the eTITLE project. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 18pp. [PDF, 149KB]

(2006) Joaquim Moré & Salvador Climent: A cheap MT-evaluation method based on Internet searches. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.19-26 [PDF, 260KB]

(2005) Carme Armentano-Oller, Antonio M. Corbí-Bellot, Mikel L. Forcada, Mireia Ginestí-Rosell, Boyan Bonev, Sergio Ortiz-Rojas, Juan Antonio Pérez-Ortiz, Gema Ramírez-Sánchez, & Felipe Sánchez-Martínez: An open-source shallow-transfer machine translation toolbox: consequences of its release and availability  MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Open Source Machine Translation; pp.23-30. [PDF, 255KB]

(2005) Victoria Arranz, Elisabet Comelles, & David Farwell: The FAME speech-to-speech translation system for Catalan, English, and Spanish. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.195-202. [PDF, 185KB]

(2005) Juan Alberto Alonso: Machine translation for Catalan↔Spanish: the real case for productive MT. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 23-26. [PDF, 147KB]

Spanish to English

(2009) Julia Aymerich & Hermes Camelo: The machine translation maturity model at PAHO. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 403-409. [PDF, 44KB]

(2009) Sergio Barrachina, Oliver Bender, Francisco Casacuberta, Jorge Civera, Elsa Cubel, Shahram Khadivi, Antonio Lagarda, Hermann Ney, Jesús Tomás, Enrique Vidal, & Juan-Miguel Vilar: Statistical approaches to computer-assisted translation. Computational Linguistics 35 (1), pp.3-28. [PDF, 252KB]

(2009) Nicola Bertoldi & Marcello Federico: Domain adaptation for statistical machine translation with monolingual resources. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.182-189. [PDF, 146KB]

(2009) E.Boldrini, S.Ferrández, R.Izquierdo, D.Tomás, & J.L.Vicedo: A parallel corpus labeled using open and restricted domain ontologies [abstract]. CICLING 2009: 10th International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, March 1-7, 2009; 1p. [PDF, 65KB]

(2009) Chris Callison-Burch: Fast, cheap, and creative: evaluating translation quality using Amazon’s Mechanical Turk. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.286-295. [PDF, 289KB]

(2009) Chris Callison-Burch, Philipp Koehn, Christof Monz, & Josh Schroeder: Findings of the 2009 Workshop on Statistical Machine Translation. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.1-28. [PDF, 544KB]

(2009) Yu Chen, Martin Kay, & Andreas Eisele: Intersecting multilingual data for faster and better statistical translations.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.128-136. [PDF, 159KB]

(2009) Neil Coffey: Machine translation – how it works, what users expect, and what they get. Ezine Articles, 8 May 2009; 5pp. [PDF, 41KB]

(2009) Raymond Flournoy & Christine Duran: Machine translation and document localization at Adobe: from pilot to production. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 425-428. [PDF, 123KB]

(2009) José A.R.Fonollosa, Maxim Khalilov, Marta R.Costa-jussà, José B.Mariño, Carlos A. Henríquez Q., Adolfo Hernández H., & Rafael E.Banchs: The TALP-UPC phrase-based translation system for EACL-WMT 2009. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.85-89. [PDF, 393KB]

(2009) Nikesh Garera, Chris Callison-Burch & David Yarowsky: Improving translation lexicon induction from monolingual corpora via dependency contexts and part-of-speech equivalences. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning, June 4-5, 2009, Boulder, Colorado; pp.129-137. [PDF, 450KB]

(2009) Philipp Koehn & Barry Haddow: Edinburgh’s submission to all tracks of the WMT2009 shared task with reordering and speed improvements to Moses.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.160-164. [PDF, 105KB]

(2009) Patrik Lambert, Yanjun Ma, Sylwia Ozdowska & Andy Way: Tracking relevant alignment characteristics for machine translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.268-275. [PDF, 443KB]

(2009) Philippe Langlais, François Yvon & Pierre Zweigenbaum: Improvements in analogical learning: application to translating multi-terms of the medical domain.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.487-495. [PDF, 172KB]

(2009) Gregor Leusch, Evgeny Matusov, & Hermann Ney: The RWTH system combination system for WMT 2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.51-55. [PDF, 173KB]

(2009) Preslav Nakov & Hwee Tou Ng: Improved statistical machine translation for resource-poor languages using related resource-rich languages. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1358-1367. [PDF, 209KB]

(2009) Michael Paul, Andrew Finch & Eiichiro Sumita: NICT@WMT09: model adaptation and transliteration for Spanish-English SMT.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.105-109. [PDF, 95KB]

(2009) Maja Popović, David Vilar, Daniel Stein, Evgeny Matusov & Hermann Ney: The RWTH machine translation system for WMT 2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.66-69. [PDF, 210KB]

(2009) Markus Saers & Dekai Wu: Improving phrase-based translation via word alignments from stochastic inversion transduction grammars.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.28-36. [PDF, 554KB]

(2009) Markus Saers, Joakim Nivre, & Dekai Wu: Learning stochastic bracketing inversion transduction grammars with a cubic time biparsing algorithm. IWPT-09: Proceedings of the 11th International Conference on Parsing Technologies, 7-9 October 2009, Paris, France; pp. 29-32. [PDF, 126KB]

(2009) Felipe Sánchez-Martínez, Mikel L. Forcada, & Andy Way: Hybrid rule-based – example-based MT: feeding Apertium with sub-sentential translation units. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.11-18. [PDF, 150KB]; presentation [PDF of PPT, 192KB]

(2009) Felipe Sánchez-Martínez & Andy Way: Marker-based filtering of bilingual phrase pairs for SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.144-151. [PDF , 386KB]

(2009) Zhuoran Wang & John Shawe-Taylor: Kernel-based machine translation.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.169-184.

(2008) Nguyen Bach, Qin Gao, & Stephan Vogel: Improving word alignment with language model based confidence scores. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.151-154. [PDF, 607KB]

(2008) Carmen Banea, Rada Mihalcea, Janyce Wiebe, & Samer Hassan: Multilingual subjectivity analysis using machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.127-135. [PDF, 15403KB]

(2008) Graeme Blackwood, Adrià de Gispert, Jamie Brunning, & William Byrne: European language translation with weighted finite string transducers: the CUED MT system for the 2008 ACL workshop on SMT. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.131-134. [PDF, 77KB]

(2008) Chris Callison-Burch, Cameron Fordyce, Philipp Koehn, Christof Monz, & Josh Schroeder: Further meta-evaluation of machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.70-106. [PDF, 320KB]

(2008) Michael Carl, Maite Melero, Toni Badia, Vincent Vandeghinste, Peter Dirix, Ineke Schuurman, Stella Markantonatou, Sokratis Sofianopoulos, Marina Vassiliou, & Olga Yannoutsou: METIS-II: low resource machine translation [abstract]. Machine Translation 22 (1/2), March-June 2008; pp.67-99.

(2008) Marta R. Costa-jussà & José A.R.Fonollosa: Computing multiple weighted reordering hypotheses for a statistical machine translation phrase-based system. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.82-88. [PDF, 623KB]

(2008) Daniel Déchelotte, Gilles Adda, Alexandre Allauzen, Hélène Bonneau-Maynard, Olivier Galibert, Jean-Luc Gauvain, Philippe Langlais, & François Yvon: LIMSI’s statistical translation systems for WMT’08. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.107-110. [PDF, 104KB]

(2008) Matthias Eck, Stephan Vogel, & Alex Waibel: Communicating unknown words in machine translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 303KB]

(2008) Andreas Eisele, Christian Federmann, Hans Uszkoreit, Hervé Saint-Amand, Martin Kay, Michael Jellinghaus, Sabine Hunsicker, Teresa Herrmann, & Yu Chen: Hybrid architectures for multi-engine machine translation. Translating and the Computer 30, 27-28 November 2008, London; 12pp. [PDF, 170KB]

(2008) Saša Hasan, Juri Ganitkevitch, Hermann Ney, & Jesús Andrés-Ferrer:  Triplet lexicon models for statistical machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.372-381. [PDF, 181KB]

(2008) Gareth J.F.Jones, Fabio Fantino, Eamonn Newman, & Ying Zhang: Domain-specific query translation for multilingual information access using machine translation augmented with dictionaries mined from Wikipedia. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.34-41. [PDF, 401KB]

(2008) Maxim Khalilov, Adolfo Hernández H., Marta R. Costa-jussà, Josep M. Crego, Carlos A. Henríquez Q., Patrik Lambert, José A.R. Fonollosa, José B. Mariño, & Rafael E. Banchs: The TALP-UPC ngram-based statistical machine translation systems for ACL-WMT 2008. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.127-130. [PDF, 128KB]

(2008) Udhyakumar Nallasamy, Alan W. Black, Tanja Schultz, & Robert Frederking: NineOneOne: recognizing and classifying speech for handling minority language emergency calls.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 95KB]

(2008) Udhyakumar Nallasamy, Alan W.Black, Tanja Schultz, Robert Frederking, & Jerry Weltman: Speech translation for triage of emergency phonecalls in minority languages. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.48-53. [PDF, 229KB]

(2008) Jan Niehues & Stephan Vogel: Discriminative word alignment via alignment matrix modelling. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.18-25. [PDF, 576KB]

(2008) Michael Paul: Overview of the IWSLT 2008 evaluation campaign. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 1-17. [PDF, 262KB]; presentation [PDF, 765KB]

(2008) Slav Petrov, Aria Haghighi, & Dan Klein: Coarse-fine syntactic machine translation using language projections.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.108-116. [PDF, 306KB]

(2008) Manny Rayner, Pierrette Bouillon, Glenn Flores, Farzad Ehsani, Marianne Starlander, Beth Ann Hockey, Jane Brotanek, & Lukas Biewald: A small-vocabulary shared task for medical speech translation. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.60-63. [PDF, 62KB]

(2008) Germán Sanchis & Joan A. Sanchis: Vocabulary extension via PoS information for SMT. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 63-70. [PDF, 379KB]

(2008) Germán Sanchis-Trilles, Daniel Ortiz-Martínez, Jorge Civera, Francisco Casacuberta, Enrique Vidal, & Hieu Hoang: Improving interactive machine translation via mouse actions. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.485-494. [PDF, 148KB]

(2008) Mark Seligman & Mike Dillinger: Rapid portability among domains in an interactive spoken language translation system. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.40-47. [PDF, 129KB]

(2008) John Tinsley, Yanjun Ma, Sylwia Ozdowska, & Andy Way: MaTrEx: the DCU MT system for WMT 2008. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.171-174. [PDF, 89KB]

(2008) Vincent Vandeghinste, Peter Dirix, Ineke Schuurman, Stella Markantonatou, Sokratis Sofianopoulos, Marina Vassiliou, Olga Yannoutsou, Toni Badia, Maite Melero, Gemma Boleda, Michael Carl, & Paul Schmidt: Evaluation of a machine translation system for low resource languages: METIS-II.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 78KB]

(2007) Vicente Alabau, Alberto Sanchis, & Francisco Casacuberta: Improving speech-to-speech translation using word posterior probabilities. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.9-14 [PDF, 87KB]

(2007) Julia Aymerich & Hermes Camelo: Automatic extraction of entries for a machine translation dictionary using bitexts. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.21-27 [PDF, 88KB]

(2007) Khalid Choukri, Olivier Hamon, & Djamel Mostefa: MT evaluation & TC-STAR. MT Summit XI Workshop: Automatic procedures in MT evaluation, 11 September 2007, Copenhagen, Denmark, [Proceedings]; 33pp. [PDF of PPT presentation, 184KB]

(2007) Jorge Civera & Alfons Juan: Domain adaptation in statistical machine translation with mixture modelling.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 177-180 [PDF, 94KB]

(2007) Marta R. Costa-jussà, Josep M.Crego, Patrik Lambert, Maxim Khalilov, José A.R.Fonollosa, José B.Mariño, & Rafael E.Banchs: Ngram-based statistical machine translation enhanced with multiple weighted reordering hypotheses.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 167-170 [PDF, 208KB]

(2007) Marta R.Costa-jussà, Josep M.Crego, David Vilar, José A.R.Fonollosa, José B.Mariño, & Hermann Ney: Analysis and system combination of phrase- and N-gram-based statistical machine translation systems. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.137-140 [PDF, 169KB]

(2007) Josep M. Crego & José B. Mariño: Extending MARIE: an n-gram-based SMT decoder. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 213-216 [PDF, 334KB]

(2007) D. Déchelotte, H.Schwenk, H.Bonneau-Maynard, A.Allauzen, & G.Adda: A state-of-the-art statistical machine translation system based on Moses. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.127-133 [PDF, 142KB]

(2007) Stefan Dowd: Machine translation – how to get a good machine translation. Ezine Articles, 18 March 2007; 4pp. [PDF, 28KB]

(2007) Loïc Dugast, Jean Senellart, & Philipp Koehn: Statistical post-editing on  SYSTRAN’s rule-based translation system.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 220-223 [PDF, 128KB]

(2007) Nicola Ehling, Richard Zens, & Hermann Ney: Minimum Bayes risk decoding for BLEU.  ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 101-104 [PDF, 334KB]

(2007) Ariadna Font Llitjós & Stephan Vogel: A walk on the other side: adding statistical components to a transfer-based translation system. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.72-79 [PDF, 113KB]

(2007) Pablo Gamallo Otero: Learning bilingual lexicons from comparable English and Spanish corpora. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.191-197 [PDF, 509KB]

(2007) Olivier Hamon, Djamel Mostefa, & Khalid Choukri: End-to-end evaluation of a speech-to-speech translation system in TC-STAR. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.223-230 [PDF, 116KB]

(2007) Philipp Koehn & Chris Callison-Burch: Evaluating evaluation – lessons from the WMT 2007 shared task. MT Summit XI Workshop: Automatic procedures in MT evaluation, 11 September 2007, Copenhagen, Denmark, [Proceedings]; 38pp. [PDF of PPT presentation, 425KB]

(2007) Stella Markantonatou, Sokratis Sofianopoulos, Vassiliki Spilioti, Marina Vassiliou, & Olga Yannoutsou: An MT system embedding pattern knowledge.  METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 8pp. [PDF, 195KB]

(2007) Maite Melero, Antoni Oliver, Toni Badia & Teresa Suñol: Dealing with bilingual divergences in MT using target language n-gram models.  METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 8pp. [PDF, 63KB]

(2007) Maite Melero & Toni Badia: Demonstration of the Spanish to English METIS-II MT system. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.132-133 [PDF, 220KB]

(2007) Karolina Owczarzak, Josef van Genabith, & Andy Way: Dependency-based automatic evaluation for machine translation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.80-87 [PDF, 245KB]

(2007) Alicia Pérez, M.Inés Torres, M.Teresa González, & Francisco Casacuberta: An integrated architecture for speech-input multi-target machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.133-136 [PDF, 130KB]

(2007) M. Paulik, K.Rottmann, J.Niehues, S.Hildebrand, & S.Vogel: The ISL phrase-based MT system for the 2007 ACL Workshop on statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 197-202 [PDF, 175KB]

(2007) Alicia Pérez, M.Inés Torres, M.Teresa González, & Francisco Casacuberta: Speech-input multi-target machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 56-63 [PDF, 208KB]

(2007) Maja Popovic & Hermann Ney: Word error rates: decomposition over POS classes and applications for error analysis. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 48-55 [PDF, 159KB]

(2006) Evan Ratliff: Me translate pretty one day. Wired 14 (12), 2006; pp.210-213. [PDF, 32KB]

(2007) Marcus Sammer & Stephen Soderland: Building a sense-distinguished multilingual lexicon from monolingual corpora and bilingual lexicons. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.399-406 [PDF, 263KB]

(2007) Nicola Ueffing, Michel Simard, Samuel Larkin, & Howard Johnson: NRC’s PORTAGE system for WMT 2007.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 185-188 [PDF, 356KB]

(2007) Andreas Zollmann, Ashish Venugopal, Matthias Paulik, & Stephan Vogel: The syntax augmented MT (SAMT) system for the shared task in the 2007 ACL workshop on statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 216-219 [PDF, 163KB]

(2006) Jaime Carbonell, Steve Klein, David Miller, Michael Steinbaum, Tomer Grassiany, & Jochen Frei: Context-based machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.19-28 [PDF, 338KB]

(2006) Roldano Cattoni, Nicola Bertoldi, Mauro Cettolo, Boxing Chen, & Marcello Federico: A web-based demonstrator of a multi-lingual phrase-based translation system. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Posters and demonstrations, Trento, Italy, April 5-6, 2006; pp.91-94 [PDF, 184KB]

(2006) Jorge Civera & Alfons Juan: Mixtures of IBM Model 2. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.159-167 [PDF, 142KB]

(2006) Lisa Cochran: Reaching across the language barrier.  Speech Technology, November/December 2006. [PDF, 86KB]

(2006) Josep Maria Crego & José B.Mariño: Improving statistical MT by coupling reordering and decoding [abstract]. Machine Translation 20 (3),2006; pp.199-215.

(2006) Josep M.Crego & José B.Mariño: Integration of POStag-based source reordering into SMT decoding by an extended search graph. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.29-36 [PDF, 287KB]

(2006) Josep M. Crego, Adrià de Gispert, Patrik Lambert, Marta R. Costa-jussà, Maxim Khalilov, Rafael E.Banchs, José B.Mariño, & José A.R.Fonollosa: N-gram-based SMT system enhanced with reordering patterns.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 162-165 [PDF, 209KB]

(2006) Marcello Federico & Nicola Bertoldi: How many bits are needed to store probabilities for phrase-based translation?  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 94-101 [PDF, 204KB]

(2006) Miguel Á. Garcia Cumbreras, L. Alfonso Ureña López, & Fernando Martínez Santiago: BRUJA: question classification for Spanish using machine translation and an English classifier. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multilingual Question Answering (MLQA06), Trento, Italy, April 4, 2006; pp.39-44 [PDF, 412 KB]

(2006) Jesús Giménez & Lluís Màrquez: The LDV-COMBO system for SMT.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 166-169 [PDF, 183KB]

(2006) Deepa Gupta & Marcello Federico: Exploiting word transformation in statistical machine translation from Spanish to English. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.75-80 [PDF, 140KB]

(2006) Philipp Koehn & Christof Monz: Manual and automatic evaluation of machine translation between European languages.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 102-121 [PDF, 199KB]

(2006) Patrik Lambert & Rafael Banchs: Grouping multi-word expressions according to part-of-speech in statistical machine translation. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp.9-16 [PDF, 318KB]

(2006) Michael C. McCord: MT for social impact (contribution to panel on “Machine translation for social impact”).  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 1022KB]

(2006) José B.Mariño, Rafael E.Banchs, Josep M.Crego, Adrià de Gispert, Patrik Lambert, José A.R.Fonollosa, & Marta R.Costa-jussà: N-gram-based machine translation.  Computational Linguistics 32 (4), pp. 528-549. [PDF, 277KB]

(2006) Evgeny Matusov, Nicola Ueffing, & Hermann Ney: Computing consensus translation from multiple machine translation systems using enhanced hypotheses alignment. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, April 3-7, 2006; pp.33-40 [PDF, 91KB]

(2006) Maite Melero, Antoni Oliver, & Toni Badia: Automatic multilingual subtitling in the eTITLE project. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 18pp. [PDF, 149KB]

(2006) Maja Popović & Hermann Ney: POS-based reorderings for statistical machine translation. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1278-1283 [PDF, 255KB]

(2006) Maja Popović & Hermann Ney: Statistical machine translation with a small amount of bilingual training data. LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.25-29. [PDF, 115KB]

(2006) Brock Pytlik & David Yarowsky: Machine translation for languages lacking bitext via multilingual gloss transduction. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.156-165 [PDF, 267KB]

(2006) Ari Rappoport & Tsahi Levent-Levi: Induction of cross-language affix and letter sequence correspondence. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.17-24 [PDF, 332 KB]

(2006) Marcus Sammer, Kobi Reiter, Stephen Soderland, Katrin Kirchhoff & Oren Etzioni: Ambiguity reduction for machine translation: human-computer collaboration.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.193-202 [PDF, 247KB]

(2006) Jesús Tomás & Francisco Casacuberta: Statistical phrase-based models for interactive computer-assisted translation.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.835-841. [PDF, 358KB]

(2005) Victoria Arranz, Elisabet Comelles, & David Farwell: The FAME speech-to-speech translation system for Catalan, English, and Spanish. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.195-202. [PDF, 185KB]

(2005) Necip Fazil Ayan, Bonnie J. Dorr, & Christof Monz: NeurAlign: combining word alignments using neural networks.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 65-72. [PDF, 139KB]

(2005) Necip Fazil Ayan, Bonnie J.Dorr, & Christof Monz: Alignment link projection using transformation-based learning.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 185-192. [PDF, 132KB]

(2005) Toni Badia, Gemma Boleda, Maite Melero, & Antoni Oliver: An n-gram approach to exploiting a monolingual corpus for machine translation MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.1-7. [PDF, 1877KB]

(2005) Rafael E. Banchs, Josep M. Crego, Adrià de Gispert, Patrik Lambert & José B. Mariño: Statistical machine translation of Euparl data by using bilingual n-grams. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 133-136. [PDF, 82KB]

(2005) Oliver Bender, Saša Hasan, David Vilar, Richard Zens, & Hermann Ney: Comparison of generation strategies for interactive machine translation. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 33-40. [PDF, 83KB]

(2005) Marta Ruiz Costa-jussà & José A.R.Fonollosa: Improving phrase-based statistical machine translation by modifying phrase extraction and including several features.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 149-154. [PDF, 185KB]

(2005) Andreas Eisele: First steps towards multi-engine machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 155-158. [PDF, 43KB]

(2005) Jesús Giménez & Lluís Màrquez: Combining linguistic data views for phrase-based SMT. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 145-148. [PDF, 73KB]

(2005) Philipp Koehn & Christof Monz: Shared task: statistical machine translation between European languages. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 119-124. [PDF, 41KB]

(2005) Patrik Lambert & Rafael Banchs: Data inferred multi-word expressions for statistical machine translation . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.396-403. [PDF, 182KB]

(2005) José B. Mariño, Rafael E. Banchs, Josep María Crego, Adrià de Gispert, Patrik Lambert, José A. R. Fonollosa, & Marta Ruiz: Bilingual n-gram statistical machine translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.275-282. [PDF, 197KB]

(2005) Maja Popovic & Hermann Ney: Exploiting phrasal lexica and additional morpho-syntactic language resources for statistical machine translation with scarce training data. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 212-218. [PDF, 65KB]

(2005) Fatiha Sadat, Howard Johnson, Akakpo Agbago, George Foster, Roland Kuhn, Joel Martin, & Aaron Tikuisis: PORTAGE: a phrase-based machine translation system.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp.129-132. [PDF, 54KB]

(2005) Juan Miguel Vilar & Enrique Vidal: A recursive statistical translation model. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 199-207. [PDF, 183KB]

(2005) Chengqing Zong & Mark Seligman: Toward practical spoken language translation [abstract]. Machine Translation 19 (2), 2005; pp.113-137.

Spanish to French

(2009) Eric Wehrli, Luka Nerima, Violeta Seretan, & Yves Scherrer: On-line and off-line translation aids for non-native readers. Proceedings of the International Multiconference on Computer Science and Information Technology, Mrągowo, Poland, 12-14 October 2009; pp.299-303. [PDF, 77KB]

Spanish to Galician

(2009) Felipe Sánchez-Martínez & Mikel L.Forcada: Inferring shallow-transfer machine translation rules from small parallel corpora. Journal of Artificial Intelligence Research 34, pp. 605-635. [PDF, 706KB]

(2005) Carme Armentano-Oller, Antonio M. Corbí-Bellot, Mikel L. Forcada, Mireia Ginestí-Rosell, Boyan Bonev, Sergio Ortiz-Rojas, Juan Antonio Pérez-Ortiz, Gema Ramírez-Sánchez, & Felipe Sánchez-Martínez: An Open-Source shallow-transfer machine translation toolbox: consequences of its release and availability  MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Open Source Machine Translation; pp.23-30. [PDF, 255KB]

Spanish to German

(2006) Kisuh Ahn & Matthew Frampton: Automatic generation of translation dictionaries using intermediary languages. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.41-44 [PDF, 278KB]

(2006) Maja Popović & Hermann Ney: POS-based reorderings for statistical machine translation. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1278-1283 [PDF, 255KB]

Spanish to Greek

(2009) Marcel Puig Portella & Eleni Papadopoulou: Treatment of the imperative forms in the machine translation between Catalan, Spanish and Greek.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.198-202 [abstract]

Spanish to Portuguese

(2009) Felipe Sánchez-Martínez & Mikel L.Forcada: Inferring shallow-transfer machine translation rules from small parallel corpora. Journal of Artificial Intelligence Research 34, pp. 605-635. [PDF, 706KB]

(2006) Alberto Simões & José João Almeida: Combinatory examples extraction for machine translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.27-32 [PDF, 130KB]

(2005) Carme Armentano-Oller, Antonio M. Corbí-Bellot, Mikel L. Forcada, Mireia Ginestí-Rosell, Boyan Bonev, Sergio Ortiz-Rojas, Juan Antonio Pérez-Ortiz, Gema Ramírez-Sánchez, & Felipe Sánchez-Martínez: An Open-Source shallow-transfer machine translation toolbox: consequences of its release and availability  MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Open Source Machine Translation; pp.23-30. [PDF, 255KB]

Spanish to Quechua

(2008) Michael Mohler & Rada Mihalcea: BABYLON parallel text builder: gathering parallel texts for low-density languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 63KB]

Spanish to UNL

(2005) Luis Iraola: Using WordNet for linking UWs to the UNL UW system. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.370-379 [abstract, PDF, 59KB]

Swahili to English

(2009) Guy De Pauw, Peter Waiganjo Wagacha, & Gilles-Maurice de Schryver: The SAWA corpus: a parallel corpus English – Swahili. Proceedings of the EACL Workshop on Language Technologies for African Languages (AfLaT 2009), Athens, Greece, 31 March 2009; pp.9-16. [PDF, 129KB]

Swedish

(2009) Krister Lindén: Guessers for finite-state transducer lexicons [abstract]. CICLING 2009: 10th International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, March 1-7, 2009; 1p. [PDF, 103KB]

(2006) Simon Corston-Oliver & Anthony Aue: Dependency parsing with reference to Slovene, Spanish and Swedish. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.196-200. [PDF, 286KB]

(2006) Martin Volk: How bad is the problem of PP-attachment? A comparison of English, German and Swedish. Proceedings of the Third ACL-SIGSEM Workshop on Prepositions, Trento, Italy, April 2006; pp.81-88. [PDF, 133KB]

Swedish to Danish

(2009) Christian Hardmeier & Martin Volk: Using linguistic annotations in statistical machine translation of film subtitles. In: Kristina Jokinen and Eckhard Bick (eds.) Proceedings of the 17th Nordic Conference of Computational Linguistics (NODALIDA 2009), 15-16 May 2009, Odense, Denmark; pp.57-64. [PDF,  158KB]

(2009) Francis M.Tyers & Jacob Nordfalk: Shallow-transfer rule-based machine translation for Swedish to Danish. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 27-33. [PDF, 178KB]

(2008) Martin Volk: The automatic translation of film subtitles: a machine translation success story? In: J.Nivre, M.Dahllöf, and B.Megyesi (eds.) Resourceful language technology: Festschrift in honor of Anna Sågvall Hein, Uppsala University, 2008; pp. 202-214. [PDF, 44KB]

(2007) Sami Virpioja, Jaako J.Väyrynen, Mathias Creutz, & Markus Sadeniemi: Morphology-aware statistical machine translation based on morphs induced in an unsupervised manner. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.491-498 [PDF, 105KB]

(2007) Martin Volk & Søren Harder: Evaluating MT with translations or translators: what is the difference? MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.499-506 [PDF, 61KB]

Swedish to English

(2009) Philippe Langlais, François Yvon & Pierre Zweigenbaum: Improvements in analogical learning: application to translating multi-terms of the medical domain.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.487-495. [PDF, 172KB]

(2008) Sara Stymne & Maria Holmqvist: Processing of Swedish compounds for phrase-based statistical machine translation. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.182-191. [PDF, 532KB]

(2007) Anna Sågvall Hein: Rule-based and statistical machine translation with a focus on Swedish [abstract]. Invited talk at TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; 1p. [PDF, 15KB]

(2007) Martin Volk, Joakim Lundborg, & Maël Mettler: A search tool for parallel treebanks. ACL 2007: proceedings of the Linguistic Annotation Workshop, Prague, Czech Republic, 28-29 June 2007; pp.85-92 [PDF, 270KB]

(2006) Anna Sågvall Hein & Per Weijnitz: Approaching a new language in machine translation – considerations in choosing a strategy. LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.7-12. [PDF, 95KB]

Swedish to Finnish

(2007) Sami Virpioja, Jaako J.Väyrynen, Mathias Creutz, & Markus Sadeniemi: Morphology-aware statistical machine translation based on morphs induced in an unsupervised manner. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.491-498 [PDF, 105KB]

Swedish to German

(2007) Martin Volk, Joakim Lundborg, & Maël Mettler: A search tool for parallel treebanks. ACL 2007: proceedings of the Linguistic Annotation Workshop, Prague, Czech Republic, 28-29 June 2007; pp.85-92 [PDF, 270KB]

(2006) Anna Sågvall Hein & Per Weijnitz: Approaching a new language in machine translation – considerations in choosing a strategy. LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.7-12. [PDF, 95KB]

Swedish to Norwegian

(2008) Martin Volk: The automatic translation of film subtitles: a machine translation success story? In: J.Nivre, M.Dahllöf, and B.Megyesi (eds.) Resourceful language technology: Festschrift in honor of Anna Sågvall Hein, Uppsala University, 2008; pp. 202-214. [PDF, 44KB]

Swedish to Turkish

(2008) Beáta Megyesi, Bengt Dahlqvist, Eva Pettersson, & Joakim Nivre: Swedish-Turkish parallel treebank. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 80KB]

(2006) Beáta Bandmann Megyesi, Anna Sågvall Hein, & Éva Csató Johanson: Building a Swedish-Turkish parallel corpus.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2130-2133 [PDF, 692KB]

Tagalog to English

(2008) Brianna Laugher & Ben MacLeod: Machine translation for Indonesian and Tagalog. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.397-401. [PDF, 518KB]

Tajik

(2008) Karine Megerdoomian & Dan Parvaz: Low-density language bootstrapping: the case of Tajiki Persian.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 331KB]

Tamil

(2009) Raghavendra Udupa, K.Saravanan, A.Kumaran, & Jagadeesh Jagarlamudi: MINT: a method for effective and scalable mining of named entity transliterations from large comparable corpora.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.799-807. [PDF, 1120KB]

Tamil to English

(2008) K.Saravanan & A.Kumaran: Some experiments in mining named entitiy transliteration pairs from comparable corpora. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.26-33. [PDF, 805KB]

Telugu to English

(2008) Sivaji Bandyopadhyay, Tapabrata Mondal, Sudip Kumar Naskar, Asif Ekbal, Rejwanul Haque, & Srinivasa Rao Godavarthy: Bengali, Hindi and Telegu to English ad-hoc bilingual task. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; abstract, p.66. [PDF, 12KB]

(2008) Prasad Pingali & Vasudeva Varma: Hindi and Telugu to English CLIR using query expansion. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; abstract, p.68. [PDF, 12KB]

Thai

(2005) Krisda Khankasikam & Nuttanart Muansuwan: Thai word segmentation a lexical semantic approach . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.331-338. [PDF, 155KB]

(2005) Patanakul Sathapornrungkij & Charnyote Pluempitiwiriyawej: Construction of Thai WordNet lexical database from machine readable dictionaries. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.87-92 [PDF, 322KB]

Thai to English

(2008) Prachya Boonkwan & Thepchai Supnithi: Memory-inductive categorical grammar: an approach to gap resolution in analytic-language translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.80-87. [PDF, 339KB]

(2008) Virach Sornlertlamvanich, Thatsanee Charoenporn, Chumpol Mokarat, Hammam Riza, Hitoshi Isahara, & Purev Jaimal: Synset assignment for bi-lingual dictionary with limited resource. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.673-678. [PDF, 549KB]

(2005) Teerapong Modhiran, Krit Kosawat, Supon Klaithin, Monthika Boriboon & Thepchai Supnithi: PARSITTE: online Thai-English machine translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.404-411. [PDF, 413KB]

Tigrinya

(2009) Michael Gasser: Semitic morphological analysis and generation using finite state transducers with feature structures. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.309-317. [PDF, 217KB]

Turkish

(2007) Mathias Creutz, Teemu Hirsimäki, Mikko Kurimo, Antti Puurula, Janne Pylkkönen, Vesa Siivola, Matti Varjokallio, Ebru Arısoy, Murat Saraçlar, & Andreas Stolcke: Analysis of morph-based speech recognition and the modeling of out-of-vocabulary words across languages. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.380-387 [PDF, 136KB]

Turkish to English

(2009) Nicola Bertoldi, Arianna Bisazza, Mauro Cettolo, Germán Sanchis-Trilles, & Marcello Federico: FBK @ IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 37-44. [PDF, 346KB]; poster [PDF, 411KB]

(2009) Arianna Bisazza & Marcello Federico: Morphological pre-processing for Turkish to English statistical machine translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 129-135. [PDF, 373KB]; presentation [PDF of PPT, 302KB]

(2009) Chris Dyer: Using maximum entropy model to build segmentation lattices for MT.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.406-414. [PDF, 173KB]

(2009) Selçuk Köprü: AppTek Turkish-English machine translation system description for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 19-23. [PDF, 309KB]

(2009) Yves Lepage, Adrien Lardilleux, & Julien Gosme: The GREYC translation memory for the IWSLT 2009 evaluation campaign: one step beyond translation memory. IWSLT 2009: Proceedings of the International Workshop on Spoke n Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 45-49. [PDF, ]; presentation [PDF of PPT,

(2009) Yanjun Ma, Tsuyoshi Okita, Özlem Çetinoğlu, Jinhua Du, & Andy Way: Low-resource machine translation using MaTrEx: the DCU machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 29-36. [PDF, 305KB]; presentation [PDF of PPT, 466KB]

(2009) Coşkun Mermer, Hamza Kaya, & Mehmet Uğur Doğan: The TÜBİTAK-UEKAE statistical machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 113-117. [PDF, 457KB]; poster [PDF, 2530KB]

(2009) Michael Paul: Overview of the IWSLT 2009 evaluation campaign. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 1-18. [PDF, 398KB]; presentation [PDF of PPT, 481KB]

(2009) Wade Shen, Brian Delaney, A.Ryan Aminzadeh, Tim Anderson, & Ray Slyh: The MIT-LL/AFRL IWSLT-2009 MT system. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 71-78. [PDF, 398KB]; presentation [PDF of PPT, 1730KB]

(2005) Ilyas Cicekli: Inducing translation templates with type constraints [abstract]. Machine Translation 19 (3-4), 2005; pp.283-299.

Turkish to Swedish

(2008) Beáta Megyesi, Bengt Dahlqvist, Eva Pettersson, & Joakim Nivre: Swedish-Turkish parallel treebank. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 80KB]

(2006) Beáta Bandmann Megyesi, Anna Sågvall Hein, & Éva Csató Johanson: Building a Swedish-Turkish parallel corpus.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2130-2133 [PDF, 692KB]

Turkmen to Turkish

(2007) A.Cüneyd Tantuğ, Eşref Adali, & Kemal Oflazer: A MT system from Turkmen to Turkish employing finite state and statistical methods. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.459-465 [PDF, 399KB]

(2007) A.Cüneyd Tantuğ, Eşref Adali, & Kemal Oflazer: Machine translation between Turkic languages. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 189-192 [PDF, 135KB]

Uighur

(2005) Polat Kadir, Koichi Yamada & Hiroshi Kinukawa: Comparative study on Japanese and Uyghur grammars for an English-Uyghur machine translation system . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.432-437. [PDF, 304KB]

Ukrainian to English

(2007) Bogdan Babych, Anthony Hartley, & Serge Sharoff: Translating from under-resourced languages: comparing direct transfer against pivot translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.29-35 [PDF, 197KB]

UNL

(2008) Rajat Kumar Mohanty & Pushpak Bhattacharyya: Lexical resources for semantic extraction.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 148KB]

UNL to Hindi

(2007) Smriti Singh, Mrugank Dalal, Vishal Vachhani, Pushpak Bhattacharyya, & Om P.Damani: Hindi generation from interlingua. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.421-428 [PDF, 206KB]

Urdu to English

(2009) Kathy Baker, Steven Bethard, Michael Bloodgood, Ralf Brown, Chris Callison-Burch, Glen Coppersmith, Bonnie Dorr, Wes Filardo, Kendall Giles, Anni Irvine, Mike Kayser, Lori Levin, Justin Martineau, Jim Mayfield, Scott Miller, Aaron Phillips, Andrew Philpot, Christine Piatko, Lane Schwartz, & David Zajic: Semantically informed machine translation (SIMT). Final report of the 2009 Summer Camp for Applied Language Exploration [John Hopkins University], November 2009; 152pp. [PDF, 5705KB]

(2009) Chris Callison-Burch: Fast, cheap, and creative: evaluating translation quality using Amazon’s Mechanical Turk. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.286-295. [PDF, 289KB]

(2009) Sebastian Padó, Michel Galley, Dan Jurafsky & Chris Manning: Robust machine translation evaluation with entailment features. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.297-305. [PDF, 437KB]

(2009) Benjamin Snyder, Tahira Naseem, & Regina Barzilay: Unsupervised multilingual grammar induction. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.73-81. [PDF, 273KB]

(2009) The 2009 NIST machine translation evaluation plan (MT09). [NIST, 2009]; 8pp. [PDF, 360KB]

(2009) NIST 2009 open machine translation evaluation (MT09). Official release of results: Urdu-English. [NIST, 2009]; [HTML, 125KB]

(2009) NIST 2009 open machine translation evaluation (MT09). Official release of results: Combination tests. [NIST, 2009]; [HTML, 114KB]

(2008)  Nizar Habash & Hayden Metsky: Automatic learning of morphological variations for handling out-of-vocabulary terms in Urdu-English machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.107-116. [PDF, 551KB]

(2008) Reginald L.Hobbs & Clare R.Voss: Designing and executing machine translation workflows through the Kepler framework. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.380-389.  [PDF of PPT presentation, 1204KB]

(2008) Clare R.Voss, Matthew Aguirre, Jeffrey Micher, Richard Chang, Jamal Laoudi, & Reginald Hobbs: Boosting performance of weak MT engines automatically: using MT output to align segments & build statistical post-editors. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.192-201. [PDF, 506KB]

(2008) Andreas Zollmann, Ashish Venugopal, Franz Och, & Jay Ponte: A systematic comparison of phrase-based, hierarchical and syntax-augmented statistical MT. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1145-1152. [PDF, 153KB]

(2008) The 2008 NIST open machine translation evaluation plan (MT08). [NIST, 2008]; 7pp. [PDF, 360KB]

(2008) NIST 2008 open machine translation evaluation – (MT08). Official evaluation results. [NIST, 2008] [HTML]

Urdu to Hindi

 (2009) Abbas Malik, Laurent Besacier, Christian Boitet, & Pushpak Bhattacharyya: A hybrid model for Urdu Hindi transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.177-185. [PDF, 377KB]

Veneto to English

(2009) Rodolfo Delmonte, Antonella Bristot, Sara Tonelli, & Emanuele Pianta: English/Veneto resource poor machine translation with STILVEN.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.82-89 [abstract]

Vietnamese

(2009) Chiori Hori, Sakriani Sakti, Michael Paul, Noriyuki Kimura, Yutaka Ashikari, Ryosuke Isotani, Eiichiro Sumita, & Satoshi Nakamura: Network-based speech-to-speech translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; p. 168. [PDF, 286KB]; presentation [PDF of PPT, 485KB]

Vietnamese to French

(2009) Thi-Ngoc-Diep Do, Viet-Bac Le, Brigitte Bigi, Laurent Besacier, & Eric Castelli: Mining a comparable text corpus for a Vietnamese-French statistical machine translation system.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.165-172. [PDF, 152KB]

Welsh

(2009) proceedings of SALTMIL 2009, “Information retrieval and information extraction for less resourced languages”, Donostia-San Sebastián, September 7 2009. [PDF, 16458KB]

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

Welsh to English

(2009) Llio Humphreys: Automated English subtitling of Welsh TV programs. [SEPLN 2009] SALTMIL 2009, Donostia-San Sebastián, Spain. “Information retrieval and information extraction for less resourced languages”, Donostia-San Sebastián, September 7; pp.63-72. [PDF, 322KB]

(2009) Francis M. Tyers & Kevin Donnelly: apertium-cy: a collaboratively-developed RBMT system for Welsh to English. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.57-66. [PDF, 134KB]  [presentation at MT Marathon 2009, 1376KB]

(2006) Dafydd Jones & Andreas Eisele: Phrase-based statistical machine translation between English and Welsh.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.75-77. [PDF, 54KB]

Yiddish to English

(2009) Dmitriy Genzel, Klaus Machery & Jakob Uszkoreit: Creating a high-quality machine translation system for a low-resource language: Yiddish. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 41-48. [PDF, 157KB]