Franz Josef Och

Also published as: F. J. Och, Franz J. Och, Franz Och


2015

pdf bib
Unsupervised Morphology Induction Using Word Embeddings
Radu Soricut | Franz Och
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

2012

pdf bib
Improved Domain Adaptation for Statistical Machine Translation
Wei Wang | Klaus Macherey | Wolfgang Macherey | Franz Och | Peng Xu
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers

We present a simple and effective infrastructure for domain adaptation for statistical machine translation (MT). To build MT systems for different domains, it trains, tunes and deploys a single translation system that is capable of producing adapted domain translations and preserving the original generic accuracy at the same time. The approach unifies automatic domain detection and domain model parameterization into one system. Experiment results on 20 language pairs demonstrate its viability.

2011

pdf bib
Language-independent compound splitting with morphological operations
Klaus Macherey | Andrew Dai | David Talbot | Ashok Popat | Franz Och
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies

pdf bib
A Lightweight Evaluation Framework for Machine Translation Reordering
David Talbot | Hideto Kazawa | Hiroshi Ichikawa | Jason Katz-Brown | Masakazu Seno | Franz Och
Proceedings of the Sixth Workshop on Statistical Machine Translation

pdf bib
Training a Parser for Machine Translation Reordering
Jason Katz-Brown | Slav Petrov | Ryan McDonald | Franz Och | David Talbot | Hiroshi Ichikawa | Masakazu Seno | Hideto Kazawa
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing

pdf bib
Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation.
Ashish Venugopal | Jakob Uszkoreit | David Talbot | Franz Och | Juri Ganitkevitch
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing

2010

pdf bib
“Poetic” Statistical Machine Translation: Rhyme and Meter
Dmitriy Genzel | Jakob Uszkoreit | Franz Och
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing

pdf bib
Model Combination for Machine Translation
John DeNero | Shankar Kumar | Ciprian Chelba | Franz Och
Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics

2009

pdf bib
Using a Dependency Parser to Improve SMT for Subject-Object-Verb Languages
Peng Xu | Jaeho Kang | Michael Ringgaard | Franz Och
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics

pdf bib
Efficient Minimum Error Rate Training and Minimum Bayes-Risk Decoding for Translation Hypergraphs and Lattices
Shankar Kumar | Wolfgang Macherey | Chris Dyer | Franz Och
Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP

2008

pdf bib
A Systematic Comparison of Phrase-Based, Hierarchical and Syntax-Augmented Statistical MT
Andreas Zollmann | Ashish Venugopal | Franz Och | Jay Ponte
Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)

pdf bib
Lattice Minimum Bayes-Risk Decoding for Statistical Machine Translation
Roy Tromble | Shankar Kumar | Franz Och | Wolfgang Macherey
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing

pdf bib
Lattice-based Minimum Error Rate Training for Statistical Machine Translation
Wolfgang Macherey | Franz Och | Ignacio Thayer | Jakob Uszkoreit
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing

2007

pdf bib
Improving Word Alignment with Bridge Languages
Shankar Kumar | Franz J. Och | Wolfgang Macherey
Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)

pdf bib
Large Language Models in Machine Translation
Thorsten Brants | Ashok C. Popat | Peng Xu | Franz J. Och | Jeffrey Dean
Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)

pdf bib
An Empirical Study on Computing Consensus Translations from Multiple Machine Translation Systems
Wolfgang Macherey | Franz J. Och
Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)

2005

bib
Statistical Machine Translation: Foundations and Recent Advances
Franz Josef Och
Proceedings of Machine Translation Summit X: Tutorial notes

2004

pdf bib
Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics
Chin-Yew Lin | Franz Josef Och
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL-04)

pdf bib
The ISI/USC MT system
Emil Ettelaie | Kevin Knight | Daniel Marcu | Dragos Stefan Munteanu | Franz J. Och | Ignacio Thayer | Quamrul Tipu
Proceedings of the First International Workshop on Spoken Language Translation: Evaluation Campaign

pdf bib
ORANGE: a Method for Evaluating Automatic Evaluation Metrics for Machine Translation
Chin-Yew Lin | Franz Josef Och
COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics

pdf bib
The Alignment Template Approach to Statistical Machine Translation
Franz Josef Och | Hermann Ney
Computational Linguistics, Volume 30, Number 4, December 2004

pdf bib
A Smorgasbord of Features for Statistical Machine Translation
Franz Josef Och | Daniel Gildea | Sanjeev Khudanpur | Anoop Sarkar | Kenji Yamada | Alex Fraser | Shankar Kumar | Libin Shen | David Smith | Katherine Eng | Viren Jain | Zhen Jin | Dragomir Radev
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: HLT-NAACL 2004

pdf bib
Discriminative Reranking for Machine Translation
Libin Shen | Anoop Sarkar | Franz Josef Och
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: HLT-NAACL 2004

2003

pdf bib
Statistical Phrase-Based Translation
Philipp Koehn | Franz J. Och | Daniel Marcu
Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics

pdf bib
Minimum Error Rate Training in Statistical Machine Translation
Franz Josef Och
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics

pdf bib
Efficient Search for Interactive Statistical Machine Translation
Franz Josef Och | Richard Zens | Hermann Ney
10th Conference of the European Chapter of the Association for Computational Linguistics

pdf bib
Comparison of Alignment Templates and Maximum Entropy Models for NLP
Oliver Bender | Klaus Macherey | Franz Josef Och | Hermann Ney
10th Conference of the European Chapter of the Association for Computational Linguistics

pdf bib
Maximum Entropy Models for Named Entity Recognition
Oliver Bender | Franz Josef Och | Hermann Ney
Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003

pdf bib
Statistical QA - Classifier vs. Re-ranker: What’s the difference?
Deepak Ravichandran | Eduard Hovy | Franz Josef Och
Proceedings of the ACL 2003 Workshop on Multilingual Summarization and Question Answering

pdf bib
Rapid-response machine translation for unexpected languages
Douglas W. Oard | Franz Josef Och
Proceedings of Machine Translation Summit IX: Papers

Statistical techniques for machine translation offer promise for rapid development in response to unexpected requirements, but realizing that potential requires rapid acquisition of required resources as well. This paper reports the results of experiments with resources collected in ten days; about 1.3 million words of parallel text from five types of sources and a bilingual term list with about 20,000 term pairs. Systems were trained with resources individually and in combination, using an approach based on alignment templates. The use of all available resources was found to yield the best results in an automatic evaluation using the BLEU measure, but a single resource (the Bible) coupled with a small amount of in-domain manual translation (less than 6,000 words) achieved more than 85% of that upper baseline. With a concerted effort, such a system could be built in a single day.

pdf bib
A Systematic Comparison of Various Statistical Alignment Models
Franz Josef Och | Hermann Ney
Computational Linguistics, Volume 29, Number 1, March 2003

2002

pdf bib
Generation of Word Graphs in Statistical Machine Translation
Nicola Ueffing | Franz Josef Och | Hermann Ney
Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP 2002)

pdf bib
Improving Alignment Quality in Statistical Machine Translation Using Context-dependent Maximum Entropy Models
Ismael García Varea | Franz J. Och | Hermann Ney | Francisco Casacuberta
COLING 2002: The 19th International Conference on Computational Linguistics

pdf bib
Discriminative Training and Maximum Entropy Models for Statistical Machine Translation
Franz Josef Och | Hermann Ney
Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics

pdf bib
Efficient integration of maximum entropy lexicon models within the training of statistical alignment models
Ismael García-Varea | Franz J. Och | Hermann Ney | Francisco Casacuberta
Proceedings of the 5th Conference of the Association for Machine Translation in the Americas: Technical Papers

Maximum entropy (ME) models have been successfully applied to many natural language problems. In this paper, we show how to integrate ME models efficiently within a maximum likelihood training scheme of statistical machine translation models. Specifically, we define a set of context-dependent ME lexicon models and we present how to perform an efficient training of these ME models within the conventional expectation-maximization (EM) training of statistical translation models. Experimental results are also given in order to demonstrate how these ME models improve the results obtained with the traditional translation models. The results are presented by means of alignment quality comparing the resulting alignments with manually annotated reference alignments.

2001

pdf bib
An Efficient A* Search Algorithm for Statistical Machine Translation
Franz Josef Och | Nicola Ueffing | Hermann Ney
Proceedings of the ACL 2001 Workshop on Data-Driven Methods in Machine Translation

pdf bib
Robust Knowledge Discovery from Parallel Speech and Text Sources
F. Jelinek | W. Byrne | S. Khudanpur | B. Hladká | H. Ney | F. J. Och | J. Cuřín | J. Psutka
Proceedings of the First International Conference on Human Language Technology Research

pdf bib
The RWTH System for Statistical Translation of Spoken Dialogues
H. Ney | F. J. Och | S. Vogel
Proceedings of the First International Conference on Human Language Technology Research

pdf bib
Statistical multi-source translation
Franz Josef Och | Hermann Ney
Proceedings of Machine Translation Summit VIII

We describe methods for translating a text given in multiple source languages into a single target language. The goal is to improve translation quality in applications where the ultimate goal is to translate the same document into many languages. We describe a statistical approach and two specific statistical models to deal with this problem. Our method is generally applicable as it is independent of specific models, languages or application domains. We evaluate the approach on a multilingual corpus covering all eleven official European Union languages that was collected automatically from the Internet. In various tests we show that these methods can significantly improve translation quality. As a side effect, we also compare the quality of statistical machine translation systems for many European languages in the same domain.

pdf bib
What can machine translation learn from speech recognition?
Franz Josef Och | Hermann Ney
Workshop on MT2010: Towards a Road Map for MT

The performance of machine translation technology after 50 years of development leaves much to be desired. There is a high demand for well performing and cheap MT systems for many language pairs and domains, which automatically adapt to rapidly changing terminology. We argue that for successful MT systems it will be crucial to apply data-driven methods, especially statistical machine translation. In addition, it will be very important to establish common test environments. This includes the availability of large parallel training corpora, well defined test corpora and standardized evaluation criteria. Thereby research results can be compared and this will open the possibility for more competition in MT research.

pdf bib
Refined Lexicon Models for Statistical Machine Translation using a Maximum Entropy Approach
Ismael García-Varea | Franz J. Och | Hermann Ney | Francisco Casacuberta
Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics

2000

pdf bib
Statistical Machine Translation
Franz Josef Och | Hermann Ney
5th EAMT Workshop: Harvesting Existing Resources

pdf bib
An Evaluation Tool for Machine Translation: Fast Evaluation for MT Research
Sonja Nießen | Franz Josef Och | Gregor Leusch | Hermann Ney
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)

pdf bib
Improved Statistical Alignment Models
Franz Josef Och | Hermann Ney
Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics

pdf bib
A Comparison of Alignment Models for Statistical Machine Translation
Franz Josef Och | Hermann Ney
COLING 2000 Volume 2: The 18th International Conference on Computational Linguistics

1999

pdf bib
Improved Alignment Models for Statistical Machine Translation
Franz Josef Och | Christoph Tillmann | Hermann Ney
1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora

pdf bib
An Efficient Method for Determining Bilingual Word Classes
Franz Josef Och
Ninth Conference of the European Chapter of the Association for Computational Linguistics

1998

pdf bib
Improving Statistical Natural Language Translation with Categories and Rules
Franz Josef Och | Hans Weber
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 2

pdf bib
Improving Statistical Natural Language Translation with Categories and Rules
Franz Josef Och | Hans Weber
COLING 1998 Volume 2: The 17th International Conference on Computational Linguistics