Jakob Elming


2014

pdf bib
Copenhagen-Malmö: Tree Approximations of Semantic Parsing Problems
Natalie Schluter | Anders Søgaard | Jakob Elming | Dirk Hovy | Barbara Plank | Héctor Martínez Alonso | Anders Johanssen | Sigrid Klerke
Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014)

pdf bib
Robust Cross-Domain Sentiment Analysis for Low-Resource Languages
Jakob Elming | Barbara Plank | Dirk Hovy
Proceedings of the 5th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis

2013

pdf bib
Using Crowdsourcing to get Representations based on Regular Expressions
Anders Søgaard | Hector Martinez | Jakob Elming | Anders Johannsen
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing

pdf bib
Down-stream effects of tree-to-dependency conversions
Jakob Elming | Anders Johannsen | Sigrid Klerke | Emanuele Lapponi | Hector Martinez Alonso | Anders Søgaard
Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

2010

pdf bib
Incremental Re-training for Post-editing SMT
Daniel Hardt | Jakob Elming
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers

A method is presented for incremental re-training of an SMT system, in which a local phrase table is created and incrementally updated as a file is translated and post-edited. It is shown that translation data from within the same file has higher value than other domain-specific data. In two technical domains, within-file data increases BLEU score by several full points. Furthermore, a strong recency effect is documented; nearby data within the file has greater value than more distant data. It is also shown that the value of translation data is strongly correlated with a metric defined over new occurrences of n-grams. Finally, it is argued that the incremental re-training prototype could serve as the basis for a practical system which could be interactively updated in real time in a post-editing setting. Based on the results here, such an interactive system has the potential to dramatically improve translation quality.

2009

pdf bib
Syntactic Reordering for English-Arabic Phrase-Based Machine Translation
Jakob Elming | Nizar Habash
Proceedings of the EACL 2009 Workshop on Computational Approaches to Semitic Languages

2008

pdf bib
Syntactic Reordering Integrated with Phrase-Based SMT
Jakob Elming
Proceedings of the ACL-08: HLT Second Workshop on Syntax and Structure in Statistical Translation (SSST-2)

pdf bib
Syntactic Reordering Integrated with Phrase-Based SMT
Jakob Elming
Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)

2007

pdf bib
Combination of Statistical Word Alignments Based on Multiple Preprocessing Schemes
Jakob Elming | Nizar Habash
Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers

2006

pdf bib
Transformation-Based Correction of Rule-Based MT
Jakob Elming
Proceedings of the 11th Annual Conference of the European Association for Machine Translation

2005

pdf bib
A Corpus-Based Approach to Topic in Danish dialog
Philip Diderichsen | Jakob Elming
Proceedings of the ACL Student Research Workshop