Elaine Farrow
2022
The EuroPat Corpus: A Parallel Corpus of European Patent Data
Kenneth Heafield | Elaine Farrow | Jelmer van der Linde | Gema Ramírez-Sánchez | Dion Wiggins
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Kenneth Heafield | Elaine Farrow | Jelmer van der Linde | Gema Ramírez-Sánchez | Dion Wiggins
Proceedings of the Thirteenth Language Resources and Evaluation Conference
We present the EuroPat corpus of patent-specific parallel data for 6 official European languages paired with English: German, Spanish, French, Croatian, Norwegian, and Polish. The filtered parallel corpora range in size from 51 million sentences (Spanish-English) to 154k sentences (Croatian-English), with the unfiltered (raw) corpora being up to 2 times larger. Access to clean, high quality, parallel data in technical domains such as science, engineering, and medicine is needed for training neural machine translation systems for tasks like online dispute resolution and eProcurement. Our evaluation found that the addition of EuroPat data to a generic baseline improved the performance of machine translation systems on in-domain test data in German, Spanish, French, and Polish; and in translating patent data from Croatian to English. The corpus has been released under Creative Commons Zero, and is expected to be widely useful for training high-quality machine translation systems, and particularly for those targeting technical documents such as patents and contracts.
2013
Improving interpretation robustness in a tutorial dialogue system
Myroslava Dzikovska | Elaine Farrow | Johanna Moore
Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications
Myroslava Dzikovska | Elaine Farrow | Johanna Moore
Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications
2010
Beetle II: A System for Tutoring and Computational Linguistics Experimentation
Myroslava O. Dzikovska | Johanna D. Moore | Natalie Steinhauser | Gwendolyn Campbell | Elaine Farrow | Charles B. Callaway
Proceedings of the ACL 2010 System Demonstrations
Myroslava O. Dzikovska | Johanna D. Moore | Natalie Steinhauser | Gwendolyn Campbell | Elaine Farrow | Charles B. Callaway
Proceedings of the ACL 2010 System Demonstrations
2009
Context-Dependent Regression Testing for Natural Language Processing
Elaine Farrow | Myroslava O. Dzikovska
Proceedings of the Workshop on Software Engineering, Testing, and Quality Assurance for Natural Language Processing (SETQA-NLP 2009)
Elaine Farrow | Myroslava O. Dzikovska
Proceedings of the Workshop on Software Engineering, Testing, and Quality Assurance for Natural Language Processing (SETQA-NLP 2009)
Dealing with Interpretation Errors in Tutorial Dialogue
Myroslava Dzikovska | Charles Callaway | Elaine Farrow | Johanna Moore | Natalie Steinhauser | Gwendolyn Campbell
Proceedings of the SIGDIAL 2009 Conference
Myroslava Dzikovska | Charles Callaway | Elaine Farrow | Johanna Moore | Natalie Steinhauser | Gwendolyn Campbell
Proceedings of the SIGDIAL 2009 Conference
2007
Adaptive Tutorial Dialogue Systems Using Deep NLP Techniques
Myroslava O. Dzikovska | Charles B. Callaway | Elaine Farrow | Manuel Marques-Pita | Colin Matheson | Johanna D. Moore
Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT)
Myroslava O. Dzikovska | Charles B. Callaway | Elaine Farrow | Manuel Marques-Pita | Colin Matheson | Johanna D. Moore
Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT)