OCR, Classification& Machine Translation (OCCAM)
Joachim Van den Bogaert, Arne Defauw, Frederic Everaert, Koen Van Winckel, Alina Kramchaninova, Anna Bardadym, Tom Vanallemeersch, Pavel Smrž, Michal Hradiš
Abstract
The OCCAM project (Optical Character recognition, ClassificAtion & Machine Translation) aims at integrating the CEF (Connecting Europe Facility) Automated Translation service with image classification, Translation Memories (TMs), Optical Character Recognition (OCR), and Machine Translation (MT). It will support the automated translation of scanned business documents (a document format that, currently, cannot be processed by the CEF eTranslation service) and will also lead to a tool useful for the Digital Humanities domain.- Anthology ID:
- 2020.eamt-1.62
- Volume:
- Proceedings of the 22nd Annual Conference of the European Association for Machine Translation
- Month:
- November
- Year:
- 2020
- Address:
- Lisboa, Portugal
- Editors:
- André Martins, Helena Moniz, Sara Fumega, Bruno Martins, Fernando Batista, Luisa Coheur, Carla Parra, Isabel Trancoso, Marco Turchi, Arianna Bisazza, Joss Moorkens, Ana Guerberof, Mary Nurminen, Lena Marg, Mikel L. Forcada
- Venue:
- EAMT
- SIG:
- Publisher:
- European Association for Machine Translation
- Note:
- Pages:
- 481–482
- Language:
- URL:
- https://aclanthology.org/2020.eamt-1.62
- DOI:
- Bibkey:
- Cite (ACL):
- Joachim Van den Bogaert, Arne Defauw, Frederic Everaert, Koen Van Winckel, Alina Kramchaninova, Anna Bardadym, Tom Vanallemeersch, Pavel Smrž, and Michal Hradiš. 2020. OCR, Classification& Machine Translation (OCCAM). In Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, pages 481–482, Lisboa, Portugal. European Association for Machine Translation.
- Cite (Informal):
- OCR, Classification& Machine Translation (OCCAM) (Van den Bogaert et al., EAMT 2020)
- Copy Citation:
- PDF:
- https://aclanthology.org/2020.eamt-1.62.pdf
Export citation
@inproceedings{van-den-bogaert-etal-2020-ocr, title = "{OCR}, Classification{\&} Machine Translation ({OCCAM})", author = "Van den Bogaert, Joachim and Defauw, Arne and Everaert, Frederic and Van Winckel, Koen and Kramchaninova, Alina and Bardadym, Anna and Vanallemeersch, Tom and Smr{\v{z}}, Pavel and Hradi{\v{s}}, Michal", editor = "Martins, Andr{\'e} and Moniz, Helena and Fumega, Sara and Martins, Bruno and Batista, Fernando and Coheur, Luisa and Parra, Carla and Trancoso, Isabel and Turchi, Marco and Bisazza, Arianna and Moorkens, Joss and Guerberof, Ana and Nurminen, Mary and Marg, Lena and Forcada, Mikel L.", booktitle = "Proceedings of the 22nd Annual Conference of the European Association for Machine Translation", month = nov, year = "2020", address = "Lisboa, Portugal", publisher = "European Association for Machine Translation", url = "https://aclanthology.org/2020.eamt-1.62", pages = "481--482", abstract = "The OCCAM project (Optical Character recognition, ClassificAtion {\&} Machine Translation) aims at integrating the CEF (Connecting Europe Facility) Automated Translation service with image classification, Translation Memories (TMs), Optical Character Recognition (OCR), and Machine Translation (MT). It will support the automated translation of scanned business documents (a document format that, currently, cannot be processed by the CEF eTranslation service) and will also lead to a tool useful for the Digital Humanities domain.", }
<?xml version="1.0" encoding="UTF-8"?> <modsCollection xmlns="http://www.loc.gov/mods/v3"> <mods ID="van-den-bogaert-etal-2020-ocr"> <titleInfo> <title>OCR, Classification& Machine Translation (OCCAM)</title> </titleInfo> <name type="personal"> <namePart type="given">Joachim</namePart> <namePart type="family">Van den Bogaert</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Arne</namePart> <namePart type="family">Defauw</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Frederic</namePart> <namePart type="family">Everaert</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Koen</namePart> <namePart type="family">Van Winckel</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Alina</namePart> <namePart type="family">Kramchaninova</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Anna</namePart> <namePart type="family">Bardadym</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Tom</namePart> <namePart type="family">Vanallemeersch</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Pavel</namePart> <namePart type="family">Smrž</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Michal</namePart> <namePart type="family">Hradiš</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <originInfo> <dateIssued>2020-11</dateIssued> </originInfo> <typeOfResource>text</typeOfResource> <relatedItem type="host"> <titleInfo> <title>Proceedings of the 22nd Annual Conference of the European Association for Machine Translation</title> </titleInfo> <name type="personal"> <namePart type="given">André</namePart> <namePart type="family">Martins</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Helena</namePart> <namePart type="family">Moniz</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Sara</namePart> <namePart type="family">Fumega</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Bruno</namePart> <namePart type="family">Martins</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Fernando</namePart> <namePart type="family">Batista</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Luisa</namePart> <namePart type="family">Coheur</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Carla</namePart> <namePart type="family">Parra</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Isabel</namePart> <namePart type="family">Trancoso</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Marco</namePart> <namePart type="family">Turchi</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Arianna</namePart> <namePart type="family">Bisazza</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Joss</namePart> <namePart type="family">Moorkens</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ana</namePart> <namePart type="family">Guerberof</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Mary</namePart> <namePart type="family">Nurminen</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Lena</namePart> <namePart type="family">Marg</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Mikel</namePart> <namePart type="given">L</namePart> <namePart type="family">Forcada</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <originInfo> <publisher>European Association for Machine Translation</publisher> <place> <placeTerm type="text">Lisboa, Portugal</placeTerm> </place> </originInfo> <genre authority="marcgt">conference publication</genre> </relatedItem> <abstract>The OCCAM project (Optical Character recognition, ClassificAtion & Machine Translation) aims at integrating the CEF (Connecting Europe Facility) Automated Translation service with image classification, Translation Memories (TMs), Optical Character Recognition (OCR), and Machine Translation (MT). It will support the automated translation of scanned business documents (a document format that, currently, cannot be processed by the CEF eTranslation service) and will also lead to a tool useful for the Digital Humanities domain.</abstract> <identifier type="citekey">van-den-bogaert-etal-2020-ocr</identifier> <location> <url>https://aclanthology.org/2020.eamt-1.62</url> </location> <part> <date>2020-11</date> <extent unit="page"> <start>481</start> <end>482</end> </extent> </part> </mods> </modsCollection>
%0 Conference Proceedings %T OCR, Classification& Machine Translation (OCCAM) %A Van den Bogaert, Joachim %A Defauw, Arne %A Everaert, Frederic %A Van Winckel, Koen %A Kramchaninova, Alina %A Bardadym, Anna %A Vanallemeersch, Tom %A Smrž, Pavel %A Hradiš, Michal %Y Martins, André %Y Moniz, Helena %Y Fumega, Sara %Y Martins, Bruno %Y Batista, Fernando %Y Coheur, Luisa %Y Parra, Carla %Y Trancoso, Isabel %Y Turchi, Marco %Y Bisazza, Arianna %Y Moorkens, Joss %Y Guerberof, Ana %Y Nurminen, Mary %Y Marg, Lena %Y Forcada, Mikel L. %S Proceedings of the 22nd Annual Conference of the European Association for Machine Translation %D 2020 %8 November %I European Association for Machine Translation %C Lisboa, Portugal %F van-den-bogaert-etal-2020-ocr %X The OCCAM project (Optical Character recognition, ClassificAtion & Machine Translation) aims at integrating the CEF (Connecting Europe Facility) Automated Translation service with image classification, Translation Memories (TMs), Optical Character Recognition (OCR), and Machine Translation (MT). It will support the automated translation of scanned business documents (a document format that, currently, cannot be processed by the CEF eTranslation service) and will also lead to a tool useful for the Digital Humanities domain. %U https://aclanthology.org/2020.eamt-1.62 %P 481-482
Markdown (Informal)
[OCR, Classification& Machine Translation (OCCAM)](https://aclanthology.org/2020.eamt-1.62) (Van den Bogaert et al., EAMT 2020)
- OCR, Classification& Machine Translation (OCCAM) (Van den Bogaert et al., EAMT 2020)
ACL
- Joachim Van den Bogaert, Arne Defauw, Frederic Everaert, Koen Van Winckel, Alina Kramchaninova, Anna Bardadym, Tom Vanallemeersch, Pavel Smrž, and Michal Hradiš. 2020. OCR, Classification& Machine Translation (OCCAM). In Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, pages 481–482, Lisboa, Portugal. European Association for Machine Translation.