Neural Speech Translation at AppTek

Evgeny Matusov; Patrick Wilken; Parnia Bahar; Julian Schamper; Pavel Golik; Albert Zeyer; Joan Albert Silvestre-Cerdà; Adrià Martínez-Villaronga; Hendrik Pesch; Jan-Thorsten Peter

Neural Speech Translation at AppTek

Evgeny Matusov, Patrick Wilken, Parnia Bahar, Julian Schamper, Pavel Golik, Albert Zeyer, Joan Albert Silvestre-Cerda, Adrià Martínez-Villaronga, Hendrik Pesch, Jan-Thorsten Peter

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

This work describes AppTek’s speech translation pipeline that includes strong state-of-the-art automatic speech recognition (ASR) and neural machine translation (NMT) components. We show how these components can be tightly coupled by encoding ASR confusion networks, as well as ASR-like noise adaptation, vocabulary normalization, and implicit punctuation prediction during translation. In another experimental setup, we propose a direct speech translation approach that can be scaled to translation tasks with large amounts of text-only parallel training data but a limited number of hours of recorded and human-translated speech.

Anthology ID:: 2018.iwslt-1.15
Volume:: Proceedings of the 15th International Conference on Spoken Language Translation
Month:: October 29-30
Year:: 2018
Address:: Brussels
Editors:: Marco Turchi, Jan Niehues, Marcello Frederico
Venue:: IWSLT
SIG:: SIGSLT
Publisher:: International Conference on Spoken Language Translation
Note:
Pages:: 104–111
Language:
URL:: https://aclanthology.org/2018.iwslt-1.15/
DOI:
Bibkey:
Cite (ACL):: Evgeny Matusov, Patrick Wilken, Parnia Bahar, Julian Schamper, Pavel Golik, Albert Zeyer, Joan Albert Silvestre-Cerda, Adrià Martínez-Villaronga, Hendrik Pesch, and Jan-Thorsten Peter. 2018. Neural Speech Translation at AppTek. In Proceedings of the 15th International Conference on Spoken Language Translation, pages 104–111, Brussels. International Conference on Spoken Language Translation.
Cite (Informal):: Neural Speech Translation at AppTek (Matusov et al., IWSLT 2018)
Copy Citation:
PDF:: https://aclanthology.org/2018.iwslt-1.15.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{matusov-etal-2018-neural,
    title = "Neural Speech Translation at {A}pp{T}ek",
    author = "Matusov, Evgeny  and
      Wilken, Patrick  and
      Bahar, Parnia  and
      Schamper, Julian  and
      Golik, Pavel  and
      Zeyer, Albert  and
      Silvestre-Cerda, Joan Albert  and
      Mart{\'i}nez-Villaronga, Adri{\`a}  and
      Pesch, Hendrik  and
      Peter, Jan-Thorsten",
    editor = "Turchi, Marco  and
      Niehues, Jan  and
      Frederico, Marcello",
    booktitle = "Proceedings of the 15th International Conference on Spoken Language Translation",
    month = oct # " 29-30",
    year = "2018",
    address = "Brussels",
    publisher = "International Conference on Spoken Language Translation",
    url = "https://aclanthology.org/2018.iwslt-1.15/",
    pages = "104--111",
    abstract = "This work describes AppTek{'}s speech translation pipeline that includes strong state-of-the-art automatic speech recognition (ASR) and neural machine translation (NMT) components. We show how these components can be tightly coupled by encoding ASR confusion networks, as well as ASR-like noise adaptation, vocabulary normalization, and implicit punctuation prediction during translation. In another experimental setup, we propose a direct speech translation approach that can be scaled to translation tasks with large amounts of text-only parallel training data but a limited number of hours of recorded and human-translated speech."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="matusov-etal-2018-neural">
    <titleInfo>
        <title>Neural Speech Translation at AppTek</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Evgeny</namePart>
        <namePart type="family">Matusov</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Patrick</namePart>
        <namePart type="family">Wilken</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Parnia</namePart>
        <namePart type="family">Bahar</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Julian</namePart>
        <namePart type="family">Schamper</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Pavel</namePart>
        <namePart type="family">Golik</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Albert</namePart>
        <namePart type="family">Zeyer</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Joan</namePart>
        <namePart type="given">Albert</namePart>
        <namePart type="family">Silvestre-Cerda</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Adrià</namePart>
        <namePart type="family">Martínez-Villaronga</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Hendrik</namePart>
        <namePart type="family">Pesch</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Jan-Thorsten</namePart>
        <namePart type="family">Peter</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2018-oct 29-30</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the 15th International Conference on Spoken Language Translation</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Marco</namePart>
            <namePart type="family">Turchi</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Jan</namePart>
            <namePart type="family">Niehues</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Marcello</namePart>
            <namePart type="family">Frederico</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>International Conference on Spoken Language Translation</publisher>
            <place>
                <placeTerm type="text">Brussels</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>This work describes AppTek’s speech translation pipeline that includes strong state-of-the-art automatic speech recognition (ASR) and neural machine translation (NMT) components. We show how these components can be tightly coupled by encoding ASR confusion networks, as well as ASR-like noise adaptation, vocabulary normalization, and implicit punctuation prediction during translation. In another experimental setup, we propose a direct speech translation approach that can be scaled to translation tasks with large amounts of text-only parallel training data but a limited number of hours of recorded and human-translated speech.</abstract>
    <identifier type="citekey">matusov-etal-2018-neural</identifier>
    <location>
        <url>https://aclanthology.org/2018.iwslt-1.15/</url>
    </location>
    <part>
        <date>2018-oct 29-30</date>
        <extent unit="page">
            <start>104</start>
            <end>111</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Neural Speech Translation at AppTek
%A Matusov, Evgeny
%A Wilken, Patrick
%A Bahar, Parnia
%A Schamper, Julian
%A Golik, Pavel
%A Zeyer, Albert
%A Silvestre-Cerda, Joan Albert
%A Martínez-Villaronga, Adrià
%A Pesch, Hendrik
%A Peter, Jan-Thorsten
%Y Turchi, Marco
%Y Niehues, Jan
%Y Frederico, Marcello
%S Proceedings of the 15th International Conference on Spoken Language Translation
%D 2018
%8 oct 29 30
%I International Conference on Spoken Language Translation
%C Brussels
%F matusov-etal-2018-neural
%X This work describes AppTek’s speech translation pipeline that includes strong state-of-the-art automatic speech recognition (ASR) and neural machine translation (NMT) components. We show how these components can be tightly coupled by encoding ASR confusion networks, as well as ASR-like noise adaptation, vocabulary normalization, and implicit punctuation prediction during translation. In another experimental setup, we propose a direct speech translation approach that can be scaled to translation tasks with large amounts of text-only parallel training data but a limited number of hours of recorded and human-translated speech.
%U https://aclanthology.org/2018.iwslt-1.15/
%P 104-111

Download as File

Markdown (Informal)

[Neural Speech Translation at AppTek](https://aclanthology.org/2018.iwslt-1.15/) (Matusov et al., IWSLT 2018)

Neural Speech Translation at AppTek (Matusov et al., IWSLT 2018)

ACL

Evgeny Matusov, Patrick Wilken, Parnia Bahar, Julian Schamper, Pavel Golik, Albert Zeyer, Joan Albert Silvestre-Cerda, Adrià Martínez-Villaronga, Hendrik Pesch, and Jan-Thorsten Peter. 2018. Neural Speech Translation at AppTek. In Proceedings of the 15th International Conference on Spoken Language Translation, pages 104–111, Brussels. International Conference on Spoken Language Translation.