Prosody Labelled Dataset for Hindi

Esha Banerjee; Atul Kr. Ojha; Girish Nath Jha

Prosody Labelled Dataset for Hindi

Esha Banerjee, Atul Kr. Ojha, Girish Jha

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

This study aims to develop an intonation labelled database for Hindi, for enhancing prosody in ASR and TTS systems, which is also helpful for building Speech to Speech Machine Translation systems. Although no single standard for prosody labelling exists in Hindi, researchers in the past have employed perceptual and statistical methods in literature to draw inferences about the behaviour of prosody patterns in Hindi. Based on such existing research and largely agreed upon intonational theories in Hindi, this study attempts to develop a manually annotated prosodic corpus of Hindi speech data, which can be used for training speech models for natural-sounding speech in the future. 500 sentences (2,550 words) for declarative and interrogative types have been labelled using Praat.

Anthology ID:: 2021.smp-1.2
Volume:: Proceedings of the Workshop on Speech and Music Processing 2021
Month:: December
Year:: 2021
Address:: NIT Silchar, India
Editors:: Anupam Biswas, Rabul Hussain Laskar, Pinki Roy
Venue:: SMP
SIG:
Publisher:: NLP Association of India (NLPAI)
Note:
Pages:: 14–19
Language:
URL:: https://aclanthology.org/2021.smp-1.2/
DOI:
Bibkey:
Cite (ACL):: Esha Banerjee, Atul Kr. Ojha, and Girish Jha. 2021. Prosody Labelled Dataset for Hindi. In Proceedings of the Workshop on Speech and Music Processing 2021, pages 14–19, NIT Silchar, India. NLP Association of India (NLPAI).
Cite (Informal):: Prosody Labelled Dataset for Hindi (Banerjee et al., SMP 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.smp-1.2.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{banerjee-etal-2021-prosody-labelled,
    title = "Prosody Labelled Dataset for {H}indi",
    author = "Banerjee, Esha  and
      Ojha, Atul Kr.  and
      Jha, Girish",
    editor = "Biswas, Anupam  and
      Laskar, Rabul Hussain  and
      Roy, Pinki",
    booktitle = "Proceedings of the Workshop on Speech and Music Processing 2021",
    month = dec,
    year = "2021",
    address = "NIT Silchar, India",
    publisher = "NLP Association of India (NLPAI)",
    url = "https://aclanthology.org/2021.smp-1.2/",
    pages = "14--19",
    abstract = "This study aims to develop an intonation labelled database for Hindi, for enhancing prosody in ASR and TTS systems, which is also helpful for building Speech to Speech Machine Translation systems. Although no single standard for prosody labelling exists in Hindi, researchers in the past have employed perceptual and statistical methods in literature to draw inferences about the behaviour of prosody patterns in Hindi. Based on such existing research and largely agreed upon intonational theories in Hindi, this study attempts to develop a manually annotated prosodic corpus of Hindi speech data, which can be used for training speech models for natural-sounding speech in the future. 500 sentences (2,550 words) for declarative and interrogative types have been labelled using Praat."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="banerjee-etal-2021-prosody-labelled">
    <titleInfo>
        <title>Prosody Labelled Dataset for Hindi</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Esha</namePart>
        <namePart type="family">Banerjee</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Atul</namePart>
        <namePart type="given">Kr.</namePart>
        <namePart type="family">Ojha</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Girish</namePart>
        <namePart type="family">Jha</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2021-12</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the Workshop on Speech and Music Processing 2021</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Anupam</namePart>
            <namePart type="family">Biswas</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Rabul</namePart>
            <namePart type="given">Hussain</namePart>
            <namePart type="family">Laskar</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Pinki</namePart>
            <namePart type="family">Roy</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>NLP Association of India (NLPAI)</publisher>
            <place>
                <placeTerm type="text">NIT Silchar, India</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>This study aims to develop an intonation labelled database for Hindi, for enhancing prosody in ASR and TTS systems, which is also helpful for building Speech to Speech Machine Translation systems. Although no single standard for prosody labelling exists in Hindi, researchers in the past have employed perceptual and statistical methods in literature to draw inferences about the behaviour of prosody patterns in Hindi. Based on such existing research and largely agreed upon intonational theories in Hindi, this study attempts to develop a manually annotated prosodic corpus of Hindi speech data, which can be used for training speech models for natural-sounding speech in the future. 500 sentences (2,550 words) for declarative and interrogative types have been labelled using Praat.</abstract>
    <identifier type="citekey">banerjee-etal-2021-prosody-labelled</identifier>
    <location>
        <url>https://aclanthology.org/2021.smp-1.2/</url>
    </location>
    <part>
        <date>2021-12</date>
        <extent unit="page">
            <start>14</start>
            <end>19</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Prosody Labelled Dataset for Hindi
%A Banerjee, Esha
%A Ojha, Atul Kr.
%A Jha, Girish
%Y Biswas, Anupam
%Y Laskar, Rabul Hussain
%Y Roy, Pinki
%S Proceedings of the Workshop on Speech and Music Processing 2021
%D 2021
%8 December
%I NLP Association of India (NLPAI)
%C NIT Silchar, India
%F banerjee-etal-2021-prosody-labelled
%X This study aims to develop an intonation labelled database for Hindi, for enhancing prosody in ASR and TTS systems, which is also helpful for building Speech to Speech Machine Translation systems. Although no single standard for prosody labelling exists in Hindi, researchers in the past have employed perceptual and statistical methods in literature to draw inferences about the behaviour of prosody patterns in Hindi. Based on such existing research and largely agreed upon intonational theories in Hindi, this study attempts to develop a manually annotated prosodic corpus of Hindi speech data, which can be used for training speech models for natural-sounding speech in the future. 500 sentences (2,550 words) for declarative and interrogative types have been labelled using Praat.
%U https://aclanthology.org/2021.smp-1.2/
%P 14-19

Download as File

Markdown (Informal)

[Prosody Labelled Dataset for Hindi](https://aclanthology.org/2021.smp-1.2/) (Banerjee et al., SMP 2021)

Prosody Labelled Dataset for Hindi (Banerjee et al., SMP 2021)

ACL

Esha Banerjee, Atul Kr. Ojha, and Girish Jha. 2021. Prosody Labelled Dataset for Hindi. In Proceedings of the Workshop on Speech and Music Processing 2021, pages 14–19, NIT Silchar, India. NLP Association of India (NLPAI).