Character Set Construction for Chinese Language Learning

Chak Yan Yeung; John S. Y. Lee

Character Set Construction for Chinese Language Learning

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

To promote efficient learning of Chinese characters, pedagogical materials may present not only a single character, but a set of characters that are related in meaning and in written form. This paper investigates automatic construction of these character sets. The proposed model represents a character as averaged word vectors of common words containing the character. It then identifies sets of characters with high semantic similarity through clustering. Human evaluation shows that this representation outperforms direct use of character embeddings, and that the resulting character sets capture distinct semantic ranges.

Anthology ID:: 2021.bea-1.6
Volume:: Proceedings of the 16th Workshop on Innovative Use of NLP for Building Educational Applications
Month:: April
Year:: 2021
Address:: Online
Editors:: Jill Burstein, Andrea Horbach, Ekaterina Kochmar, Ronja Laarmann-Quante, Claudia Leacock, Nitin Madnani, Ildikó Pilán, Helen Yannakoudakis, Torsten Zesch
Venue:: BEA
SIG:: SIGEDU
Publisher:: Association for Computational Linguistics
Note:
Pages:: 59–63
Language:
URL:: https://aclanthology.org/2021.bea-1.6/
DOI:
Bibkey:
Cite (ACL):: Chak Yan Yeung and John Lee. 2021. Character Set Construction for Chinese Language Learning. In Proceedings of the 16th Workshop on Innovative Use of NLP for Building Educational Applications, pages 59–63, Online. Association for Computational Linguistics.
Cite (Informal):: Character Set Construction for Chinese Language Learning (Yeung & Lee, BEA 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.bea-1.6.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{yeung-lee-2021-character,
    title = "Character Set Construction for {C}hinese Language Learning",
    author = "Yeung, Chak Yan  and
      Lee, John",
    editor = "Burstein, Jill  and
      Horbach, Andrea  and
      Kochmar, Ekaterina  and
      Laarmann-Quante, Ronja  and
      Leacock, Claudia  and
      Madnani, Nitin  and
      Pil{\'a}n, Ildik{\'o}  and
      Yannakoudakis, Helen  and
      Zesch, Torsten",
    booktitle = "Proceedings of the 16th Workshop on Innovative Use of NLP for Building Educational Applications",
    month = apr,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.bea-1.6/",
    pages = "59--63",
    abstract = "To promote efficient learning of Chinese characters, pedagogical materials may present not only a single character, but a set of characters that are related in meaning and in written form. This paper investigates automatic construction of these character sets. The proposed model represents a character as averaged word vectors of common words containing the character. It then identifies sets of characters with high semantic similarity through clustering. Human evaluation shows that this representation outperforms direct use of character embeddings, and that the resulting character sets capture distinct semantic ranges."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="yeung-lee-2021-character">
    <titleInfo>
        <title>Character Set Construction for Chinese Language Learning</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Chak</namePart>
        <namePart type="given">Yan</namePart>
        <namePart type="family">Yeung</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">John</namePart>
        <namePart type="family">Lee</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2021-04</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the 16th Workshop on Innovative Use of NLP for Building Educational Applications</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Jill</namePart>
            <namePart type="family">Burstein</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Andrea</namePart>
            <namePart type="family">Horbach</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Ekaterina</namePart>
            <namePart type="family">Kochmar</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Ronja</namePart>
            <namePart type="family">Laarmann-Quante</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Claudia</namePart>
            <namePart type="family">Leacock</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Nitin</namePart>
            <namePart type="family">Madnani</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Ildikó</namePart>
            <namePart type="family">Pilán</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Helen</namePart>
            <namePart type="family">Yannakoudakis</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Torsten</namePart>
            <namePart type="family">Zesch</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Online</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>To promote efficient learning of Chinese characters, pedagogical materials may present not only a single character, but a set of characters that are related in meaning and in written form. This paper investigates automatic construction of these character sets. The proposed model represents a character as averaged word vectors of common words containing the character. It then identifies sets of characters with high semantic similarity through clustering. Human evaluation shows that this representation outperforms direct use of character embeddings, and that the resulting character sets capture distinct semantic ranges.</abstract>
    <identifier type="citekey">yeung-lee-2021-character</identifier>
    <location>
        <url>https://aclanthology.org/2021.bea-1.6/</url>
    </location>
    <part>
        <date>2021-04</date>
        <extent unit="page">
            <start>59</start>
            <end>63</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Character Set Construction for Chinese Language Learning
%A Yeung, Chak Yan
%A Lee, John
%Y Burstein, Jill
%Y Horbach, Andrea
%Y Kochmar, Ekaterina
%Y Laarmann-Quante, Ronja
%Y Leacock, Claudia
%Y Madnani, Nitin
%Y Pilán, Ildikó
%Y Yannakoudakis, Helen
%Y Zesch, Torsten
%S Proceedings of the 16th Workshop on Innovative Use of NLP for Building Educational Applications
%D 2021
%8 April
%I Association for Computational Linguistics
%C Online
%F yeung-lee-2021-character
%X To promote efficient learning of Chinese characters, pedagogical materials may present not only a single character, but a set of characters that are related in meaning and in written form. This paper investigates automatic construction of these character sets. The proposed model represents a character as averaged word vectors of common words containing the character. It then identifies sets of characters with high semantic similarity through clustering. Human evaluation shows that this representation outperforms direct use of character embeddings, and that the resulting character sets capture distinct semantic ranges.
%U https://aclanthology.org/2021.bea-1.6/
%P 59-63

Download as File

Markdown (Informal)

[Character Set Construction for Chinese Language Learning](https://aclanthology.org/2021.bea-1.6/) (Yeung & Lee, BEA 2021)

Character Set Construction for Chinese Language Learning (Yeung & Lee, BEA 2021)

ACL

Chak Yan Yeung and John Lee. 2021. Character Set Construction for Chinese Language Learning. In Proceedings of the 16th Workshop on Innovative Use of NLP for Building Educational Applications, pages 59–63, Online. Association for Computational Linguistics.