MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning

Aina Garí Soler; Marianna Apidianaki

doi:10.18653/v1/2020.semeval-1.18

MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

We present the MULTISEM systems submitted to SemEval 2020 Task 3: Graded Word Similarity in Context (GWSC). We experiment with injecting semantic knowledge into pre-trained BERT models through fine-tuning on lexical semantic tasks related to GWSC. We use existing semantically annotated datasets, and propose to approximate similarity through automatically generated lexical substitutes in context. We participate in both GWSC subtasks and address two languages, English and Finnish. Our best English models occupy the third and fourth positions in the ranking for the two subtasks. Performance is lower for the Finnish models which are mid-ranked in the respective subtasks, highlighting the important role of data availability for fine-tuning.

Anthology ID:: 2020.semeval-1.18
Volume:: Proceedings of the Fourteenth Workshop on Semantic Evaluation
Month:: December
Year:: 2020
Address:: Barcelona (online)
Editors:: Aurelie Herbelot, Xiaodan Zhu, Alexis Palmer, Nathan Schneider, Jonathan May, Ekaterina Shutova
Venue:: SemEval
SIG:: SIGLEX
Publisher:: International Committee for Computational Linguistics
Note:
Pages:: 158–165
Language:
URL:: https://aclanthology.org/2020.semeval-1.18/
DOI:: 10.18653/v1/2020.semeval-1.18
Bibkey:
Cite (ACL):: Aina Garí Soler and Marianna Apidianaki. 2020. MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 158–165, Barcelona (online). International Committee for Computational Linguistics.
Cite (Informal):: MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning (Garí Soler & Apidianaki, SemEval 2020)
Copy Citation:
PDF:: https://aclanthology.org/2020.semeval-1.18.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{gari-soler-apidianaki-2020-multisem,
    title = "{MULTISEM} at {S}em{E}val-2020 Task 3: Fine-tuning {BERT} for Lexical Meaning",
    author = "Gar{\'i} Soler, Aina  and
      Apidianaki, Marianna",
    editor = "Herbelot, Aurelie  and
      Zhu, Xiaodan  and
      Palmer, Alexis  and
      Schneider, Nathan  and
      May, Jonathan  and
      Shutova, Ekaterina",
    booktitle = "Proceedings of the Fourteenth Workshop on Semantic Evaluation",
    month = dec,
    year = "2020",
    address = "Barcelona (online)",
    publisher = "International Committee for Computational Linguistics",
    url = "https://aclanthology.org/2020.semeval-1.18/",
    doi = "10.18653/v1/2020.semeval-1.18",
    pages = "158--165",
    abstract = "We present the MULTISEM systems submitted to SemEval 2020 Task 3: Graded Word Similarity in Context (GWSC). We experiment with injecting semantic knowledge into pre-trained BERT models through fine-tuning on lexical semantic tasks related to GWSC. We use existing semantically annotated datasets, and propose to approximate similarity through automatically generated lexical substitutes in context. We participate in both GWSC subtasks and address two languages, English and Finnish. Our best English models occupy the third and fourth positions in the ranking for the two subtasks. Performance is lower for the Finnish models which are mid-ranked in the respective subtasks, highlighting the important role of data availability for fine-tuning."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="gari-soler-apidianaki-2020-multisem">
    <titleInfo>
        <title>MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Aina</namePart>
        <namePart type="family">Garí Soler</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Marianna</namePart>
        <namePart type="family">Apidianaki</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2020-12</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the Fourteenth Workshop on Semantic Evaluation</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Aurelie</namePart>
            <namePart type="family">Herbelot</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Xiaodan</namePart>
            <namePart type="family">Zhu</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Alexis</namePart>
            <namePart type="family">Palmer</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Nathan</namePart>
            <namePart type="family">Schneider</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Jonathan</namePart>
            <namePart type="family">May</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Ekaterina</namePart>
            <namePart type="family">Shutova</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>International Committee for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Barcelona (online)</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>We present the MULTISEM systems submitted to SemEval 2020 Task 3: Graded Word Similarity in Context (GWSC). We experiment with injecting semantic knowledge into pre-trained BERT models through fine-tuning on lexical semantic tasks related to GWSC. We use existing semantically annotated datasets, and propose to approximate similarity through automatically generated lexical substitutes in context. We participate in both GWSC subtasks and address two languages, English and Finnish. Our best English models occupy the third and fourth positions in the ranking for the two subtasks. Performance is lower for the Finnish models which are mid-ranked in the respective subtasks, highlighting the important role of data availability for fine-tuning.</abstract>
    <identifier type="citekey">gari-soler-apidianaki-2020-multisem</identifier>
    <identifier type="doi">10.18653/v1/2020.semeval-1.18</identifier>
    <location>
        <url>https://aclanthology.org/2020.semeval-1.18/</url>
    </location>
    <part>
        <date>2020-12</date>
        <extent unit="page">
            <start>158</start>
            <end>165</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning
%A Garí Soler, Aina
%A Apidianaki, Marianna
%Y Herbelot, Aurelie
%Y Zhu, Xiaodan
%Y Palmer, Alexis
%Y Schneider, Nathan
%Y May, Jonathan
%Y Shutova, Ekaterina
%S Proceedings of the Fourteenth Workshop on Semantic Evaluation
%D 2020
%8 December
%I International Committee for Computational Linguistics
%C Barcelona (online)
%F gari-soler-apidianaki-2020-multisem
%X We present the MULTISEM systems submitted to SemEval 2020 Task 3: Graded Word Similarity in Context (GWSC). We experiment with injecting semantic knowledge into pre-trained BERT models through fine-tuning on lexical semantic tasks related to GWSC. We use existing semantically annotated datasets, and propose to approximate similarity through automatically generated lexical substitutes in context. We participate in both GWSC subtasks and address two languages, English and Finnish. Our best English models occupy the third and fourth positions in the ranking for the two subtasks. Performance is lower for the Finnish models which are mid-ranked in the respective subtasks, highlighting the important role of data availability for fine-tuning.
%R 10.18653/v1/2020.semeval-1.18
%U https://aclanthology.org/2020.semeval-1.18/
%U https://doi.org/10.18653/v1/2020.semeval-1.18
%P 158-165

Download as File

Markdown (Informal)

[MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning](https://aclanthology.org/2020.semeval-1.18/) (Garí Soler & Apidianaki, SemEval 2020)

MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning (Garí Soler & Apidianaki, SemEval 2020)

ACL

Aina Garí Soler and Marianna Apidianaki. 2020. MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 158–165, Barcelona (online). International Committee for Computational Linguistics.