The Effectiveness of Masked Language Modeling and Adapters for Factual Knowledge Injection

Sondre Wold

The Effectiveness of Masked Language Modeling and Adapters for Factual Knowledge Injection

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

This paper studies the problem of injecting factual knowledge into large pre-trained language models. We train adapter modules on parts of the ConceptNet knowledge graph using the masked language modeling objective and evaluate the success of the method by a series of probing experiments on the LAMA probe. Mean P@K curves for different configurations indicate that the technique is effective, increasing the performance on sub-sets of the LAMA probe for large values of k by adding as little as 2.1% additional parameters to the original models.

Anthology ID:: 2022.textgraphs-1.6
Volume:: Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing
Month:: October
Year:: 2022
Address:: Gyeongju, Republic of Korea
Editors:: Dmitry Ustalov, Yanjun Gao, Alexander Panchenko, Marco Valentino, Mokanarangan Thayaparan, Thien Huu Nguyen, Gerald Penn, Arti Ramesh, Abhik Jana
Venue:: TextGraphs
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 54–59
Language:
URL:: https://aclanthology.org/2022.textgraphs-1.6/
DOI:
Bibkey:
Cite (ACL):: Sondre Wold. 2022. The Effectiveness of Masked Language Modeling and Adapters for Factual Knowledge Injection. In Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing, pages 54–59, Gyeongju, Republic of Korea. Association for Computational Linguistics.
Cite (Informal):: The Effectiveness of Masked Language Modeling and Adapters for Factual Knowledge Injection (Wold, TextGraphs 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.textgraphs-1.6.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{wold-2022-effectiveness,
    title = "The Effectiveness of Masked Language Modeling and Adapters for Factual Knowledge Injection",
    author = "Wold, Sondre",
    editor = "Ustalov, Dmitry  and
      Gao, Yanjun  and
      Panchenko, Alexander  and
      Valentino, Marco  and
      Thayaparan, Mokanarangan  and
      Nguyen, Thien Huu  and
      Penn, Gerald  and
      Ramesh, Arti  and
      Jana, Abhik",
    booktitle = "Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing",
    month = oct,
    year = "2022",
    address = "Gyeongju, Republic of Korea",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.textgraphs-1.6/",
    pages = "54--59",
    abstract = "This paper studies the problem of injecting factual knowledge into large pre-trained language models. We train adapter modules on parts of the ConceptNet knowledge graph using the masked language modeling objective and evaluate the success of the method by a series of probing experiments on the LAMA probe. Mean P@K curves for different configurations indicate that the technique is effective, increasing the performance on sub-sets of the LAMA probe for large values of k by adding as little as 2.1{\%} additional parameters to the original models."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="wold-2022-effectiveness">
    <titleInfo>
        <title>The Effectiveness of Masked Language Modeling and Adapters for Factual Knowledge Injection</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Sondre</namePart>
        <namePart type="family">Wold</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2022-10</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Dmitry</namePart>
            <namePart type="family">Ustalov</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Yanjun</namePart>
            <namePart type="family">Gao</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Alexander</namePart>
            <namePart type="family">Panchenko</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Marco</namePart>
            <namePart type="family">Valentino</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Mokanarangan</namePart>
            <namePart type="family">Thayaparan</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Thien</namePart>
            <namePart type="given">Huu</namePart>
            <namePart type="family">Nguyen</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Gerald</namePart>
            <namePart type="family">Penn</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Arti</namePart>
            <namePart type="family">Ramesh</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Abhik</namePart>
            <namePart type="family">Jana</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Gyeongju, Republic of Korea</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>This paper studies the problem of injecting factual knowledge into large pre-trained language models. We train adapter modules on parts of the ConceptNet knowledge graph using the masked language modeling objective and evaluate the success of the method by a series of probing experiments on the LAMA probe. Mean P@K curves for different configurations indicate that the technique is effective, increasing the performance on sub-sets of the LAMA probe for large values of k by adding as little as 2.1% additional parameters to the original models.</abstract>
    <identifier type="citekey">wold-2022-effectiveness</identifier>
    <location>
        <url>https://aclanthology.org/2022.textgraphs-1.6/</url>
    </location>
    <part>
        <date>2022-10</date>
        <extent unit="page">
            <start>54</start>
            <end>59</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T The Effectiveness of Masked Language Modeling and Adapters for Factual Knowledge Injection
%A Wold, Sondre
%Y Ustalov, Dmitry
%Y Gao, Yanjun
%Y Panchenko, Alexander
%Y Valentino, Marco
%Y Thayaparan, Mokanarangan
%Y Nguyen, Thien Huu
%Y Penn, Gerald
%Y Ramesh, Arti
%Y Jana, Abhik
%S Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing
%D 2022
%8 October
%I Association for Computational Linguistics
%C Gyeongju, Republic of Korea
%F wold-2022-effectiveness
%X This paper studies the problem of injecting factual knowledge into large pre-trained language models. We train adapter modules on parts of the ConceptNet knowledge graph using the masked language modeling objective and evaluate the success of the method by a series of probing experiments on the LAMA probe. Mean P@K curves for different configurations indicate that the technique is effective, increasing the performance on sub-sets of the LAMA probe for large values of k by adding as little as 2.1% additional parameters to the original models.
%U https://aclanthology.org/2022.textgraphs-1.6/
%P 54-59

Download as File

Markdown (Informal)

[The Effectiveness of Masked Language Modeling and Adapters for Factual Knowledge Injection](https://aclanthology.org/2022.textgraphs-1.6/) (Wold, TextGraphs 2022)

The Effectiveness of Masked Language Modeling and Adapters for Factual Knowledge Injection (Wold, TextGraphs 2022)

ACL

Sondre Wold. 2022. The Effectiveness of Masked Language Modeling and Adapters for Factual Knowledge Injection. In Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing, pages 54–59, Gyeongju, Republic of Korea. Association for Computational Linguistics.