oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes

Daniel Campos; Alexandre Marques; Mark Kurtz; Cheng Xiang Zhai

doi:10.18653/v1/2023.sustainlp-1.3

oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes

Daniel Campos, Alexandre Marques, Mark Kurtz, Cheng Xiang Zhai

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Anthology ID:: 2023.sustainlp-1.3
Volume:: Proceedings of the Fourth Workshop on Simple and Efficient Natural Language Processing (SustaiNLP)
Month:: July
Year:: 2023
Address:: Toronto, Canada (Hybrid)
Editors:: Nafise Sadat Moosavi, Iryna Gurevych, Yufang Hou, Gyuwan Kim, Young Jin Kim, Tal Schuster, Ameeta Agrawal
Venue:: sustainlp
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 39–58
Language:
URL:: https://aclanthology.org/2023.sustainlp-1.3/
DOI:: 10.18653/v1/2023.sustainlp-1.3
Bibkey:
Cite (ACL):: Daniel Campos, Alexandre Marques, Mark Kurtz, and Cheng Xiang Zhai. 2023. oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes. In Proceedings of the Fourth Workshop on Simple and Efficient Natural Language Processing (SustaiNLP), pages 39–58, Toronto, Canada (Hybrid). Association for Computational Linguistics.
Cite (Informal):: oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes (Campos et al., sustainlp 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.sustainlp-1.3.pdf
Video:: https://aclanthology.org/2023.sustainlp-1.3.mp4

PDF Cite Search Video Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{campos-etal-2023-oberta,
    title = "o{BERT}a: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes",
    author = "Campos, Daniel  and
      Marques, Alexandre  and
      Kurtz, Mark  and
      Xiang Zhai, Cheng",
    editor = "Sadat Moosavi, Nafise  and
      Gurevych, Iryna  and
      Hou, Yufang  and
      Kim, Gyuwan  and
      Kim, Young Jin  and
      Schuster, Tal  and
      Agrawal, Ameeta",
    booktitle = "Proceedings of the Fourth Workshop on Simple and Efficient Natural Language Processing (SustaiNLP)",
    month = jul,
    year = "2023",
    address = "Toronto, Canada (Hybrid)",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.sustainlp-1.3/",
    doi = "10.18653/v1/2023.sustainlp-1.3",
    pages = "39--58"
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="campos-etal-2023-oberta">
    <titleInfo>
        <title>oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Daniel</namePart>
        <namePart type="family">Campos</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Alexandre</namePart>
        <namePart type="family">Marques</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Mark</namePart>
        <namePart type="family">Kurtz</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Cheng</namePart>
        <namePart type="family">Xiang Zhai</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2023-07</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the Fourth Workshop on Simple and Efficient Natural Language Processing (SustaiNLP)</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Nafise</namePart>
            <namePart type="family">Sadat Moosavi</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Iryna</namePart>
            <namePart type="family">Gurevych</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Yufang</namePart>
            <namePart type="family">Hou</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Gyuwan</namePart>
            <namePart type="family">Kim</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Young</namePart>
            <namePart type="given">Jin</namePart>
            <namePart type="family">Kim</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Tal</namePart>
            <namePart type="family">Schuster</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Ameeta</namePart>
            <namePart type="family">Agrawal</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Toronto, Canada (Hybrid)</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <identifier type="citekey">campos-etal-2023-oberta</identifier>
    <identifier type="doi">10.18653/v1/2023.sustainlp-1.3</identifier>
    <location>
        <url>https://aclanthology.org/2023.sustainlp-1.3/</url>
    </location>
    <part>
        <date>2023-07</date>
        <extent unit="page">
            <start>39</start>
            <end>58</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes
%A Campos, Daniel
%A Marques, Alexandre
%A Kurtz, Mark
%A Xiang Zhai, Cheng
%Y Sadat Moosavi, Nafise
%Y Gurevych, Iryna
%Y Hou, Yufang
%Y Kim, Gyuwan
%Y Kim, Young Jin
%Y Schuster, Tal
%Y Agrawal, Ameeta
%S Proceedings of the Fourth Workshop on Simple and Efficient Natural Language Processing (SustaiNLP)
%D 2023
%8 July
%I Association for Computational Linguistics
%C Toronto, Canada (Hybrid)
%F campos-etal-2023-oberta
%R 10.18653/v1/2023.sustainlp-1.3
%U https://aclanthology.org/2023.sustainlp-1.3/
%U https://doi.org/10.18653/v1/2023.sustainlp-1.3
%P 39-58

Download as File

Markdown (Informal)

[oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes](https://aclanthology.org/2023.sustainlp-1.3/) (Campos et al., sustainlp 2023)

oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes (Campos et al., sustainlp 2023)

ACL

Daniel Campos, Alexandre Marques, Mark Kurtz, and Cheng Xiang Zhai. 2023. oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes. In Proceedings of the Fourth Workshop on Simple and Efficient Natural Language Processing (SustaiNLP), pages 39–58, Toronto, Canada (Hybrid). Association for Computational Linguistics.