Contrastive Loss is All You Need to Recover Analogies as Parallel Lines

Narutatsu Ri; Fei-Tzin Lee; Nakul Verma

doi:10.18653/v1/2023.repl4nlp-1.14

Contrastive Loss is All You Need to Recover Analogies as Parallel Lines

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

While static word embedding models are known to represent linguistic analogies as parallel lines in high-dimensional space, the underlying mechanism as to why they result in such geometric structures remains obscure. We find that an elementary contrastive-style method employed over distributional information performs competitively with popular word embedding models on analogy recovery tasks, while achieving dramatic speedups in training time. Further, we demonstrate that a contrastive loss is sufficient to create these parallel structures in word embeddings, and establish a precise relationship between the co-occurrence statistics and the geometric structure of the resulting word embeddings.

Anthology ID:: 2023.repl4nlp-1.14
Volume:: Proceedings of the 8th Workshop on Representation Learning for NLP (RepL4NLP 2023)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Burcu Can, Maximilian Mozes, Samuel Cahyawijaya, Naomi Saphra, Nora Kassner, Shauli Ravfogel, Abhilasha Ravichander, Chen Zhao, Isabelle Augenstein, Anna Rogers, Kyunghyun Cho, Edward Grefenstette, Lena Voita
Venue:: RepL4NLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 164–173
Language:
URL:: https://aclanthology.org/2023.repl4nlp-1.14/
DOI:: 10.18653/v1/2023.repl4nlp-1.14
Bibkey:
Cite (ACL):: Narutatsu Ri, Fei-Tzin Lee, and Nakul Verma. 2023. Contrastive Loss is All You Need to Recover Analogies as Parallel Lines. In Proceedings of the 8th Workshop on Representation Learning for NLP (RepL4NLP 2023), pages 164–173, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Contrastive Loss is All You Need to Recover Analogies as Parallel Lines (Ri et al., RepL4NLP 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.repl4nlp-1.14.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{ri-etal-2023-contrastive,
    title = "Contrastive Loss is All You Need to Recover Analogies as Parallel Lines",
    author = "Ri, Narutatsu  and
      Lee, Fei-Tzin  and
      Verma, Nakul",
    editor = "Can, Burcu  and
      Mozes, Maximilian  and
      Cahyawijaya, Samuel  and
      Saphra, Naomi  and
      Kassner, Nora  and
      Ravfogel, Shauli  and
      Ravichander, Abhilasha  and
      Zhao, Chen  and
      Augenstein, Isabelle  and
      Rogers, Anna  and
      Cho, Kyunghyun  and
      Grefenstette, Edward  and
      Voita, Lena",
    booktitle = "Proceedings of the 8th Workshop on Representation Learning for NLP (RepL4NLP 2023)",
    month = jul,
    year = "2023",
    address = "Toronto, Canada",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.repl4nlp-1.14/",
    doi = "10.18653/v1/2023.repl4nlp-1.14",
    pages = "164--173",
    abstract = "While static word embedding models are known to represent linguistic analogies as parallel lines in high-dimensional space, the underlying mechanism as to why they result in such geometric structures remains obscure. We find that an elementary contrastive-style method employed over distributional information performs competitively with popular word embedding models on analogy recovery tasks, while achieving dramatic speedups in training time. Further, we demonstrate that a contrastive loss is sufficient to create these parallel structures in word embeddings, and establish a precise relationship between the co-occurrence statistics and the geometric structure of the resulting word embeddings."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="ri-etal-2023-contrastive">
    <titleInfo>
        <title>Contrastive Loss is All You Need to Recover Analogies as Parallel Lines</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Narutatsu</namePart>
        <namePart type="family">Ri</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Fei-Tzin</namePart>
        <namePart type="family">Lee</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Nakul</namePart>
        <namePart type="family">Verma</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2023-07</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the 8th Workshop on Representation Learning for NLP (RepL4NLP 2023)</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Burcu</namePart>
            <namePart type="family">Can</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Maximilian</namePart>
            <namePart type="family">Mozes</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Samuel</namePart>
            <namePart type="family">Cahyawijaya</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Naomi</namePart>
            <namePart type="family">Saphra</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Nora</namePart>
            <namePart type="family">Kassner</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Shauli</namePart>
            <namePart type="family">Ravfogel</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Abhilasha</namePart>
            <namePart type="family">Ravichander</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Chen</namePart>
            <namePart type="family">Zhao</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Isabelle</namePart>
            <namePart type="family">Augenstein</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Anna</namePart>
            <namePart type="family">Rogers</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Kyunghyun</namePart>
            <namePart type="family">Cho</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Edward</namePart>
            <namePart type="family">Grefenstette</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Lena</namePart>
            <namePart type="family">Voita</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Toronto, Canada</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>While static word embedding models are known to represent linguistic analogies as parallel lines in high-dimensional space, the underlying mechanism as to why they result in such geometric structures remains obscure. We find that an elementary contrastive-style method employed over distributional information performs competitively with popular word embedding models on analogy recovery tasks, while achieving dramatic speedups in training time. Further, we demonstrate that a contrastive loss is sufficient to create these parallel structures in word embeddings, and establish a precise relationship between the co-occurrence statistics and the geometric structure of the resulting word embeddings.</abstract>
    <identifier type="citekey">ri-etal-2023-contrastive</identifier>
    <identifier type="doi">10.18653/v1/2023.repl4nlp-1.14</identifier>
    <location>
        <url>https://aclanthology.org/2023.repl4nlp-1.14/</url>
    </location>
    <part>
        <date>2023-07</date>
        <extent unit="page">
            <start>164</start>
            <end>173</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Contrastive Loss is All You Need to Recover Analogies as Parallel Lines
%A Ri, Narutatsu
%A Lee, Fei-Tzin
%A Verma, Nakul
%Y Can, Burcu
%Y Mozes, Maximilian
%Y Cahyawijaya, Samuel
%Y Saphra, Naomi
%Y Kassner, Nora
%Y Ravfogel, Shauli
%Y Ravichander, Abhilasha
%Y Zhao, Chen
%Y Augenstein, Isabelle
%Y Rogers, Anna
%Y Cho, Kyunghyun
%Y Grefenstette, Edward
%Y Voita, Lena
%S Proceedings of the 8th Workshop on Representation Learning for NLP (RepL4NLP 2023)
%D 2023
%8 July
%I Association for Computational Linguistics
%C Toronto, Canada
%F ri-etal-2023-contrastive
%X While static word embedding models are known to represent linguistic analogies as parallel lines in high-dimensional space, the underlying mechanism as to why they result in such geometric structures remains obscure. We find that an elementary contrastive-style method employed over distributional information performs competitively with popular word embedding models on analogy recovery tasks, while achieving dramatic speedups in training time. Further, we demonstrate that a contrastive loss is sufficient to create these parallel structures in word embeddings, and establish a precise relationship between the co-occurrence statistics and the geometric structure of the resulting word embeddings.
%R 10.18653/v1/2023.repl4nlp-1.14
%U https://aclanthology.org/2023.repl4nlp-1.14/
%U https://doi.org/10.18653/v1/2023.repl4nlp-1.14
%P 164-173

Download as File

Markdown (Informal)

[Contrastive Loss is All You Need to Recover Analogies as Parallel Lines](https://aclanthology.org/2023.repl4nlp-1.14/) (Ri et al., RepL4NLP 2023)

Contrastive Loss is All You Need to Recover Analogies as Parallel Lines (Ri et al., RepL4NLP 2023)

ACL

Narutatsu Ri, Fei-Tzin Lee, and Nakul Verma. 2023. Contrastive Loss is All You Need to Recover Analogies as Parallel Lines. In Proceedings of the 8th Workshop on Representation Learning for NLP (RepL4NLP 2023), pages 164–173, Toronto, Canada. Association for Computational Linguistics.