BERT Goes Shopping: Comparing Distributional Models for Product Representations

Federico Bianchi; Bingqing Yu; Jacopo Tagliabue

doi:10.18653/v1/2021.ecnlp-1.1

BERT Goes Shopping: Comparing Distributional Models for Product Representations

Federico Bianchi, Bingqing Yu, Jacopo Tagliabue

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

Word embeddings (e.g., word2vec) have been applied successfully to eCommerce products through prod2vec. Inspired by the recent performance improvements on several NLP tasks brought by contextualized embeddings, we propose to transfer BERT-like architectures to eCommerce: our model - Prod2BERT - is trained to generate representations of products through masked session modeling. Through extensive experiments over multiple shops, different tasks, and a range of design choices, we systematically compare the accuracy of Prod2BERT and prod2vec embeddings: while Prod2BERT is found to be superior in several scenarios, we highlight the importance of resources and hyperparameters in the best performing models. Finally, we provide guidelines to practitioners for training embeddings under a variety of computational and data constraints.

Anthology ID:: 2021.ecnlp-1.1
Volume:: Proceedings of the 4th Workshop on e-Commerce and NLP
Month:: August
Year:: 2021
Address:: Online
Editors:: Shervin Malmasi, Surya Kallumadi, Nicola Ueffing, Oleg Rokhlenko, Eugene Agichtein, Ido Guy
Venue:: ECNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1–12
Language:
URL:: https://aclanthology.org/2021.ecnlp-1.1/
DOI:: 10.18653/v1/2021.ecnlp-1.1
Bibkey:
Cite (ACL):: Federico Bianchi, Bingqing Yu, and Jacopo Tagliabue. 2021. BERT Goes Shopping: Comparing Distributional Models for Product Representations. In Proceedings of the 4th Workshop on e-Commerce and NLP, pages 1–12, Online. Association for Computational Linguistics.
Cite (Informal):: BERT Goes Shopping: Comparing Distributional Models for Product Representations (Bianchi et al., ECNLP 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.ecnlp-1.1.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{tagliabue-etal-2021-bert,
    title = "{BERT} Goes Shopping: Comparing Distributional Models for Product Representations",
    author = "Bianchi, Federico  and
      Yu, Bingqing  and
      Tagliabue, Jacopo",
    editor = "Malmasi, Shervin  and
      Kallumadi, Surya  and
      Ueffing, Nicola  and
      Rokhlenko, Oleg  and
      Agichtein, Eugene  and
      Guy, Ido",
    booktitle = "Proceedings of the 4th Workshop on e-Commerce and NLP",
    month = aug,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.ecnlp-1.1/",
    doi = "10.18653/v1/2021.ecnlp-1.1",
    pages = "1--12",
    abstract = "Word embeddings (e.g., word2vec) have been applied successfully to eCommerce products through prod2vec. Inspired by the recent performance improvements on several NLP tasks brought by contextualized embeddings, we propose to transfer BERT-like architectures to eCommerce: our model - Prod2BERT - is trained to generate representations of products through masked session modeling. Through extensive experiments over multiple shops, different tasks, and a range of design choices, we systematically compare the accuracy of Prod2BERT and prod2vec embeddings: while Prod2BERT is found to be superior in several scenarios, we highlight the importance of resources and hyperparameters in the best performing models. Finally, we provide guidelines to practitioners for training embeddings under a variety of computational and data constraints."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="tagliabue-etal-2021-bert">
    <titleInfo>
        <title>BERT Goes Shopping: Comparing Distributional Models for Product Representations</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Federico</namePart>
        <namePart type="family">Bianchi</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Bingqing</namePart>
        <namePart type="family">Yu</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Jacopo</namePart>
        <namePart type="family">Tagliabue</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2021-08</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the 4th Workshop on e-Commerce and NLP</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Shervin</namePart>
            <namePart type="family">Malmasi</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Surya</namePart>
            <namePart type="family">Kallumadi</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Nicola</namePart>
            <namePart type="family">Ueffing</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Oleg</namePart>
            <namePart type="family">Rokhlenko</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Eugene</namePart>
            <namePart type="family">Agichtein</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Ido</namePart>
            <namePart type="family">Guy</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Online</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>Word embeddings (e.g., word2vec) have been applied successfully to eCommerce products through prod2vec. Inspired by the recent performance improvements on several NLP tasks brought by contextualized embeddings, we propose to transfer BERT-like architectures to eCommerce: our model - Prod2BERT - is trained to generate representations of products through masked session modeling. Through extensive experiments over multiple shops, different tasks, and a range of design choices, we systematically compare the accuracy of Prod2BERT and prod2vec embeddings: while Prod2BERT is found to be superior in several scenarios, we highlight the importance of resources and hyperparameters in the best performing models. Finally, we provide guidelines to practitioners for training embeddings under a variety of computational and data constraints.</abstract>
    <identifier type="citekey">tagliabue-etal-2021-bert</identifier>
    <identifier type="doi">10.18653/v1/2021.ecnlp-1.1</identifier>
    <location>
        <url>https://aclanthology.org/2021.ecnlp-1.1/</url>
    </location>
    <part>
        <date>2021-08</date>
        <extent unit="page">
            <start>1</start>
            <end>12</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T BERT Goes Shopping: Comparing Distributional Models for Product Representations
%A Bianchi, Federico
%A Yu, Bingqing
%A Tagliabue, Jacopo
%Y Malmasi, Shervin
%Y Kallumadi, Surya
%Y Ueffing, Nicola
%Y Rokhlenko, Oleg
%Y Agichtein, Eugene
%Y Guy, Ido
%S Proceedings of the 4th Workshop on e-Commerce and NLP
%D 2021
%8 August
%I Association for Computational Linguistics
%C Online
%F tagliabue-etal-2021-bert
%X Word embeddings (e.g., word2vec) have been applied successfully to eCommerce products through prod2vec. Inspired by the recent performance improvements on several NLP tasks brought by contextualized embeddings, we propose to transfer BERT-like architectures to eCommerce: our model - Prod2BERT - is trained to generate representations of products through masked session modeling. Through extensive experiments over multiple shops, different tasks, and a range of design choices, we systematically compare the accuracy of Prod2BERT and prod2vec embeddings: while Prod2BERT is found to be superior in several scenarios, we highlight the importance of resources and hyperparameters in the best performing models. Finally, we provide guidelines to practitioners for training embeddings under a variety of computational and data constraints.
%R 10.18653/v1/2021.ecnlp-1.1
%U https://aclanthology.org/2021.ecnlp-1.1/
%U https://doi.org/10.18653/v1/2021.ecnlp-1.1
%P 1-12

Download as File

Markdown (Informal)

[BERT Goes Shopping: Comparing Distributional Models for Product Representations](https://aclanthology.org/2021.ecnlp-1.1/) (Bianchi et al., ECNLP 2021)

BERT Goes Shopping: Comparing Distributional Models for Product Representations (Bianchi et al., ECNLP 2021)

ACL

Federico Bianchi, Bingqing Yu, and Jacopo Tagliabue. 2021. BERT Goes Shopping: Comparing Distributional Models for Product Representations. In Proceedings of the 4th Workshop on e-Commerce and NLP, pages 1–12, Online. Association for Computational Linguistics.