SINAI at SemEval-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models

Flor Miriam Plaza-del-Arco; M. Dolores Molina-González; L. Alfonso Urena Lopez; M. Teresa Martín-Valdivia

doi:10.18653/v1/2020.semeval-1.211

SINAI at SemEval-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models

Flor Miriam Plaza del Arco, M. Dolores Molina González, Alfonso Ureña-López, Maite Martin

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

This paper describes the participation of SINAI team at Task 12: OffensEval 2: Multilingual Offensive Language Identification in Social Media. In particular, the participation in Sub-task A in English which consists of identifying tweets as offensive or not offensive. We preprocess the dataset according to the language characteristics used on social media. Then, we select a small set from the training set provided by the organizers and fine-tune different Transformerbased models in order to test their effectiveness. Our team ranks 20th out of 85 participants in Subtask-A using the XLNet model.

Anthology ID:: 2020.semeval-1.211
Volume:: Proceedings of the Fourteenth Workshop on Semantic Evaluation
Month:: December
Year:: 2020
Address:: Barcelona (online)
Editors:: Aurelie Herbelot, Xiaodan Zhu, Alexis Palmer, Nathan Schneider, Jonathan May, Ekaterina Shutova
Venue:: SemEval
SIG:: SIGLEX
Publisher:: International Committee for Computational Linguistics
Note:
Pages:: 1622–1627
Language:
URL:: https://aclanthology.org/2020.semeval-1.211/
DOI:: 10.18653/v1/2020.semeval-1.211
Bibkey:
Cite (ACL):: Flor Miriam Plaza del Arco, M. Dolores Molina González, Alfonso Ureña-López, and Maite Martin. 2020. SINAI at SemEval-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 1622–1627, Barcelona (online). International Committee for Computational Linguistics.
Cite (Informal):: SINAI at SemEval-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models (Plaza del Arco et al., SemEval 2020)
Copy Citation:
PDF:: https://aclanthology.org/2020.semeval-1.211.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{plaza-del-arco-etal-2020-sinai,
    title = "{SINAI} at {S}em{E}val-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models",
    author = "Plaza del Arco, Flor Miriam  and
      Molina Gonz{\'a}lez, M. Dolores  and
      Ure{\~n}a-L{\'o}pez, Alfonso  and
      Martin, Maite",
    editor = "Herbelot, Aurelie  and
      Zhu, Xiaodan  and
      Palmer, Alexis  and
      Schneider, Nathan  and
      May, Jonathan  and
      Shutova, Ekaterina",
    booktitle = "Proceedings of the Fourteenth Workshop on Semantic Evaluation",
    month = dec,
    year = "2020",
    address = "Barcelona (online)",
    publisher = "International Committee for Computational Linguistics",
    url = "https://aclanthology.org/2020.semeval-1.211/",
    doi = "10.18653/v1/2020.semeval-1.211",
    pages = "1622--1627",
    abstract = "This paper describes the participation of SINAI team at Task 12: OffensEval 2: Multilingual Offensive Language Identification in Social Media. In particular, the participation in Sub-task A in English which consists of identifying tweets as offensive or not offensive. We preprocess the dataset according to the language characteristics used on social media. Then, we select a small set from the training set provided by the organizers and fine-tune different Transformerbased models in order to test their effectiveness. Our team ranks 20th out of 85 participants in Subtask-A using the XLNet model."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="plaza-del-arco-etal-2020-sinai">
    <titleInfo>
        <title>SINAI at SemEval-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Flor</namePart>
        <namePart type="given">Miriam</namePart>
        <namePart type="family">Plaza del Arco</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">M</namePart>
        <namePart type="given">Dolores</namePart>
        <namePart type="family">Molina González</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Alfonso</namePart>
        <namePart type="family">Ureña-López</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Maite</namePart>
        <namePart type="family">Martin</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2020-12</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the Fourteenth Workshop on Semantic Evaluation</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Aurelie</namePart>
            <namePart type="family">Herbelot</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Xiaodan</namePart>
            <namePart type="family">Zhu</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Alexis</namePart>
            <namePart type="family">Palmer</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Nathan</namePart>
            <namePart type="family">Schneider</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Jonathan</namePart>
            <namePart type="family">May</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Ekaterina</namePart>
            <namePart type="family">Shutova</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>International Committee for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Barcelona (online)</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>This paper describes the participation of SINAI team at Task 12: OffensEval 2: Multilingual Offensive Language Identification in Social Media. In particular, the participation in Sub-task A in English which consists of identifying tweets as offensive or not offensive. We preprocess the dataset according to the language characteristics used on social media. Then, we select a small set from the training set provided by the organizers and fine-tune different Transformerbased models in order to test their effectiveness. Our team ranks 20th out of 85 participants in Subtask-A using the XLNet model.</abstract>
    <identifier type="citekey">plaza-del-arco-etal-2020-sinai</identifier>
    <identifier type="doi">10.18653/v1/2020.semeval-1.211</identifier>
    <location>
        <url>https://aclanthology.org/2020.semeval-1.211/</url>
    </location>
    <part>
        <date>2020-12</date>
        <extent unit="page">
            <start>1622</start>
            <end>1627</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T SINAI at SemEval-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models
%A Plaza del Arco, Flor Miriam
%A Molina González, M. Dolores
%A Ureña-López, Alfonso
%A Martin, Maite
%Y Herbelot, Aurelie
%Y Zhu, Xiaodan
%Y Palmer, Alexis
%Y Schneider, Nathan
%Y May, Jonathan
%Y Shutova, Ekaterina
%S Proceedings of the Fourteenth Workshop on Semantic Evaluation
%D 2020
%8 December
%I International Committee for Computational Linguistics
%C Barcelona (online)
%F plaza-del-arco-etal-2020-sinai
%X This paper describes the participation of SINAI team at Task 12: OffensEval 2: Multilingual Offensive Language Identification in Social Media. In particular, the participation in Sub-task A in English which consists of identifying tweets as offensive or not offensive. We preprocess the dataset according to the language characteristics used on social media. Then, we select a small set from the training set provided by the organizers and fine-tune different Transformerbased models in order to test their effectiveness. Our team ranks 20th out of 85 participants in Subtask-A using the XLNet model.
%R 10.18653/v1/2020.semeval-1.211
%U https://aclanthology.org/2020.semeval-1.211/
%U https://doi.org/10.18653/v1/2020.semeval-1.211
%P 1622-1627

Download as File

Markdown (Informal)

[SINAI at SemEval-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models](https://aclanthology.org/2020.semeval-1.211/) (Plaza del Arco et al., SemEval 2020)

SINAI at SemEval-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models (Plaza del Arco et al., SemEval 2020)

ACL

Flor Miriam Plaza del Arco, M. Dolores Molina González, Alfonso Ureña-López, and Maite Martin. 2020. SINAI at SemEval-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 1622–1627, Barcelona (online). International Committee for Computational Linguistics.