The SlangTrack Dataset: Supporting the Detection of Words Used in Slang Senses

Afnan Mohammed Aloraini; Riza Theresa Batista-Navarro; Goran Nenadic; Viktor Schlegel

The SlangTrack Dataset: Supporting the Detection of Words Used in Slang Senses

Afnan Mohammed Aloraini, Riza Batista-Navarro, Goran Nenadic, Viktor Schlegel

Abstract

Slang is widespread in informal communication, yet its fluidity poses challenges for natural language processing (NLP), especially when words alternate between slang and non-slang senses. While prior work has examined slang through dictionaries, sentiment analysis, and lexicon building, little attention has been given to detecting slang usage in context. We address this gap by reframing slang detection as distinguishing slang from non-slang senses of the same lexical item. To support this task, we introduce SlangTrack (ST), a diachronically structured dataset of dual-meaning words annotated at the sentence level with high inter-annotator agreement. We benchmark (1) deep learning models with static and contextual embeddings, (2) transformer-based models, and (3) large language models evaluated in zero-shot, few-shot, and fine-tuned settings. Fine-tuned transformers, especially BERT-large enriched with sentiment and emotion features, achieve the strongest performance, reaching an F1-score of 72% for slang and 92% for non-slang usage. Our findings highlight both the difficulty of contextual slang detection and the value of affective cues for improving model robustness.

Anthology ID:: 2026.lchange-1.1
Volume:: The Proceedings for the 6th International Workshop on Computational Approaches to Language Change (LChange’26)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Nina Tahmasebi, Pierluigi Cassotti, Syrielle Montariol, Andrey Kutuzov, Netta Huebscher, Elena Spaziani, Naomi Baes
Venues:: LChange | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1–19
Language:
URL:: https://aclanthology.org/2026.lchange-1.1/
DOI:
Bibkey:
Cite (ACL):: Afnan Mohammed Aloraini, Riza Batista-Navarro, Goran Nenadic, and Viktor Schlegel. 2026. The SlangTrack Dataset: Supporting the Detection of Words Used in Slang Senses. In The Proceedings for the 6th International Workshop on Computational Approaches to Language Change (LChange’26), pages 1–19, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: The SlangTrack Dataset: Supporting the Detection of Words Used in Slang Senses (Aloraini et al., LChange 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.lchange-1.1.pdf

PDF Cite Search Fix data