Scent Mining: Extracting Olfactory Events, Smell Sources and Qualities

Stefano Menini, Teresa Paccosi, Serra Sinem Tekiroğlu, Sara Tonelli


Abstract
Olfaction is a rather understudied sense compared to the other senses. In NLP, however, there have been recent attempts to develop taxonomies and benchmarks specifically designed to capture smell-related information. In this work, we further extend this research line by presenting a supervised system for olfactory information extraction in English. We cast this problem as a token classification task and build a system that identifies smell words, smell sources and qualities. The classifier is then applied to a set of English historical corpora, covering different domains and written in a time period between the 15th and the 20th Century. A qualitative analysis of the extracted data shows that they can be used to infer interesting information about smelly items such as tea and tobacco from a diachronical perspective, supporting historical investigation with corpus-based evidence.
Anthology ID:
2023.latechclfl-1.15
Volume:
Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature
Month:
May
Year:
2023
Address:
Dubrovnik, Croatia
Editors:
Stefania Degaetano-Ortlieb, Anna Kazantseva, Nils Reiter, Stan Szpakowicz
Venue:
LaTeCHCLfL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
135–140
Language:
URL:
https://aclanthology.org/2023.latechclfl-1.15
DOI:
10.18653/v1/2023.latechclfl-1.15
Bibkey:
Cite (ACL):
Stefano Menini, Teresa Paccosi, Serra Sinem Tekiroğlu, and Sara Tonelli. 2023. Scent Mining: Extracting Olfactory Events, Smell Sources and Qualities. In Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pages 135–140, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):
Scent Mining: Extracting Olfactory Events, Smell Sources and Qualities (Menini et al., LaTeCHCLfL 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.latechclfl-1.15.pdf
Video:
 https://aclanthology.org/2023.latechclfl-1.15.mp4