Semi-Automatic Topic Discovery and Classification for Epidemic Intelligence via Large Language Models

Federico Borazio, Danilo Croce, Giorgio Gambosi, Roberto Basili, Daniele Margiotta, Antonio Scaiella, Martina Del Manso, Daniele Petrone, Andrea Cannone, Alberto M. Urdiales, Chiara Sacco, Patrizio Pezzotti, Flavia Riccardo, Daniele Mipatrini, Federica Ferraro, Sobha Pilati


Abstract
This paper introduces a novel framework to harness Large Language Models (LLMs) for Epidemic Intelligence, focusing on identifying and categorizing emergent socio-political phenomena within health crises, with a spotlight on the COVID-19 pandemic. Our approach diverges from traditional methods, such as Topic Models, by providing explicit support to analysts through the identification of distinct thematic areas and the generation of clear, actionable statements for each topic. This supports a Zero-shot Classification mechanism, enabling effective matching of news articles to fine-grain topics without the need for model fine-tuning. The framework is designed to be as transparent as possible, producing linguistically informed insights to make the analysis more accessible to analysts who may not be familiar with every subject matter of inherently emerging phenomena. This process not only enhances the precision and relevance of the extracted Epidemic Intelligence but also fosters a collaborative environment where system linguistic abilities and the analyst’s domain expertise are integrated.
Anthology ID:
2024.politicalnlp-1.8
Volume:
Proceedings of the Second Workshop on Natural Language Processing for Political Sciences @ LREC-COLING 2024
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Haithem Afli, Houda Bouamor, Cristina Blasi Casagran, Sahar Ghannay
Venues:
PoliticalNLP | WS
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
68–84
Language:
URL:
https://aclanthology.org/2024.politicalnlp-1.8
DOI:
Bibkey:
Cite (ACL):
Federico Borazio, Danilo Croce, Giorgio Gambosi, Roberto Basili, Daniele Margiotta, Antonio Scaiella, Martina Del Manso, Daniele Petrone, Andrea Cannone, Alberto M. Urdiales, Chiara Sacco, Patrizio Pezzotti, Flavia Riccardo, Daniele Mipatrini, Federica Ferraro, and Sobha Pilati. 2024. Semi-Automatic Topic Discovery and Classification for Epidemic Intelligence via Large Language Models. In Proceedings of the Second Workshop on Natural Language Processing for Political Sciences @ LREC-COLING 2024, pages 68–84, Torino, Italia. ELRA and ICCL.
Cite (Informal):
Semi-Automatic Topic Discovery and Classification for Epidemic Intelligence via Large Language Models (Borazio et al., PoliticalNLP-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.politicalnlp-1.8.pdf