Classifying Organized Criminal Violence in Mexico using ML and LLMs

Javier Osorio, Juan Vasquez


Abstract
Natural Language Processing (NLP) tools have been rapidly adopted in political science for the study of conflict and violence. In this paper, we present an application to analyze various lethal and non-lethal events conducted by organized criminal groups and state forces in Mexico. Based on a large corpus of news articles in Spanish and a set of high-quality annotations, the application evaluates different Machine Learning (ML) algorithms and Large Language Models (LLMs) to classify documents and individual sentences, and to identify specific behaviors related to organized criminal violence and law enforcement efforts. Our experiments support the growing evidence that BERT-like models achieve outstanding classification performance for the study of organized crime. This application amplifies the capacity of conflict scholars to provide valuable information related to important security challenges in the developing world.
Anthology ID:
2023.case-1.1
Volume:
Proceedings of the 6th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text
Month:
sEPTEMBER
Year:
2023
Address:
Varna, Bulgaria
Editors:
Ali Hürriyetoğlu, Hristo Tanev, Vanni Zavarella, Reyyan Yeniterzi, Erdem Yörük, Milena Slavcheva
Venues:
CASE | WS
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
1–10
Language:
URL:
https://aclanthology.org/2023.case-1.1
DOI:
Bibkey:
Cite (ACL):
Javier Osorio and Juan Vasquez. 2023. Classifying Organized Criminal Violence in Mexico using ML and LLMs. In Proceedings of the 6th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, pages 1–10, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
Classifying Organized Criminal Violence in Mexico using ML and LLMs (Osorio & Vasquez, CASE-WS 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.case-1.1.pdf