Innovative Approaches to Enhancing Safety and Ethical AI Interactions in Digital Environments

Zachary Yang


Abstract
Ensuring safe online environments is a formidable challenge, but nonetheless an important one as people are now chronically online. The increasing online presence of people paired with the prevalence of harmful content such as toxicity, hate speech, misinformation and disinformation across various social media platforms and within different video calls for stronger detection and prevention methods. My research interests primarily lie in applied natural language processing for social good. Previously, I focused on measuring partisan polarization on social media during the COVID-19 pandemic and its societal impacts. Currently, at Ubisoft La Forge, I am dedicated to enhancing player safety within in-game chat systems by developing methods to detect toxicity, evaluating the biases in these detection systems, and assessing the current ecological state of online interactions. Additionally, I am engaged in simulating social media environments using LLMs to ethically test detection methods, evaluate the effectiveness of current mitigation strategies, and potentially introduce new, successful strategies. My suggested topics for discussion: 1. Understanding and mitigating social harms through high fidelity simulated social media environments 2. Enhancing safety in online environments such as within in-game chats (text and speech) 3. Personification of LLM agents 4. Ethically simulating social media sandbox environments at scale with LLM agents 5. Re-balancing the playing field between good and bad actors: Strategies for countering societal-scale manipulation.
Anthology ID:
2024.yrrsds-1.24
Volume:
Proceedings of the 20th Workshop of Young Researchers' Roundtable on Spoken Dialogue Systems
Month:
September
Year:
2024
Address:
Kyoto, Japan
Editors:
Koji Inoue, Yahui Fu, Agnes Axelsson, Atsumoto Ohashi, Brielen Madureira, Yuki Zenimoto, Biswesh Mohapatra, Armand Stricker, Sopan Khosla
Venues:
YRRSDS | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
64–67
Language:
URL:
https://aclanthology.org/2024.yrrsds-1.24
DOI:
Bibkey:
Cite (ACL):
Zachary Yang. 2024. Innovative Approaches to Enhancing Safety and Ethical AI Interactions in Digital Environments. In Proceedings of the 20th Workshop of Young Researchers' Roundtable on Spoken Dialogue Systems, pages 64–67, Kyoto, Japan. Association for Computational Linguistics.
Cite (Informal):
Innovative Approaches to Enhancing Safety and Ethical AI Interactions in Digital Environments (Yang, YRRSDS-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.yrrsds-1.24.pdf