Social Intelligence Data Infrastructure: Structuring the Present and Navigating the Future

Minzhi Li, Weiyan Shi, Caleb Ziems, Diyi Yang


Abstract
As Natural Language Processing (NLP) systems become increasingly integrated into human social life, these technologies will need to increasingly rely on social intelligence. Although there are many valuable datasets that benchmark isolated dimensions of social intelligence, there does not yet exist any body of work to join these threads into a cohesive subfield in which researchers can quickly identify research gaps and future directions. Towards this goal, we build a Social AI Data Infrastructure, which consists of a comprehensive social AI taxonomy and a data library of 480 NLP datasets. Our infrastructure allows us to analyze existing dataset efforts, and also evaluate language models’ performance in different social intelligence aspects. Our analyses demonstrate its utility in enabling a thorough understanding of current data landscape and providing a holistic perspective on potential directions for future dataset development. We show there is a need for multifaceted datasets, increased diversity in language and culture, more long-tailed social situations, and more interactive data in future social intelligence data efforts.
Anthology ID:
2024.findings-acl.163
Volume:
Findings of the Association for Computational Linguistics: ACL 2024
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2789–2805
Language:
URL:
https://aclanthology.org/2024.findings-acl.163
DOI:
10.18653/v1/2024.findings-acl.163
Bibkey:
Cite (ACL):
Minzhi Li, Weiyan Shi, Caleb Ziems, and Diyi Yang. 2024. Social Intelligence Data Infrastructure: Structuring the Present and Navigating the Future. In Findings of the Association for Computational Linguistics: ACL 2024, pages 2789–2805, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Social Intelligence Data Infrastructure: Structuring the Present and Navigating the Future (Li et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-acl.163.pdf