Red-Teaming LLM Multi-Agent Systems via Communication Attacks

Pengfei He; Yuping Lin; Shen Dong; Han Xu; Yue Xing; Hui Liu

doi:10.18653/v1/2025.findings-acl.349

Red-Teaming LLM Multi-Agent Systems via Communication Attacks

Pengfei He, Yuping Lin, Shen Dong, Han Xu, Yue Xing, Hui Liu

Abstract

Large Language Model-based Multi-Agent Systems (LLM-MAS) have revolutionized complex problem-solving capability by enabling sophisticated agent collaboration through message-based communications. While the communication framework is crucial for agent coordination, it also introduces a critical yet unexplored security vulnerability. In this work, we introduce Agent-in-the-Middle (AiTM), a novel attack that exploits the fundamental communication mechanisms in LLM-MAS by intercepting and manipulating inter-agent messages. Unlike existing attacks that compromise individual agents, AiTM demonstrates how an adversary can compromise entire multi-agent systems by only manipulating the messages passing between agents. To enable the attack under the challenges of limited control and role-restricted communication format, we develop an LLM-powered adversarial agent with a reflection mechanism that generates contextually-aware malicious instructions. Our comprehensive evaluation across various frameworks, communication structures, and real-world applications demonstrates that LLM-MAS is vulnerable to communication-based attacks, highlighting the need for robust security measures in multi-agent systems.

Anthology ID:: 2025.findings-acl.349
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6726–6747
Language:
URL:: https://aclanthology.org/2025.findings-acl.349/
DOI:: 10.18653/v1/2025.findings-acl.349
Bibkey:
Cite (ACL):: Pengfei He, Yuping Lin, Shen Dong, Han Xu, Yue Xing, and Hui Liu. 2025. Red-Teaming LLM Multi-Agent Systems via Communication Attacks. In Findings of the Association for Computational Linguistics: ACL 2025, pages 6726–6747, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Red-Teaming LLM Multi-Agent Systems via Communication Attacks (He et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.349.pdf

PDF Cite Search Fix data