DIAMOND: An LLM-Driven Agent for Context-Aware Baseball Highlight Summarization

Jeonghun Kang; Soonmok Kwon; Joonseok Lee; Byung-Hak Kim

doi:10.18653/v1/2025.realm-1.28

DIAMOND: An LLM-Driven Agent for Context-Aware Baseball Highlight Summarization

Jeonghun Kang, Soonmok Kwon, Joonseok Lee, Byung-Hak Kim

Abstract

Highlight summarization in baseball requires balancing statistical analysis with narrative coherence. Traditional approaches—such as Win Probability Added (WPA)-based ranking or computer vision-driven event detection—can identify scoring plays but often miss strategic depth, momentum shifts, and storyline progression. Manual curation remains the gold standard but is resource-intensive and not scalable.We introduce DIAMOND, an LLM-driven agent for context-aware baseball highlight summarization that integrates structured sports analytics with natural language reasoning. DIAMOND leverages sabermetric features—Win Expectancy, WPA, and Leverage Index—to quantify play importance, while an LLM module enhances selection based on contextual narrative value. This hybrid approach ensures both quantitative rigor and qualitative richness, surpassing the limitations of purely statistical or vision-based systems.Evaluated on five diverse Korean Baseball Organization League games, DIAMOND improves F1-score from 42.9% (WPA-only) to 84.8%, outperforming both commercial and statistical baselines. Though limited in scale, our results highlight the potential of modular, interpretable agent-based frameworks for event-level summarization in sports and beyond.

Anthology ID:: 2025.realm-1.28
Volume:: Proceedings of the 1st Workshop for Research on Agent Language Models (REALM 2025)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Ehsan Kamalloo, Nicolas Gontier, Xing Han Lu, Nouha Dziri, Shikhar Murty, Alexandre Lacoste
Venues:: REALM | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 386–400
Language:
URL:: https://aclanthology.org/2025.realm-1.28/
DOI:: 10.18653/v1/2025.realm-1.28
Bibkey:
Cite (ACL):: Jeonghun Kang, Soonmok Kwon, Joonseok Lee, and Byung-Hak Kim. 2025. DIAMOND: An LLM-Driven Agent for Context-Aware Baseball Highlight Summarization. In Proceedings of the 1st Workshop for Research on Agent Language Models (REALM 2025), pages 386–400, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: DIAMOND: An LLM-Driven Agent for Context-Aware Baseball Highlight Summarization (Kang et al., REALM 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.realm-1.28.pdf

PDF Cite Search Fix data