Building Resource-Constrained Language Agents: A Korean Case Study on Chemical Toxicity Information

Hojun Cho; Donghu Kim; Soyoung Yang; Chan Lee; Hunjoo Lee; Jaegul Choo

doi:10.18653/v1/2025.emnlp-industry.191

Building Resource-Constrained Language Agents: A Korean Case Study on Chemical Toxicity Information

Hojun Cho, Donghu Kim, Soyoung Yang, Chan Lee, Hunjoo Lee, Jaegul Choo

Abstract

Language agents powered by large language models (LLMs) face significant deployment challenges in resource-constrained environments, particularly for specialized domains and less-common languages. This paper presents Tox-chat, a Korean chemical toxicity information agent devised within these limitations. We propose two key innovations: a context-efficient architecture that reduces token consumption through hierarchical section search, and a scenario-based dialogue generation methodology that effectively distills tool-using capabilities from larger models. Experimental evaluations demonstrate that our fine-tuned 8B parameter model substantially outperforms both untuned models and baseline approaches, in terms of DB faithfulness and preference. Our work offers valuable insights for researchers developing domain-specific language agents under practical constraints.

Anthology ID:: 2025.emnlp-industry.191
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
Month:: November
Year:: 2025
Address:: Suzhou (China)
Editors:: Saloni Potdar, Lina Rojas-Barahona, Sebastien Montella
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2852–2880
Language:
URL:: https://aclanthology.org/2025.emnlp-industry.191/
DOI:: 10.18653/v1/2025.emnlp-industry.191
Bibkey:
Cite (ACL):: Hojun Cho, Donghu Kim, Soyoung Yang, Chan Lee, Hunjoo Lee, and Jaegul Choo. 2025. Building Resource-Constrained Language Agents: A Korean Case Study on Chemical Toxicity Information. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, pages 2852–2880, Suzhou (China). Association for Computational Linguistics.
Cite (Informal):: Building Resource-Constrained Language Agents: A Korean Case Study on Chemical Toxicity Information (Cho et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-industry.191.pdf

PDF Cite Search Fix data