Chia-Hui Chang

2025

The Study of a Traffic Accident Information Collection Agent System Based on Fine-tuned Open-Source Large Language Models
Jo-Chi Kung | Chia-Hui Chang
Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025)

本研究提出了一套名為「交通事故資訊蒐集代理人」(Collision Care Guide, CCG)的系統架構,專注於事故初期階段的結構化資訊蒐集。CCG 整合三大模組:問題生成、資訊擷取及事故重建,透過多輪對話引導使用者敘述事故細節並轉換為結構化資料格式(TARF),同時生成可讀性敘述供核對。為滿足成本效益、隱私保護及部署彈性需求,本研究比較開源 Llama 模型(3B/8B 參數,完整微調及 4-bit PEFT 方法)與商業基準 GPT-4o-mini 的效能表現。結果顯示,資訊擷取模組欄位準確率高於 0.94,JSON 語義相似度達 0.995;問題生成模組語義相似度介於 0.85-0.88,問題表達更加精煉。微調模型在對話品質與資訊擷取的 LLM 評估中均獲得 4 分以上(滿分 5 分),與商業基準差距小於 0.5 分。研究證實開源模型經微調後能逼近商業模型效能,且量化版本在資源受限場景中具備高效能與部署潛力。CCG 的設計填補了事故初期互動式資訊蒐集的技術空白,為交通事故處理提供了高效且具成本優勢的解決方案。

pdf bib abs

Speech-Driven Editing System for Chinese ASR Errors
Sji-Jie Ding | Chia-Hui Chang | Zi-Xuan Jian
Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025)

Despite recent advances in AI, ASR systems still struggle with real-world errors from pronunciation and homophones. To solve this issue, we propose a verbal-command-based correction system that enables users to utter natural-language instructions to refine recognition outputs with minimal effort. The system consists of three modules: an input classifier, a command classifier, and a correction labeler. To support training and evaluation, we simulate ASR errors via TTS and ASR pipelines to simulate the potential errors, followed by verbal correction commands issued based on linguistic features or LLMs. Experiments show that the overall system achieves over 80% correction accuracy and delivers stable performance. Compared to manual correction, this system also demonstrates highly competitive correction speed, which sufficiently indicates its feasibility for practical deployment.

2023

pdf bib abs

Story Co-telling Dialogue Generation based on Multi-Agent Reinforcement Learning and Story Highlights
Yu-Kai Lee | Chia-Hui Chang
Proceedings of the 21st Annual Workshop of the Australasian Language Technology Association

Retelling a story is one way to develop narrative skills in students, but it may present some challenges for English as Second Language (ESL) students who are learning new stories and vocabularies at the same time. The goal of this research is to develop a dialogue module for story co-telling for ESL students in order to help students to co-narrate an English story and enhance their narrative skills. However, story co-telling is a relatively underexplored and novel task. In order to understand the story content and select the right plot to continue the story co-telling based on the current dialogue, we utilize open domain information extraction techniques to construct a knowledge graph, and adopt multi-agent reinforcement learning methods to train two agents to select relevant facts from the knowledge graph and generate responses, jointly accomplishing the task of story co-telling. Compared to models that reply on chronological order, our model improves the performance from 67.0% to 70.8% through self-training with reward evaluation, achieving an increase of approximately 3.8%.

Chia-Hui Chang

2025

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2006

Co-authors

Venues