Design and Evaluation of a Courtroom Examination AI Simulation System with Behavioral Fidelity

Hsien-Jyh Liao

Design and Evaluation of a Courtroom Examination AI Simulation System with Behavioral Fidelity

Abstract

AI simulation system centered on Behavioral Fidelity, with speech interaction included as a design feature to enhance immersion. For standardization and reproducibility, the present pilot evaluation uses transcripts. The system integrates pragmatic–psychological rules with Taiwanese criminal case files to simulate witness behavior under cross-examination pressure. Using an optimized Expert Turing Test framework with four dimensions—professional accuracy, situational adaptability, human-likeness, and logical consistency—we conduct a pilot study. Under identical prompts and knowledge sources, the customized GPT condition received higher ratings than GPT-Vanilla on adaptability and human-likeness. Applying the same framework to another mainstream model (Gemini 2.5 Flash) yielded comparable performance, while differences remain inconclusive at this sample size. Overall, the results provide preliminary evidence that Behavioral Fidelity is a feasible evaluation target and indicate the scalability of generative AI for legal training; speech-condition evaluation and multi-case, multi-role extensions are left for future work.

Anthology ID:: 2025.rocling-main.3
Volume:: Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025)
Month:: November
Year:: 2025
Address:: National Taiwan University, Taipei City, Taiwan
Editors:: Kai-Wei Chang, Ke-Han Lu, Chih-Kai Yang, Zhi-Rui Tam, Wen-Yu Chang, Chung-Che Wang
Venue:: ROCLING
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 20–28
Language:
URL:: https://aclanthology.org/2025.rocling-main.3/
DOI:
Bibkey:
Cite (ACL):: Hsien-Jyh Liao. 2025. Design and Evaluation of a Courtroom Examination AI Simulation System with Behavioral Fidelity. In Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025), pages 20–28, National Taiwan University, Taipei City, Taiwan. Association for Computational Linguistics.
Cite (Informal):: Design and Evaluation of a Courtroom Examination AI Simulation System with Behavioral Fidelity (Liao, ROCLING 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.rocling-main.3.pdf

PDF Cite Search Fix data