Structure Modeling Approach for UD Parsing of Historical Modern Japanese

Hiroaki Ozaki; Mai Omura; Kanako Komiya; Masayuki Asahara; Toshinobu Ogiso

doi:10.18653/v1/2025.xllm-1.12

Structure Modeling Approach for UD Parsing of Historical Modern Japanese

Hiroaki Ozaki, Mai Omura, Kanako Komiya, Masayuki Asahara, Toshinobu Ogiso

Abstract

This study shows the effectiveness of structure modeling for transfer ability in diachronic syntactic parsing. The syntactic parsing for historical languages is significant from a humanities and quantitative linguistics perspective to enable annotation support and analysis on unannotated documents.We compared the zero-shot transfer ability between Transformer-based Biaffine UD parsers and our structure modeling approach. The structure modeling approach is a pipeline method consisting with dictionary-based morphological analysis (MeCab), a deep learning-based phrase (bunsetsu) analysis (Monaka), SVM-based phrase dependency parsing (CaboCha) and a rule-based conversion from phrase dependencies to UD.This pipeline closely follows the methodology used in constructing Japanese UD corpora.Experimental results showed that the structure modeling approach outperformed zero-shot transfer from the contemporary to the modern Japanese. Moreover, the structure modeling approach outperformed several existing UD parsers in contemporary Japanese. To this end, the structure modeling approach outperformed in the diachronic transfer of Japanese by a wide margin and was useful to those applications for digital humanities and quantitative linguistics.

Anthology ID:: 2025.xllm-1.12
Volume:: Proceedings of the 1st Joint Workshop on Large Language Models and Structure Modeling (XLLM 2025)
Month:: August
Year:: 2025
Address:: Vienna, Austria
Editors:: Hao Fei, Kewei Tu, Yuhui Zhang, Xiang Hu, Wenjuan Han, Zixia Jia, Zilong Zheng, Yixin Cao, Meishan Zhang, Wei Lu, N. Siddharth, Lilja Øvrelid, Nianwen Xue, Yue Zhang
Venues:: XLLM | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 106–114
Language:
URL:: https://aclanthology.org/2025.xllm-1.12/
DOI:: 10.18653/v1/2025.xllm-1.12
Bibkey:
Cite (ACL):: Hiroaki Ozaki, Mai Omura, Kanako Komiya, Masayuki Asahara, and Toshinobu Ogiso. 2025. Structure Modeling Approach for UD Parsing of Historical Modern Japanese. In Proceedings of the 1st Joint Workshop on Large Language Models and Structure Modeling (XLLM 2025), pages 106–114, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Structure Modeling Approach for UD Parsing of Historical Modern Japanese (Ozaki et al., XLLM 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.xllm-1.12.pdf

PDF Cite Search Fix data