Masanori Oya
2025
The propositional idea densities of different languages in multi-lingual parallel corpus
Masanori Oya | Yuka Kaise | Yuto Tsuchiya
Proceedings of the 39th Pacific Asia Conference on Language, Information and Computation
Masanori Oya | Yuka Kaise | Yuto Tsuchiya
Proceedings of the 39th Pacific Asia Conference on Language, Information and Computation
Degree centrality as a measure of robustness of dependency structures of the sentences in a large-scale learner corpus of English
Masanori Oya
Proceedings of the Third Workshop on Quantitative Syntax (QUASY, SyntaxFest 2025)
Masanori Oya
Proceedings of the Third Workshop on Quantitative Syntax (QUASY, SyntaxFest 2025)
This paper examines the differences in the robustness of syntactic dependency structures in written English produced by learners of varying proficiency levels and by native English speakers. The robustness of these dependency structures is represented by their degree centralities, and corpus-based investigation revealed that learners with higher proficiency levels tend to produce sentences with lower degree centralities. This means that they produce more robust, and more embedded sentences. It is also revealed that the sentences produced by native speakers of English tend to produce more embedded sentences than non-native speakers.
UD Treebanks for Esperanto as a natural language
Masanori Oya
Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025)
Masanori Oya
Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025)
This paper describes the details of UD-based morphological and syntactic annotations on Esperanto texts to construct its small-scale UD treebank. Though it was created as an international auxiliary language, Esperanto has increasingly been studied as a natural language both in linguistics and in NLP. This paper introduces the detail of manual annotation of UD morphological and relational tags and describes how the frequencies of these tags differ across the treebanks and discusses the possibility of future research of Esperanto as a natural language.
2024
Cross-Linguistic Variances of Dependency Distances in Multi-Lingual Parallel Corpus
Masanori Oya
Proceedings of the 38th Pacific Asia Conference on Language, Information and Computation
Masanori Oya
Proceedings of the 38th Pacific Asia Conference on Language, Information and Computation
2023
Low-Frequency Long-Distance Dependencies as Long Tails
Masanori Oya
Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation
Masanori Oya
Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation
2022
Curve-fitting of frequency distributions of dependency distances in a multi-lingual parallel corpus
Masanori Oya
Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation
Masanori Oya
Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation
2021
Three Types of Average Dependency Distances of Sentences in a Multilingual Parallel Corpus
Masanori Oya
Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation
Masanori Oya
Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation
2020
Syntactic similarity of the sentences in a multi-lingual parallel corpus based on the Euclidean distance of their dependency trees
Masanori Oya
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation
Masanori Oya
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation
2018
Utilization of Dependency Type per Sentence to Identify Differences among Genres of English Texts
Masanori Oya
Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation
Masanori Oya
Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation