Yueh-Yin Shih
2022
Converting the Sinica Treebank of Mandarin Chinese to Universal Dependencies
Yu-Ming Hsieh | Yueh-Yin Shih | Wei-Yun Ma
Proceedings of the 16th Linguistic Annotation Workshop (LAW-XVI) within LREC2022
Yu-Ming Hsieh | Yueh-Yin Shih | Wei-Yun Ma
Proceedings of the 16th Linguistic Annotation Workshop (LAW-XVI) within LREC2022
This paper describes the conversion of the Sinica Treebank, one of the major Mandarin Chinese treebanks, to Universal Dependencies. The conversion is rule-based and the process involves POS tag mapping, head adjusting in line with the UD scheme and the dependency conversion. Linguistic insights into Mandarin Chinese alongwith the conversion are also discussed. The resulting corpus is the UD Chinese Sinica Treebank which contains more than fifty thousand tree structures according to the UD scheme. The dataset can be downloaded at https://github.com/ckiplab/ud.
2018
Extended HowNet 2.0 – An Entity-Relation Common-Sense Representation Model
Wei-Yun Ma | Yueh-Yin Shih
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Wei-Yun Ma | Yueh-Yin Shih
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
2007
Knowledge Representation for Interrogatives in E-HowNet
Shu-Ling Huang | You-Shan Chung | Yueh-Yin Shih | Keh-Jiann Chen
Proceedings of the 19th Conference on Computational Linguistics and Speech Processing
Shu-Ling Huang | You-Shan Chung | Yueh-Yin Shih | Keh-Jiann Chen
Proceedings of the 19th Conference on Computational Linguistics and Speech Processing
2006
Semantic Representation and Composition for Unknown Compounds in E-HowNet
Yueh-Yin Shih | Shu-Ling Huang | Keh-Jiann Chen
Proceedings of the 20th Pacific Asia Conference on Language, Information and Computation
Yueh-Yin Shih | Shu-Ling Huang | Keh-Jiann Chen
Proceedings of the 20th Pacific Asia Conference on Language, Information and Computation