Xuefeng Gao


2020

pdf bib
Ciron: a New Benchmark Dataset for Chinese Irony Detection
Rong Xiang | Xuefeng Gao | Yunfei Long | Anran Li | Emmanuele Chersoni | Qin Lu | Chu-Ren Huang
Proceedings of the Twelfth Language Resources and Evaluation Conference

Automatic Chinese irony detection is a challenging task, and it has a strong impact on linguistic research. However, Chinese irony detection often lacks labeled benchmark datasets. In this paper, we introduce Ciron, the first Chinese benchmark dataset available for irony detection for machine learning models. Ciron includes more than 8.7K posts, collected from Weibo, a micro blogging platform. Most importantly, Ciron is collected with no pre-conditions to ensure a much wider coverage. Evaluation on seven different machine learning classifiers proves the usefulness of Ciron as an important resource for Chinese irony detection.

2018

pdf bib
Exclamative Sentences in Emotion Expressions in Mandarin Chinese: A Corpus-based Approach
Xuefeng Gao | Sophia Yat Mei Lee
Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation

2017

pdf bib
A Corpus-based Analysis of Near-Synonymous Sentence-final Particles in Mandarin Chinese: “bale” and “eryi”
Xuefeng Gao | Yat-Mei Lee
Proceedings of the 31st Pacific Asia Conference on Language, Information and Computation