Rint Sybesma
2021
Using BERT for choosing classifiers in Mandarin
Jani Järnfors
|
Guanyi Chen
|
Kees van Deemter
|
Rint Sybesma
Proceedings of the 14th International Conference on Natural Language Generation
Choosing the most suitable classifier in a linguistic context is a well-known problem in the production of Mandarin and many other languages. The present paper proposes a solution based on BERT, compares this solution to previous neural and rule-based models, and argues that the BERT model performs particularly well on those difficult cases where the classifier adds information to the text.
2017
Investigating the content and form of referring expressions in Mandarin: introducing the Mtuna corpus
Kees van Deemter
|
Le Sun
|
Rint Sybesma
|
Xiao Li
|
Bo Chen
|
Muyun Yang
Proceedings of the 10th International Conference on Natural Language Generation
East Asian languages are thought to handle reference differently from languages such as English, particularly in terms of the marking of definiteness and number. We present the first Data-Text corpus for Referring Expressions in Mandarin, and we use this corpus to test some initial hypotheses inspired by the theoretical linguistics literature. Our findings suggest that function words deserve more attention in Referring Expressions Generation than they have so far received, and they have a bearing on the debate about whether different languages make different trade-offs between clarity and brevity.
Search
Co-authors
- Kees van Deemter 2
- Jani Järnfors 1
- Guanyi Chen 1
- Le Sun 1
- Xiao Li 1
- show all...
Venues
- inlg2