How Should a Large Corpus Be Built?-A Comparative Study of Closure in Annotated Newspaper Corpora from Two Chinese Sources, Towards Building a Larger Representative Corpus Merged from Representative Sublanguage Collections John J Kovarik author 2000-10 text Second Chinese Language Processing Workshop Association for Computational Linguistics Hong Kong, China conference publication kovarik-2000-large 10.3115/1117769.1117788 https://aclanthology.org/W00-1217/ 2000-10 116 123