Creating Japanese Political Corpus from Local Assembly Minutes of 47 prefectures

Yasutomo Kimura, Keiichi Takamaru, Takuma Tanaka, Akio Kobayashi, Hiroki Sakaji, Yuzu Uchida, Hokuto Ototake, Shigeru Masuyama


Abstract
This paper describes a Japanese political corpus created for interdisciplinary political research. The corpus contains the local assembly minutes of 47 prefectures from April 2011 to March 2015. This four-year period coincides with the term of office for assembly members in most autonomies. We analyze statistical data, such as the number of speakers, characters, and words, to clarify the characteristics of local assembly minutes. In addition, we identify problems associated with the different web services used by the autonomies to make the minutes available to the public.
Anthology ID:
W16-5410
Volume:
Proceedings of the 12th Workshop on Asian Language Resources (ALR12)
Month:
December
Year:
2016
Address:
Osaka, Japan
Venues:
ALR | WS
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
78–85
Language:
URL:
https://aclanthology.org/W16-5410
DOI:
Bibkey:
Copy Citation:
PDF:
https://aclanthology.org/W16-5410.pdf