Extraction of the Argument Structure of Tokyo Metropolitan Assembly Minutes: Segmentation of Question-and-Answer Sets
Keiichi Takamaru | Yasutomo Kimura | Hideyuki Shibuki | Hokuto Ototake | Yuzu Uchida | Kotaro Sakamoto | Madoka Ishioroshi | Teruko Mitamura | Noriko Kando
Proceedings of the Twelfth Language Resources and Evaluation Conference
In this study, we construct a corpus of Japanese local assembly minutes. All speeches in an assembly were transcribed into a local assembly minutes based on the local autonomy law. Therefore, the local assembly minutes form an extremely large amount of text data. Our ultimate objectives were to summarize and present the arguments in the assemblies, and to use the minutes as primary information for arguments in local politics. To achieve this, we structured all statements in assembly minutes. We focused on the structure of the discussion, i.e., the extraction of question and answer pairs. We organized the shared task “QA Lab-PoliInfo” in NTCIR 14. We conducted a “segmentation task” to identify the scope of one question and answer in the minutes as a sub task of the shared task. For the segmentation task, 24 runs from five teams were submitted. Based on the obtained results, the best recall was 1.000, best precision was 0.940, and best F-measure was 0.895.
Creating Japanese Political Corpus from Local Assembly Minutes of 47 prefectures
Yasutomo Kimura | Keiichi Takamaru | Takuma Tanaka | Akio Kobayashi | Hiroki Sakaji | Yuzu Uchida | Hokuto Ototake | Shigeru Masuyama
Proceedings of the 12th Workshop on Asian Language Resources (ALR12)
This paper describes a Japanese political corpus created for interdisciplinary political research. The corpus contains the local assembly minutes of 47 prefectures from April 2011 to March 2015. This four-year period coincides with the term of office for assembly members in most autonomies. We analyze statistical data, such as the number of speakers, characters, and words, to clarify the characteristics of local assembly minutes. In addition, we identify problems associated with the different web services used by the autonomies to make the minutes available to the public.
- Yasutomo Kimura 2
- Yuzu Uchida 2
- Hokuto Ototake 2
- Takuma Tanaka 1
- Akio Kobayashi 1
- show all...