Takuma Tanaka


2016

pdf bib
Creating Japanese Political Corpus from Local Assembly Minutes of 47 prefectures
Yasutomo Kimura | Keiichi Takamaru | Takuma Tanaka | Akio Kobayashi | Hiroki Sakaji | Yuzu Uchida | Hokuto Ototake | Shigeru Masuyama
Proceedings of the 12th Workshop on Asian Language Resources (ALR12)

This paper describes a Japanese political corpus created for interdisciplinary political research. The corpus contains the local assembly minutes of 47 prefectures from April 2011 to March 2015. This four-year period coincides with the term of office for assembly members in most autonomies. We analyze statistical data, such as the number of speakers, characters, and words, to clarify the characteristics of local assembly minutes. In addition, we identify problems associated with the different web services used by the autonomies to make the minutes available to the public.