2024
pdf
bib
abs
Collection of Japanese Route Information Reference Expressions Using Maps as Stimuli
Yoshiko Kawabata
|
Mai Omura
|
Hikari Konishi
|
Masayuki Asahara
|
Johane Takeuchi
Proceedings of the 4th Workshop on Spatial Language Understanding and Grounded Communication for Robotics (SpLU-RoboNLP 2024)
We constructed a database of Japanese expressions based on route information. Using 20 maps as stimuli, we requested descriptions of routes between two points on each map from 40 individuals per route, collecting 1600 route information reference expressions. We determined whether the expressions were based solely on relative reference expressions by using landmarks on the maps. In cases in which only relative reference expressions were used, we labeled the presence or absence of information regarding the starting point, waypoints, and destination. Additionally, we collected clarity ratings for each expression using a survey.
2023
pdf
bib
Spatial Information Annotation Based on the Double Cross Model
Yoshiko Kawabata
|
Mai Omura
|
Masayuki Asahara
|
Johane Takeuchi
Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation
2022
pdf
bib
abs
Design and Evaluation of the Corpus of Everyday Japanese Conversation
Hanae Koiso
|
Haruka Amatani
|
Yasuharu Den
|
Yuriko Iseki
|
Yuichi Ishimoto
|
Wakako Kashino
|
Yoshiko Kawabata
|
Ken’ya Nishikawa
|
Yayoi Tanaka
|
Yasuyuki Usuda
|
Yuka Watanabe
Proceedings of the Thirteenth Language Resources and Evaluation Conference
We have constructed the Corpus of Everyday Japanese Conversation (CEJC) and published it in March 2022. The CEJC is designed to contain various kinds of everyday conversations in a balanced manner to capture their diversity. The CEJC features not only audio but also video data to facilitate precise understanding of the mechanism of real-life social behavior. The publication of a large-scale corpus of everyday conversations that includes video data is a new approach. The CEJC contains 200 hours of speech, 577 conversations, about 2.4 million words, and a total of 1675 conversants. In this paper, we present an overview of the corpus, including the recording method and devices, structure of the corpus, formats of video and audio files, transcription, and annotations. We then report some results of the evaluation of the CEJC in terms of conversant and conversation attributes. We show that the CEJC includes a good balance of adult conversants in terms of gender and age, as well as a variety of conversations in terms of conversation forms, places, activities, and numbers of conversants.
2018
pdf
bib
Construction of the Corpus of Everyday Japanese Conversation: An Interim Report
Hanae Koiso
|
Yasuharu Den
|
Yuriko Iseki
|
Wakako Kashino
|
Yoshiko Kawabata
|
Ken’ya Nishikawa
|
Yayoi Tanaka
|
Yasuyuki Usuda
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)