%0 Conference Proceedings %T Two-level Annotation of Utterance-units in Japanese Dialogs: An Empirically Emerged Scheme %A Den, Yasuharu %A Koiso, Hanae %A Maruyama, Takehiko %A Maekawa, Kikuo %A Takanashi, Katsuya %A Enomoto, Mika %A Yoshida, Nao %Y Calzolari, Nicoletta %Y Choukri, Khalid %Y Maegaard, Bente %Y Mariani, Joseph %Y Odijk, Jan %Y Piperidis, Stelios %Y Rosner, Mike %Y Tapias, Daniel %S Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10) %D 2010 %8 May %I European Language Resources Association (ELRA) %C Valletta, Malta %F den-etal-2010-two %X In this paper, we propose a scheme for annotating utterance-level units in Japanese dialogs, which emerged from an analysis of the interrelationship among four schemes, i) inter-pausal units, ii) intonation units, iii) clause units, and iv) pragmatic units. The associations among the labels of these four units were illustrated by multiple correspondence analysis and hierarchical cluster analysis. Based on these results, we prescribe utterance-unit identification rules, which identify two sorts of utterance-units with different granularities: short and long utterance-units. Short utterance-units are identified by acoustic and prosodic disjuncture, and they are considered to constitute units of speaker’s planning and hearer’s understanding. Long utterance-units, on the other hand, are recognized by syntactic and pragmatic disjuncture, and they are regarded as units of interaction. We explore some characteristics of these utterance-units, focusing particularly on unit duration and syntactic property, other participants’ responses, and mismatch between the two-levels. We also discuss how our two-level utterance-units are useful in analyzing cognitive and communicative aspects of spoken dialogs. %U http://www.lrec-conf.org/proceedings/lrec2010/pdf/391_Paper.pdf