Kiyohiro Shikano
2006
Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria
Tomoyuki Kato | Tomiki Toda | Hiroshi Saruwatari | Kiyohiro Shikano
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
Tomoyuki Kato | Tomiki Toda | Hiroshi Saruwatari | Kiyohiro Shikano
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
This paper describes a novel method for reducing the transcription effort in the construction of task-adapted acoustic models for a practical automatic speech recognition (ASR) system. We have to prepare actual data samples collected in the practical system and transcribe them for training the task-adapted acoustic models. However, transcribing utterances is a time-consuming and laborious process. In the proposed method, we firstly adapt initial models to acoustic environment of the system using a small number of collected data samples with transcriptions. And then, we automatically select informative training data samples to be transcribed from a large-sized speech corpus based on acoustic likelihoods of the models. We perform several experimental evaluations in the framework of Takemarukun, a practical speech-oriented guidance system. Experimental results show that 1) utterance sets with low likelihoods cause better task-adapted models compared with those with high likelihoods although the set with the lowest likelihoods causes the performance degradation because of including outliers, and 2) MLLR adaptation is effective for training the task-adapted models when the amount of the transcribed data is small and EM training outperforms MLLR if we transcribe more than around 10,000 utterances.
Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech
Hiromichi Kawanami | Takahiro Kitamura | Kiyohiro Shikano
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
Hiromichi Kawanami | Takahiro Kitamura | Kiyohiro Shikano
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
As a practical information guidance system, we have been developing a speech-oriented system named "Takemaru-kun". The system has been operated on a public space since Nov. 2002. The system answers to user's question about the hall facilities, sightseeing, transportation, weather information around the city, etc. All triggered inputs to the system have been recorded since the operation started. And all system inputs during 22 months are manually transcribed and labelled for speakers gender and age category. In this paper, we conduct a long-term prosody analysis of user speech to find a clue to obtain users attitude from a users speech. In this preliminary analysis, it is observed that F0 decreases regardless of age and gender category when the stability of the dialogue system is not established.
2004
Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification
Kazuki Adachi | Tomoki Toda | Hiromichi Kawanami | Hiroshi Saruwatari | Kiyohiro Shikano
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
Kazuki Adachi | Tomoki Toda | Hiromichi Kawanami | Hiroshi Saruwatari | Kiyohiro Shikano
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
2002
Continuous Speech Recognition Consortium an Open Repository for CSR Tools and Models
Akinobu Lee | Tatsuya Kawahara | Kazuya Takeda | Masato Mimura | Atsushi Yamada | Akinori Ito | Katsunobu Itou | Kiyohiro Shikano
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)
Akinobu Lee | Tatsuya Kawahara | Kazuya Takeda | Masato Mimura | Atsushi Yamada | Akinori Ito | Katsunobu Itou | Kiyohiro Shikano
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)
Designing speech database with prosodic variety for expressive TTS system
Hiromichi Kawanami | Tsuyoshi Masuda | Tomoki Toda | Kiyohiro Shikano
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)
Hiromichi Kawanami | Tsuyoshi Masuda | Tomoki Toda | Kiyohiro Shikano
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)
2000
IPA Japanese Dictation Free Software Project
Katsunobu Itou | Kiyohiro Shikano | Tatsuya Kawahara | Kasuya Takeda | Atsushi Yamada | Akinori Itou | Takehito Utsuro | Tetsunori Kobayashi | Nobuaki Minematsu | Mikio Yamamoto | Shigeki Sagayama | Akinobu Lee
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)
Katsunobu Itou | Kiyohiro Shikano | Tatsuya Kawahara | Kasuya Takeda | Atsushi Yamada | Akinori Itou | Takehito Utsuro | Tetsunori Kobayashi | Nobuaki Minematsu | Mikio Yamamoto | Shigeki Sagayama | Akinobu Lee
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)
1994
A Large-Vocabulary Continuous Speech Recognition Algorithm and Its Application to a Multi-Modal Telephone Directory Assistance System
Yasuhiro Minami | Kiyohiro Shikano | Osamu Yoshioka | Satoshi Takahashi | Tomokazu Yamada | Sadaoki Furui
Human Language Technology: Proceedings of a Workshop held at Plainsboro, New Jersey, March 8-11, 1994
Yasuhiro Minami | Kiyohiro Shikano | Osamu Yoshioka | Satoshi Takahashi | Tomokazu Yamada | Sadaoki Furui
Human Language Technology: Proceedings of a Workshop held at Plainsboro, New Jersey, March 8-11, 1994
1992
Recent Topics in Speech Recognition Research at NTT Laboratories
Sadaoki Furui | Kiyohiro Shikano | Shoichi Matsunaga | Tatsuo Matsuoka | Satoshi Takahashi | Tomokazu Yamada
Speech and Natural Language: Proceedings of a Workshop Held at Harriman, New York, February 23-26, 1992
Sadaoki Furui | Kiyohiro Shikano | Shoichi Matsunaga | Tatsuo Matsuoka | Satoshi Takahashi | Tomokazu Yamada
Speech and Natural Language: Proceedings of a Workshop Held at Harriman, New York, February 23-26, 1992
1990
Search
Fix author
Co-authors
- Hiromichi Kawanami 3
- Tomoki Toda 3
- Sadaoki Furui 2
- Katsunobu Itou 2
- Tatsuya Kawahara 2
- Akinobu Lee 2
- Hiroshi Saruwatari 2
- Satoshi Takahashi 2
- Kazuya Takeda 2
- Atsushi Yamada 2
- Tomokazu Yamada 2
- Kazuki Adachi 1
- Akinori Ito 1
- Akinori Itou 1
- Tomoyuki Kato 1
- Takeshi Kawabata 1
- Takahiro Kitamura 1
- Tetsunori Kobayashi 1
- Katsuteru Maruyama 1
- Tsuyoshi Masuda 1
- Shoichi Matsunaga 1
- Tatsuo Matsuoka 1
- Masato Mimura 1
- Yasuhiro Minami 1
- Nobuaki Minematsu 1
- Masami Nakamura 1
- Shigeki Sagayama 1
- Takehito Utsuro 1
- Mikio Yamamoto 1
- Osamu Yoshioka 1