Video Retrieval System Using Automatic Speech Recognition for the Japanese Diet

Mikitaka Masuyama, Tatsuya Kawahara, Kenjiro Matsuda


Abstract
The Japanese House of Representatives, one of the two houses of the Diet, has adopted an Automatic Speech Recognition (ASR) system, which directly transcribes parliamentary speech with an accuracy of 95 percent. The ASR system also provides a timestamp for every word, which enables retrieval of the video segments of the Parliamentary meetings. The video retrieval system we have developed allows one to pinpoint and play the parliamentary video clips corresponding to the meeting minutes by keyword search. In this paper, we provide its overview and suggest various ways we can utilize the system. The system is currently extended to cover meetings of local governments, which will allow us to investigate dialectal linguistic variations.
Anthology ID:
2024.parlaclarin-1.21
Volume:
Proceedings of the IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora (ParlaCLARIN) @ LREC-COLING 2024
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Darja Fiser, Maria Eskevich, David Bordon
Venues:
ParlaCLARIN | WS
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
145–148
Language:
URL:
https://aclanthology.org/2024.parlaclarin-1.21
DOI:
Bibkey:
Cite (ACL):
Mikitaka Masuyama, Tatsuya Kawahara, and Kenjiro Matsuda. 2024. Video Retrieval System Using Automatic Speech Recognition for the Japanese Diet. In Proceedings of the IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora (ParlaCLARIN) @ LREC-COLING 2024, pages 145–148, Torino, Italia. ELRA and ICCL.
Cite (Informal):
Video Retrieval System Using Automatic Speech Recognition for the Japanese Diet (Masuyama et al., ParlaCLARIN-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.parlaclarin-1.21.pdf