Relevance of ASR for the Automatic Generation of Keywords Suggestions for TV programs

Véronique Malaisé, Luit Gazendam, Willemijn Heeren, Roeland Ordelman, Hennie Brugman


Abstract
Semantic access to multimedia content in audiovisual archives is to a large extent dependent on quantity and quality of the metadata, and particularly the content descriptions that are attached to the individual items. However, the manual annotation of collections puts heavy demands on resources. A large number of archives are introducing (semi) automatic annotation techniques for generating and/or enhancing metadata. The NWO funded CATCH-CHOICE project has investigated the extraction of keywords from textual resources related to TV programs to be archived (context documents), in collaboration with the Dutch audiovisual archives, Sound and Vision. This paper investigates the suitability of Automatic Speech Recognition transcripts produced in the CATCH-CHoral project for generating such keywords, which we evaluate against manual annotations of the documents, and against keywords automatically generated from context documents describing the TV programs’ content.
Anthology ID:
2009.jeptalnrecital-court.34
Volume:
Actes de la 16ème conférence sur le Traitement Automatique des Langues Naturelles. Articles courts
Month:
June
Year:
2009
Address:
Senlis, France
Venue:
JEP/TALN/RECITAL
SIG:
Publisher:
ATALA
Note:
Pages:
311–320
Language:
URL:
https://aclanthology.org/2009.jeptalnrecital-court.34
DOI:
Bibkey:
Cite (ACL):
Véronique Malaisé, Luit Gazendam, Willemijn Heeren, Roeland Ordelman, and Hennie Brugman. 2009. Relevance of ASR for the Automatic Generation of Keywords Suggestions for TV programs. In Actes de la 16ème conférence sur le Traitement Automatique des Langues Naturelles. Articles courts, pages 311–320, Senlis, France. ATALA.
Cite (Informal):
Relevance of ASR for the Automatic Generation of Keywords Suggestions for TV programs (Malaisé et al., JEP/TALN/RECITAL 2009)
Copy Citation:
PDF:
https://aclanthology.org/2009.jeptalnrecital-court.34.pdf