MEDIAPI-SKEL - A 2D-Skeleton Video Database of French Sign Language With Aligned French Subtitles

Hannah Bull, Annelies Braffort, Michèle Gouiffès


Abstract
This paper presents MEDIAPI-SKEL, a 2D-skeleton database of French Sign Language videos aligned with French subtitles. The corpus contains 27 hours of video of body, face and hand keypoints, aligned to subtitles with a vocabulary size of 17k tokens. In contrast to existing sign language corpora such as videos produced under laboratory conditions or translations of TV programs into sign language, this database is constructed using original sign language content largely produced by deaf journalists at the media company Média-Pi. Moreover, the videos are accurately synchronized with French subtitles. We propose three challenges appropriate for this corpus that are related to processing units of signs in context: automatic alignment of text and video, semantic segmentation of sign language, and production of video-text embeddings for cross-modal retrieval. These challenges deviate from the classic task of identifying a limited number of lexical signs in a video stream.
Anthology ID:
2020.lrec-1.743
Volume:
Proceedings of the 12th Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
6063–6068
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.743
DOI:
Bibkey:
Cite (ACL):
Hannah Bull, Annelies Braffort, and Michèle Gouiffès. 2020. MEDIAPI-SKEL - A 2D-Skeleton Video Database of French Sign Language With Aligned French Subtitles. In Proceedings of the 12th Language Resources and Evaluation Conference, pages 6063–6068, Marseille, France. European Language Resources Association.
Cite (Informal):
MEDIAPI-SKEL - A 2D-Skeleton Video Database of French Sign Language With Aligned French Subtitles (Bull et al., LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.743.pdf