Integrating Auslan Resources into the Language Data Commons of Australia

River Tae Smith, Louisa Willoughby, Trevor Johnston


Abstract
This paper describes a project to secure Auslan (Australian Sign Language) resources within a national language data network called the Language Data Commons of Australia (LDaCA). The resources are Auslan Signbank, a web-based multi-media dictionary, and the Auslan Corpus, a collection of video recordings of the language being used in various contexts with time-aligned ELAN annotation files. We aim to make these resources accessible to the language community, encourage community participation in the curation of the data, and facilitate and extend their uses in language teaching and linguistic research. The software platforms of both resources will be made compatible with other LDaCA resources; and the two will also be aggregated and linked so that (i) users of the dictionary can view attested corpus examples for an entry; and (ii) users of the corpus can instantly view the dictionary entry for an already glossed sign to check phonological, lexical and grammatical information about it, and/or to ensure that the correct annotation gloss (aka ‘ID-gloss’) for a sign token has been chosen. This will enhance additions to annotations in the Auslan Corpus, entries in Auslan Signbank and the integrity of research based on both.
Anthology ID:
2022.signlang-1.28
Volume:
Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Eleni Efthimiou, Stavroula-Evita Fotinea, Thomas Hanke, Julie A. Hochgesang, Jette Kristoffersen, Johanna Mesch, Marc Schulder
Venue:
SignLang
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
181–186
Language:
URL:
https://aclanthology.org/2022.signlang-1.28
DOI:
Bibkey:
Cite (ACL):
River Tae Smith, Louisa Willoughby, and Trevor Johnston. 2022. Integrating Auslan Resources into the Language Data Commons of Australia. In Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources, pages 181–186, Marseille, France. European Language Resources Association.
Cite (Informal):
Integrating Auslan Resources into the Language Data Commons of Australia (Smith et al., SignLang 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.signlang-1.28.pdf