Resources for computer-based sign recognition from video, and the criticality of consistency of gloss labeling across multiple large ASL video corpora
Files
Published version
Date
2022-06-25
DOI
Authors
Neidle, Carol
Opoku, Augustine
Ballard, Carey
Dafnis, Konstantinos M.
Chroni, Evgenia
Metaxas, Dimitri
Version
Published version
OA Version
Citation
C. Neidle, A. Opoku, C. Ballard, K. Dafnis, E. Chroni, D. Metaxas. 2022. "Resources for Computer-Based Sign Recognition from Video, and the Criticality of Consistency of Gloss Labeling across Multiple Large ASL Video Corpora." LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources. LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources. Marseille, France, 2022-06-25 - 2022-06-25.
Abstract
The WLASL purports to be “the largest video dataset for Word-Level American Sign Language (ASL) recognition.” It brings together various publicly shared video collections that could be quite valuable for sign recognition research, and it has been used extensively for such research. However, a critical problem with the accompanying annotations has heretofore not been recognized by the authors, nor by those who have exploited these data: There is no 1-1 correspondence between sign productions and gloss labels. Here we describe a large, linguistically annotated, video corpus of citation-form ASL signs shared by the ASLLRP—with 23,452 sign tokens and an online Sign Bank—in which such correspondences are enforced. We furthermore provide annotations for 19,672 of the WLASL video examples consistent with ASLLRP glossing conventions. For those wishing to use WLASL videos, this provides a set of annotations making it possible: (1) to use those data reliably for computational research; and/or (2) to combine the WLASL and ASLLRP datasets, creating a combined resource that is larger and richer than either of those datasets individually, with consistent gloss labeling for all signs. We also offer a summary of our own sign recognition research to date that exploits these data resources.
Description
License
© European Language Resources Association (ELRA). This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License