Jul 1, 2024

YouTube-ASL Clip Keypoint Dataset


Zelezny, Tomas
;
Hruz, Marek
;
Straka, Jaub
;
Gueuwou, Shester
University of Western Bohemia, Pilsen  (Publisher)
Text
sign language
sign language translation
machine translation
pose estimation
keypoint detection
visual language
inclusive technology
Descriptions
The YouTube-ASL Clip Keypoint Dataset is a curated collection of sentence-level American Sign Language (ASL) keypoint sequences derived from publicly available YouTube videos. Rather than providing raw video files, the dataset consists solely of JSON files containing frame-by-frame 2D keypoints extracted from segmented clips of individual signed sentences. Each frame has been processed using MediaPipe, which generates 208 2D keypoints representing body, face, hands, and pose landmarks. These keypoint sequences provide a compact, privacy-preserving representation of ASL visual-linguistic content, enabling research in sign language recognition, gesture analysis, and multimodal communication. The dataset consists of 390 547 json files zipped in 10 separate zip files for easier manipulation. Beside the keypoint files, we also provide the annotation json files.

Additional details

Primary language American sign language
Related resources
This dataset is source of
GitHub - zeleznyt/T5_for_SLT

Files

Name Size
MD5: 0019b603f9ebd7594fce8fae8dc65167
37.1 GB
MD5: c891a51901ca17fa6f42529e5657df67
37.3 GB
MD5: 403958646966402245f69f9d473c4346
37.4 GB
MD5: 34b884205edf609baa97f92bc64cbd4c
58.9 MB
MD5: 75223dd5e7e9b6ccb9f34c5792fc1e6d
37.5 GB
MD5: 9d52caab1aa2db4218188819485f92ab
37.4 GB
MD5: 786fe0067d1e4d8665b1ddbb8628a17c
37.5 GB
MD5: 7b7b36e50f384f09aea7cf818c00b83c
53.1 MB
MD5: 890dd2fa779b56cd90c6ae39fa67faef
37.3 GB
MD5: 0713086c142d52c31cff1b4be9f4f82a
37.4 GB
MD5: b6bcc2e8517c2dcf8347347bbd74800c
37.3 GB
MD5: e935642e8bb5f9b594af74a8ba75d97f
37.4 GB
MD5: 26e96e563f0c50f933b021c8089eeece
5.8 MB