YouTube-ASL Clip Keypoint Dataset

Jul 1, 2024

Qualified relations

Zelezny, Tomas

;

Hruz, Marek

;

Straka, Jaub

;

Gueuwou, Shester

University of Western Bohemia, Pilsen (Publisher)

Resource type

Text

Subjects

sign language

sign language translation

machine translation

pose estimation

keypoint detection

visual language

inclusive technology

Descriptions

The YouTube-ASL Clip Keypoint Dataset is a curated collection of sentence-level American Sign Language (ASL) keypoint sequences derived from publicly available YouTube videos. Rather than providing raw video files, the dataset consists solely of JSON files containing frame-by-frame 2D keypoints extracted from segmented clips of individual signed sentences. Each frame has been processed using MediaPipe, which generates 208 2D keypoints representing body, face, hands, and pose landmarks. These keypoint sequences provide a compact, privacy-preserving representation of ASL visual-linguistic content, enabling research in sign language recognition, gesture analysis, and multimodal communication. The dataset consists of 390 547 json files zipped in 10 separate zip files for easier manipulation. Beside the keypoint files, we also provide the annotation json files.

Additional details

Primary language	American sign language
Related resources	This dataset is source of GitHub - zeleznyt/T5_for_SLT IRI https://github.com/zeleznyt/T5_for_SLT

Files

Name	Size
raw_keypoints_10.zip MD5: 0019b603f9ebd7594fce8fae8dc65167	37.1 GB
raw_keypoints_2.zip MD5: c891a51901ca17fa6f42529e5657df67	37.3 GB
raw_keypoints_6.zip MD5: 403958646966402245f69f9d473c4346	37.4 GB
YT.translations.all.json MD5: 34b884205edf609baa97f92bc64cbd4c	58.9 MB
raw_keypoints_1.zip MD5: 75223dd5e7e9b6ccb9f34c5792fc1e6d	37.5 GB
raw_keypoints_9.zip MD5: 9d52caab1aa2db4218188819485f92ab	37.4 GB
raw_keypoints_7.zip MD5: 786fe0067d1e4d8665b1ddbb8628a17c	37.5 GB
YT.translations.train.json MD5: 7b7b36e50f384f09aea7cf818c00b83c	53.1 MB
raw_keypoints_4.zip MD5: 890dd2fa779b56cd90c6ae39fa67faef	37.3 GB
raw_keypoints_3.zip MD5: 0713086c142d52c31cff1b4be9f4f82a	37.4 GB
raw_keypoints_8.zip MD5: b6bcc2e8517c2dcf8347347bbd74800c	37.3 GB
raw_keypoints_5.zip MD5: e935642e8bb5f9b594af74a8ba75d97f	37.4 GB
YT.translations.dev.json MD5: 26e96e563f0c50f933b021c8089eeece	5.8 MB

GitHub - zeleznyt/T5_for_SLT