Jul 1, 2024
YouTube-ASL Clip Keypoint Dataset
- Zelezny, Tomas;Hruz, Marek;Straka, Jaub;Gueuwou, ShesterUniversity of Western Bohemia, Pilsen (Publisher)
- Text
- sign languagesign language translationmachine translationpose estimationkeypoint detectionvisual languageinclusive technology
- Descriptions
The YouTube-ASL Clip Keypoint Dataset is a curated collection of sentence-level American Sign Language (ASL) keypoint sequences derived from publicly available YouTube videos. Rather than providing raw video files, the dataset consists solely of JSON files containing frame-by-frame 2D keypoints extracted from segmented clips of individual signed sentences. Each frame has been processed using MediaPipe, which generates 208 2D keypoints representing body, face, hands, and pose landmarks. These keypoint sequences provide a compact, privacy-preserving representation of ASL visual-linguistic content, enabling research in sign language recognition, gesture analysis, and multimodal communication. The dataset consists of 390 547 json files zipped in 10 separate zip files for easier manipulation. Beside the keypoint files, we also provide the annotation json files. -
Files
Name Size MD5: 0019b603f9ebd7594fce8fae8dc65167 37.1 GB MD5: c891a51901ca17fa6f42529e5657df67 37.3 GB MD5: 403958646966402245f69f9d473c4346 37.4 GB MD5: 34b884205edf609baa97f92bc64cbd4c 58.9 MB MD5: 75223dd5e7e9b6ccb9f34c5792fc1e6d 37.5 GB MD5: 9d52caab1aa2db4218188819485f92ab 37.4 GB MD5: 786fe0067d1e4d8665b1ddbb8628a17c 37.5 GB MD5: 7b7b36e50f384f09aea7cf818c00b83c 53.1 MB MD5: 890dd2fa779b56cd90c6ae39fa67faef 37.3 GB MD5: 0713086c142d52c31cff1b4be9f4f82a 37.4 GB MD5: b6bcc2e8517c2dcf8347347bbd74800c 37.3 GB MD5: e935642e8bb5f9b594af74a8ba75d97f 37.4 GB MD5: 26e96e563f0c50f933b021c8089eeece 5.8 MB