Curated Dataset Structure#
Once a Dataset has been curated, you can download a ZIP file of the curated dataset or view it in your S3 storage location. The contents of a curated dataset is described below.
| File/Folder Name | Description | 
|---|---|
| 
 | Directory containing the curated clips | 
| 
 | Directory containing the embeddings for the clips | 
| 
 | Directory contains the captions for each clip. The clip and captions share the same uuid file in the file names. | 
| 
 | Directory containing the webp previews of the curated clips | 
| 
 | Directory containing processed video file segment names | 
| 
 | Directory containing processed video file names | 
| 
 | JSON file containing detailed metadata summary of videos processed, clips generated, and some statistics, along with video/clip metadata | 
| 
 | Directory containing the same captions as  |