Curated Dataset Structure#
Once a Dataset has been curated, you can download a ZIP file of the curated dataset or view it in your S3 storage location. The contents of a curated dataset is described below.
File/Folder Name |
Description |
---|---|
|
Directory containing the curated clips |
|
Directory containing the embeddings for the clips |
|
Directory contains the captions for each clip. The clip and captions share the same uuid file in the file names. |
|
Directory containing the webp previews of the curated clips |
|
Directory containing processed video file segment names |
|
Directory containing processed video file names |
|
JSON file containing detailed metadata summary of videos processed, clips generated, and some statistics, along with video/clip metadata |
|
Directory containing the same captions as |