Kinetics Dataset
This dataset is available on Hyak Klone and Tillicum.
Sponsoring groups are Rajesh Rao, Abhishek Gupta, Ali Farhadi. Student users are Vishwas Sathish, Chuning Zhu, and Aditya Kusupati. Initial deployment of April 2024 on Klone and Sep 2025 on Tillicum.
What is this?
Kinetics is a collection of large-scale, high-quality datasets of URL links of up to 650,000 video clips that cover 400/600/700 human action classes, depending on the dataset version. You can learn more at their github here or from their paper here.
How to prepare for use?
Install decord on your python environment to read the video files. You can also use other libraries to read video frames. Popular ones include OpenCV, python bindings for ffmpeg and skvideo.
from decord import VideoReader, cpu
# load a single file of the dataset
video_clip_path = "/<path>/<to>/<video_folder>/LABLE/IDX-start_time-end_time.mp4"
vr = VideoReader(file_path, num_threads=-1, ctx=cpu(0))
How to access?
The file path for kinetics dataset on klone
is /data/kinetics
.
The file path for kinetics dataset on tillicum
is /gpfs/datasets/kinetics
.
The kinetics dataset is licensed by Google Inc. under a Creative Commons Attribution 4.0 International License. Published. May 22, 2017.
How to cite?
If you use the Kinetics dataset or any of the components, please cite:
@article{kay2017kinetics,
title={The Kinetics Human Action Video Dataset},
author={Will Kay and Joao Carreira and Karen Simonyan and Brian Zhang and Chloe Hillier and Sudheendra Vijayanarasimhan and Fabio Viola and Tim Green and Trevor Back and Apostol Natsev and Mustafa Suleyman and Andrew Zisserman},
journal={ArXiv},
year={2017},
volume={abs/1705.06950},
}