Giter Site home page Giter Site logo

kinetics-dataset's Introduction

Kinetics Datasets Downloader

Kinetics is a collection of large-scale, high-quality datasets of URL links of up to 650,000 video clips that cover 400/600/700 human action classes, depending on the dataset version. The videos include human-object interactions such as playing instruments, as well as human-human interactions such as shaking hands and hugging. Each action class has at least 400/600/700 video clips. Each clip is human annotated with a single action class and lasts around 10 seconds.

The Kinetics project publications can be found here: https://deepmind.com/research/open-source/kinetics.

Note that it may not be safe to train / test across different versions of the dataset, for example the k400 validation set is largely part of the k700 training set so results will not be meaningful.

Updates

5th of May: fixed k400/train/part_120.tar.gz, it was a tar file before

10th of December: add two downloader scripts for datasets automatic setup. (k400_downloader.sh and k700_2020_downloader.sh)

13th of September. K400: replaced corrupted mountain climber validation file and made available 1300+ replacement videos for existing corrupted training videos from various classes. K600: added list of links for held out test set.

4th of August 2021 -- replaced corrupted videos in the kinetics-700-2020 test set (these were typically shorter than 9s as well). There are still 5% of the videos in the test set that are shorter than 9s, but genuinely so (e.g. because they are like that in youtube).

Download Videos

CVDF currently hosts the videos in the Kinetics-400 and Kinetics-700-2020 datasets.

Kinetics-400

Kinetics-400 Download:

Clone repo and enter directory
git clone https://github.com/cvdfoundation/kinetics-dataset.git
cd kinetics-dataset
Download tar gzip files

This will create two directories, k400 and k400_targz. Tar gzips will be in k400_targz, you can delete k400_targz directory after extraction.

bash ./k400_downloader.sh
Extract tar gzip files
bash ./k400_extractor.sh

Kinetics-400 Info:

The train/val/test splits are subdivided into many files. The lists of links to video files can be found here:

https://s3.amazonaws.com/kinetics/400/train/k400_train_path.txt

https://s3.amazonaws.com/kinetics/400/val/k400_val_path.txt

https://s3.amazonaws.com/kinetics/400/test/k400_test_path.txt

It is easy to obtain a specific split (e.g. train), by:

bash download.sh k400_train_path.txt

Then, extract *.tar.gz files by:

bash extract.sh k400_train_path.txt

The links/annotations can be found under the annotation subfolders:

https://s3.amazonaws.com/kinetics/400/annotations/train.csv

https://s3.amazonaws.com/kinetics/400/annotations/val.csv

https://s3.amazonaws.com/kinetics/400/annotations/test.csv

A readme file can be found in:

http://s3.amazonaws.com/kinetics/400/readme.md

News: users found ~1400 corrupted videos. A replacement for the vast majority can be found here:

https://s3.amazonaws.com/kinetics/400/replacement_for_corrupted_k400.tgz

Kinetics-600

Kinetics-600 Download:

Clone repo and enter directory
git clone https://github.com/cvdfoundation/kinetics-dataset.git
cd kinetics-dataset
Download tar gzip files

This will create two directories, k600 and k600_targz. Tar gzips will be in k600_targz, you can delete k600_targz directory after extraction.

bash ./k600_downloader.sh
Extract tar gzip files
bash ./k600_extractor.sh

Kinetics-600 Info:

The train/val/test splits are subdivided into many files. The lists of links to video files can be found here:

https://s3.amazonaws.com/kinetics/600/train/k600_train_path.txt

https://s3.amazonaws.com/kinetics/600/val/k600_val_path.txt

https://s3.amazonaws.com/kinetics/600/test/k600_test_path.txt

The links/annotations can be found under the annotation subfolders:

https://s3.amazonaws.com/kinetics/600/annotations/train.txt

https://s3.amazonaws.com/kinetics/600/annotations/train.csv (incomplete)

https://s3.amazonaws.com/kinetics/600/annotations/val.txt

https://s3.amazonaws.com/kinetics/600/annotations/val.csv (incomplete)

https://s3.amazonaws.com/kinetics/600/annotations/test.csv

https://s3.amazonaws.com/kinetics/600/annotations/kinetics600_holdout_test.csv

A readme file can be found in:

http://s3.amazonaws.com/kinetics/600/readme.md

Kinetics-700-2020

Kinetics-700-2020 Download:

Clone repo and enter directory
git clone https://github.com/cvdfoundation/kinetics-dataset.git
cd kinetics-dataset
Download tar gzip files

This will create two directories, k700-2020 and k700-2020_targz. Tar gzips will be in k700-2020_targz, you can delete k700-2020_targz directory after extraction.

bash ./k700_2020_downloader.sh
Extract tar gzip files
bash ./k700_2020_extractor.sh

Kinetics-700-2020 Info:

The train/val/test splits are subdivided into many files. The lists of links to video files can be found here:

https://s3.amazonaws.com/kinetics/700_2020/train/k700_2020_train_path.txt

https://s3.amazonaws.com/kinetics/700_2020/val/k700_2020_val_path.txt

https://s3.amazonaws.com/kinetics/700_2020/test/k700_2020_test_path.txt

The links/annotations can be found under the annotation subfolders:

https://s3.amazonaws.com/kinetics/700_2020/annotations/train.csv

https://s3.amazonaws.com/kinetics/700_2020/annotations/val.csv

https://s3.amazonaws.com/kinetics/700_2020/annotations/test.csv

A readme file can be found in:

http://s3.amazonaws.com/kinetics/700_2020/K700_2020_readme.txt

Downstream annotations

We also host annotations for AVA-Kinetics and Countix, which both use Kinetics-700 videos.

To download annotations for AVA-Kinetics: https://s3.amazonaws.com/kinetics/700_2020/annotations/ava_kinetics_v1_0.tar.gz

To download annotations for countix: https://s3.amazonaws.com/kinetics/700_2020/annotations/countix.tar.gz

kinetics-dataset's People

Contributors

abhinav95 avatar deepsworld avatar jerryx1110 avatar kinetics-cvdf avatar maoyunyao avatar pace577 avatar sawyermade avatar shoufachen avatar tylin avatar yakhyo avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.