JTA Dataset

JTA (Joint Track Auto) is a huge dataset for pedestrian pose estimation and tracking in urban scenarios created by exploiting the highly photorealistic video game Grand Theft Auto V. We collected a set of 512 full-HD videos (256 for training and 256 for testing), 30 seconds long, recorded at 30 fps. The dataset was created with this tool.

Obtain the Dataset

To obtain the Dataset, you first need the JTA-Key, so please send an email to Ing. Matteo Fabbri (with object "JTA Dataset Download") stating:

Your name, title and affilation
Your intended use of the data
The following statement:

With this email we declare that we will use the JTA Dataset for research and educational purposes only, since we are aware that commercial use is prohibited. We also undertake to purchase a copy of Grand Theft Auto V.

We will promptly reply with the JTA-Key.

How to Start

Clone the JTA-Dataset repo and run download_data.py to download videos and annotations.

git clone https://github.com/fabbrimatteo/JTA-Dataset.git
cd JTA-Dataset
pip install -r requirements.txt
python download_data.py

In order to start your download you will be asked to enter the JTA-Key we have sent you by email.
If you want to manually download the data, use the following link:
- https://drive.google.com/open?id=JTA-Key

`JTA-Dataset` Contents

After the data download, your JTA-Dataset directory will contain the following files:

annotations: directory with dataset annotations
- annotations/train: directory with 256 JSON files, one for each training sequence
- annotations/test: directory with 128 JSON files, one for each testing sequence
- annotations/val: directory with 128 JSON files, one for each validation sequence
videos: directory with dataset videos
- videos/train: directory with 256 videos (.MP4), one for each training sequence
- videos/test: directory with 128 videos (.MP4), one for each testing sequence
- videos/val: directory with 128 videos (.MP4), one for each validation sequence
to_imgs.py: Python script that splits the videos into its frames and saves them in a specified directory with the desired format (default = JPG)
- requires Python >= 3.6 (see requirements.txt for more details)
- use python to_imgs.py --help to read the help message
- use option --img_format='png' for better quality
- each frame has a size of 1920×1080 px
- usage example:
```
python to_imgs.py --out_dir_path='frames' --img_format='jpg'
```
to_poses.py: Python script that splits the per sequence annotations into per frame annotations and saves them in a specified directory with the desired format (default = numpy)
- requires Python >= 3.6 (see requirements.txt for more details)
- use python to_poses.py --help to read the help message
- use option --format=torch to save the annotations using torch
- usage example:
```
python to_poses.py --out_dir_path='poses' --format='numpy'
```
WARNING: before loading those annotations you have to import Joint from joint.py and Pose from pose.py.
visualize.py: Python script that provides a visual representation of the annotations.
- requires Python >= 3.6 (see requirements.txt for more details)
- use python visualize.py --help to read the help message
- usage example:
```
python visualize.py --in_mp4_file_path='videos/train/seq_42.mp4' --json_file_path='annotations/train/seq_42.json' --out_mp4_file_path='vis_ann/seq_42.mp4'
```
coco_style_convert.py: Python script for annotation conversion (from JTA format to COCO format).
- requires Python >= 3.6 (see requirements.txt for more details)
- use python coco_style_convert.py --help to read the help message
- usage example:
```
python coco_style_convert.py --out_dir_path='coco_annotations'
```
joint.py and pose.py: support classes for the scripts.

Annotations

Each annotation file refers to a specific sequence (e.g. seq_42.json is the annotation file of seq_42.mp4). An annotation file consists of a list of lists, that is a matrix having N rows and 10 columns. Each row of the matrix contains the data of a joint; these data are organized as follows:

Element	Name	Description
`row[0]`	frame number	number of the frame to which the joint belongs
`row[1]`	person ID	unique identifier of the person to which the joint belongs
`row[2]`	joint type	identifier of the type of joint; see 'Joint Types' subsection
`row[3]`	x2D	2D x coordinate of the joint in pixel
`row[4]`	y2D	2D y coordinate of the joint in pixel
`row[5]`	x3D	3D x coordinate of the joint in meters
`row[6]`	y3D	3D y coordinate of the joint in meters
`row[7]`	z3D	3D z coordinate of the joint in meters
`row[8]`	occluded	`1` if the joint is occluded; `0` otherwise
`row[9]`	self-occluded	`1` if the joint is occluded by its owner; `0` otherwise

Note #1: 2D coordinates are relative to the top left corner of the frame, while 3D coordinates are in the standard camera coordinate system.
Note #2: frames are counted starting from 1.

Camera

Each sequence has been recorded with the same camera with the followng intrinsic matrix:

$K = \begin{pmatrix} 1158 & 0 & 960\\ 0 & 1158 & 540\\ 0 & 0 & 1 \end{pmatrix}$

WARNING: on 21 February 2019 we have changed the annotations, converting the 3D coordinates to the standard camera coordinate system. Here you can download the new annotations separately.

Joint Types

The associations between numerical identifier and type of joint are the following:

 0: head_top
 1: head_center
 2: neck
 3: right_clavicle
 4: right_shoulder
 5: right_elbow
 6: right_wrist
 7: left_clavicle
 8: left_shoulder
 9: left_elbow
10: left_wrist
11: spine0
12: spine1
13: spine2
14: spine3
15: spine4
16: right_hip
17: right_knee
18: right_ankle
19: left_hip
20: left_knee
21: left_ankle

Annotation Management - Example

Read data of an annotation file and convert it to numpy array:

import json
import numpy as np

json_file_path = 'annotations/training/seq_42.json'

with open(json_file_path, 'r') as json_file:
    matrix = json.load(json_file)
    matrix = np.array(matrix)

Get data of frame #118:

frame_data = matrix[matrix[:, 0] == 118]

Get data of all the joints of person with ID=3 in frame #118:
```
person_data = frame_data[frame_data[:, 1] == 3]
```
Note: we have chosen to save our annotations in JSON format for portability reasons. However, if you want to speed up data loading in Python, we suggest converting the annotations to numpy format:
```
np.save(json_file_path.replace('json', 'npy'), matrix)
```

COCO-Style Annotations

If you want, you can convert our annotations to COCO format using the coco_style_convert.py script, but note that occlusion, tracking and 3D informations are not available in that format.

Important Note

This dataset was introduced in the paper "Learning to Detect and Track Visible and Occluded Body Joints in a Virtual World" (ECCV 2018). In the experimental section of the paper, when referring to the "test set", we mean the set consisting of the test and val directories of the JTA Dataset.

Citation

We believe in open research and we are happy if you find this data useful.
If you use it, please cite our work.

@inproceedings{fabbri2018learning,
   title     = {Learning to Detect and Track Visible and Occluded Body Joints in a Virtual World},
   author    = {Fabbri, Matteo and Lanzi, Fabio and Calderara, Simone and Palazzi, Andrea and Vezzani, Roberto and Cucchiara, Rita},
   booktitle = {European Conference on Computer Vision (ECCV)},
   year      = {2018}
 }

License

JTA-Dataset is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

liuguoyou / jta-dataset Goto Github PK

jta-dataset's Introduction

JTA Dataset

Obtain the Dataset

How to Start

`JTA-Dataset` Contents

Annotations

Camera

Joint Types

Annotation Management - Example

COCO-Style Annotations

Important Note

Citation

License

jta-dataset's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

liuguoyou / jta-dataset Goto Github PK

jta-dataset's Introduction

JTA Dataset

Obtain the Dataset

How to Start

JTA-Dataset Contents

Annotations

Camera

Joint Types

Annotation Management - Example

COCO-Style Annotations

Important Note

Citation

License

jta-dataset's People

Contributors

Watchers

Recommend Projects

Recommend Topics

Recommend Org

`JTA-Dataset` Contents