Giter Site home page Giter Site logo

modanet's Introduction

License: CC BY-NC 4.0

ModaNet

logo

Table of Contents

ModaNet is a street fashion images dataset consisting of annotations related to RGB images. ModaNet provides multiple polygon annotations for each image. This dataset is described in a technical paper with the title ModaNet: A Large-Scale Street Fashion Dataset with Polygon Annotations. Each polygon is associated with a label from 13 meta fashion categories. The annotations are based on images in the PaperDoll image set, which has only a few hundred images annotated by the superpixel-based tool. The contribution of ModaNet is to provide new and extra polygon annotations for the images.

Thanks to the EvalAI team, we have ModaNet challenge leaderboard, please submit your best results there. It provides leaderboards for semantic segmentation, instance segmentation, and object detection, all in COCO-style. We also provide the per-class IOU score for semantic segmentation task in stdout File. You might need to register an account in EvalAI.

Why we made ModaNet

ModaNet is intended to serve an educational purpose by providing a benchmark annotation set for emerging computer vision research including semantic segmentation, object detection, instance segmentation, polygon detection, and etc.

Access to the annotations git repo

Please first install Git Large File Storage by following the below instructions. You might need to apply this step before you download the sql file from paperdoll repo.

https://help.github.com/articles/installing-git-large-file-storage/

Then you should git clone the repo along with the annotation files, otherwise the annotation files would not be downloaded directly using git clone.

git clone [email protected]:eBay/modanet.git 

Before you start working on the dataset, please verify the annotation files by using md5sum. In Mac, this requires you to run the following command:

md5 modanet2018_instances_train.json

you should expect to see the following md5 value:

MD5 (modanet2018_instances_train.json) = 96478657d20e322e9d3282c6d73c0c4c

If you apply the same command to check the val annotation, you should expect to see:

MD5 (modanet2018_instances_val.json) = 900b24b7d6c0c48203e6244f45d65499

Notice the file size:

modanet2018_instances_train.json (101MB)
modanet2018_instances_val.json (210KB)

Labels

Each polygon (bounding box, segmentation mask) annotation is assigned to one of the following labels:

Label Description Fine-Grained-categories
1 bag bag
2 belt belt
3 boots boots
4 footwear footwear
5 outer coat/jacket/suit/blazers/cardigan/sweater/Jumpsuits/Rompers/vest
6 dress dress/t-shirt dress
7 sunglasses sunglasses
8 pants pants/jeans/leggings
9 top top/blouse/t-shirt/shirt
10 shorts shorts
11 skirt skirt
12 headwear headwear
13 scarf & tie scartf & tie

The annotation data format of ModaNet follows the same style as COCO-dataset.

Data format

{
'info' : info, 'images' : [image], 'annotations' : [annotation], 'licenses' : [license],'year': year, 'categories': [category], 'type': type
}

info{
'version' : str, 'description' : str, 'contributor' : str, 'date_created' : datetime,
}

image{
'id' : int, 'width' : int, 'height' : int, 'file_name' : str, 'license' : int
}

license{
'id' : int, 'name' : str, 'url' : str,
}

annotation{
  'area': int, 
  'bbox': [x,y,width,height],
  'segmentation': [polygon],
  'image_id': int,
  'id': int,
  'category_id': int,
  'iscrowd': int
}
category{
  'supercategory': str, 'id': int, 'name': str,
}

Submitting results to leaderboard

You can participate only the Object Detection task by submitting results as follows

[{
'image_id' : int, 'category_id' : int, 'bbox' : [x,y,width,height], 'score' : float,
}]

Example

[{'bbox': [192, 30, 20, 28],
  'category_id': 13,
  'image_id': 100014,
  'score': 0.8}]

You can participate only the Instance Segmentation/Semantic Segmentation/Polygon prediction tasks by submitting results as follows

[{
'image_id' : int, 'category_id' : int, 'segmentation' : polygon, 'score' : float,
}]

Example

[{'segmentation': [[210,
    31,
    212,
    35,
    204,
    37,
    204,
    45,
    205,
    54,
    199,
    58,
    194,
    52,
    198,
    42,
    192,
    32,
    194,
    30,
    201,
    33]],
  'category_id': 13,
  'image_id': 100014,
  'score': 0.8 }]

You can participate the task of joint detection and segmentation by submitting results as follows

[{
'image_id' : int, 'category_id' : int, 'segmentation' : polygon, 'score' : float, 'bbox' : [x,y,width,height]
}]

Example

[{'bbox': [192, 30, 20, 28],
  'category_id': 13,
  'image_id': 100014,
  'segmentation': [[210,
    31,
    212,
    35,
    204,
    37,
    204,
    45,
    205,
    54,
    199,
    58,
    194,
    52,
    198,
    42,
    192,
    32,
    194,
    30,
    201,
    33]],
  'score': 0.8}]

We acknowledge the contribution of COCOdataset team and all the format would follow the same style as those in the COCOdataset. Check COCOAPI.

Contributing

You are more than welcome to contribute to this github repo! Either by submitting a bug report, or providing feedback about this dataset. Please open issues for specific tasks or post to the contact Google group below.

Contact

To discuss the dataset, please contact Moda-net Google Group.

Citing ModaNet

If you use ModaNet, we would appreciate reference to the following paper:

Shuai Zheng, Fan Yang, M. Hadi Kiapour, Robinson Piramuthu. ModaNet: A Large-Scale Street Fashion Dataset with Polygon Annotations. ACM Multimedia, 2018. PDF 5.7MB

Biblatex entry:

@inproceedings{zheng/2018acmmm,
  author       = {Shuai Zheng and Fan Yang and M. Hadi Kiapour and Robinson Piramuthu},
  title        = {ModaNet: A Large-Scale Street Fashion Dataset with Polygon Annotations},
  booktitle    = {ACM Multimedia},
  year         = {2018},
}

License

This annotation data is released under the Creative Commons Attribution-NonCommercial license 4.0.

modanet's People

Contributors

bittnt avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

modanet's Issues

Valid dataset without annotations

Hi Guys,

The validation json doesn't have any annotation for the validation images, is that right? How can I get the corresponding annotations?

Bests

Can not download annotations

Hi Thanks for your work
But I can not download annotations because of it has been bush long time ago,
Could you please offer another method to download dataset like google drive?

Careful! pictures (image_urls) in metadata might not corespond to pictures that were actually labeled

I was exploring the dataset and found a very huge (5-20%) amount of misslabeled samples.
From my quick research of the problem i found out that the image_url provided in the metadata does not link to the image that was actually labeled.
I think it happens when post_url contains multiple pictures. But im not yet sure if this is true 100% of the time.
Here is a notebook pic with examples of the problem:

https://imgur.com/a/gLUxxR0
The notebook should make it clear.

I will try to check out if actually getting the images by scraping the post_url (and always getting the first image ) instead of downloading using the url will solve the problem

Quality issue

The dataset is poorly labelled. And doesn't stand to the standards mentioned in the paper!

How to evaluate the result?

I submitted my validation set result to the EvalAI website but I cannot find the evaluation result. What should I do?

repository not able to download

I am getting below error
[email protected]: Permission denied (publickey).
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists.

Can anyone please help.

Subset of data

Hi,
Is there any subset of data? Or a version with a lower volume? it's more than 40 GB?

Matching with Paperdoll images

I am exploring the dataset, and I can not find all training images in the paperdoll dataset.
I am assuming that the matching between Modanet and Paperdoll is based on the id of the images.

Among the 52377 training images, 32413 of them are included in paperdoll, for example

However 19964 of them are missing, including for following lines

  • {'file_name': '0000003.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 3}
  • {'file_name': '0393220.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 393220}
  • {'file_name': '0655363.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 655363}
  • {'file_name': '0131080.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 131080}
  • {'file_name': '0262155.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 262155}
  • {'file_name': '0000013.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 13}
  • {'file_name': '0524311.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 524311}
  • {'file_name': '0786458.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 786458}
  • {'file_name': '0131098.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 131098}
  • {'file_name': '0131102.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 131102}

My paperdoll metadata is "data/paperdoll_dataset.mat" from data-v1.0.tar, downloaded from http://vision.is.tohoku.ac.jp/~kyamagu/research/paperdoll/

Thank you

No segmentation masks in validation set

Hey,

i'm currently trying to train a Mask R-CNN model on the ModaNet dataset
and realised that the validation set contains no polygon annotations.
For that reason i'm not able to complete validation at the end of each epoch.

Were they left out on purpose?

Thanks

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.