ebay / modanet Goto Github PK

View Code? Open in Web Editor NEW

325.0 22.0 84.0 28.47 MB

ModaNet: A large-scale street fashion dataset with polygon annotations

License: Other

deep-learning computer-vision

modanet's Introduction

ModaNet

Table of Contents

Why we made ModaNet
Labels
Contributing
Contact
Citing ModaNet

ModaNet is a street fashion images dataset consisting of annotations related to RGB images. ModaNet provides multiple polygon annotations for each image. This dataset is described in a technical paper with the title ModaNet: A Large-Scale Street Fashion Dataset with Polygon Annotations. Each polygon is associated with a label from 13 meta fashion categories. The annotations are based on images in the PaperDoll image set, which has only a few hundred images annotated by the superpixel-based tool. The contribution of ModaNet is to provide new and extra polygon annotations for the images.

Thanks to the EvalAI team, we have ModaNet challenge leaderboard, please submit your best results there. It provides leaderboards for semantic segmentation, instance segmentation, and object detection, all in COCO-style. We also provide the per-class IOU score for semantic segmentation task in stdout File. You might need to register an account in EvalAI.

Why we made ModaNet

ModaNet is intended to serve an educational purpose by providing a benchmark annotation set for emerging computer vision research including semantic segmentation, object detection, instance segmentation, polygon detection, and etc.

Access to the annotations git repo

Please first install Git Large File Storage by following the below instructions. You might need to apply this step before you download the sql file from paperdoll repo.

https://help.github.com/articles/installing-git-large-file-storage/

Then you should git clone the repo along with the annotation files, otherwise the annotation files would not be downloaded directly using git clone.

git clone [email protected]:eBay/modanet.git

Before you start working on the dataset, please verify the annotation files by using md5sum. In Mac, this requires you to run the following command:

md5 modanet2018_instances_train.json

you should expect to see the following md5 value:

MD5 (modanet2018_instances_train.json) = 96478657d20e322e9d3282c6d73c0c4c

If you apply the same command to check the val annotation, you should expect to see:

MD5 (modanet2018_instances_val.json) = 900b24b7d6c0c48203e6244f45d65499

Notice the file size:

modanet2018_instances_train.json (101MB)
modanet2018_instances_val.json (210KB)

Labels

Each polygon (bounding box, segmentation mask) annotation is assigned to one of the following labels:

Label	Description	Fine-Grained-categories
1	bag	bag
2	belt	belt
3	boots	boots
4	footwear	footwear
5	outer	coat/jacket/suit/blazers/cardigan/sweater/Jumpsuits/Rompers/vest
6	dress	dress/t-shirt dress
7	sunglasses	sunglasses
8	pants	pants/jeans/leggings
9	top	top/blouse/t-shirt/shirt
10	shorts	shorts
11	skirt	skirt
12	headwear	headwear
13	scarf & tie	scartf & tie

The annotation data format of ModaNet follows the same style as COCO-dataset.

Data format

{
'info' : info, 'images' : [image], 'annotations' : [annotation], 'licenses' : [license],'year': year, 'categories': [category], 'type': type
}

info{
'version' : str, 'description' : str, 'contributor' : str, 'date_created' : datetime,
}

image{
'id' : int, 'width' : int, 'height' : int, 'file_name' : str, 'license' : int
}

license{
'id' : int, 'name' : str, 'url' : str,
}

annotation{
  'area': int, 
  'bbox': [x,y,width,height],
  'segmentation': [polygon],
  'image_id': int,
  'id': int,
  'category_id': int,
  'iscrowd': int
}
category{
  'supercategory': str, 'id': int, 'name': str,
}

Submitting results to leaderboard

You can participate only the Object Detection task by submitting results as follows

[{
'image_id' : int, 'category_id' : int, 'bbox' : [x,y,width,height], 'score' : float,
}]

Example

[{'bbox': [192, 30, 20, 28],
  'category_id': 13,
  'image_id': 100014,
  'score': 0.8}]

You can participate only the Instance Segmentation/Semantic Segmentation/Polygon prediction tasks by submitting results as follows

[{
'image_id' : int, 'category_id' : int, 'segmentation' : polygon, 'score' : float,
}]

Example

[{'segmentation': [[210,
    31,
    212,
    35,
    204,
    37,
    204,
    45,
    205,
    54,
    199,
    58,
    194,
    52,
    198,
    42,
    192,
    32,
    194,
    30,
    201,
    33]],
  'category_id': 13,
  'image_id': 100014,
  'score': 0.8 }]

You can participate the task of joint detection and segmentation by submitting results as follows

[{
'image_id' : int, 'category_id' : int, 'segmentation' : polygon, 'score' : float, 'bbox' : [x,y,width,height]
}]

Example

[{'bbox': [192, 30, 20, 28],
  'category_id': 13,
  'image_id': 100014,
  'segmentation': [[210,
    31,
    212,
    35,
    204,
    37,
    204,
    45,
    205,
    54,
    199,
    58,
    194,
    52,
    198,
    42,
    192,
    32,
    194,
    30,
    201,
    33]],
  'score': 0.8}]

We acknowledge the contribution of COCOdataset team and all the format would follow the same style as those in the COCOdataset. Check COCOAPI.

Contributing

You are more than welcome to contribute to this github repo! Either by submitting a bug report, or providing feedback about this dataset. Please open issues for specific tasks or post to the contact Google group below.

Contact

To discuss the dataset, please contact Moda-net Google Group.

Citing ModaNet

If you use ModaNet, we would appreciate reference to the following paper:

Shuai Zheng, Fan Yang, M. Hadi Kiapour, Robinson Piramuthu. ModaNet: A Large-Scale Street Fashion Dataset with Polygon Annotations. ACM Multimedia, 2018. PDF 5.7MB

Biblatex entry:

@inproceedings{zheng/2018acmmm,
  author       = {Shuai Zheng and Fan Yang and M. Hadi Kiapour and Robinson Piramuthu},
  title        = {ModaNet: A Large-Scale Street Fashion Dataset with Polygon Annotations},
  booktitle    = {ACM Multimedia},
  year         = {2018},
}

License

This annotation data is released under the Creative Commons Attribution-NonCommercial license 4.0.

modanet's People

Contributors

Stargazers

Watchers

Forkers

bittnt fyang26 ssssssssssss annxoik i008 hrsma2i threadsstyling johndpope dalavancloud eyebies amoliu dreadlord1984 jdc08161063 xjmeng001 ghoshaw pchank hhgxx123 stonexjr zwcdp alyzandestudent obodroid mtkshu m-khosravi ferasbasha cm-jsw asteur lavanyashukla sunn-e pkusnail qa276390 simaiden daniel-m mxs30443 ahoquegh dgks0n fosterleejoe chenying99 kr1za ashwinrajendraprasad ivanliu1989 peternara ducthangqd1998 chaechunjae isazheng888 phamhe thiagoneves holyhao junetrbl vitaly63723 macshkim isabella232 tsvs chaoso nicolass03 kleinmind jaspermorrison mithi21 alyzande rayhung1122 hugotong6425 ardagun evitself jelledv4 salaheddinekouiri shabazbelim andrej1a unicornofdata otaviokamel qq1323 interpause waterwheel31 omkarwalve hmd78 rimihazra qpc-github quantum-platinum-cloud shizidushu linux-devil jerry-maurice

modanet's Issues

Valid dataset without annotations

Hi Guys,

The validation json doesn't have any annotation for the validation images, is that right? How can I get the corresponding annotations?

Bests

Can not download annotations

Hi Thanks for your work
But I can not download annotations because of it has been bush long time ago,
Could you please offer another method to download dataset like google drive?

Facing issue where to get this file'../PaperDoll/labels/paperdoll.json'

where to get this paperdoll.json in https://github.com/hrsma2i/modanet/blob/master/ModaNet.ipynb

json_file = '../PaperDoll/labels/paperdoll.json'
df_ppdl = pd.read_json(json_file)
df_ppdl

How to get source images？

How to get the source images? How to convert this annotations into COCO format, such as PNG???

Some masks go over the size of the image

Especially with the footwear images, i found that they often go over by about 20px down, so I had to resize them to the end of the image (600px in height).

Careful! pictures (image_urls) in metadata might not corespond to pictures that were actually labeled

I was exploring the dataset and found a very huge (5-20%) amount of misslabeled samples.
From my quick research of the problem i found out that the image_url provided in the metadata does not link to the image that was actually labeled.
I think it happens when post_url contains multiple pictures. But im not yet sure if this is true 100% of the time.
Here is a notebook pic with examples of the problem:

https://imgur.com/a/gLUxxR0
The notebook should make it clear.

I will try to check out if actually getting the images by scraping the post_url (and always getting the first image ) instead of downloading using the url will solve the problem

Quality issue

The dataset is poorly labelled. And doesn't stand to the standards mentioned in the paper!

How to evaluate the result?

I submitted my validation set result to the EvalAI website but I cannot find the evaluation result. What should I do?

how to get modanet dataset's images ?I need download all paperdoll dataset(43G)? or other ways

repository not able to download

I am getting below error
[email protected]: Permission denied (publickey).
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists.

Can anyone please help.

Subset of data

Hi,
Is there any subset of data? Or a version with a lower volume? it's more than 40 GB?

Matching with Paperdoll images

I am exploring the dataset, and I can not find all training images in the paperdoll dataset.
I am assuming that the matching between Modanet and Paperdoll is based on the id of the images.

Among the 52377 training images, 32413 of them are included in paperdoll, for example

{'file_name': '0655364.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 655364}
matches with
{'post_url': 'http://www.chictopia.com/photo/show/584805', 'url': 'http://images1.chictopia.com/photos/DailyFashionista/5496701252/6589588537/black-reindeer-vintage-sweater-black-dolce-vita-shoes_400.jpg', 'id': 655364}
{'file_name': '0524293.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 524293}
matches with
{'post_url': 'http://www.chictopia.com/photo/show/471733', 'url': 'http://images0.chictopia.com/photos/LisaMarieF/7769599330/heather-gray-francescas-shirt-camel-rachel-zoe-purse-light-blue-express-shor_400.jpg', 'id': 524293}
{'file_name': '0393224.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 393224}
matches with
{'post_url': 'http://www.chictopia.com/photo/show/457562', 'url': 'http://images0.chictopia.com/photos/courtandhudson/1425342853/yellow-skirt-chartreuse-top_400.jpg', 'id': 393224}

However 19964 of them are missing, including for following lines

{'file_name': '0000003.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 3}
{'file_name': '0393220.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 393220}
{'file_name': '0655363.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 655363}
{'file_name': '0131080.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 131080}
{'file_name': '0262155.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 262155}
{'file_name': '0000013.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 13}
{'file_name': '0524311.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 524311}
{'file_name': '0786458.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 786458}
{'file_name': '0131098.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 131098}
{'file_name': '0131102.jpg', 'license': 3, 'height': 600, 'width': 400, 'id': 131102}

My paperdoll metadata is "data/paperdoll_dataset.mat" from data-v1.0.tar, downloaded from http://vision.is.tohoku.ac.jp/~kyamagu/research/paperdoll/

Thank you

No segmentation masks in validation set

Hey,

i'm currently trying to train a Mask R-CNN model on the ModaNet dataset
and realised that the validation set contains no polygon annotations.
For that reason i'm not able to complete validation at the end of each epoch.

Were they left out on purpose?

Thanks

ebay / modanet Goto Github PK

modanet's Introduction

ModaNet

Why we made ModaNet

Access to the annotations git repo

Labels

Data format

Submitting results to leaderboard

Contributing

Contact

Citing ModaNet

License

modanet's People

Contributors

Stargazers

Watchers

Forkers

modanet's Issues

Recommend Projects

Recommend Topics

Recommend Org