Giter Site home page Giter Site logo

rit-18's Introduction

RIT-18

High-Resolution Multispectral Dataset for Semantic Segmentation

Description

This repository contains the RIT-18 dataset we built for the semantic segmentation of remote sensing imagery. It was collected with the Tetracam Micro-MCA6 multispectral imaging sensor flown on-board a DJI-1000 octocopter. The main contributions of this dataset include 1) very-high resolution multispectral imagery from a drone, 2) six-spectral VNIR bands, and 3) 18 object classes (plus background) with a severely unbalanced class distribution. Details about its construction can be found in our paper.

If you use this dataset in a publication, please cite:

@article{kemker2017algorithms,
  title={Algorithms for Semantic Segmentation of Multispectral Remote Sensing Imagery using Deep Learning},
  author={Kemker, Ronald and Salvaggio, Carl and Kanan, Christopher},
  journal={arXiv preprint arXiv:1703.06452},
  year={2017}
}

Data Files

This repository contains the following files:

  1. rit18_data_url: The URL to the current location of the data.
  2. evaluate_rit18.py: The evaluation script used to score the predicition map
  3. read_rit18.py: This script opens all of the data in the dataset.

The data, once downloaded, is ~3.0GB (1.58 GB compressed). It is a .mat file containing a dictionary of various elements including:

  • 'train_data' : (7 x 9,393 x 5,642) numpy array containing the training ortho. The first six bands are the VNIR spectral bands and the 7th band is the mask of the orthomosaic.
  • 'train_labels': (9,393 x 5,642) numpy array containing the training labels.
  • 'val_data' : (7 x 8,833 x 6,918) numpy array containing the validation ortho. The first six bands are the VNIR spectral bands and the 7th band is the mask of the orthomosaic.
  • 'val_labels' : (8,833 x 6,918) numpy array containing the validation labels.
  • 'test_data' : (7 x 12,446 x 7,654) numpy array containing the testing ortho. The first six bands are the VNIR spectral bands and the 7th band is the mask of the orthomosaic.
  • 'band_centers' : Spectral band centers
  • 'band_center_units' : Units for 'band_centers'
  • 'sensor' : Information about the sensor
  • 'classes' : List of object classes
  • 'info' : Various information about the dataset

Instructions

The dataset contain pixel-wise annotations for both the training and validation folds. Both sets of labels can be used to train a classifier. It is separated as a rough per-class split, but the validation fold does not contain the black and white wooden targets. This is because we want to evaluate our model's ability to perform low-shot learning.

The goal is to have the test labels available on the IEEE GRSS evaluation server. Until then, you can e-mail me your test predictions using the following format:

  • Same spatial dimensions as the test image (12,446 x 7,654)
  • uint8 datatype (smaller file)
  • Either .mat (MATLAB) or .npy (Python) file format
  • Compressed (so you don't kill my e-mail account)

I will use your predicitions on the evaluate_rit18.py script that I provided here and send you the output file. I will not score the area outside of the mask, but the background pixels ("class 0") will be scored. As soon as I get this up on the evaluation server, then the user will be able to do all of this themselves.

Contact

Author: Ronald Kemker

E-mail: [email protected]

Website: http://www.cis.rit.edu/~rmk6217/

rit-18's People

Watchers

James Cloos avatar Shyam Sunder avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.