Giter Site home page Giter Site logo

hardevkhandhar / building-segmentation-image-processing Goto Github PK

View Code? Open in Web Editor NEW
8.0 2.0 3.0 6.44 MB

Building Segmentation for Aerial Image Processing using Machine Learning.

Jupyter Notebook 100.00%
machine-learning image-processing deep-learning u-net pytorch python image-segmentation cnn tensorflow

building-segmentation-image-processing's Introduction

Image Processing

  • Image processing is a method to perform some operations on an image, in order to get an enhanced image or to extract some useful information from it. It is a type of signal processing in which input is an image and output may be image or characteristics / features associated with that image.

Image Segmentation

  • Image Segmentation is a pixel level classification of an image. Road area or Building feature extraction can be considered as a segmentation or pixel-level classification problem.

  • Neural Networks have already been used in various remote sensing tasks, like damage assessment of washed-away building from pre and post tsunami aerial images. Furthermore, researchers are using Deep Learning to solve the problems of interpretation and understanding of remote sensing data.

Building Segmentation

  • Building Segmentation from Aerial Imagery is a challenging task. Obstruction from nearby trees, shadows of adjacent buildings, varying texture and color of rooftops, varying shapes and dimensions of buildings are among other challenges that hinder present day models in segmenting sharp building boundaries.

  • High-quality aerial imagery datasets facilitate comparisons of existing methods and lead to increased interest in aerial imagery applications in the machine learning and computer vision communities.

Dataset

Massachusetts Buildings Dataset

  • The Massachusetts Buildings Dataset consists of 151 aerial images of Boston area, with each of the images being 1500 Γ— 1500 pixels for an area of 2.25 square kilometers. Hence, the entire dataset covers roughly 340 square kilometers.

  • The data is split into a training set of 137 images, a test set of 10 images and a validation set of 4 images. The target maps were obtained by rasterizing building footprints obtained from the OpenStreetMap project. The data was restricted to regions with an average omission noise level of roughly 5% or less.

  • The large amount of high quality building footprint data was possible to collect because the City of Boston contributed building footprints for the entire city to the OpenStreetMap project. The dataset covers mostly urban and suburban areas and buildings of all sizes, including individual houses and garages, are included in the labels.

  • The datasets make use of imagery released by the state of Massachusetts. All imagery is rescaled to a resolution of 1 pixel per square meter. The target maps for the dataset were generated using data from the OpenStreetMap project. Target maps for the test and validation portions of the dataset were hand-corrected to make the evaluations more accurate.

Acknowledgements

Final Thoughts

  • Given enough training time on GPU, larger datasets can be used for training the network. This can highly improve the accuracy and allow leveraging such image / semantic segmentation techniques in production Enterprise GIS environments. Rapid advances in Image Understanding using Computer Vision techniques have brought us many state-of-the-art deep learning models across various benchmark datasets.

  • Deep learning is a machine learning technique that teaches computers to do what comes naturally to humans: learn by example. It is getting lots of attention lately and for good reason. It’s achieving results that were not possible before. These models can achieve state-of-the-art accuracy, sometimes exceeding human-level performance.


Thank you!

building-segmentation-image-processing's People

Contributors

hardevkhandhar avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.