Giter Site home page Giter Site logo

zoomspn's Introduction

Scale spn

The purpose of this project is to investigate the use of spartial transformer networks on very large images with spartial structures that can be seen, but not nessesarily distinquished, from smaller scalings of the same image.

The algorithms will take a large image, scale it to different sizes and the progressively use transformer networks to crop "into" the larger images. E.g This will allow the algorithm to take crops of 2048x2048 images without ever working on anything above 256x256.

The essense of this project is to test two types of scaling algorithms; Maxpool and scipys resize. Maxpool is currently the algorithm of choice in many deep neural networks, but it is not nessesarily the best algorithm for scaling.

I have(so far) made two contributions to test this: Firstly, coded a network working on a self-made variant of the mnist.

python mnist_zoom_spn.py -h

Either it uses maxpooling to device three different sizes of the mnist variant, and uses transformer networks twice to zoom "into" the larger image. Or it uses scipy to supply images of three different sizes and performs the same computation.

Secondly, whaleresize.py will try take a 2048x2048 image of a whale and rescale it using maxpool and scipy for illustrating the difference between the algorithms. Here are a few examples, which illustrates how maxpool performs whose the more severe the scaling is.

Downscaling with a factor of 2 - from 2048 to 1024(scipy left, maxpool right)

alt text

Downscaling with a factor of 8 - from 2048 to 256(scipy left, maxpool right)

alt text

Downscaling with a factor of 32 - from 2048 to 64(scipy left, maxpool right)

alt text

Downscaling with a factor of 64 - from 2048 to 32(scipy left, maxpool right)

alt text

zoomspn's People

Contributors

alrojo avatar

Stargazers

Ash avatar Andrew Kabrit avatar

Watchers

James Cloos avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.