Giter Site home page Giter Site logo

mkaczkow / stereo_net Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 0.0 7.54 MB

Repo for implementation of StereoNet

License: MIT License

Python 87.65% Jupyter Notebook 12.35%
disparity-estimation kitti-dataset neural-network pytorch siamese-neural-network sceneflow-dataset stereo-net pytorch-lightning

stereo_net's Introduction

Stereo Net

Repo for implementation of StereoNet

TODO

  • Research and fix why KITTIs and dome FlyingThing are not working
  • Heavy refactor and more DRY
  • Get models .ckpt from original repo
  • Create gradio UI with multiple IO and option to select best model
  • Integrate UI with rest of the code (inference function)

First of all, I would like to acknowledge the authors of the paper. I am not the author of the paper, I am was just trying to re-implement it. Link to the paper:
https://arxiv.org/abs/1807.08865.

I also relied heavy on this awesome repos:
https://github.com/andrewlstewart/StereoNet_PyTorch
https://github.com/zhixuanli/StereoNet

Contary to mentioned sources I also used both KITTI datasets (KITTI 2012 and KITTI 2015). This was a little chalenging, because of the new policy regarding usage of said datasets, as specified on:
https://www.cvlibs.net/datasets/kitti/eval_scene_flow.php?benchmark=stereo

Important Policy Update: As more and more non-published work and re-implementations 
of existing work is submitted to KITTI, we have established a new policy: from now 
on, only submissions with significant novelty that are leading to a peer-reviewed 
paper in a conference or journal are allowed. Minor modifications of existing 
algorithms or student research projects are not allowed. Such work must be evaluated 
on a split of the training set. To ensure that our policy is adopted, new users must 
detail their status, describe their work and specify the targeted venue during 
registration. Furthermore, we will regularly delete all entries that are 6 months 
old but are still anonymous or do not have a paper associated with them. For 
conferences, 6 month is enough to determine if a paper has been accepted and to add 
the bibliography information. For longer review cycles, you need to resubmit your 
results.

This resulted in c.a. 200 pictures from each KITTI dataset, for training, validation and testing. Such little number of pictures resulted in overfitting problems for both KITTI datasets.

Some of the solutions are probably clunky and not really elegant, but generally gets the job done.

README's for specific folders are in the folders themselves:

stereo_net's People

Contributors

mkaczkow avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.