Giter Site home page Giter Site logo

camera-pose_estimation's Introduction

Camera pose estimation using Deep Learning

Camera pose estimation deals with the problem of estimating the position and orientation of the camera in the 3D space from the pictures taken by it. This work is still in progress

Why camera pose estimation?

  • Determining where the camera is relative to the object it is looking at is one of the fundamental problems in computer vision and accurately determining the same is the key to applications like augmented reality (AR), autonomous navigation and robot vision.
  • Most solutions to the problem so far require complicated mathematical modelling and are very scene-specific, i.e., a method or framework developed in one scene is likely to fail in another scene.

Dependencies

The training and testing of deep networks has been carried out on Google Colab.

  • Python (=<3.6)
  • Tensorflow (1.1x)
  • Due to scarcity of labelled datasets, this project used the robotics simulator CoppeliaSim to simulate various viewpoints of a scene while keeping track of the camera position transformation to achieve them. The player version or the educational version may be downloaded from here.

Basic methodology

Using the robotics simulator, several viewpoints of some common indoor and outdoor scenes were simulated with the camera position changes (as measured in 6 degrees of freedom about a known origin) to view them tracked. Deep learning techniques are then used to regress the position of the camera from the images. This method is inspired from this CVPR paper.

To do

Add references and results

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.