Giter Site home page Giter Site logo

rnn_video_object_detection's Introduction

RNN_video_object_detection

This is the implementation of our BMVC 2016 paper "Context Matters: Refining Object Detection in Video with Recurrent Neural Networks".

  1. It includes conversion of YOLO binary weight file to python weights
  2. finetuning YOLO with theano and lasagne for youtube-objects dataset : DA-YOLO as referred in the paper, the pseudo-label generator
  3. I have included the compatible label/annotations I needed to create for training & eval in theano.
  4. Pre-calculated features for training and test numpy arrays are available with me. However, those are bigger than the allowable file size for uploading.
  5. fine tuned DA-YOLO weights are 1.1 GB, however the conversion python file could be used to generate those. YOLO weight bin file can be downloaded from darknet website.
  6. GRU training and evaluation code and visual results - this notebook is large. use the following command if it doesn't open directly here: http://nbviewer.jupyter.org/ and use SubarnaTripathi/RNN_video_object_detection in the box. And, open
    RNN_Object_Detection_GRU_Smoothness_visual_results.ipynb

** The code requires cleaning, which I'll eventually do. **

rnn_video_object_detection's People

Contributors

subarnatripathi avatar

Stargazers

Dongyuxin avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.