Giter Site home page Giter Site logo

akshatsh / videosearchengine Goto Github PK

View Code? Open in Web Editor NEW
64.0 64.0 22.0 178.7 MB

Semantically be able to search through a database of videos (using generated summaries)

License: MIT License

Python 96.02% Shell 2.25% CSS 0.72% HTML 1.01%
computer-vision deep-learning flask natural-language-generation natural-language-processing search-engine video-processing

videosearchengine's People

Contributors

abbygray avatar akshatsh avatar sarahyu17 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

videosearchengine's Issues

Group Video Frames into semantically Different Components

Frame Grouping

Given a video, break apart sequences of frames into different components, and parallelize/distribute the work.

Need to implement

  • video_utils.py

  • VideoDistributer.py

  • any other necessary files

  • Basic Version

  • More complex things

Create a database for Videos

Store video name and summary in a no SQL store

Probably following an API with

 { 
    "name" : "the name of the video",
    "summary": "the summary of the video",
    "url" : "video_url"
}

Adding more information about how to train and evaluate the captioning network

Hi,
Thanks a lot for this amazing work. Can you please add more details on the readme file about the training and evaluation steps of your network with and without YOLO ?
The current readme file lacks of information on how to do that.
It will be very useful if you can provide the trained model in order to test it using some raw videos.
many thanks again

Filter Noisy Frames

GIven a set of object detections, drop the noisy frames to help language generation

  • Look into descrimintory networks

Extract Useful Frames

Given a set of frames, select a subset that are meaningful and have no repititions.

Basic Heuristic: select every 10th frame

Use TaCos Dataset

This dataset is incredibly large, and incredibly helpful (30GB in size). We want to use this, but attu has a file limit and this is way too big for it. We need some solution to be able to train on this end to end.

  • Sent email to TA asking for help

Educational credits for different cloud services (AWS, Google Could Compute, Azure)

We have access to one of the research servers for running experiments.

[Errno 61] Connection refused on mac

Connection is being refused when all the proper permissions are granted.

Error:
Sending cluster to worker: localhost:24448
[Errno 61] Connection refused


24449 1


Sending cluster to worker: localhost:24449
[Errno 61] Connection refused

Screen Shot 2019-06-06 at 2 49 47 PM

Update README for contributors

On the README update it to include a link to something about you (Github, Linkedin, etc.)

  • Akshat Shrivastava
  • Kevin Bi
  • Sarah Yu
  • Abby Gray

Combine generated summaries of frames to one large summary

The current system, can/will split up work to different workers if the videos are too large and generate summaries.

Each worker would generate a summary for its portion.

When all finished, each worker would return a summary for its portion, all the summaries need to be blended together for one coherent summary of the video.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.