akshatsh / videosearchengine Goto Github PK
View Code? Open in Web Editor NEWSemantically be able to search through a database of videos (using generated summaries)
License: MIT License
Semantically be able to search through a database of videos (using generated summaries)
License: MIT License
Frame Grouping
Given a video, break apart sequences of frames into different components, and parallelize/distribute the work.
Need to implement
video_utils.py
VideoDistributer.py
any other necessary files
Basic Version
More complex things
Store video name and summary in a no SQL store
Probably following an API with
{
"name" : "the name of the video",
"summary": "the summary of the video",
"url" : "video_url"
}
Generate an image caption for each frame
Obj2Text is probably helpful
Hi,
Thanks a lot for this amazing work. Can you please add more details on the readme file about the training and evaluation steps of your network with and without YOLO ?
The current readme file lacks of information on how to do that.
It will be very useful if you can provide the trained model in order to test it using some raw videos.
many thanks again
GIven a set of object detections, drop the noisy frames to help language generation
Given a set of frames, select a subset that are meaningful and have no repititions.
Basic Heuristic: select every 10th frame
This dataset is incredibly large, and incredibly helpful (30GB in size). We want to use this, but attu has a file limit and this is way too big for it. We need some solution to be able to train on this end to end.
Educational credits for different cloud services (AWS
, Google Could Compute
, Azure
)
Detect objects in each frame using YOLO
Be able to monitor status of models as they train using tensor board
On the README update it to include a link to something about you (Github, Linkedin, etc.)
Some Google like UI for the video database
The current system, can/will split up work to different workers if the videos are too large and generate summaries.
Each worker would generate a summary for its portion.
When all finished, each worker would return a summary for its portion, all the summaries need to be blended together for one coherent summary of the video.
A basic version would be what we did for 333
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.