Video to Text Description Generator

The Video to Text Description Generator is a Python application that automatically generates textual descriptions for videos. It utilizes scene detection techniques to extract keyframes from a video and employs the OpenAI API to generate descriptive text for each keyframe.

Features

Automatically detects scenes in a video and extracts keyframes
Generates textual descriptions for each keyframe using the OpenAI API
Provides a user-friendly graphical interface for easy interaction
Allows adjusting the sensitivity threshold for scene detection
Supports processing multiple videos and saving the results in an Excel file

Installation

Clone the repository:

git clone https://github.com/noobAIcoder/video-to-text.git

Install the required dependencies:

pip install -r requirements.txt

Set up the necessary environment variables:

Create a .env file in the project directory.

Add the following variables to the .env file:

OPENAI_API_KEY=your-api-key
OPENAI_MODEL=gpt-4-vision-preview
OPENAI_MAX_TOKENS=300

Replace your-api-key with your actual OpenAI API key.

Usage

Run the application:

python .\main.py

Use the graphical interface to select a video file and adjust the sensitivity threshold (higher = lower).
Click the "Run Video Processing" button to start the scene detection and keyframe extraction process.
Once the video processing is complete, select folder where screenshots were saved and click the "Run Screenshot Processing" button to generate textual descriptions for each keyframe.
The generated descriptions will be saved in an Excel file in the same directory as the selected screenshots folder.

Contributing

Contributions are welcome! If you would like to contribute to this project, please follow the guidelines in CONTRIBUTING.md.

Authors

noobAIcoder - Prompting and copy-pasting
Claude (Anthropic AI) - Advisory, code generation, debugging

License

This project is licensed under the MIT License. See the LICENSE file for more information

noobaicoder / video-to-text Goto Github PK

video-to-text's Introduction

Video to Text Description Generator

Features

Installation

Usage

Contributing

Authors

License

video-to-text's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent