Giter Site home page Giter Site logo

youtube-video-summarization-speech-to-text-using-asr's Introduction

YouTube-Video-Summarization-Speech-to-text-using-ASR

YouTube Video Summarizer is a Python project that aims to provide a convenient way to summarize the content of YouTube videos by converting their audio into text. The project utilizes automatic speech recognition (ASR) techniques, fine-tuned models, and audio processing to generate concise summaries of the video's content.

Features

  • Downloads YouTube videos and extracts the audio.
  • Converts audio format from .mp4 to .wav for further processing.
  • Uses a popular fine-tuned CTC model, "jonatasgrosman/wav2vec2-large-xlsr-53-english," for English speech recognition.
  • Audio chunking: Splits the audio into smaller segments of 30 seconds each for efficient processing.
  • Summarizes the audio transcript using pipeline techniques for a concise summary.
  • Provides the summarized text as the output.

Installation

  1. Clone the repository:
git clone https://github.com/Piyush4455/YouTube-Video-Summarization-Speech-to-text-using-ASR.git
  1. Change to the project directory:
cd YouTube-Video-Summarization-Speech-to-text-using-ASR
  1. Install the required dependencies:
pip install -r requirements.txt

USAGE

1.Run the YouTube_Video_Summarizer_Speech_to_Text_ASR.ipynb notebook.
2.Replace video_link with the URL of the YouTube video you want to summarize.
3.It will download the video, convert the audio to .wav format, perform automatic speech recognition (ASR) using the fine-tuned model, chunk the audio, and generate a summarized transcript.
4.The summarized text will be displayed.

youtube-video-summarization-speech-to-text-using-asr's People

Contributors

piyush4455 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.