Giter Site home page Giter Site logo

cstollw / podcast-chapterize Goto Github PK

View Code? Open in Web Editor NEW

This project forked from stereolith/podcast-chapterize

0.0 0.0 0.0 148.91 MB

segment a podcast episode automatically and add chapter markers

Python 66.76% JavaScript 6.03% HTML 1.84% Vue 25.29% CSS 0.09%

podcast-chapterize's Introduction

podcast-chapterize

This project aims to automatically provide longform audio podcast episodes with chapter markers. This is achieved with statistical natrual language processing algorithms that try to subdivide transcribed podcast episodes into topically cohesive parts.

[work in progress]

Requirements

CLI:

Web interface:

  • node.js
  • npm

Usage

This program can be used in the command line or as an HTTP API with a web interface.

CLI

Usage: python3 main.py [subcommand] [options] ...

  • Help: python3 main.py --help
  • Subcommand help: python3 main.py [subcommand] --help
  • Possible subcommands:
    • python3 main.py run: Start chapterization process from podcast RSS feed URL
    • python3 main.py transcribe: Transcribe podcast episode from RSS feed URL
    • python3 main.py chapterize: Chapterize transcript

Web interface

API:

  • Create python3 venv: python3 -m venv venv
  • Activate venv: source venv/bin/activate
  • (optional) Set environment variables for IP address and Port in the ´.flaskenv´ file
  • start API server with flask run

Frontend:

  • Serve frontend files 'web/client/dist' on web server

If the server is not running on the same machine:

  1. specify API host in web/client/.env
  2. cd into 'web/client'
  3. install dependencies with npm install
  4. build files with npm run build
  5. serve built files from (web/client/dist) on web server

podcast-chapterize's People

Contributors

lmoench avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.