Giter Site home page Giter Site logo

vmeng / spindle-code Goto Github PK

View Code? Open in Web Editor NEW

This project forked from ox-it/spindle-code

0.0 2.0 0.0 1.46 MB

Speech-to-text for podcasting prototypes

Home Page: http://openspires.oucs.ox.ac.uk/spindle/

OpenEdge ABL 83.09% Python 6.05% Shell 0.33% JavaScript 10.53%

spindle-code's Introduction

Introduction

Welcome to the SPINDLE project [blog] [website] code repository. This repository contains three main folders:

  • keywords: this directory contains keywords.py, a python script developed during the SPINDLE project that generates keywords from a text. It has been used during the project to [generate keywords from automatic transcriptions] (http://blogs.oucs.ox.ac.uk/openspires/2012/09/12/spindle-automatic-keyword-generation-step-by-step/).

  • speechToText: this directory contains instructions to set up CMU Sphinx4 in Large Vocabulary Continuous Speech Recognition mode. It also contains a config.xml file for the Transcribe.java application.

  • web: A prototype web interface for the above two systems, which can also do other useful things relating to podcasts, transcription and speech to text:

    • import audio and video podcasts from RSS, matching up audio and video recordings of the same material where possible
    • import transcripts from formats including SRT/WebVTT, Adobe Premiere XMP files, and others
    • edit imported transcripts in sync with the audio or video, or start with a blank transcript and create one from scratch
    • queue items for automated speech-to-text transcription, either using the freely available CMU Sphinx4 system or interfacing with the Koemei commercial ASR service (http://koemei.com)
    • automatically extract keywords for items with associated transcript text
    • export edited captions and transcripts in SRT, plain text, and HTML form
    • publish exported captions and transcripts to a directory of static files, with control over which items are published and in what form
    • publish a copy of the incoming RSS feed, including automatically extracted keywords as <category> tags

Some of these features are closer to completion than others. The web interface is written in Python with the Django framework. See web/README.md for more details on how to install and test it.

Tags

#spindle #openspires #ukoer #oerri

spindle-code's People

Contributors

joddie avatar vmeng avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.