Giter Site home page Giter Site logo

speech2text's Introduction

Speech to text

A container which translates audio files to text files and places those text files to designated locations.

/audio          - folder to which files appear
/target         - folder to which files should be created
/targets.json   - a text file describing the magic word which is used to move files
/config.json    - simple email integration - supports only very simple SMTP
/var/models     - location of the Whisper models, recommended to be cached with a volume

targets.json

JSON structure for text file creation. Options are, keep the source audio file or not , prepend a time stamp in the form of 2024-01-07 21:04 to the text, and to append to a specific file - if not stated then the file will be the audiofile with .md file ending.

default describes the minimal requirement JSON structure requirement

{
    "default": { "keepaudiofile": "path/to/folder", "transcript": "path/to/folder" }, 
    "magic word": { "keepaudiofile": "path/to/folder", "transcript": "path/to/folder", "filename":"filename_to_append_transcripts_to" },
    "magic word": { "keepaudiofile": "path/to/folder", "transcript": "path/to/folder", "filename":"filename_to_append_transcripts_to", "timestamp": true },
}

email definition (expets config.json)

required parameters and valid values:

  • keepaudiofile: true or false (if true, file is put as a message attachment, if false file is destroyed)
  • email: valid email address the receiver
  • transcript subject or body (the transcripted text will place in this part of the email)

example:

{ "magic word": {"keepaudiofile": true, "email": "[email protected]", "transcript": "subject"}}

Running the container

docker run -it -v /host/system/audiofolder:/audio -v whisper_models:/var/models \
    -v /host/filedestinations:/target -v /host/user/targets.json:/targets.json \
    -v /host/user/config.json./config.json  -u $(id -u ${USER}):$(id -g ${USER}) speech2text

speech2text's People

Contributors

jleivo avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.