ElevenLabs S4TS

Speech to text, text to Speech - STTTTS - S4TS

ElevenLabs S4TS is a PySide6 (Qt) application that does speech to text and then text to speech using eleven labs. The automatic speech recognition (ASR) model used for this application is OpenAI’s Whisper.

At startup, the application will use the whisper-base model for faster audio transcription. However, if your hardware supports cuda, you can change it to whisper-medium by checking Use Medium Model. ElevenLabs S4TS will automatically use cuda if your hardware supports cuda and your PyTorch is installed to support it.

How to Run ElevenLabs S4TS

Install Dependencies

Make sure Python 3.9 > is installed
Make your conda or pip env
Activate the virtual environment
Install PyTorch by following the instructions here

Install ElevenLabs S4TS dependencies

# Pip
pip install -U -r requirements.txt

# Conda
conda install pip
pip install -U -r requirements.txt

Run the application

Once you have all of the dependencies installed. We simply need to run ui.py by doing the following (assuming the virtual environment is activated):

python3 ui.py

How to use ElevenLabs S4TS

First of all, you need to have a plan for ElevenLabs. It does not matter what plan tier you have as long as you have one. Go here to check out plans that they offer.
When you’re signed up, go to your profile icon on the top left and click profile and copy your API Key.
Paste your API key on the input field labeled API Key on the window
Select your desired input and output device
Select desired ElevenLabs voice
Hold the Record button and speak
Once released, the audio will be processed using whisper for transcription
After transcription, the text will be sent to ElevenLabs using their API
The request returns an audio data that ElevenLabsS4TS plays through the set output device

Future plans

Package application
Add ability to voice clone using mic

cyr1en / elevenlabss4ts Goto Github PK

elevenlabss4ts's Introduction

ElevenLabs S4TS

How to Run ElevenLabs S4TS

Install Dependencies

Run the application

How to use ElevenLabs S4TS

Future plans

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent