Yeshua WB III's Projects
A simple web UI for Suno-AI Bark
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
A voice-based ChatGPT clone that can search on the Internet and also in local files
Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project
NVIDIA's TalkNET - Train on colab
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.
Atmospheric adventure chat for AI language models (KoboldAI, NovelAI, Pygmalion, OpenAI chatgpt, gpt-4)
Teach any questions in seconds (by OpenAI)
这是一个Android系统TTS应用,内置微软演示接口,可自定义HTTP请求,可导入其他本地TTS引擎,以及根据中文双引号的简单旁白/对话识别朗读 ,还有自动重试,备用配置,文本替换等更多功能。
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System)
Venom-Tool-Installer is a Kali Linux hacking tools installer for Termux and linux system. Venom-Tool-Installer was developed for Termux and linux based systems. Using Venom-Tool-Installer, you can install almost 370+ hacking tools in Termux (android) and other Linux based distributions. Now Venom-Tool-Installer is available for Ubuntu, Debian etc.
The "vicuna-installation-guide" provides step-by-step instructions for installing and configuring Vicuna-13B
Opinionated fork/implementation of Stable Diffusion
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. Feel free to check my thesis if you're curious or if you're looking for info I haven't documented.
an improved version of Real-time-voice-cloning
無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン
VTube Studio API Development Page
Quickly download the abstracts for arxiv papers related to a given topic and render with markdown
Project that allows one to use a microphone with OpenAI whisper.
Convert live audio input to text on screen using WhisperAI