sharrnah Goto Github PK
Type: User
Type: User
Voice Cloning CLI tool for Bark. - mainly used for the Bark TTS Whispering Tiger (https://github.com/Sharrnah/whispering-ui) Plugin, but can be used standalone as well.
π Text-prompted Generative Audio Model - With the ability to clone voices
A generative speech model for daily dialogue.
This is a Docker Setup for Dreambooth to train personalized stable diffusion models.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Faster Whisper transcription with CTranslate2
Cross platform GUI toolkit in Go inspired by Material Design
lightweight, standalone C++ inference engine for Google's Gemma models.
Compresses Images to reach a maximum Filesize
Karras et al. (2022) diffusion models for PyTorch
Large Language Model API with short chat-history memory
Voice data <= 10 mins can also be used to train a good VC model!
Minimal Python downloader with robustness in mind - resumable downloads, retries, and more
Foundational Models for State-of-the-Art Speech and Text Translation
Stable Diffusion Docker Project with WebUI
Fixes Steam/Windows not finding the Audio Device of the VR Headset if connected through NVIDIA High Definition Audio. See manual fix https://www.reddit.com/r/ValveIndex/comments/ca78vn/whenever_i_start_up_my_index_i_have_to_always/
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
A graphical terminal emulator for Linux using Fyne
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Dockerfile with FastAPI to run Text-to-Speech
General Speech Restoration
Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRChat and Overlays in most Streaming Applications
Plugins for Whispering Tiger
Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. πππ
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google β€οΈ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.