whitefu Goto Github PK
Type: User
Bio: speech synthesis & voice conversion & speech enhancement
Type: User
Bio: speech synthesis & voice conversion & speech enhancement
Clone a voice in 5 seconds to generate arbitrary speech in real-time
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
算法理论基础知识应知应会
Model and code for RepCodec: A Speech Representation Codec for Speech Tokenization
用Resnet101+GPT搭建一个玩王者荣耀的AI
Voice data <= 10 mins can also be used to train a good VC model!
Rich-Text-to-Image Generation
Speech2Vec Reality Check
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
(CVPR 2023)SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E
Foundational Models for State-of-the-Art Speech and Text Translation
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Sentence Embeddings with BERT & XLNet
使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。
Deep learning for Arabic text Vocalization - التشكيل الالي للنصوص العربية
Linux命令行与shell脚本编程大全案例
Official implementation of the source-filter HiFiGAN vocoder
Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。
A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
收集有关so-vits-svc、TTS、SD、LLMs的各种模型、应用以及文字、声音、图片、视频有关的model。
SoftVC VITS Singing Voice Conversion
A toolkit and documentation version of so-vits-svc.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.