segmentationfaults Goto Github PK
Type: User
Type: User
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
This repository will illustrate the use of some different backends on NIST SRE 2019.
ChatTTS is a generative speech model for daily dialogue.
LLM based TTS model, providing inference/training/deployment full-stack ability.
C++那些事
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
The official implementation of HierSpeech++
FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar
This is the official location of the Kaldi project.
Learn Go with test-driven development
Go 学习之路:Go 开发者博客、Go 微信公众号、Go 学习资料(文档、书籍、视频)
High-speed Deep learning API Server with Libtorch (C++) and Gin (Golang)
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
Compile script for Openblas and Android binaries
pycorrector is a toolkit for text error correction. 文本纠错,Kenlm,Seq2Seq_Attention,BERT,MacBERT,ELECTRA,ERNIE,Transformer等模型实现,开箱即用。
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
An Open Source Machine Learning Framework for Everyone
A TensorFlow implementation of DeepMind's WaveNet paper
:stuck_out_tongue_closed_eyes: TensorflowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A high-throughput and memory-efficient inference and serving engine for LLMs
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Zero-Shot Speech Editing and Text-to-Speech in the Wild
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.