segmentationfaults,github

awesome-speech-enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

backends-for-sre19

This repository will illustrate the use of some different backends on NIST SRE 2019.

chattts

ChatTTS is a generative speech model for daily dialogue.

cosyvoice

LLM based TTS model, providing inference/training/deployment full-stack ability.

end-to-end-asr-pytorch

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

hierspeechpp

The official implementation of HierSpeech++

k2

FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar

kaldi

This is the official location of the Kaldi project.

learning-golang

Go 学习之路：Go 开发者博客、Go 微信公众号、Go 学习资料（文档、书籍、视频）

libtorch-gin-api-server

High-speed Deep learning API Server with Libtorch (C++) and Gin (Golang)

mace

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

mimic-recording-studio

Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2

openblasbuildforandroid

Compile script for Openblas and Android binaries

pycorrector

pycorrector is a toolkit for text error correction. 文本纠错，Kenlm，Seq2Seq_Attention，BERT，MacBERT，ELECTRA，ERNIE，Transformer等模型实现，开箱即用。

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

tensorflow

An Open Source Machine Learning Framework for Everyone

tensorflow-wavenet

A TensorFlow implementation of DeepMind's WaveNet paper

tensorflowtts

:stuck_out_tongue_closed_eyes: TensorflowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

voicecraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

vosk-server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.

segmentationfaults Goto Github PK

segmentationfaults's Projects

Recommend Projects

Recommend Topics

Recommend Org