sunnnnnnnny,github

academicodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

adaspeech

An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

attention_onnx_exp

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

automatic-prosody-annotation

auxiliaryasr

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

awesome-singing-voice-synthesis-and-singing-voice-conversion

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).

bark

🔊 Text-Prompted Generative Audio Model

bbdown

Bilibili Downloader. 一款命令行式哔哩哔哩下载器.

cdfse_fastspeech2

The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis”

chattts

ChatTTS is a generative speech model for daily dialogue.

chinese-fastspeech2

基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏

chinese_speech_pretrain

chinese speech pretrained models

clap

Learning audio concepts from natural language supervision

dall-e

PyTorch package for the discrete VAE used for DALL·E.

dalle-pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder archi

sunnnnnnnny Goto Github PK

sunnnnnnnny's Projects

Recommend Projects

Recommend Topics

Recommend Org