markyouyuren Goto Github PK
Type: User
Type: User
尝试使用神经网络生成音乐游戏Malody的谱面。
An application of vocal melody extraction.
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
voice conversion system
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
This repository contains the implementation of the AI-based "BeatNet" Joint beat, downbeat, tempo, and meter tracking system using CRNN and particle filtering. 2021's state-of-the-art online model - (ISMIR 2021).
vits2 backbone with multilingual-bert
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Deep Learning Examples
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
PyTorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
This is the GitHub page for publicly available emotional speech data.
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Chinese version of GPT2 training code, using BERT tokenizer.
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
MixGAN-TTS: End-to-End Speech Synthesis Based on Diffusion Model
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.