silyfox,github

adain-style

Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization

adain-vc

An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".

again-vc

This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization.

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

arbitrary_style_transfer

Fast Neural Style Transfer with Arbitrary Style using AdaIN Layer - Based on Huang et al. "Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization"

artflow

Official PyTorch implementation of "ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows"

audioldm

AudioLDM: Generate speech, sound effects, music and beyond, with text.

audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

audiomlproject3

Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification problem. Emotion Classes: Happy, Sad, Neutral, Relaxed and Angry

awesome-contrastive-self-supervised-learning

A comprehensive list of awesome contrastive self-supervised learning papers.

awesome-diffusion-models

A collection of resources and papers on Diffusion Models and Score-based Models, a darkhorse in the field of Generative Models

awesome-normalizing-flows

Awesome resources on normalizing flows.

awesome-speech-recognition-speech-synthesis-papers

Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling

bark

🔊 Text-prompted Generative Audio Model

beta-tcvae

code for "Isolating Sources of Disentanglement in Variational Autoencoders".

beta-vae

Pytorch implementation of β-VAE

beta_tcvae_v1

Playing around with Beta-TCVAE implementation

cdfse_fastspeech2

The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis”

chattts

A generative speech model for daily dialogue.

chinese_song_generation

circe

Efficient Conditionally Invariant Representation Learning (ICLR 2023, Oral)

clari_wavenet_vocoder

clarinet

A Pytorch Implementation of ClariNet

club

Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

conference-acceptance-rate

Acceptance rates for the major AI conferences

contrastive-predictive-coding

Keras implementation of Representation Learning with Contrastive Predictive Coding

controlspeech

ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec

cr-gan

Yu Tian et al. "CR-GAN: Learning Complete Representations for Multi-view Generation", IJCAI 2018

cross-speaker-emotion-transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

silyfox Goto Github PK

silyfox's Projects

Recommend Projects

Recommend Topics

Recommend Org