ahmeftah Goto Github PK
Type: User
Type: User
Evaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models
Generated Audio Samples by ALGAN-VC model are available in the folder
A Curated List Of Programming Books For C, C++ , Python, JavaScript, NodeJs, ReactJs, Web, JQuery, Flask, Dom, Angular, CSS, HTML for beginners, intermediate, advanced and experts
automated speech recognition system for arabic language for customers query classification, with adaptive learning and merged learning models trained with weka
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"
Project done as part of Audio Processing course at Tampere University. Topic was separation of harmonic and percussive elements according to paper EPARATION OF A MONAURAL AUDIO SIGNAL INTO HARMONIC/PERCUSSIVE COMPONENTS BY COMPLEMENTARY DIFFUSION ON SPECTROGRAM by Nobutaka Ono, Kenichi Miyamoto, Jonathan Le Roux, Hirokazu Kameoka, and Shigeki Sagayama.
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
Nonparallel Emotional Speech Conversion with MUNIT. Introduction: This is a tensorflow implementation of paper(https://arxiv.org/pdf/1811.01174.pdf) Nonparallel Emotional Speech Conversion. It is an end-to-end voice conversion system which can change the speaker's emotion. For example, neutral to angry, sad to happy. The model aims at generating speech with desired emotions while keeping the original linguistic content and speaker identity. It first extracts acoustic features from raw audio, then learn the mapping from source emotion to target emotion in the feature space, and finally put those features together to rebuild the waveform. In our approach, three types of features are considered: Features: Fundamental frequency (log F_0), converted by logarithm Gaussian normalized transformation Power envelope, converted by logarithm Gaussian normalized transformation Mel-cepstral coefficients (MCEPs), a representation of spectral envelope, trained by CycleGAN Aperiodicities (APs), directly used without modification. Dependencies: Python 3.5, Numpy 1.15, TensorFlow 1.8, LibROSA 0.6, FFmpeg 4.0, PyWorld
Compilation of R and Python programming codes on the Data Professor YouTube channel.
This is the code for controllable EVC framework for seen and unseen emotion generation.
My notes / works on deep learning from Coursera
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
Reproducing PARALLEL-DATA-FREE VOICE CONVERSION USING CYCLE-CONSISTENT ADVERSARIAL NETWORKS (https://arxiv.org/pdf/1711.11293.pdf)
Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.
EMOTIONAL VOICE CONVERSION WITH CYCLE-CONSISTENT ADVERSARIAL NETWORK
Tensorflow implementation for learning an image-to-image translation without input-output pairs. https://arxiv.org/pdf/1703.10593.pdf
CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer
Dive into Deep Learning Compiler
Course: Deep Learning
The convertor/conversion of deep learning models for different deep learning frameworks/softwares.
:satellite: All You Need to Know About Deep Learning - A kick-starter
Deep Learning Examples
DeepNude's algorithm and general image generation theory and practice research, including pix2pix, CycleGAN, UGATIT, DCGAN, SinGAN, ALAE, mGANprior, StarGAN-v2 and VAE models (TensorFlow2 implementation). DeepNude的算法以及通用生成对抗网络(GAN,Generative Adversarial Network)图像生成的理论与实践研究。
:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:
demo page https://MingjieChen.github.io/dygan-vc
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Implementation of Emo-StarGAN
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.