ductho9799,Trần Nguyễn Đức Thọ,github

100daysofsystemdesign

Documenting resources and notes for learning system design.

asr-with-dfcnn-and-transformer

Speech Recognition with DFCNN and Transformer

awesome-computer-vision

A curated list of awesome computer vision resources

awesome-document-understanding

A curated list of resources for Document Understanding (DU) topic

awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

bigvgan

Official implementation of BigVGAN in PyTorch

cheetah

On-device streaming speech-to-text engine powered by deep learning

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

comprehensive-transformer-tts

A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.

contextnet

Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recognition using global context

convnext

Code release for ConvNeXt model

coqui-tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

course

The Hugging Face course

cross-lingual-voice-cloning

Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.

crunker

Simple way to merge or concatenate audio files with the Web Audio API.

deepbeat

DeepBeat: Multi-task deep learning for cardiac rhythm detection in wearable devices

deepecg

ECG classification programs based on ML/DL methods

deeplearningexamples

Deep Learning Examples

deepperformer

Deep Performer: Score-to-audio music performance synthesis

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

ductho9799 Goto Github PK

Trần Nguyễn Đức Thọ's Projects

Recommend Projects

Recommend Topics

Recommend Org