jinmingche Goto Github PK
Type: User
Type: User
Experiments on speech recognition robustness to accents and dialects
The PyTorch-based audio source separation toolkit for researchers
An Open Source Tools for Speaker Recognition
Keras Layer implementation of Attention for Sequential models
Implementation of "Attention Is Off By One" by Evan Miller
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
A Convolutional Transformer for Keyword Spotting
Deep Learning Neural Networks Final Project
Auto-AVSR: Lip-Reading Sentences Project
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the worldβs resources for speech enhancement and make them universally accessible and useful.
BWE matlab
simple delaysum, MVDR and CGMM-MVDR
chinese speech pretrained models
[INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation
Making big AI models cheaper, easier, and scalable
Compare AIRES BSS with TRINICON, ILRMA and AuxIVA (online and offline versions)
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
configuration files, such as repo (download android source file)γ.git-completion.bash(git autocomplete bash)
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"
DCCRN with various loss functions
PyTorch implementation of 'Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding' by Song Han, Huizi Mao, William J. Dally
Noise supression using deep filtering
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
Official DeiT repository
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Conversational Multimodal Emotion Recognition
A declarative, efficient, and flexible JavaScript library for building user interfaces.
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. πππ
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google β€οΈ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.