q-y-tang Goto Github PK
Name: David zhang
Type: User
Name: David zhang
Type: User
Pitch detection and pitch tracking, voicing unvoicing detection (VAD),基音检测
Caffe: a fast framework for deep learning. For the most recent version checkout the dev branch. For the latest stable release checkout the master branch.
Programming assignments for the class
implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch
Noise supression using deep filtering
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
A stacked self-attention network for two-dimensional direction-of-arrival estimation in hands-free speech communication
A easy HMM program written with Python, including the full codes of training, prediction and decoding.
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
Harmonic plus noise synthesis
Augmenting Room Impulse Response
microphone array speech generator (MASG) in room acoustic
beam_doa
This is the official implementation of our mesh-based neural network (MESH2IR) to generate acoustic impulse responses (IRs) for indoor 3D scenes represented using a mesh.
MESSL wrappers etc for JSALT 2015, including CHiME3
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, image/video restoration/enhancement, etc.
Different implementations of "Weighted Prediction Error" for speech dereverberation
Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"
It's a repository for implementations of neural speech editing algorithms.
estimation of steering vector for beamforming
Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
Learning Efficient Representations for Keyword Spotting with Triplet Loss
A statistical model-based Voice Activity Detection
This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
Noise Suppression Module Port From WebRTC
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.