Giter Site home page Giter Site logo

David zhang's Projects

caffe icon caffe

Caffe: a fast framework for deep learning. For the most recent version checkout the dev branch. For the latest stable release checkout the master branch.

cs228 icon cs228

Programming assignments for the class

dccrn icon dccrn

implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch

deepxi icon deepxi

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

dns-challenge icon dns-challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

easy_hmm icon easy_hmm

A easy HMM program written with Python, including the full codes of training, prediction and decoding.

ecapa-tdnn icon ecapa-tdnn

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

fast-rir icon fast-rir

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

fullsubnet-plus icon fullsubnet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

hnm icon hnm

Harmonic plus noise synthesis

ir-gan icon ir-gan

Augmenting Room Impulse Response

masg icon masg

microphone array speech generator (MASG) in room acoustic

mesh2ir icon mesh2ir

This is the official implementation of our mesh-based neural network (MESH2IR) to generate acoustic impulse responses (IRs) for indoor 3D scenes represented using a mesh.

mmagic icon mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, image/video restoration/enhancement, etc.

nara_wpe icon nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

realbasicvsr icon realbasicvsr

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

tc-resnet icon tc-resnet

Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices

triplet_loss_kws icon triplet_loss_kws

Learning Efficient Representations for Keyword Spotting with Triplet Loss

voiceflow-tts icon voiceflow-tts

This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.