jinmingche,github

accentedspeechrecognition

Experiments on speech recognition robustness to accents and dialects

asteroid

The PyTorch-based audio source separation toolkit for researchers

asv-subtools

An Open Source Tools for Speaker Recognition

attention_keras

Keras Layer implementation of Attention for Sequential models

attentionisoffbyone

Implementation of "Attention Is Off By One" by Evan Miller

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

audiomer-pytorch

A Convolutional Transformer for Keyword Spotting

audiotagger

Deep Learning Neural Networks Final Project

auto_avsr

Auto-AVSR: Lip-Reading Sentences Project

awesome-keyword-spotting

This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).

awesome-speech-enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

bandwidth_extension

BWE matlab

beamforming-for-speech-enhancement

simple delaysum, MVDR and CGMM-MVDR

chinese_speech_pretrain

chinese speech pretrained models

cif-hieradist

[INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation

colossalai

Making big AI models cheaper, easier, and scalable

comparison-of-blind-source-separation-techniques

Compare AIRES BSS with TRINICON, ILRMA and AuxIVA (online and offline versions)

comprehensive-transformer-tts

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

configurationfiles

configuration files, such as repo (download android source file)、.git-completion.bash(git autocomplete bash)

conformer

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

dacidian

DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)

darcn

The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"

dccrn-with-various-loss-functions

DCCRN with various loss functions

deep-compression-pytorch

PyTorch implementation of 'Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding' by Song Han, Huizi Mao, William J. Dally

deepcomplexcrn

deepfilternet

Noise supression using deep filtering

deepxi

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

deit

Official DeiT repository

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

diasenti

Conversational Multimodal Emotion Recognition

jinmingche Goto Github PK

jinmingche's Projects

Recommend Projects

Recommend Topics

Recommend Org