birdyfun,github

alimeeting

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

ant-design

An enterprise-class UI design language and React UI library

av_hubert

A self-supervised learning framework for audio-visual speech

beamformit

BeamformIt acoustic beamforming software

causal-u-net

unofficial PyTorch implementation of 《A Causal U-net based Neural Beamforming Network for Real-Time Multi-Channel Speech Enhancement》

conformer

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

deepcomplexunetpytorch

Implementation of Deep Complex UNet Using PyTorch

eabnet

This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which was submitted to ICASSP2022.

espnet

End-to-End Speech Processing Toolkit

faf-net

funasr

A Fundamental End-to-End Speech Recognition Toolkit

kaldi_for_chime

kaldi-asr/kaldi is the official location of the Kaldi project.

l3das22_challenge

L3DAS22_Challenge for icassp 2021

lingvo

Lingvo

mimo-speech

Multi-channel (MI) Multi-speaker (MO) speech recognition.

multi-channel-asr-se-toolkit

A personal toolkit for single/multi-channel speech recognition & enhancement & separation.

multiiris-demo

demo page for multiIRIS: End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation

multimodal-speech-emotion-recognition

Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)

nbss

The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".

paddlespeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

birdyfun Goto Github PK

birdyfun's Projects

Recommend Projects

Recommend Topics

Recommend Org