normonisping,Qibaba,github

rocblas

Next generation BLAS implementation for ROCm platform

s3prl

Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

segnext

Official Pytorch implementations for "SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation" (NeurIPS 2022)

self-supervised-speech-recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

sherpa

Streaming and non-streaming ASR server in Python

sherpa-ncnn

Real-time speech recognition using next-gen Kaldi with ncnn

Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

simclue

大规模语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型

sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

small-matrix-inverse

SIMD optimised library for matrix inversion of 2x2, 3x3, and 4x4 matrices.

sms_wsj

SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition

sndfile-tools

A collection of tools (written in C) to do interesting things with sound files

sota-backbones

A collection of SOTA Image Classification Models in PyTorch

sound-spaces

A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.

soundsourceseparation

The code for multi-channel speech enhancement and source separation such as MNMF, MNMF_DP, ILRMA, ILRMA_DP, FastMNMF, FastMNMF_DP, FCA, FastFCA

source_separation

Deep learning based speech source separation using Pytorch

specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

speech-aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

speech-backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

speech-denoising-wavenet

A neural network for end-to-end speech denoising

speech-enhancement-using-gsc

To Implement the Generalized Side Lobe Canceller with Fixed Beamformer,parallel blocking matrix and adaptive interference canceller achieved the effective Speech Enhancement.

speech-enhancement-wgan

speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN

speech-separation-paper-tutorial

A must-read paper for speech separation based on neural networks

speech-transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

speech2singing-demo

Demo for cycleGAN-based speech to singing conversion

normonisping Goto Github PK

Qibaba's Projects

Recommend Projects

Recommend Topics

Recommend Org