Giter Site home page Giter Site logo

Qibaba's Projects

rocblas icon rocblas

Next generation BLAS implementation for ROCm platform

s3prl icon s3prl

Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit

segnext icon segnext

Official Pytorch implementations for "SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation" (NeurIPS 2022)

server icon server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

sherpa icon sherpa

Streaming and non-streaming ASR server in Python

sherpa-ncnn icon sherpa-ncnn

Real-time speech recognition using next-gen Kaldi with ncnn

sherpa-onnx icon sherpa-onnx

Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin

silero-vad icon silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

simclue icon simclue

大规模语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型

sincnet icon sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

sms_wsj icon sms_wsj

SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition

sndfile-tools icon sndfile-tools

A collection of tools (written in C) to do interesting things with sound files

sound-spaces icon sound-spaces

A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.

soundsourceseparation icon soundsourceseparation

The code for multi-channel speech enhancement and source separation such as MNMF, MNMF_DP, ILRMA, ILRMA_DP, FastMNMF, FastMNMF_DP, FCA, FastFCA

specaugment icon specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

speech-aligner icon speech-aligner

speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

speech-backbones icon speech-backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

speech-enhancement-using-gsc icon speech-enhancement-using-gsc

To Implement the Generalized Side Lobe Canceller with Fixed Beamformer,parallel blocking matrix and adaptive interference canceller achieved the effective Speech Enhancement.

speech-transformer icon speech-transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.