normonisping Goto Github PK
Name: Qibaba
Type: User
Name: Qibaba
Type: User
Next generation BLAS implementation for ROCm platform
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
Foundational Models for State-of-the-Art Speech and Text Translation
Official Pytorch implementations for "SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation" (NeurIPS 2022)
speech to text with self-supervised learning based on wav2vec 2.0 framework
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Streaming and non-streaming ASR server in Python
Real-time speech recognition using next-gen Kaldi with ncnn
Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin
Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector
大规模语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型
SincNet is a neural architecture for efficiently processing raw audio samples.
SIMD optimised library for matrix inversion of 2x2, 3x3, and 4x4 matrices.
SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
A collection of tools (written in C) to do interesting things with sound files
A collection of SOTA Image Classification Models in PyTorch
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.
The code for multi-channel speech enhancement and source separation such as MNMF, MNMF_DP, ILRMA, ILRMA_DP, FastMNMF, FastMNMF_DP, FCA, FastFCA
Deep learning based speech source separation using Pytorch
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
A neural network for end-to-end speech denoising
To Implement the Generalized Side Lobe Canceller with Fixed Beamformer,parallel blocking matrix and adaptive interference canceller achieved the effective Speech Enhancement.
speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN
A must-read paper for speech separation based on neural networks
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Demo for cycleGAN-based speech to singing conversion
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.