makinglong Goto Github PK
Type: User
Bio: VOIP: WEBRTC FFMPEG SPEEX PJSIP ASR: KALDI HTK CMU_SPHINX LFR_DFMNS
Type: User
Bio: VOIP: WEBRTC FFMPEG SPEEX PJSIP ASR: KALDI HTK CMU_SPHINX LFR_DFMNS
《机器学习》(西瓜书)公式推导解析,在线阅读地址:https://datawhalechina.github.io/pumpkin-book
A Python wrapper for Kaldi
Example code for interfacing with C and C++ from Python using Cython, SWIG, CFFI, PyPy, and pybind11
Image-to-image translation in PyTorch (e.g., horse2zebra, edges2cats, and more)
Turn Chinese natural language into structured data 中文自然语言理解
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Recurrent neural network for audio noise reduction
Robotics Toolbox for MATLAB
deep learning based speech enhancement using keras python, make it easy to use
Unsupervised text tokenizer for Neural Network-based text generation.
similarity:相似度计算工具包,java编写。用于词语、短语、句子、词法分析、情感分析、语义分析等相关的相似度计算。
国内外为数不多致力于极致体验的超强全自研跨平台(windows/android/iOS)流媒体内核,通过模块化自由组合,支持实时RTMP推流、RTSP推流、RTMP播放器、RTSP播放器、录像、多路流媒体转发、音视频导播、动态视频合成、音频混音、直播互动、内置轻量级RTSP服务等,比快更快,业界真正靠谱的超低延迟直播SDK(1秒内,低延迟模式下200~400ms)。
Snips Python library to extract meaning from text
Code for ACL 2020 paper "Rigid Formats Controlled Text Generation":https://www.aclweb.org/anthology/2020.acl-main.68/
Simple library to speed up or slow down speech
Android NDK wrapper for libsonic
An iOS Application written in Objective-C use to record and play sounds on iPhone/iPad. 📱 🔊
基于SoundTouch的变音Demo,AndroidStudio工程,支持实时语音变音处理
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
A neural network for end-to-end speech denoising
2018年7⽉30⽇-8⽉13⽇持续2周的好未来AI训练营中语⾳情感识别营的项目报告
Deep learning for audio denoising
This is a single channel speech dereverberation method based on DOI: 10.1109/TSA.2005.858066; implemented in MATLAB
Speech recognition module for Python, supporting several engines and APIs, online and offline.
A local auto speech recognition project based on Kaldi and ALSA.
Pure Java speech recognition library
Spoken Dialogue System: ASR, NLU, DM, NLG, and TTS
Voice Conversion Tool Kit
SRS is a simple, high efficiency and realtime video server, supports RTMP, WebRTC, HLS, HTTP-FLV, SRT and GB28181.
Voice Activity Detector (VAD) for NIST Speaker Recognition Evaluation
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.