ishine's Projects
XLNet: Generalized Autoregressive Pretraining for Language Understanding
喜马拉雅音频下载工具
A PyTorch implementation of target speaker extraction.
This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)
Multi-dimensional arrays with broadcasting and lazy computing
being a multi-speaker video-to-speech network
一个关于血色衣冠的对话机器人, 基于 Rasa, 可语音与机器人对话
Extremely fast non-cryptographic hash algorithm
Y-vector: Multiscale Waveform Encoder for Speaker Embedding
优客服,是一个多渠道融合的客户支持服务平台(智能客服系统),和电话销售平台(电销系统),包含WebIM,微信,电话,邮件,短信等接入渠道 http://www.youkefu.cn
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
恋听网有声书爬虫, scrapy框架
Bulding kg from 0
Open tools and data for cloudless automatic speech recognition
Tacotron based speech synthesizer
A BERT-based Chinese Text Encoder Enhanced by N-gram Representations
Zero -- A neural machine translation system
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统,包含语音编码器、语音合成器、声码器和可视化模块。