ishine's Projects
Keyword Spotting suitable for embedded devices.
为 AAAI-2018 Neural Networks Incorporating Dictionaries for Chinese Word Segmentation 添加大量注释
基于行块分布函数的通用网页正文抽取算法的Python版本实现,添加了英文支持/ Web page content extraction algorithm, support both Chinese and English
cxxnet port to windows
his code is a pytorch version for CycleFlow model in "CycleFlow: Purify Information Factors by Cycle Loss"
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
Cynical data selection
Compile a python package in one `.so` file, and package without source code
缠中说禅技术分析工具;缠论;股票;期货
C++ port of ZXing for Android
PyTorch implementation of Densely Connected Time Delay Neural Network
D2M-GAN for music generation from dance videos
Dual-Stage Attention-Based Recurrent Neural Net for Time Series Prediction
📃 **Unofficial** PyTorch Implementation of DA-RNN (arXiv:1704.02971)
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech
Multiple DOA estimation & delay-and-sum beamforming
The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"
darknet yolov3 tiny train model demo
Deep Audio Segmenter
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language"
PyTorch implementation of Data2Vec self-supervised approach for vision use cases.
The codebase for Data-driven general-purpose voice activity detection.
Streamlit app to visualize and edit TTS datasets
A dual-branch attention-in-attention transformer (dubbed DB-AIAT) to focus on both coarse and fine-grained regions of spectrum in parallel, i.e., spectral magnitude and lost complex spectral details. The source code will be released soon
A scoring neural backend for x-vector based speaker verification.
Code for DCASE 2020 task 1a and task 1b.