ishine's Projects
语音文本时间打点,voice alignment ,Conversion Time Mark
Master thesis Fall 2018: Neural Network based Audio Blind source Separation for Noise Suppression @ EPFL & Logitech
Audio Diarization Annotation tool
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
Unofficial PyTorch implementation of paper Masked Autoencoders that Listen
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
A Convolutional Transformer for Keyword Spotting
JavaScript library to sync audio with text based on a timing file
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
A Tiny Project For ASR model training and Deployment
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
Train custom adaptive filter optimizers without hand tuning or extra labels.
Voice conversion
About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008
Auto-KWS 2021 Challenge 1st place solution.
新闻人物言论自动提取
Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
auto generate srt subtitles for any video or audio file and translate it for free using googletrans-4.0.0-rc1
[NO LONGER MAINTAINED] Command-line utility for auto-generating subtitles for any video file
Python+flask+selenium 搭建UI自动化测试平台
Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
Avocodo: Generative Adversarial Network for Artifact-free Vocoder [WIP]