ishine Goto Github PK

followers: 109.0 following: 111.0 repos: 3.2K gists: 1.0

Type: User

Company: gerzz.inc

Bio: speech asr/speech-recognition tts/text-to-speech vc/voice-conversion

Location: shanghai

Blog: dubbing-ai.com

Hi 👋, I'm ishine.

🔭 I’m currently working on TTS, VC, SVS, ASR.
voice conversion/changer @ dubbing-ai.com

ishine's Projects

da-rnn

Dual-Stage Attention-Based Recurrent Neural Net for Time Series Prediction

da-rnn-1

📃 **Unofficial** PyTorch Implementation of DA-RNN (arXiv:1704.02971)

daft-exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

dailytalk

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech

dap_project

Multiple DOA estimation & delay-and-sum beamforming

darcn

The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"

data2vec-pytorch

PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language"

data2vec-vision

PyTorch implementation of Data2Vec self-supervised approach for vision use cases.

datadriven-gpvad

The codebase for Data-driven general-purpose voice activity detection.

dataset_viewer

Streamlit app to visualize and edit TTS datasets

A dual-branch attention-in-attention transformer (dubbed DB-AIAT) to focus on both coarse and fine-grained regions of spectrum in parallel, i.e., spectral magnitude and lost complex spectral details. The source code will be released soon

dbenet

A scoring neural backend for x-vector based speaker verification.

dcase2020_task1

Code for DCASE 2020 task 1a and task 1b.

dcasenet

Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, and sound event detection. Implemented using PyTorch.