ishine Goto Github PK

followers: 114.0 following: 132.0 repos: 3.3K gists: 1.0

Type: User

Company: gerzz.inc

Bio: speech asr/speech-recognition tts/text-to-speech vc/voice-conversion

Location: shanghai

Blog: dubbing-ai.com

Hi 👋, I'm ishine.

🔭 I’m currently working on TTS, VC, SVS, ASR.
voice conversion/changer @ dubbing-ai.com

ishine's Projects

transformer-tts

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

transformer-tts-1

A Tensorflow Implementation like "Neural Speech Synthesis with Transformer Network" Port From OpenSeq2Seq

transformers

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

transformertts-1

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

transfusion-asr

Transcribing Speech with Multinomial Diffusion, training code and models.

translate-python

Online translation as a Python module & command line tool. No key, no authentication needed.

transphone

phone tokenizer and grapheme-to-phoneme model for 8k languages

transtacos-retunegan

A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.

transummar

Transformer for abstractive summarization on cnn/daily-mail and gigawords

triplenet

TripleNet: Triple Attention Network for Multi-Turn Response Selection in Retrieval-based Chatbots (CoNLL2019)

triplet-loss-train-for-speaker-recognition

It is a complete project of voiceprint recognition or speaker recognition.Before, I upload a very classic VGG based model for speaker recognition . The model simply use softmax-loss to train super-parameters. But during testing stage,we found the model is not very reliable。for example, the model can easily distinguish man-man group, and man-woman group, but difficultly in woman-woman. So, we try another method called triplet-group to retrain our model, of course, we use triplet-loss as the loss for back propagation. The I upload our core code, and training curve for the two training stage. Why, I refer to "two training stage"? That need you to understand the triplet-group method. And very very welcome to my mailbox: [email protected]