Giter Site home page Giter Site logo

Hi 👋, I'm ishine.

  • 🔭 I’m currently working on TTS, VC, SVS, ASR.
  • voice conversion/changer @ dubbing-ai.com

ishine's Projects

alibi icon alibi

PyTorch implementation of Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation

aligntts icon aligntts

Implementation of ALIGNTTS: EFFICIENT FEED-FORWARD TEXT-TO-SPEECH SYSTEMWITHOUT EXPLICIT ALIGNMENT (arXiv:2003.01950v1 [eess.AS] 4 Mar 2020)

alimeeting icon alimeeting

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

alta icon alta

An Automatic Lyrics Transcription Framework using Dilated Convolutional Neural Networks with Self-Attention based on kaldi

amazonqa icon amazonqa

Evidence-based QA system for community question answering.

amtl-bnfs-i-vector-based-lr icon amtl-bnfs-i-vector-based-lr

This is the implementation of an unpublished paper: Adversarial Multi-task deep feature and unsupervised back-end adpatation for language recognition

animateportrait icon animateportrait

Code for "Animating Portrait Line Drawings from a Single Face Photo and a Speech Signal"

animeganv2 icon animeganv2

[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime

animeganv3 icon animeganv3

Use AnimeGANv3 to make your own animation works, including turning photos or videos into anime.

anofed icon anofed

用于标注、查看和修改结果的中文NLP分词工具

answer icon answer

基于本体的语义搜索引擎-Answer-V0.9 问答系统

apb2facev2 icon apb2facev2

An improved version of APB2Face: Real-Time Audio-Guided Multi-Face Reenactment

aps icon aps

A workspace for single/multi-channel speech recognition & enhancement & separation.

ash-ir-dataset icon ash-ir-dataset

An impulse response dataset for binaural synthesis of spatial audio systems on headphones

asr-hybrid-decoding icon asr-hybrid-decoding

This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs. The output is a mix of in-vocabulary words and phoneme sequences. This decoding is suitable for systems with only a small dictionary available and for further recovery of OOV words.

asr-score icon asr-score

simple, dependency-free Word Error Rate calculator in python3

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.