Giter Site home page Giter Site logo

Katsuya Iida's Projects

all-en icon all-en

Chromium extension to replace ja-jp with en-us in the URL so that you can easily move from Japanese page to English page if the site is multilingual.

fairseq icon fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

kokoro-align icon kokoro-align

Kokoro-Align is a PyTorch speech-transcript alignment tool for LibriVox. It splits audio files in silent positions and find CTC best path to align transcript texts with the audio files.

nemo icon nemo

NeMo: a toolkit for conversational AI

nemoonnxsharp icon nemoonnxsharp

Text-to-speech and speech recognition, VAD with NVIDIA NeMo and ONNX Runtime for .NET Core.

soundstream-pytorch icon soundstream-pytorch

Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint

tacotron icon tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

torchsharp icon torchsharp

A .NET library that provides access to the library that powers PyTorch.

tts icon tts

πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

voice100 icon voice100

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.

voice100-runtime icon voice100-runtime

Voice100 runtime. Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.

voice100androidapp icon voice100androidapp

Voice100 Android App is a TTS/ASR sample app that uses ONNX Runtime and Voice100 neural TTS/ASR models on Xamarin. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.

voice100sharp icon voice100sharp

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.

wav2vec2_ja icon wav2vec2_ja

wav2vec 2.0 finetuned with Common Voice 12.0 Japanese

world icon world

A high-quality speech analysis, manipulation and synthesis system

yamnetunitydemo icon yamnetunitydemo

This is prediction demo of TensorFlow YamNet model on Unity Barracuda.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.