yuekaizhang Goto Github PK

followers: 94.0 following: 25.0 repos: 28.0 gists: 1.0

Name: Yuekai Zhang

Type: User

Company: @Nvidia

Location: Shanghai, CN

Blog: https://scholar.google.com/citations?user=YGmuq3UAAAAJ&hl=en

Yuekai Zhang's Projects

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

audio-adversarial-examples-papers

ctc_decoder

A ctc decoder for both online and offline asr model

espnet

End-to-End Speech Processing Toolkit

fastchat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

fastertransformer

Transformer related optimization, including BERT, GPT

funasr

A Fundamental End-to-End Speech Recognition Toolkit

gss

A simple package for Guided source separation (GSS)

icefall

instructglm

GLM model SFT

k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

lhotse

Tools for handling speech data in machine learning projects.

minutes

Podcast Summarizer with LLM Technology

nemo

NeMo: a toolkit for conversational AI

nemo-guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

riva-asrlib-decoder

Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva

sherpa

Streaming and non-streaming ASR server in Python

sherpa-onnx

Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin

tensorrt-hackthon-wenet

triton-asr-client

ASR client for Triton ASR Service

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

yuekaizhang Goto Github PK

Yuekai Zhang's Projects

Recommend Projects

Recommend Topics

Recommend Org