ishine Goto Github PK

largest-ever Automatic Speech Recognition leaderboard, periodically benchmarks SOTA commercial ASR APIs from Alibaba, Baidu, Google, IFlytek, Microsoft and so on.

leaf-audio

learn2sing2.0

Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher

learnableupsamplinglayer-pytorch

Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)

lebert

Code for the ACL2021 paper "Lexicon Enhanced Chinese Sequence Labelling Using BERT Adapter"

leveraging-self-supervised-learning-for-avsr

Official PyTorch implementation of paper Leveraging Uni-Modal Self Supervised Learning for Multimodal Audio-visual Speech Recognition

lexiconaugmentedner

Reject complicated operations for incorporating lexicon for Chinese NER.

lfc

lhotse

Tools for handling speech data in machine learning projects.

libaudio3d

binaural 3D sound synthesis using HRTFs

libfacedetection

An open source library for face detection in images. The face detection speed can reach 1000FPS.

libfvad

Voice activity detection (VAD) library, based on WebRTC's VAD engine

libritts-rtvc

libsio

libtorch_mobile_build

libtorch mobile build script

libweicws

A micro blog oriented Chinese word segmentation system. Code for 'Micro blogs Oriented Word Segmentation System'

ishine Goto Github PK

Hi 👋, I'm ishine.

ishine's Projects

Recommend Projects

Recommend Topics

Recommend Org