liusongxiang Goto Github PK

followers: 336.0 following: 97.0 repos: 87.0 gists: 3.0

Name: Songxiang Liu

Type: User

Bio: Work on spoken language processing: General Audio synthesis, TTS, VC, SVS & SVC etc.

Hi there 👋

My research interests encompass the extensive domain of speech and language intelligence, which includes speech foundation models, large language models (LLMs), text-to-speech synthesis (TTS), voice conversion (VC), singing synthesis, cross-modal representation learning, audio adversarial attacks & defense, among other related areas.

My homepage

Google scholar profile

Songxiang Liu's Projects

g2p

g2p: English Grapheme To Phoneme Conversion

gelp

glow

Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"

glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

hiddenmarkovmodel_pytorch

Pytorch: Viterbi, Forward-Backward and Baum Welch with a Hidden Markov Model (HMM)

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

hn-unifiedsourcefiltergan

jd-nmf

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion

kaldi-io-for-python

Python functions for reading kaldi data formats. Useful for rapid prototyping with python.

librittslabel

Alignment files of LibriTTS.

liusongxiang

liusongxiang.github.io

Personal homepage:

lpcnet

Efficient neural speech synthesis

merlin

This is now the official location of the Merlin project.

mfcc-dtw

Simple MFCC extractor and an speech recognition algorithm (Dynamic Time Warping)

montreal-forced-aligner

Command line utility for forced alignment using Kaldi

mos-render-test

mtts

A Demo of Mandarin/Chinese TTS frontend

nonparaseq2seqvc_code

Implementation code of non-parallel sequence-to-sequence VC

pages-themes-cayman

Custom Cayman is a Jekyll theme for GitHub Pages

parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

parselmouth

Praat in Python, the Pythonic way

piano-synthesis

Code accompanying ML4MD ICML 2020 paper - "Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance".

pika-vim

My vim comfiguration

ppg-vc

PPG-Based Voice Conversion

python

All Algorithms implemented in Python

python-100-days

Python - 100天从新手到大师

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

rayeren.github.io

My personal homepage

resemblyzer

A python package to analyze and compare voices with deep learning

liusongxiang Goto Github PK

Hi there 👋

Songxiang Liu's Projects

Recommend Projects

Recommend Topics

Recommend Org