Giter Site home page Giter Site logo

Hi there 👋

My research interests encompass the extensive domain of speech and language intelligence, which includes speech foundation models, large language models (LLMs), text-to-speech synthesis (TTS), voice conversion (VC), singing synthesis, cross-modal representation learning, audio adversarial attacks & defense, among other related areas.

My homepage

Google scholar profile

Songxiang Liu's Projects

academicodec icon academicodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

advattacksasvspoof icon advattacksasvspoof

This is the implementation of the paper "Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification".

algorithm_interview_notes-chinese icon algorithm_interview_notes-chinese

2018/2019/校招/春招/秋招/自然语言处理(NLP)/深度学习(Deep Learning)/机器学习(Machine Learning)/C/C++/Python/面试笔记,此外,还包括创建者看到的所有机器学习/深度学习面经中的问题。 除了其中 DL/ML 相关的,其他与算法岗相关的计算机知识也会记录。 但是不会包括如前端/测试/JAVA/Android等岗位中有关的问题。

audiolm-pytorch icon audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

blow icon blow

Code to train and run Blow

bne-seq2seqmol-vc icon bne-seq2seqmol-vc

Demo for "Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling"

cceyda icon cceyda

Short profile with some stats and keywords

cnpy icon cnpy

library to read/write .npy and .npz files in C/C++

cpc_audio icon cpc_audio

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

cpp_primer icon cpp_primer

My solutions to C++ Primer(5th edition) exercises.

crystal icon crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

e6870 icon e6870

My solution to course E6870 (Speech Recognition) of Columbia University.

efficient_tts icon efficient_tts

Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"

end-to-end-asr-pytorch icon end-to-end-asr-pytorch

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

end2endac icon end2endac

Audio samples for the paper "End-to-end Accent Conversion"

espnet icon espnet

End-to-End Speech Processing Toolkit

fac-via-ppg icon fac-via-ppg

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.