My research interests encompass the extensive domain of speech and language intelligence, which includes speech foundation models, large language models (LLMs), text-to-speech synthesis (TTS), voice conversion (VC), singing synthesis, cross-modal representation learning, audio adversarial attacks & defense, among other related areas.
liusongxiang / cpc_audio Goto Github PK
View Code? Open in Web Editor NEWThis project forked from facebookresearch/cpc_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
License: MIT License