My research interests encompass the extensive domain of speech and language intelligence, which includes speech foundation models, large language models (LLMs), text-to-speech synthesis (TTS), voice conversion (VC), singing synthesis, cross-modal representation learning, audio adversarial attacks & defense, among other related areas.
liusongxiang Goto Github PK
Name: Songxiang Liu
Type: User
Bio: Work on spoken language processing: General Audio synthesis, TTS, VC, SVS & SVC etc.