My research interests encompass the extensive domain of speech and language intelligence, which includes speech foundation models, large language models (LLMs), text-to-speech synthesis (TTS), voice conversion (VC), singing synthesis, cross-modal representation learning, audio adversarial attacks & defense, among other related areas.
liusongxiang / fac-via-ppg Goto Github PK
View Code? Open in Web Editor NEWThis project forked from guanlongzhao/fac-via-ppg
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)
License: Apache License 2.0