Is it possible to fine-tune it using a new voice? about seamless_communication HOT 3 CLOSED

facebookresearch commented on August 15, 2024 5

Is it possible to fine-tune it using a new voice?

from seamless_communication.

Comments (3)

cndn commented on August 15, 2024 1

Hey @rtruszkowski - I imagine the question is about getting a translation model that generates a specific voice? Then you only need to train your own vocoder. We trained the vocoder following the HifiGAN implementation in https://github.com/facebookresearch/speech-resynthesis - our multilingual version is slightly different in aux embedding.

Given a dataset you have with your own voice
(1) With our UPCOMING unit_extraction pipeline (XLSR + kmeans), extract discrete units (WIP #17 by @kauterry )
(2) Train vocoder using the library above
(3) At inference time, replace our multilingual vocoder with your vocoder

from seamless_communication.

kauterry commented on August 15, 2024

#17 is merged, so this should be possible.

from seamless_communication.

blldd commented on August 15, 2024

Hey @rtruszkowski - I imagine the question is about getting a translation model that generates a specific voice? Then you only need to train your own vocoder. We trained the vocoder following the HifiGAN implementation in https://github.com/facebookresearch/speech-resynthesis - our multilingual version is slightly different in aux embedding.

Given a dataset you have with your own voice (1) With our UPCOMING unit_extraction pipeline (XLSR + kmeans), extract discrete units (WIP #17 by @kauterry ) (2) Train vocoder using the library above (3) At inference time, replace our multilingual vocoder with your vocoder

Thanks for the suggestion, I'm curious about the amount of voice data. So, how many seconds of voice data do I need to collect to train a stable vocoder? And how long does the training process takes?
Further, is there any more convenient way?
Thank you for your reply :P

from seamless_communication.

Recommend Projects

Is it possible to fine-tune it using a new voice? about seamless_communication HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent