Comments (3)
@fazlekarim Since it is an unsupervised way to learn the tokens, it's hard to see what the token means when using multi-speakers. I guess it's better to add a speaker embedding to the network as another google's paper.
from gst-tacotron.
The GST paper covered this, too. Section 7.2:
Without using any metadata as labels, we train a baseline
Tacotron and a 1024-token GST model for comparison. As
expected, the baseline fails to learn, since the multi-speaker
data is too varied. The GST model results are presented
in Figure 7. This shows spectrograms for the same phrase
overlaid with F0 tracks, generated by conditioning the model
on two randomly chosen tokens. Examining the trained
GSTs, we find that different tokens correspond to different
speakers. This means that, to synthesize with a specific
speaker’s voice, we can simply feed audio from that speaker
as a reference signal. See Section 7.3 for more quantitative
evaluations
from gst-tacotron.
@rryan With that in mind, can we exploit this and use it for voice conversion?
from gst-tacotron.
Related Issues (20)
- GMM Attention HOT 5
- No clear speech HOT 7
- Some problems when preprocessing ljspeech dataset HOT 1
- Reference Encoder Padding
- where do you insert or import wav file of models voice for training?
- Why use the 'tf.layer.conv1d' for query, key transformation instead of fully connected layer?
- Error in datafeeder.py HOT 1
- Path for Reference Audio HOT 1
- erro in eval.py HOT 1
- Check failed: dnnReLUCreateBackward_F32 HOT 1
- can we synthesis speaker-A's tone with speaker-B's prosody?
- What is in reference audio path?
- Pretrained Weights HOT 1
- Unable to reproduce results
- Mumbling in synthesis HOT 1
- Regarding the trained model
- Using pre-trained model of Keithito's tacotron implementation
- Add style weights when there is no reference audio
- shape of linear_outputs is not same as while training
- training stops many seconds to create new queue of data
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gst-tacotron.