nii-yamagishilab / multi-speaker-tacotron Goto Github PK
View Code? Open in Web Editor NEWVCTK multi-speaker tacotron for ICASSP 2020
License: BSD 3-Clause "New" or "Revised" License
VCTK multi-speaker tacotron for ICASSP 2020
License: BSD 3-Clause "New" or "Revised" License
Hello, I have tried downloading the data from the dropbox link you provided (https://www.dropbox.com/sh/rq4lebus0n8tmso/AACldbmKDPRN9YiXrRROjtTSa?dl=0). But unfortunately, I am getting the "Folder has too many files to download" error. I have also tried using "wget https://www.dropbox.com/sh/rq4lebus0n8tmso/AACldbmKDPRN9YiXrRROjtTSa?dl=1" command but I was still not able to download the data. I would be really grateful if you could provide a link to the dropbox which can be downloaded through wget command, or provide a zip version of the data folder. Thank you very much for the help.
Can you provide examples of runtime in your code, including all txt and wav.
I can not run your example with the path of /gs/hs0/tgh-19IAA/ecooper/data/vctk0.91-preprocessed-phone/source and target-data-root as well, and --hparam-json-file=/gs/hs0/tgh-19IAA/ecooper/data/vctk_hpf_sv56_preprocess_30db/target/hparams.json in scripts folder.Waiting for your letter.
Is there a way to generate the LDE embeddings for new samples that I have?
Hi.
I am exploring about speed of training and inference different multi speaker TTS models on single CPU or on singe GPU.
Thanks for any explanation in this case for current model or any other models of multi speaker TTS.
Hi,
Thanks for the great work and I am interested in synthesis my own voice using your system.
Can you provide a guide on how to do that?
In particular, I can't find the pre-trained model to generate new speaker embedding (pytorch-kaldi-neural-speaker-embeddings repo didn't share the pretrained model). Would you mind provide the embedding model you used?
Thank you very much for providing the source code. May I ask when the nancy model for parameter initialization will be available? Thanks.
Dear authors,
The paper is very interesting. Any news on when the code will be published?
It would be great if this repo's ReadME can clear the air regarding this repo and https://github.com/GSByeon/multi-speaker-tacotron-tensorflow (only Korean presentation) both claiming to be THE "Multi-Speaker Tacotron".
Hello,
I have 2 quick questions about what can be done using Tacotron.
What is the minimum training time (in minutes) required to have a good result ?
Can the processing time (after training data) be instantaneous ? I mean if we can get the cloned voice in real time...
Happy new year by the way !
Thank you !
Hi,
Great work you did. I have a question. Can you provide a script to extract features for a given text that can be used as input for the predictmel script (provided one has the phonemes from flite). I couldnt get it to work (normally i dont use tensorflow). I also wonder how to get from the phonemes to the token ids. Is that part of the feature extraction script?
Best,
Christian
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.