Giter Site home page Giter Site logo

multi-speaker-tacotron's People

Contributors

ecooper7 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

multi-speaker-tacotron's Issues

problem downloading data from Dropbox

Hello, I have tried downloading the data from the dropbox link you provided (https://www.dropbox.com/sh/rq4lebus0n8tmso/AACldbmKDPRN9YiXrRROjtTSa?dl=0). But unfortunately, I am getting the "Folder has too many files to download" error. I have also tried using "wget https://www.dropbox.com/sh/rq4lebus0n8tmso/AACldbmKDPRN9YiXrRROjtTSa?dl=1" command but I was still not able to download the data. I would be really grateful if you could provide a link to the dropbox which can be downloaded through wget command, or provide a zip version of the data folder. Thank you very much for the help.

Dataset

Can you provide examples of runtime in your code, including all txt and wav.
I can not run your example with the path of /gs/hs0/tgh-19IAA/ecooper/data/vctk0.91-preprocessed-phone/source and target-data-root as well, and --hparam-json-file=/gs/hs0/tgh-19IAA/ecooper/data/vctk_hpf_sv56_preprocess_30db/target/hparams.json in scripts folder.Waiting for your letter.

inference speed

Hi.
I am exploring about speed of training and inference different multi speaker TTS models on single CPU or on singe GPU.
Thanks for any explanation in this case for current model or any other models of multi speaker TTS.

Guide to how to synthesis my own voice

Hi,

Thanks for the great work and I am interested in synthesis my own voice using your system.

Can you provide a guide on how to do that?

In particular, I can't find the pre-trained model to generate new speaker embedding (pytorch-kaldi-neural-speaker-embeddings repo didn't share the pretrained model). Would you mind provide the embedding model you used?

Can we get a cloned voicie in Real Time ?

Hello,

I have 2 quick questions about what can be done using Tacotron.

What is the minimum training time (in minutes) required to have a good result ?
Can the processing time (after training data) be instantaneous ? I mean if we can get the cloned voice in real time...
Happy new year by the way !

Thank you !

My Additive-Attention is not good

The Additive-Attention has always been bad.
According your paper, this Attention helps Forward Attention to align, so is it normal that always bad?
My Forward Attention is well aligned.

I use all of the VCTK, and the batch_size is 32.
The figure below shows the number of epochs 37, 47, and 48.

53000
67000
70000

Feature generation for given text

Hi,

Great work you did. I have a question. Can you provide a script to extract features for a given text that can be used as input for the predictmel script (provided one has the phonemes from flite). I couldnt get it to work (normally i dont use tensorflow). I also wonder how to get from the phonemes to the token ids. Is that part of the feature extraction script?

Best,
Christian

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.