Giter Site home page Giter Site logo

sungfeng-huang / meta-tts Goto Github PK

View Code? Open in Web Editor NEW
185.0 185.0 33.0 11.49 MB

Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.

Python 100.00%
deep-learning few-shot-learning meta-learning pytorch speech-synthesis

meta-tts's People

Contributors

hhhaaahhhaa avatar sungfeng-huang avatar toolacious avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

meta-tts's Issues

Question for Meta-update in Meta-TTS

I have a question about meta update in the outer loop of Meta TTS

I compare the algorithm of MetaTTS and the original of MAML, are they the same in the below red box

image

I can not find the sum of query loss of all tasks in the repo, can you help to show!

image

I refer to the implementation of meta-learning for mnist, it seems also sum query loss of all tasks:
https://github.com/learnables/learn2learn/blob/0b9d3a3d540646307ca5debf8ad9c79ffe975e1c/examples/vision/meta_mnist.py#L100

One more question is the MAML from L2L, what is the "out of space" mean:
image

Thank you SungFeng

What's the difference between the models in the "pretrained model" link?

Hi. The work is amazing. If I want to test the model, which pretrained model should I use? (Because I notice that the filenames in the pretrained model link are quite similar)
I also notice that in the demo page (Section 4.3), you only did parallel voice cloning with unseen speakers, have you tried testing with different text with these speakers?
Thank you very much.

Have you tried to finetune only a few parameters?

Hi, thanks for your amazing work! I noticed you finetune the whole decoder with other layers in your experiment. Have you ever tried to finetune only a few parameters? For example,only the last layer of the model? I want to know how it performs with very few trainable parameters.

LibriTTS-360 pretrained checkpoints

Hello, thanks for great work on MetaTTS paper and repo!

I've noticed there are "dev" configs for training on larger dataset including LibriTTS 360 and 500 subets. Do you have plans for releasing pretrained checkpoint and publishing results obtained with more data?

try HiFi-GAN

I noticed that you used MelGAN in this paper. Have you also tried using HiFi-GAN?

incompatible package versions

感谢您的贡献。我是一名学生,想以您的Meta-TTS模型作为基线模型或者对照组来完成我的毕业论文,因此需要复现您的代码。但在复现过程中出现了一些预料之外的bug比如安装包不兼容情况等,如果可能的话,希望您提供requiremengts.txt中包的版本,或者我应该复现之前版本的代码(可以使用您的预训练模型)。我想在您的预训练模型上进行推理,如果您能给到我一些宝贵的建议我将感激不尽。

8gpu CUDA out of memory

I used 8 GPUs for training, and set shots and queries to 3, meta batchsizes is set to 8. And set batchsize in config/train/base.yaml is set to 48, grad_ acc_ Step is set to 8. But when the code runs, it still reports CUDA out of memory. What else do I need to change?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.