Giter Site home page Giter Site logo

again-melgan-voice-cloning's People

Contributors

vlomme avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

again-melgan-voice-cloning's Issues

Wavgan as a vocoder for espeak

Dear community.
I want to inform you that I was able to train wavgan vocoder with the output of the espeak (http://espeak.sourceforge.net/). And I got the following results:
Espeak output en
Wavgan output LJSpeech
Espeak ouput ru
Wavgan output R012

I also tried to train the melgan with the same approach. With the following sad result:
Melgan output LJSpeech

This approach shows that wavgan is relatively flexible concerning input audio. It should encourage the community to try out other approaches.

Thank you Vlomme!

Error while model training start

When I try to make training I get the error (I added torch.autograd.set_detect_anomaly(True) to get more details):

Загрузка чекпоинтов
Чекпоинт загружен: Эпоха 298, Шаг 518000
Warning: Error detected in WeightNormCudaInterfaceBackward. Traceback of forward call that caused the error:
  File "melgan.py", line 204, in <module>
    train()
  File "melgan.py", line 114, in train
    d_fake = D(fake_audio.cuda())
  File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 550, in __call__
    result = self.forward(*input, **kwargs)
  File "/content/MelGan-WavGan/utils.py", line 278, in forward
    result.append(module(x))
  File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 550, in __call__
    result = self.forward(*input, **kwargs)
  File "/content/MelGan-WavGan/utils.py", line 259, in forward
    x = module(x)
  File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 542, in __call__
    result = hook(self, input)
  File "/usr/local/lib/python3.6/dist-packages/torch/nn/utils/weight_norm.py", line 55, in __call__
    setattr(module, self.name, self.compute_weight(module))
  File "/usr/local/lib/python3.6/dist-packages/torch/nn/utils/weight_norm.py", line 18, in compute_weight
    return _weight_norm(v, g, self.dim)
 (print_stack at /pytorch/torch/csrc/autograd/python_anomaly_mode.cpp:60)
Traceback (most recent call last):
  File "melgan.py", line 204, in <module>
    train()
  File "melgan.py", line 154, in train
    g_loss.backward()
  File "/usr/local/lib/python3.6/dist-packages/torch/tensor.py", line 198, in backward
    torch.autograd.backward(self, gradient, retain_graph, create_graph)
  File "/usr/local/lib/python3.6/dist-packages/torch/autograd/__init__.py", line 100, in backward
    allow_unreachable=True)  # allow_unreachable flag
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.FloatTensor [1, 1024, 3]] is at version 2; expected version 1 instead. Hint: the backtrace further above shows the operation that failed to compute its gradient. The variable in question was changed in there or anywhere later. Good luck!

Maybe I need to install correct pytorch version or there is some other issue with inplace variables. Was not able to address the issue.

In my experiment I just call
python melgan.py -r t -f mel

Add new voice

Hi, in case I want to train MelGan and WavGan on a new voice, what amount of input data do i need? 1H, 10Hrs or more for a new speaker?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.