Giter Site home page Giter Site logo

tensorflow_wavenet_vocoder's Introduction

Hi there 👋

azraelkuan's GitHub stats

tensorflow_wavenet_vocoder's People

Contributors

azraelkuan avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

tensorflow_wavenet_vocoder's Issues

there are some question about my implementation

Hi azraelkuan, thanks for your work and sharing!
I encounter three questions during implementation.

my environment: win10, python==3.6.7, tensorflow==1.11, anaconda==3

one is when I finished "preprocess.py", my file(LJSpeech-1-mel.npy) was only 176KB and file(LJSpeech-1-audio.npy) was only 281KB. I think it may be that something is repeatedly covered or the problem is caused by the difference between windows and Linux? i am not sure about this.

The second problem is encountered during the Synthesize step. I didn't find a file called "eavl.txt". There are only three files (LJSpeech-1-audio.npy, LJSpeech-1-mel.npy, train.txt) in output path after preprocess.

the final question is it seems that the input of the parameter is adjusted, unlike the command in the readme?
about '--eval_txt' i just set the output folder for preprocess.

tensorflow_wavenet_vocoder>python mul_generate.py --eval_txt ./FeaPath/ --wav_out_path ./WavOut/ checkpoint ./log_ljspeech/train/2018-11-18T18-07-48/model.ckpt-99999 ---hparams gc_enable=False,global_channel=0,global_cardinality=0,NPY_DATAROOT=/your_npy_datadir/,sample_rate=22050 usage: mul_generate.py [-h] [--logdir LOGDIR] [--temperature TEMPERATURE] [--save_every SAVE_EVERY] [--eval_txt EVAL_TXT] [--hparams HPARAMS] checkpoint mul_generate.py: error: unrecognized arguments: --wav_out_path checkpoint ./log_ljspeech/train/2018-11-18T18-07-48/model.ckpt-99999 ---hparams gc_enable=False,global_channel=0,global_cardinality=0,NPY_DATAROOT=/your_npy_datadir/,sample_rate=22050

or this code can't running on windows?
Tell me if I'm wrong, thanks ^_^

how to increase the batch size?

Hi azraelkuan!
when i using your wavenet vocoder in my corpus, keeping other parameter unchanged. but the loss does not converge.
my computer is 2080ti with 11 GB's memory, but my batch size can only be 1. when i directly increase it to 2, its memory overflows.
how could i increase my batch size ?
thanks!

there are some question about my implementation,please help me

Exception in thread Thread-1:
Traceback (most recent call last):
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 916, in _bootstrap_inner
self.run()
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 864, in run
self._target(*self._args, **self._kwargs)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 145, in thread_main
assert_ready_for_upsampling(wav, local_condition)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 50, in assert_ready_for_upsampling
assert len(x) % len(c) == 0 and len(x) // len(c) == audio.get_hop_size()
AssertionError
Exception in thread Thread-2:
Traceback (most recent call last):
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 916, in _bootstrap_inner
self.run()
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 864, in run
self._target(*self._args, **self._kwargs)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 145, in thread_main
assert_ready_for_upsampling(wav, local_condition)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 50, in assert_ready_for_upsampling
assert len(x) % len(c) == 0 and len(x) // len(c) == audio.get_hop_size()
AssertionError

Exception in thread Thread-8:
Traceback (most recent call last):
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 916, in _bootstrap_inner
self.run()
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 864, in run
self._target(*self._args, **self._kwargs)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 145, in thread_main
assert_ready_for_upsampling(wav, local_condition)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 50, in assert_ready_for_upsampling
assert len(x) % len(c) == 0 and len(x) // len(c) == audio.get_hop_size()
AssertionError

Exception in thread Thread-7:
Traceback (most recent call last):
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 916, in _bootstrap_inner
self.run()
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 864, in run
self._target(*self._args, **self._kwargs)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 145, in thread_main
assert_ready_for_upsampling(wav, local_condition)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 50, in assert_ready_for_upsampling
assert len(x) % len(c) == 0 and len(x) // len(c) == audio.get_hop_size()
AssertionError

Exception in thread Thread-6:
Traceback (most recent call last):
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 916, in _bootstrap_inner
self.run()
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 864, in run
self._target(*self._args, **self._kwargs)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 145, in thread_main
assert_ready_for_upsampling(wav, local_condition)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 50, in assert_ready_for_upsampling
assert len(x) % len(c) == 0 and len(x) // len(c) == audio.get_hop_size()
AssertionError

Exception in thread Thread-4:
Traceback (most recent call last):
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 916, in _bootstrap_inner
self.run()
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 864, in run
self._target(*self._args, **self._kwargs)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 145, in thread_main
assert_ready_for_upsampling(wav, local_condition)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 50, in assert_ready_for_upsampling
assert len(x) % len(c) == 0 and len(x) // len(c) == audio.get_hop_size()
AssertionError

Exception in thread Thread-5:
Traceback (most recent call last):
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 916, in _bootstrap_inner
self.run()
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 864, in run
self._target(*self._args, **self._kwargs)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 145, in thread_main
assert_ready_for_upsampling(wav, local_condition)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 50, in assert_ready_for_upsampling
assert len(x) % len(c) == 0 and len(x) // len(c) == audio.get_hop_size()
AssertionError

Exception in thread Thread-3:
Traceback (most recent call last):
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 916, in _bootstrap_inner
self.run()
File "D:\Anaconda3\envs\wtx\lib\threading.py", line 864, in run
self._target(*self._args, **self._kwargs)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 145, in thread_main
assert_ready_for_upsampling(wav, local_condition)
File "D:\tf-wavenet_vocoder-master\apps\vocoder\datasets\data_feeder.py", line 50, in assert_ready_for_upsampling
assert len(x) % len(c) == 0 and len(x) // len(c) == audio.get_hop_size()
AssertionError

Could you please share some trained params?

Dear Kuan Chen,
Could you please share your checkpoints for the LJspeech and CMU_arctic, as well as the corresponding hparams settings? That will be much helpful for me.
Thank you in advance!

Songxiang Liu

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.