Giter Site home page Giter Site logo

kyubyong / css10 Goto Github PK

View Code? Open in Web Editor NEW
453.0 453.0 60.0 183.26 MB

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

License: Apache License 2.0

Jupyter Notebook 26.91% HTML 41.27% Python 31.82%
dataset speech speech-to-text

css10's Introduction

Hi there ๐Ÿ‘‹

I'm writing to inform that I'm not allowed to introduce myself as "I work for Kakao Brain." any more. I left Kakao Brain, where I've put my heart for the last four years, and began a new journey on my own. Luckily, I'm not alone on that road not taken. I'm with five great co-founding members--all of them were my team mates at Kakao Brain. We named our startup TUNiB, inspired by the popular animation character (https://octonauts.fandom.com/wiki/Tunip_the_Vegimal). We are still at the very early stage, preparing for IR. Please support us as well as Kakao Brain. You can reach us/me at my email: either [email protected] or [email protected].

Best,
Kyubyong

css10's People

Contributors

kyubyong avatar tmulc18 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

css10's Issues

Brazilian Portuguese Model

Could you make a database available in Brazilian Portuguese? If not, could you guide me on how to train one like the databases you made available?

Can't use GPU

Hi,

I was able to test the synthesize function (synthesize.py) on CPU with success.
But when I tried to use GPU, I have faced with different issues.
First, I tried to use tensorflow-gpu==1.3.0, but according to this chart: https://www.tensorflow.org/install/source#gpu, it requires CUDA 8, and according to this list: https://gitlab.com/nvidia/container-images/cuda/blob/master/doc/supported-tags.md, I could use only Ubuntu 16.04 with an nvidia docker base image for CUDA 8.0, but I have failed with the installation of the requirements on Ubuntu 16.04.
As a second step, I have tried to use tensorflow-gpu==1.5.0 with CUDA 9, but the Nvidia base image for Ubuntu 18.04 support only CUDA 9.2, and not 9.0, and those looks uncompatible...
As a third step, I have tried tensorflow-gpu==1.13.1 with CUDA 10.0, with a CUDA 10.0 based Ubuntu 18.04 base docker image.
Finally, tensorflow can detect the GPU, but the session initialization (sess,run()) takes forever, and eats up all the GPU memory.
I have tried to limit the memory usage, and then the session initialization could finish after more than 4 minutes, but the Feed Forward just stuck at the very beginning, no progress at all within a few minutes.

Any ideas or suggestion? What am I doing wrong?

Thanks!

Vocab for japanese

i'm not found vocab in model DCTSS for japanese, you can share with me, thank.

Numbers or ( ) are not considered by the models

In automatically extracted sentences, both can appear. Looks like numbers can be handled by NTLK but "(" seems harder to handle, since they are associated to an "inflection" in the intonation.

Output node names

Hi,

I would like to freeze the pretrained Tacotron model (French) but I can't figure out what the output node names are. I tried various tools to visualize the model but none of them succeeded on my (old) machine because of the model size.

Would you mind sharing this information?

Thank you for your support and for publishing your work publicly.

Synthesis example

Hi there, I am new to this project.

Would you please give an example of using pre-trained model to synthesize a new audio?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.