Giter Site home page Giter Site logo

extract_feats's Introduction

Speech audio feature extraction processor by Kyle Kastner

Extract features with HTK/speech_tools/festival/merlin.

This is a fork of gist https://gist.github.com/kastnerkyle/cc0ac48d34860c5bb3f9112f4d9a0300

The data is processed using Merlin. From each audio clip the vocoder features are extracted using the WORLD vocoder. The resulting dataset will be located under subfolder data as follows:

loop
├── data
    └── vctk
        ├── norm_info
        │   ├── norm.dat
        ├── numpy_feautres
        │   ├── p294_001.npz
        │   ├── p294_002.npz
        │   └── ...
        └── numpy_features_valid

Prerequsites

sudo apt-get install tcl-snack sox doxygen xsltproc graphviz texlive texinfo texi2html libncurses5-dev csh
sudo pip install theano matplotlib bandmat wget lxml

Note pip modules are to be installed for Python 2.7.

Deployment

First, build all the necessary tools and utilities:

python ./install_tts.py

Then run the feature extraction script itself:

python ./extract_feats.py -w ./vctk/VCTK-Corpus/wav48/p225 -t ./vctk/VCTK-Corpus/txt/p225

extract_feats's People

Contributors

kastnerkyle avatar dmikushin avatar

Stargazers

Grigory Grigoryev avatar

Watchers

James Cloos avatar  avatar

extract_feats's Issues

Block on extract_feats

Hi, I am trying build my own dataset use your code. I must say your code is more convenient that the original repo. However, when I excuted this code it somehow blocked at wav file feature extraction and consuming all 48 CPU. I waited for a whole night and nothing happened, any idea about it? I am using Ubuntu 16.04 and tried python 3.7 and 2.7. My wav file is in 48000 sr and 16 bit.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.