extract_feats's Introduction

Speech audio feature extraction processor by Kyle Kastner

Extract features with HTK/speech_tools/festival/merlin.

This is a fork of gist https://gist.github.com/kastnerkyle/cc0ac48d34860c5bb3f9112f4d9a0300

The data is processed using Merlin. From each audio clip the vocoder features are extracted using the WORLD vocoder. The resulting dataset will be located under subfolder data as follows:

loop
├── data
    └── vctk
        ├── norm_info
        │   ├── norm.dat
        ├── numpy_feautres
        │   ├── p294_001.npz
        │   ├── p294_002.npz
        │   └── ...
        └── numpy_features_valid

Prerequsites

sudo apt-get install tcl-snack sox doxygen xsltproc graphviz texlive texinfo texi2html libncurses5-dev csh
sudo pip install theano matplotlib bandmat wget lxml

Note pip modules are to be installed for Python 2.7.

Deployment

First, build all the necessary tools and utilities:

python ./install_tts.py

Then run the feature extraction script itself:

python ./extract_feats.py -w ./vctk/VCTK-Corpus/wav48/p225 -t ./vctk/VCTK-Corpus/txt/p225

extract_feats's People

Contributors

Stargazers

Watchers

extract_feats's Issues

Block on extract_feats

Hi, I am trying build my own dataset use your code. I must say your code is more convenient that the original repo. However, when I excuted this code it somehow blocked at wav file feature extraction and consuming all 48 CPU. I waited for a whole night and nothing happened, any idea about it? I am using Ubuntu 16.04 and tried python 3.7 and 2.7. My wav file is in 48000 sr and 16 bit.

Recommend Projects

dmikushin / extract_feats Goto Github PK

extract_feats's Introduction

Speech audio feature extraction processor by Kyle Kastner

Prerequsites

Deployment

extract_feats's People

Contributors

Stargazers

Watchers

extract_feats's Issues

Block on extract_feats

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent