Giter Site home page Giter Site logo

speech_denoising_tools's Introduction

jsalt2019-diadet

Repository of recipes for the JSALT2019 workshop on "Speaker Detection in Adverse Scenarios with a Single Microphone"

Cloning the repo

  • To clone the repo execute
git clone --recursive https://github.com/jsalt2019-diadet/jsalt2019-diadet.git
  • The recursive option downloads some dependencies:

    • hyperion: python code for spk detection back-end
  • If you want to update the sumodules to the last commit, run

cd jsalt2019-diadet
git submodule sync
git submodule update --init --recursive --remote
  • Dependencies are downloaded in
jsalt2019-diadet/tools

Other Dependencies:

  • The recipes also depend on Anaconda3.5, Kaldi, cuDNN, etc.

  • Recommended: use some preinstalled versions of the dependencies in the grid to avoid each person having its own.

    • To create links to preinstalled kaldi, anaconda and cudnn, run:
    cd jsalt2019-diadet/
    ./make_clsp_links.sh
    • The anaconda that you will link with this has several environments:
      • base: numpy, h5py, pandas, etc.
    • tensorflow1.8g_cpu: tensorflow 1.8 for cpu
    • tensorflow1.8g_gpu: tensorflow 1.8 for gpu
    • pytorch1.0_cuda9.0: pytorch 1.0 with cuda 9.0
    • pyannote: python3.6 with pyannote-metrics installed.
  • Anaconda3.5:

    • Make a link to your anaconda installation in the tools directory:
    cd jsalt2019-diadet/tools/anaconda
    ln -s <your-anaconda-3.5> anaconda3.5
    • or follow instructions in jsalt2019-diadet/tools/anaconda/full_install.sh to install anaconda from scratch
  • Kaldi speech recognition toolkit

    • Make link to an existing kaldi installation
    cd jsalt2019-diadet/tools/kaldi
    ln -s <your-kaldi> kaldi
    • or follow instructions in jsalt2019-diadet/tools/anaconda/install_kaldi.sh to install kaldi from scratch
  • CuDNN: tensorflow and pytorch will need some version of cudnn

    • Make a link to some existing cudnn version that matches the requirements of your tf or pytorch, e.g.:
    cd jsalt2019-diadet/tools/cudnn
    #cudnn v7.4 for cuda 9.0 needed by pytorch 1.0 
    ln -s /home/janto/usr/local/cudnn-9.0-v7.4 cudnn-9.0-v7.4

Directory structure:

  • The directory structure of the repo looks like this:
./jsalt2019-diadet
./jsalt2019-diadet/tools
./jsalt2019-diadet/tools/anaconda
./jsalt2019-diadet/tools/anaconda/anaconda3
./jsalt2019-diadet/tools/cudnn
./jsalt2019-diadet/tools/cudnn/cudnn-9.0-v7.4
./jsalt2019-diadet/tools/kaldi
./jsalt2019-diadet/tools/kaldi/kaldi
./jsalt2019-diadet/tools/hyperion
./jsalt2019-diadet/tools/hyperion/hyperion
./jsalt2019-diadet/tools/speech_denoising_tools
./jsalt2019-diadet/egs
./jsalt2019-diadet/egs/jsalt2019-diadet
./jsalt2019-diadet/egs/jsalt2019-diadet/v1
./jsalt2019-diadet/src
  • Directories:
    • tools: contains external repos and tools like kaldi, python, pyannotate, hyperion, cudnn, etc.
    • src: it can be used to place code that we create specifically for this repo.
      • src/kaldi_augmentation: some scripts to perform data augmentation using the wav-reverberate kaldi tool
    • egs: contains the recipes
      • egs/jsalt2019-diadet: recipe for speaker diarization/detection/tracking for all datasets that we use in the workshop.
        • v1: Version 1 is based on kaldi x-vectors
      • egs/sitw_noisy: recipe for SITW with added noise and reverberation in the dev/eval test. Used to measure performance of enhancement methods at different noise types, noise levels, RT60 reveration times.
        • v1: Based on kaldi x-vectors.

speech_denoising_tools's People

Contributors

mmmaat avatar nryant avatar staplesinla avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.