Giter Site home page Giter Site logo

Comments (3)

Toku11 avatar Toku11 commented on August 12, 2024

In your preprocessing you are creating a "bit mask" which is supposed to be your labels for the silent interval detection however your bitmmask is always 1 because you created it like '1' * file_info[FIELDS[5]], how is that you use it to train

from listening-to-sound-of-silence-for-speech-denoising.

aloha0920 avatar aloha0920 commented on August 12, 2024

Hi, same issue here. Did you figure it out yet?

from listening-to-sound-of-silence-for-speech-denoising.

henryxrl avatar henryxrl commented on August 12, 2024

@Toku11 You are correct. Each 1/0 bit indicates a 1/30 time interval. And yes, 1 means speech, 0 means silence. Each piece of training data is indeed 2 seconds which means 60 of those 1/0 bits.

In the preprocessor_audioonly.py file, the reason why it's setting all 1's is that this generates data for inference job, NOT for training (so all 1's doesn't matter - you'll get the correct 1/0 sequence as a result). For training, you'll have to provide the ground truth correctly labeled 1/0 sequence. You can get creative and create your own. My way of doing so is to look at the power of the speech signal and label each 1/30 time interval 0 (silence) if the power is below a certain threshold, and 1 if above. As a reference, you can take a look at the 'get_bitstream_better' function (commented out) from the util.py file in the preprocessing directory.

from listening-to-sound-of-silence-for-speech-denoising.

Related Issues (13)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.