Giter Site home page Giter Site logo

moon_voice_activity_detection's Introduction

Jihye Moon's Voice Activity Detection Study in 2015-2016

Motivation

In 2014-2016, I have worked as an undergraduate researcher for speech pathology detection projects. One of my main works was developing robust eldery people's voice activity detection.

In 2015, the most popular method for voice activity detection was pitch-based end-point determination. Pitch computation commonly relied on the Autocorrelation function (ACF) and Average Magnitude Difference Function (AMDF) methods. However, time-series speech relied on ACF and AMDF to compute Pitch for not only voiced sounds but also unvoiced sounds.

For speech pathology detection, capturing voiced sounds produced from vocal cords is crucial. Elderly people's speech contains various unvoiced sounds, making it significant to develop voice activity detection algorithms robust to unvoiced sounds for more accurate pathology detection within the elderly population.

I have worked on Higher Order Differential Energy Operators(HODEO) for desgin novel voice activity detection algorithms. Through this work, I have delved deeply into speech signal processing and experimented with various voice activity detection algorithms from scratch!

Source Code

This Github provides ACF, AMDF, and HODEO-based voice activity detection codes.

Put your data (data.wav) and run 'Code/Moon_2016_Scratch_Voice_Activity_Detection.m'

Results

image

We found that the HODEO-based Voice Activity Detection (VAD) approach might be better suited for the elderly population compared to ACF and AMDF-based methods. However, since I had access to only a limited amount of elderly speech data, further validation would necessitate the use of various datasets with appropriate ground truths.

The HODEO-based voice activity detection module was integrated into my Speech Analysis Software after 2016!

Please feel free to contact me at [email protected] if you are interested in! :)

moon_voice_activity_detection's People

Contributors

jihyemooon avatar

Stargazers

Cassey (Dong Han) avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.