Giter Site home page Giter Site logo

jsingh811 / pyaudioprocessing Goto Github PK

View Code? Open in Web Editor NEW
215.0 3.0 38.0 23.5 MB

Audio feature extraction and classification

License: GNU General Public License v3.0

Python 100.00%
audio-data feature-extraction classify classify-audio mfcc mfcc-features mfcc-extractor gfcc gfcc-features gfcc-extractor

pyaudioprocessing's People

Contributors

cclauss avatar jsingh811 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

pyaudioprocessing's Issues

Optimize gfcc feature calculations

Description

The gfcc feature can be optimized in terms of computations and scale. This ticket entails work for the betterment of gfcc feature calculation.

Acceptance Criteria

  • GFCC feature improvement
  • Separates out chroma and spectral features into individual features rather than features included with gfcc and mfcc

Add visualization for classifier confusion matrix and cross-validation results

Description

Currently, classifier stats for different hyper-parameters and the final confusion matrix prints to the screen. We want to add functionality that also saves the output to disc and displays it is easy to understand visualizations(ideas welcome!).

Acceptance criteria

-[ ] Shortlist visualization tools
-[ ] Implement and integrate the tool selected after review

Add mel-spectrogram feature

##Description
A mel spectrogram is a spectrogram where the frequencies are converted to the mel scale.
The images of mel-spectrograms are widely used with CNNs to classify audio.

Write a method that returns mfcc and gfcc features for an input audio file

Currently, mfcc and gfcc features are being computed as a mid step of a classification problem and we don't get a direct view of the features.
One can want to look at these features independently and use them with a different classification backend.

  • implement a function to return mfcc for an input audio file
  • implement a function to return gfcc for an input audio file
  • update documentation to guide the usage of these new functions
    Thus, we want to write a function that outputs mfcc and a function that outputs gfcc feature for an input audio file.

Typos discovered by codespell

% codespell --ignore-words-list="fo" --quiet-level=2

./pyAudioProcessing/extract_features.py:74: stucture ==> structure
./pyAudioProcessing/trainer/audioTrainTest.py:49: containt ==> contain, content
./pyAudioProcessing/trainer/audioTrainTest.py:68: containt ==> contain, content
./pyAudioProcessing/trainer/audioTrainTest.py:242: signle ==> single
./pyAudioProcessing/trainer/audioTrainTest.py:242: seperate ==> separate
./pyAudioProcessing/features/audioFeatureExtraction.py:255: seperate ==> separate
./pyAudioProcessing/features/filters.py:122: eigth ==> eighth, eight
./pyAudioProcessing/features/filters.py:123: eigth ==> eighth, eight

Dependancies need to be updated for Python 3.8 and 3.9

% python3.9 -m pip install --upgrade pip
% python3.9 -m pip install -r requirements/requirements.txt

Will fail because the new pip dependency resolver says the specified version of scipy is not Python 3.9 compatible.

Screenshot 2021-03-30 at 00 00 50

install not working on linux

Install not working on linux - unfulfilled dependency python-magic-bin==0.4.14

detail: Centos 8
Clone repo git clone https://github.com/jsingh811/pyAudioProcessing.git

install with pip3 install -e .

Failed dependency

  Downloading https://files.pythonhosted.org/packages/79/db/eaf620b73a1eec3c8c6f8f5b0b236a50f9da88ad57802154b7ba7664d0b8/pydub-0.23.1-py2.py3-none-any.whl
Collecting python-magic-bin==0.4.14 (from pyAudioProcessing==1.1.6)
  Could not find a version that satisfies the requirement python-magic-bin==0.4.14 (from pyAudioProcessing==1.1.6) (from versions: )
No matching distribution found for python-magic-bin==0.4.14 (from pyAudioProcessing==1.1.6)

Reviewing python-magic-bin, I don't see a linux version.

Can you advise?

LPCC feature integration

LPC and LPCC feature integration.

  • Add feature generation code
  • Integrate with the classification pipeline
  • Add tests

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.