Topic: mfcc Goto Github

Some thing interesting about mfcc

👇 Here are 236 public repositories matching this topic...

abhay0899193 / speaker-recognition

mfcc,Speaker Recognition System using MFCC and GMM.

User: abhay0899193

gmm gui mfcc python3 sklearn speaker-recognition tkinter

adamstark / gist

mfcc,A C++ Library for Audio Analysis

User: adamstark

Home Page: http://www.adamstark.co.uk/project/gist/

audio-analysis c-plus-plus pitch-tracking mfcc onset-detection music audio gist music-information-retrieval mir

alicex2020 / deep-learning-lie-detection

mfcc,Use machine learning models to detect lies based solely on acoustic speech information

User: alicex2020

machine-learning deep-learning mfcc mfcc-analysis lie-detector acoustic-features pitch-tracking support-vector-machines ensemble-learning ensemble-model

alicex2020 / mandarin-tone-classification

mfcc,Deep learning using CNN for Mandarin Chinese tone classification

User: alicex2020

mfcc deep-learning deep-neural-networks cnn cnn-keras pitch f0 spectrograms classification recognition

amanbasu / speech-emotion-recognition

mfcc,Detecting emotions using MFCC features of human speech using Deep Learning

User: amanbasu

tensorflow deep-learning rnn mfcc speech-recognition emotion-recognition emotion

ar1st0crat / nwaves

mfcc,.NET DSP library with a lot of audio processing functions

User: ar1st0crat

audio dsp filtering sound-effects feature-extraction psychoacoustics sound-synthesis wav mfcc lpc

aubio / aubio

mfcc,a library for audio and music analysis

Organization: aubio

Home Page: https://aubio.org

audio music analysis c python sound extraction annotation onset pitch

aubio / vamp-aubio-plugins

mfcc,aubio plugins for Vamp

Organization: aubio

Home Page: https://aubio.org/vamp-aubio-plugins

aubio vamp-plugins tempo-tracking tempo-detection mfcc beat-detection beat-tracking tempo beat onset

csukuangfj / kaldifeat

mfcc,Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

User: csukuangfj

Home Page: https://csukuangfj.github.io/kaldifeat

kaldi features-extraction mfcc plp fbank python online-feature-extractor streaming-feature-extractor pytorch cpp

dataxujing / asr-paper

mfcc,:fire: ASR教程: https://dataxujing.github.io/ASR-paper/

User: dataxujing

asr fbank mfcc gmm-hmm tandem dnn-hmm las ctc rnn-t neural-transducer

ddbourgin / numpy-ml

mfcc,Machine learning, in numpy

User: ddbourgin

Home Page: https://numpy-ml.readthedocs.io/

machine-learning neural-networks topic-modeling gaussian-mixture-models hidden-markov-models gradient-boosting bayesian-inference wavenet vae resnet

dydtjr1128 / speaker-recognition-using-nn

mfcc,Speaker Recognition using Neural Network & Linear Regression

User: dydtjr1128

speaker-recognition mfcc neural-network linear-regression python nn machine-learning voice-recognition

ewan-xu / librosacpp

mfcc,LibrosaCpp is a c++ implemention of librosa to compute short-time fourier transform coefficients,mel spectrogram or mfcc

User: ewan-xu

eigen librosa mfcc

federicapaoli1 / stm32-speech-recognition-and-traduction

mfcc,stm32-speech-recognition-and-traduction is a project developed for the Advances in Operating Systems exam at the University of Milan (academic year 2020-2021). It implements a speech recognition and speech-to-text translation system using a pre-trained machine learning model running on the stm32f407vg microcontroller.

User: federicapaoli1

Home Page: https://github.com/FedericaPaoli1/stm32-speech-recognition-and-traduction

audio-processing bsp i2s i2s-audio i2s-microphone mems mems-microphone mfcc mfcc-features pcm pdm speech-recognition speech-to-text stm32 stm32cubemx stm32f4 stm32f4-discovery stm32f407vg tflite tflite-model

fragit / fragit-main

mfcc,FragIt main repository

Organization: fragit

python molecule mfcc fragments

fragjage / speakervoiceidentifier

mfcc,SpeakerVoiceIdentifier can recognize the voice of a speaker by learning.

User: fragjage

classifier gmm identifier learning mfcc recognition speaker voice

gauravwaghmare / speaker-identification

mfcc,A program for automatic speaker identification using deep learning techniques.

User: gauravwaghmare

keras mfcc speaker-recognition speaker-verification

geekysethi / audio_classification

mfcc,

User: geekysethi

audio cnn-classification cnn machine-learning deep-learning pytorch resnet audio-processing lstm spectrogram

georgid / alignmentduration

mfcc,Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

User: georgid

Home Page: http://mtg.upf.edu/node/3751

python htk lyrics duration decoding deep-learning hidden-markov-model alignment synchronization mfcc

gionanide / speech_signal_processing_and_classification

mfcc,Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].

User: gionanide

speech-processing mfcc linear-prediction-coefficients classifier speech-utterance feature-extraction support-vector-machines gaussian-mixture-models long-short-term-memory principal-component-analysis

guitarsai / basicsmusicalinstrumclassifi

mfcc,Basics of Musical Instruments Classification using Machine Learning

User: guitarsai

musical-instruments-classification mfcc svm machine-learning deep-learning music-information-retrieval

ihabbendidi / voice-authentification-api

mfcc,A RESTFUL API implementation of an authentification system using voice fingerprint

User: ihabbendidi

mfcc-features mfcc-extractor mfcc-analysis mfcc machine-learning gmm flask api server voice-recognition

javierantoran / tiger-costume-voice-conversion

mfcc,Voice Alignment and Conversion with Neural Networks and the WORLD codec.

User: javierantoran

voice speech voice-conversion neural-network voice-alignment voice-generation speaker alignment dwt dynamic-time-warping

jsingh811 / pyaudioprocessing

mfcc,Audio feature extraction and classification

User: jsingh811

audio-data feature-extraction classify classify-audio mfcc mfcc-features mfcc-extractor gfcc gfcc-features gfcc-extractor

k-farruh / speech-accent-detection

mfcc,The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines accent based audio record. The result of the model could be used to determine accents and help decrease accents to English learning students and improve accents by training.

User: k-farruh

native-speakers accent english-languages accent-detection mfcc

lexicalstressdetection / lexical-stress-detection

mfcc,Deep Learning model for lexical stress detection in spoken English

Organization: lexicalstressdetection

speech-recognition lexical-stress-detection deep-learning convolutional-neural-network data-science mel-frequency-cepstral-coefficients mfcc english-learning pronunciation pronunciation-scoring

libaudioflux / audioflux

mfcc,A library for audio and music analysis, feature extraction.

Organization: libaudioflux

Home Page: https://audioflux.top

audio audio-analysis audio-features audio-processing deep-learning machine-learning mfcc mir music music-analysis music-information-retrieval pitch python signal-processing spectral-analysis spectrogram time-frequency-analysis wavelet-analysis wavelet-transform

linksense / convolutionaneuralnetworkstoenhancecodedspeech

mfcc,In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral domain features. The proposed postprocessors in both domains are evaluated for various narrowband and wideband speech codecs in a wide range of conditions. The proposed postprocessor improves speech quality (PESQ) by up to 0.25 MOS-LQO points for G.711, 0.30 points for G.726, 0.82 points for G.722, and 0.26 points for adaptive multirate wideband codec (AMR-WB). In a subjective CCR listening test, the proposed postprocessor on G.711-coded speech exceeds the speech quality of an ITU-T-standardized postfilter by 0.36 CMOS points, and obtains a clear preference of 1.77 CMOS points compared to G.711, even en par with uncoded speech.

Organization: linksense

Home Page: https://ansleliu.github.io/CNN.html

cnn keras speech-enhancement speech-processing speech-reconstruction post-processing gan generative-adversarial-network wasserstein-gan 1d-convolution mfcc

mathquis / node-personal-wakeword

mfcc,Personal wake word detector

User: mathquis

wakeword hotword-detection hotword-detector node dtw mfcc

mechanicalsea / spectra

mfcc,Spectra extraction tutorials based on torch and torchaudio.

User: mechanicalsea

mfcc filterbank pytorch voice-activity-detection

mycroftai / sonopy

mfcc,A simple audio feature extraction library

Organization: mycroftai

audio-processing mfcc spectrogram sound mel-spectrogram library

nipunmanral / spoken-language-identification

mfcc,Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features

User: nipunmanral

gru lstm deep-learning neural-networks keras language-identification mfcc

pulakk / live-audio-mfcc

mfcc,Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorial

User: pulakk

meyda mfcc node-js p5-sketches webaudioapi

ringabout / scim

mfcc,[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

User: ringabout

nim audio speech-recognition arraymancer mfcc speech-analysis wav speech-processing digital-signal-processing scientific-computing

sheelabhadra / emergency-vehicle-detection

mfcc,Python implementation of papers on emergency vehicle detection using audio signals

User: sheelabhadra

self-driving-car audio-processing machine-learning mfcc wavelets pitch-detection emergency-response neural-network emergency-vehicle-detection

skaws2003 / pytorch-mfcc

mfcc,A pytorch implementation of MFCC.

User: skaws2003

mfcc pytorch

sp-nitech / diffsptk

mfcc,A differentiable version of SPTK

Organization: sp-nitech

Home Page: http://sp-tk.sourceforge.net

cepstrum cqt ddsp deep-learning digital-signal-processing dsp lpc lsp mdct mfcc plp pqmf python pytorch signal-processing sptk stft

sp-nitech / sptk

mfcc,A suite of speech signal processing tools

Organization: sp-nitech

Home Page: http://sp-tk.sourceforge.net

audio-processing cepstrum cpp dsp lpc lsp mfcc signal-processing speech speech-processing sptk unix-command

superkogito / spafe

mfcc,:sound: spafe: Simplified Python Audio Features Extraction

User: superkogito

Home Page: https://superkogito.github.io/spafe/

python dsp audio music audio-analysis music-information-retrieval features-extraction mfcc filterbank signal-processing

superkogito / voice-based-gender-recognition

mfcc,:sound: :boy: :girl:Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)

User: superkogito

gender-recognition gender-detection gender-classification gmm mfcc gender-recognition-by-voice voice mel-frequencies gaussian-mixture-models signal

superkogito / voice-based-speaker-identification

mfcc,:sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM

User: superkogito

speaker-identification speaker-recognition gmm mfcc voice mel-frequency-cepstral-coefficients mel-frequencies gaussian-mixture-models signal machine-learning

supikiti / pncc

mfcc,A implementation of Power Normalized Cepstral Coefficients: PNCC

User: supikiti

Home Page: https://www.eurasip.org/Proceedings/Eusipco/Eusipco2015/papers/1570104069.pdf

speech-recognition robustness mfcc pncc deep-learning speech-enhancement speech-processing

suyashmore / mevonai-speech-emotion-recognition

mfcc,Identify the emotion of multiple speakers in an Audio Segment

User: suyashmore

Home Page: https://colab.research.google.com/github/SuyashMore/MevonAI-Speech-Emotion-Recognition/blob/master/src/notebooks/Emotion_Recognition_Demo.ipynb

artificial-intelligence convolutional-neural-networks machine-learning speech-processing emotion-recognition emotion-analysis deep-learning diarization mfcc mfcc-analysis

tympanix / subsync

mfcc,Synchronize your subtitles using machine learning

User: tympanix

neural-network subtitles mfcc machine-learning speech-detection shift-subtitle subtitle delay shift fix

x4nth055 / emotion-recognition-using-speech

mfcc,Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

User: x4nth055

machine-learning speech-emotion-recognition emotion-recognition emotion-recognizer sklearn kneighborsclassifier random-forest-classifier mfcc feature-extraction emotion-detection

zafarrafii / cqhc-python

mfcc,Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.

User: zafarrafii

mfcc cqt cqt-spectrogram timbre deconvolution pitch envelope nsynth music audio

zafarrafii / zaf-matlab

mfcc,Zafar's Audio Functions in Matlab for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.

User: zafarrafii

Home Page: http://zafarrafii.com/

stft istft chromagram mfcc dct dst mdct imdct matlab cqt-kernel

zafarrafii / zaf-python

mfcc,Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.

User: zafarrafii

Home Page: http://zafarrafii.com/

python stft dct dst mdct inverse-stft cqt-kernel cqt-spectrogram chromagram inverse-mdct

zhengyima / dtw_digital_voice_recognition

mfcc,基于DTW与MFCC特征进行数字0-9的语音识别，DTW，MFCC，语音识别，中英数据，端点检测，Digital Voice Recognition。

User: zhengyima

dtw mfcc voice-recognition digital-signal-processing machine-learning dynamic-programming

zitengwang / python_kaldi_features

mfcc,python codes to extract MFCC and FBANK speech features for Kaldi

User: zitengwang

mfcc kaldi

Topic: mfcc Goto Github

👇 Here are 236 public repositories matching this topic...

abhay0899193 / speaker-recognition

adamstark / gist

alicex2020 / deep-learning-lie-detection

alicex2020 / mandarin-tone-classification

amanbasu / speech-emotion-recognition

ar1st0crat / nwaves

aubio / aubio

aubio / vamp-aubio-plugins

csukuangfj / kaldifeat

dataxujing / asr-paper

ddbourgin / numpy-ml

dydtjr1128 / speaker-recognition-using-nn

ewan-xu / librosacpp

federicapaoli1 / stm32-speech-recognition-and-traduction

fragit / fragit-main

fragjage / speakervoiceidentifier

gauravwaghmare / speaker-identification

geekysethi / audio_classification

georgid / alignmentduration

gionanide / speech_signal_processing_and_classification

guitarsai / basicsmusicalinstrumclassifi

ihabbendidi / voice-authentification-api

javierantoran / tiger-costume-voice-conversion

jsingh811 / pyaudioprocessing

k-farruh / speech-accent-detection

lexicalstressdetection / lexical-stress-detection

libaudioflux / audioflux

linksense / convolutionaneuralnetworkstoenhancecodedspeech

mathquis / node-personal-wakeword

mechanicalsea / spectra

mycroftai / sonopy

nipunmanral / spoken-language-identification

pulakk / live-audio-mfcc

ringabout / scim

sheelabhadra / emergency-vehicle-detection

skaws2003 / pytorch-mfcc

sp-nitech / diffsptk

sp-nitech / sptk

superkogito / spafe

superkogito / voice-based-gender-recognition

superkogito / voice-based-speaker-identification

supikiti / pncc

suyashmore / mevonai-speech-emotion-recognition

tympanix / subsync

x4nth055 / emotion-recognition-using-speech

zafarrafii / cqhc-python

zafarrafii / zaf-matlab

zafarrafii / zaf-python

zhengyima / dtw_digital_voice_recognition

zitengwang / python_kaldi_features

Recommend Projects

Recommend Topics

Recommend Org