Giter Site home page Giter Site logo

speaker-identification's Introduction

Speaker Recognition

Speaker recognition is the task of recognizing a person automatically from his or her voice; we assume there is a set of known voices recorded prior to the recognition task. In addtion, we don't address the verification problem here.

For detailed discussion about the subject, please refer to the paper Automatic Speaker Recognition Using Gaussian Mixture Speaker Models.

The main technique we are using are MFCC and GMM.

MFCC

Mel-frequency cepstrum coefficients are common features that are used for voice/speech related tasks. It's a representation of the power spectrum, but based on a nonlinear mel scale.

There is an educational MFCC implementation (not focusing on , please check out the mfcc branch. The master branch uses BLAS to speed up feature extraction.

GMM

For text-independent speaker recognition, the sound source (human voice) can be modelled as a mixture models, each following the Gaussian distribution. Typically, 32-component GMM should be sufficent to distinguish a few number of speakers.

Code Structure

This repository mainly contains the C++ implementation. However, during the research, it was easier to use Matlab to do data and algorithm exploration. You can check out the matlab folder and specifically the script speaker_recognition.m. It currently uses data that's exported from GRT. It wouldn't be too hard to simply use audio files (you may check out the MOCHA-TIMIT dataset).

Dataset

  1. TIMIT

Doesn't seem to be free.

  1. MOCHA-TIMIT

Although small, two speakers but can be used for initial testing.

speaker-identification's People

Contributors

nebgnahz avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.