Light

raymondhuang210129 / uiuc-audio-lab Goto Github PK

View Code? Open in Web Editor NEW

0.0 2.0 0.0 133.01 MB

Jupyter Notebook 54.93% HTML 45.07% Python 0.01%

uiuc-audio-lab's Introduction

UIUC CS498 Spring 2020: Audio Labaratory Assignments

Introduction

This course covers the basic ideas of audio processing and wide range of techniques to create sound effects.
Courses and assignments are designed by Professor Paris Smaragdis.

Contents

W1: Basic Sound Manipulation
- Generate tones and basic sounds such as chirp, ping-pong, etc.
- In/output sounds from/to storage or sound devices
W2: Time and Frequency Domain
- Conduct Short Time Fourier Transform and generate spectrograms
- Blocking some noises in Frequency Domain and transform back to sound data in time domain.
W3: Fix Sounds with Filters
- Design Lowpass, Highpass, Bandpass, Bandstop filters with different techniques and apply on different type of noises.
W4: Reverberation and Room Simulation
- Generate reverb effects by designing Comb filters, Allpass filters, etc.
- Create the room's response via deconvolution and apply to the sounds.
W5: 3D Audio & Virtual Sound
- Simulate the sound sources by convoluting the original sound with ITD and ILD filters
- Simulate the dynamic sources by covoluting the sound with different HRTFs at each moment
W6: Microphone Arrays
- Generate steering vector for different inputs' phase shifts and identify the directions of sound sources.
- Enable the microphone array to act like beamforming microphone and tramsform the original input to localized sound.
W7: Denoising
- Sample the background noise from recording and denoise with spectral subtraction and median filtering
- Dynamically sample the noise by using voice activity detector to get better result in every moment
W8: Pitch Tracking and modifications
- Track the voice pitch by using autocorrelation and ignore un-pitched frames
- Track the voice pitch by detecting zero-crossing and tune the sound with pitch synchronous overlap adding (PSOLA)
W9: Spectral Factorizations
- Learn the spectral factors in sound track and extract sounds emitted by each instrument
- Train the factors of human speaking and chimes and separate the mixture of the two
W10: Audio Classification
- Train an speech/music classifier with implementing covariance vector and Gaussian PDF function
- Implement music genres classifier with sklearn API and applying MFCC transformation to enhance the characteristics of each genres
W11: DTW and HMMs
- Apply DTW on speech recognizer to recognize user's inputs

uiuc-audio-lab's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.