Giter Site home page Giter Site logo

klean2050 / ic_augment Goto Github PK

View Code? Open in Web Editor NEW
3.0 2.0 0.0 77.96 MB

[EUSIPCO 2020] Official TensorFlow Implementation

License: MIT License

Jupyter Notebook 100.00%
instrument-classification augmentation audio-mixing ensemble-learning music eusipco2020

ic_augment's Introduction

Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music

Image

This is the official implementation code of the paper:

  • *A. Kratimenos, *K. Avramidis, C. Garoufis, A. Zlatintsi, & P. Maragos, "Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music," 2020 28th European Signal Processing Conference (EUSIPCO), Amsterdam, 2021, pp. 156-160, doi: 10.23919/Eusipco47968.2020.9287745.

Abstract

Instrument classification is one of the fields in Music Information Retrieval (MIR) that has attracted a lot of research interest. However, the majority of that is dealing with monophonic music, while efforts on polyphonic material mainly focus on predominant instrument recognition or multi-instrument recognition for entire tracks. We present an approach for instrument classification in polyphonic music using monophonic training data that involves mixing-augmentation methods. Specifically, we experiment with pitch and tempo-based synchronization, as well as mixes of tracks with similar music genres. Further, a custom CNN model is proposed, that uses the augmented training data efficiently and a plethora of suitable evaluation metrics are discussed as well. The tempo-sync and genre techniques stand out, achieving an 81% label ranking average precision accuracy, detecting up to 9 instruments in over 2300 testing tracks.

Citation

If you find our paper useful in your research, please consider citing:

@inproceedings{kratimenos2020ic,
  title={Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music},
  author={Kratimenos, A. and Avramidis, K. and Garoufis, C. and Zlatintsi, A. and Maragos, P.},
  booktitle={2020 28th European Signal Processing Conference (EUSIPCO)},
  pages={156-160},
  year={2020},
  doi={10.23919/Eusipco47968.2020.9287745}
}

ic_augment's People

Contributors

agelosk avatar klean2050 avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.