Giter Site home page Giter Site logo

victor369basu / audio-track-separation Goto Github PK

View Code? Open in Web Editor NEW
19.0 2.0 5.0 2.61 MB

In this Repository, We developed an audio track separator in tensorflow that successfully separates Vocals and Drums from an input audio song track.

Python 100.00%
tensorflow audio audio-processing visualization unet research-project librosa keras neural-network machine-learning

audio-track-separation's Introduction

Hi there ๐Ÿ‘‹

I am Victor Basu, I'm a passionate Data Scientist and Machine Learning Engineer with a strong background in turning data into actionable insights and building intelligent systems. My journey in the world of data and machine learning began with a curiosity to unravel the hidden patterns in data and use them to make informed decisions. I am a kaggle Notebooks Master, you could follow me on Kaggle at @basu369victor. I have a hand full of experience with the technologies required today at the industry level. Other than Data Science and Machine Learning I do take some interest in web-development stuffs. You could also follow me on LinkedIn at @Victor Basu

My contribution to Keras -

My work on Attention based Protein Structure Prediction

protein Structure Prediction

Highlight

Facial Emotion Recognition

HLD

In this project I have developed an end-to-end pipeline for real-time Facial emotion recognition application through full-stack development. The frontend is developed in react.js and the backend is developed in FastAPI. The emotion prediction model is built with Tensorflow Keras, and for real-time face detection with animation on the frontend, Tensorflow.js have been used.

GitHub project repository - Facial Emotion Recognition

High quality youtube video available at - https://youtu.be/aTe05n6T5Vo

Architecture to host QuickSight Dashboard for HuggingFace model monitoring deployed on SageMaker along with data EDA

architecture

This is a solution that demonstrates how to train and deploy a pre-trained Huggingface model on AWS SageMaker and publish an AWS QuickSight Dashboard that visualizes the model performance over the validation dataset and Exploratory Data Analysis for the pre-processed training dataset. With this as the architecture for the proposing solution, we try to solve the classification of medical transcripts through Machine Learning, which is basically solving a Bio-medical NLP problem. In this solution, we also discuss feature engineering and handling imbalanced datasets through class weights while training by writing a custom Huggingface trainer in PyTorch.

GitHub project repository - Host QuickSight Dashboard for HuggingFace model monitoring deployed on SageMaker along with data EDA Demo video - https://youtu.be/RhTSnn41cnM

Readme Card Readme Card Readme Card Readme Card Readme Card Readme Card

Top Langs

Victor's GitHub stats

audio-track-separation's People

Contributors

victor369basu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

audio-track-separation's Issues

About Separating Track

Hi,

How can I separate .wav file to vocal and instrumental(accompaniment) .wav files? I can't see it in output folder.

checkpoint model request

Hi, very excited about this tool that looks promising,

But I would like to make a request, would it be possible to release the pre-trained model? to test on our own songs,

We are already grateful.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.