xj-martin,github

alphazero_gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

asteroid

The PyTorch-based audio source separation toolkit for researchers

awesome-speech-enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

bird-recognition-review

A list of useful resources in the bird sound (song and calls) recognition, such as datasets, papers, links to open source projects and competitions

bird_audio_detection

This project analyzes and detects bird audio sounds. It used mel-spectrograms and apply CNN over it.

co-separation

Co-Separating Sounds of Visual Objects (ICCV 2019)

complex-mtassnet

Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.

conditioned-source-separation-lasaft

A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation"

conv-tasnet-1

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

conv-tasnet-2

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

dns-challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

dual-path-rnn-pytorch

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

external-attention-pytorch

Pytorch implementation of various Attention Mechanism

fcanet

FcaNet: Frequency Channel Attention Networks

mdx-net

KUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021

minus-plus-network

this repo is the codebase for ICCV19 paper "Recursive Visual Sound Separation Using Minus-Plus Net"

mlkd2020-classification-and-identification-of-musical-emotions

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.

music-demixing-challenge-starter-kit

Starter kit for getting started in the Music Demixing Challenge.

reinforcement-learning-with-pytorch

Reinforcement learning with PyTorch, inspired by MorvanZhou, change the framework from Tensorflow to PyTorch

semseg

Semantic Segmentation in Pytorch

sigsep-mus-db

Python parser and tools for MUSDB18 Music Separation Dataset

sound-of-pixels

Codebase for ECCV18 "The Sound of Pixels"

speech-emotion-recognition

Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别

xj-martin Goto Github PK

xj-martin's Projects

Recommend Projects

Recommend Topics

Recommend Org