dori2063 Goto Github PK
Name: Youngdo Ahn
Type: User
Company: GIST
Bio: πββοΈ
Location: Gwangju, Republic of Korea
Name: Youngdo Ahn
Type: User
Company: GIST
Bio: πββοΈ
Location: Gwangju, Republic of Korea
Source code for "On the Relationship between Self-Attention and Convolutional Layers"
TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition"
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Finding label errors in datasets and learning with noisy labels.
source code to ICLR'19, 'A Closer Look at Few-shot Classification'
Cross-Domain Few-Shot Classification via Learned Feature-Wise Transformation (ICLR 2020 spotlight)
The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"
Tensorflow code for CVPR 2017 paper: Learning a Deep Embedding Model for Zero-Shot Learning
Code for paper "DeepEMD: Few-Shot Image Classification with Differentiable Earth Mover's Distance and Structured Classifiers", CVPR2020
DomainBed is a suite to test domain generalization algorithms
Tools for testing emotion recognition methods.
End-to-End Speech Processing Toolkit
Separate block diagrams for training and test phases.
Implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation
Perform transfer learning for MIR using Jukebox!
NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for noisy labels).
Different implementations of "Weighted Prediction Error" for speech dereverberation
code for the ICML paper "SelectiveNet - A Deep Neural Network with an Integrated Reject Option"
speech emotion recognition, augmentation
An implementation of SkipVQVC with various settings.
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
PyTorch code for βTVLT: Textless Vision-Language Transformerβ (NeurIPS 2022)
Code release for Universal Domain Adaptation(CVPR 2019)
Baseline pipeline LiFE to reproduce the extracted linguistic features from the ComParE2020_USOMS-e challenge. We utilise and provide contextual word embeddings using a frozen (not fine-tuned) German Bidirectional Language Transformer (Bert).
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. πππ
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google β€οΈ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.