Huu Tuong Tu's Projects
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
PyTorch android examples of usage in applications
AudioLDM training, finetuning, evaluation and inference.
Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline
Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.
Materials for the Hugging Face Diffusion Models Course
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
Example code for Fluent Python, 2nd edition (O'Reilly 2022)
Khung project sử dụng FastAPI
Build and train state-of-the-art natural language processing models using BERT
Python toolkit for quantitative finance
The official implementation of HierSpeech++
Config files for my GitHub profile.
Just test :D
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
JuliaLang version of "An Introduction to Statistical Learning: With Applications in R"
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Knowledge distillation in Mispronunciation detection and diagnosis
Comparison of L2 Korean pronunciation error patterns from five L1 backgrounds by using automatic phonetic transcription
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
This is Mispronunciation detection and diagnosis Score Metric