Huy Q Can's Projects
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
KTVXL
A comprehensive list of awesome document image rectification papers.
papers about Face Detection; Face Alignment; Face Recognition && Face Identification && Face Verification && Face Representation; Face Reconstruction; Face Tracking; Face Super-Resolution && Face Deblurring; Face Generation && Face Synthesis; Face Transfer; Face Anti-Spoofing; Face Retrieval;
A curated list of resources dedicated to table recognition
Assignment 1 For CIS 419 on Decision Tree Learning and Linear Regression
Operation system course
Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv) Convolutional Neural Networks; (v) Sequence Models
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
Official implementation of Character Region Awareness for Text Detection (CRAFT)
Data science interview questions and answers
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
End-to-End Object Detection with Transformers
A hybrid dataset for document unwarping (Paper: https://www3.cs.stonybrook.edu/~cvl/projects/dewarpnet/storage/paper.pdf)
DocILE: Document Information Localization and Extraction Benchmark
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
Course materials for Georgia Tech CS 4650 and 7650, "Natural Language"
The human parsing network used in ViTAA, which is specially trained for reid datasets.
Config files for my GitHub profile.