Name: Hangjie Yuan
Type: User
Company: ZJU, Alibaba DAMO, MMLab@NTU
Bio: An AI researcher and a realistic idealist.
Location: Hangzhou, Singapore
Blog: https://jacobyuan7.github.io/
Hangjie Yuan's Projects
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Official implementations for paper: Anydoor: zero-shot object-level image customization
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
[ICLR 2022] DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
[ICCV 2021] A new codebase containing various methods for Group Activity Recognition. Paper title: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition.
[CVPR 2022 Oral]Official implementation of DN-DETR
Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation
A list of Human-Object Interaction Learning.
Config files for my GitHub profile.
Hangjie Yuan's homepage
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
[AAAI 2022] Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics.
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
README文件语法解读,即Github Flavored Markdown语法介绍
[NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Graph Generation.
[ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”
Finetune ModelScope's Text To Video model using Diffusers 🧨
Official repo for I2VGen-XL: High-Quality Image-to-Video Synthesis Via Cascaded Diffusion Models
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Personal homepage of Jiangning Zhang