Giter Site home page Giter Site logo

awesome_diffusions's Introduction

I only put the resources (e.g. papers, blogs, etc.) that I have read and found interesting in this list. So the update speed would be depending on my speed of reading stuffs.


Catalogue:


1. Vision: [Back to Top]

1.1. Text-to-Image Generation:

  • "AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities" Zhongzhi Chen, Guang Liu, Bo-Wen Zhang, Fulong Ye, Qinghong Yang, Ledell Wu; [arxiv][code]
  • "Hierarchical Text-Conditional Image Generation with CLIP Latents" Aditya Ramesh, Prafulla Dhariwal, Alex Nichol, Casey Chu, Mark Chen; [arxiv]
  • "High-Resolution Image Synthesis with Latent Diffusion Models" Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer; [arxiv]

1.2. Object Detection:

  • "DiffusionDet: Diffusion Model for Object Detection" Shoufa Chen, Peize Sun, Yibing Song, Ping Luo; [arxiv][code]

1.3. Image Generation:

  • "DENOISING DIFFUSION IMPLICIT MODELS" Jiaming Song, Chenlin Meng, Stefano Ermon; [arxiv][code]

  • "Adding Conditional Control to Text-to-Image Diffusion Models" Lvmin Zhang and Maneesh Agrawala; [arxiv][code]


2. Language: [Back to Top]

2.1. Text Generation:

  • "Diffusion-LM Improves Controllable Text Generation" Xiang Lisa Li, John Thickstun, Ishaan Gulrajani, Percy Liang, Tatsunori B. Hashimoto; [arxiv][code]
  • "DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models" Zhengfu He, Tianxiang Sun, Kuanning Wang, Xuanjing Huang, Xipeng Qiu; [arxiv][code].
  • "GENIE: Large Scale Pre-training for Generation with Diffusion Model" Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Weizhu Chen, Nan Duan; [arxiv]
  • "Difformer: Empowering Diffusion Model on Embedding Space for Text Generation" Zhujin Gao, Junliang Guo, Xu Tan, Yongxin Zhu, Fang Zhang, Jiang Bian, Linli Xu; [arxiv]
  • "DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models" Shansan Gong, Mukai Li, Jiangtao Feng, Zhiyong Wu, Lingpeng Kong; [arxiv][code]
  • "Latent Diffusion for Language Generation" Justin Lovelace, Varsha Kishore, Chao Wan, Eliot Shekhtman, Kilian Weinberger; [arxiv]

3. Vision and Language: [Back to Top]

3.1. Image Captioning:

  • "Exploring Discrete Diffusion Models for Image Captioning" Zixin Zhu, Yixuan Wei, Jianfeng Wang, Zhe Gan, Zheng Zhang, Le Wang, Gang Hua, Lijuan Wang, Zicheng Liu, Han Hu; [arxiv][code]

4. Other Topics: [Back to Top]

  • "CARD: Classification and Regression Diffusion Models" Xizewen Han, Huangjie Zheng, Mingyuan Zhou; [arxiv][code]

5. Blogs and Other Resources: [Back to Top]

  • "The Illustrated Stable Diffusion" Jay Alammar; [link]
  • "How diffusion models work: the math from scratch" Sergios Karagiannakos, Nikolas Adaloglou; [link]
  • Github Repo for minimal-text-diffusion

awesome_diffusions's People

Contributors

yxuansu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.