GenTron: Diffusion Transformers for Image and Video Generation
Shoufa Chen, Mengmeng Xu, Jiawei Ren, Yuren Cong, Sen He, Yanping Xie, Animesh Sinha, Ping Luo, Tao Xiang, Juan-Manuel Perez-Rua
The University of Hong Kong, Meta
This repository contains:
- 🪐 A simple PyTorch implementation of Text-to-Image GenTron
- 🛸 A GenTron training script
python sample.py --image_size 512 --seed 1
python sample.py --model GenTron-T2I-XL/2 --image_size 256 --ckpt /path/to/model.pt
accelerate launch --mixed_precision fp16 train.py --data_path /path/to/ImageNet/train