Giter Site home page Giter Site logo

min-max-in-dit's Introduction

Min-Max-Imagenet DiT

In a similar spirit to the Keller Jordan's Fastest CIFAR-10 training, I want to be the fastest diffusion trainer in the east. I'll keep the progress here. Currently very much WIP.

Featuring:

  • DeepSpeed training of Diffusion Transformer. Supports Zero-1,2,3.
  • CPU-offloaded, skipped EMA trick for Karras' Post-hoc EMA analysis, where you EMA once in every N steps instead. You have to adjust beta_1 and beta_2 so they are properly accounting for the fact you skipped last N-1 steps. Of course, saving codes are there.
  • Featuring Streaming Dataset, specially my quantized imagenet.int8 for insanely lightweight imagenet training.

Dataset

Since this dataset is so small, you don't need to setup massive remote data setup stuff, just point to the local_dir, set remote_dir to None.

Running

For single-node setup, just

run.sh

Whats the goal here?

My goal is to get FID score of 30 under 20 hours of training. I'll keep updating this README as I make progress.

min-max-in-dit's People

Contributors

cloneofsimo avatar

Stargazers

Eisneim Terry avatar Ollin Boer Bohan avatar Yuqian Hong avatar elucida avatar Mario Larcher avatar  avatar  avatar  avatar Yong-jin avatar kaan bursa avatar Ina299 avatar YongHyun Park avatar Sang avatar Hans Brouwer avatar George Grigorev avatar  avatar Yutong Lin avatar  avatar  avatar  avatar Ruixiang Zhang avatar Pasha S avatar Sofian Mejjoute avatar Sean avatar  avatar Keep Growing And Moving Forward avatar

Watchers

Ruixiang Zhang avatar Hans Brouwer avatar  avatar  avatar Keep Growing And Moving Forward avatar Sofian Mejjoute avatar

Forkers

hohotyh

min-max-in-dit's Issues

Benchmarks on speed improvements

Hey do you have some benchmarks on how much you improve for each component?

In particular I'm interested about the streaming library and the int8 imagenet.

Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.