Giter Site home page Giter Site logo

stylegan-keras-ece655's Introduction

stylegan-keras-ece655

result 1076000

result 1074000

We implemented the model with Tensorflow Keras based on the paper https://arxiv.org/pdf/1812.04948.pdf and trained the model on Google Cloud AI Notebook with one Nvidia Tesla v100 GPU. We used the dataset CelebA-HQ and chose the image resolution to be 256x256. The training took about 3-4 days.

The learning rate for the generator and discriminator was 0.0001 and the styler (mapping network called in the paper) 0.01 times of that.

We have saved some generated results for every 1000 global steps in the folder /image-bucket/generated_images. For each image, the results in the third row is the mix in styles of the first row and the second row in different layers. It looks like it has the trend of gradually improving on fine details. I might continue the training process when budget allows.

My personal understanding of the structure:

  1. The Adaptive Instance Normalization (AdaIN) operation helps to apply weights to convolution kernels to select which kernels to use for each generator layer/block based on the styler outputs.

  2. The learning rate of the styler needs to be lower for the selection of the convolution kernels to be random enough at start and fully trained. I also believe it possible that for models requiring more levels of details, it might require lower learning rates for the styler and more training time to improve on the finer details.

stylegan-keras-ece655's People

Contributors

freegyp avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.