Giter Site home page Giter Site logo

Comments (9)

HelenMao avatar HelenMao commented on May 24, 2024

Hi, thanks for your interest in our work.
In our paper, we use all the training samples and the generated samples in all classes to compute FID (not per-class FID and average the result).
We find most of the previous papers report the FID in this way.
The mean and the standard derivation are calculated based on five independent trials. Please see the appendix in our paper.

from msgan.

youngjung avatar youngjung commented on May 24, 2024

Wow super prompt reply! Thanks!

I misunderstood the sentence in the appendix 'We use all the training samples and the generated samples to compute FID.'. Now I see the point. 5K samples per class and 50K in total and all 50K samples are used to compute a single stats.

(What a waste of time writing per-class FID..... stupid me lol)

Further explanation on five trials is also valuable.
Could you explain whether it means

  • five different models from five training trials (probably this one is the right one but just for double check..)
  • or five testing for one fixed model (randomness from sampling z from Gaussian)
    ?

from msgan.

HelenMao avatar HelenMao commented on May 24, 2024

We just used one fixed model and generated five times of samples.
Hope it helps.

from msgan.

youngjung avatar youngjung commented on May 24, 2024

Thanks!

Finally I got FID of 28.60 from your pretrained model.

However, the model trained by myself reports 35.66.

Did you observe variation among multiple training trials?

I will re-run the training for several times anyway but any comments will be helpful since training for 200K iters takes about 10 hours on my machine.

Is weight_decay on optimizer is accidentally not mentioned in the paper or not erased from your code?
Or, do I have to train for more than 200K iters?

from msgan.

HelenMao avatar HelenMao commented on May 24, 2024

We just obtained the pre-trained model using the codes released and reported the five times of the results using that model.

from msgan.

youngjung avatar youngjung commented on May 24, 2024

Thanks it means that I can keep working on the same code!

I will train again several times and share my experience.

Thank you again :)

from msgan.

youngjung avatar youngjung commented on May 24, 2024

I ran the same code for four more trials and got FIDs below.

trial0: 29.93222174258068
trial1: 30.902965756971412
trial2: 29.98750439731532
trial3: 35.25630393016536

Probably measure of the provided pre-trained model can be reached in a few more trials.

from msgan.

HelenMao avatar HelenMao commented on May 24, 2024

Hi, I find I set the max_iter=200,000 in my code. In fact, I remembered I trained 200 epochs of the pre-trained model.

from msgan.

youngjung avatar youngjung commented on May 24, 2024

Oh, now I see that!

My 00199.pth are not actually from after 200 epoch hahahahaha.

Thanks I will try without if total_it >= max_it: block :)

from msgan.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.