Giter Site home page Giter Site logo

Comments (10)

MarekKowalski avatar MarekKowalski commented on May 10, 2024

Hi,

Those results are a bit surprising, as the error for the IBUG dataset you obtained seems to be much lower than any paper I have seen so far. Are you sure you are measuring this correctly?

If you want, you can place the models you have in some public data storage like Dropbox and send me the link, so I can re-evaluate it. This would help rule out the possibility of measurement error.

Best regards,

Marek

from deepalignmentnetwork.

mariolew avatar mariolew commented on May 10, 2024

@MarekKowalski Hi, in the training phase, did you hold the learning rate fixed to be 0.001? If I use learning rate 0.001, then the loss in stage 2 cannot drop.

from deepalignmentnetwork.

MarekKowalski avatar MarekKowalski commented on May 10, 2024

Yes, the learning rate was kept the same. When you are training the second stage are you updating the parameters for both stages or just stage 2?

In my experience, if you are updating parameters of both stages at the same time, the error takes much longer to drop.

from deepalignmentnetwork.

mariolew avatar mariolew commented on May 10, 2024

@MarekKowalski Actually, I only update parameters for stage 2, but I can see error drop only when I lower down the learning rate.

from deepalignmentnetwork.

MarekKowalski avatar MarekKowalski commented on May 10, 2024

That's surprising, for how long are you trying to see if the error starts droping? Also, is that in your implementation or in the one in this repo?

from deepalignmentnetwork.

mariolew avatar mariolew commented on May 10, 2024

@MarekKowalski I mean in my implementation, I only observe error drop with my learning rate lower down. And, someone has found almost the same problem in another tensorflow implementation. I've double checked the functionality of the self defined layer and found no problem, I don't know if the Adam is implemented differently in theano, or Batch Norm is implemented differently.

from deepalignmentnetwork.

MarekKowalski avatar MarekKowalski commented on May 10, 2024

I agree that it might be because TF has a different implementation of some part of the learning algorithm or the network.

from deepalignmentnetwork.

zjjMaiMai avatar zjjMaiMai commented on May 10, 2024

@MarekKowalski @mariolew Hi
In my implementation on tensorflow, I have not found this problem.
i use a private dataset , and epoch is [15,45]

image

from deepalignmentnetwork.

jnulzl avatar jnulzl commented on May 10, 2024

i want to know if both stages share parameters in the same layer?

from deepalignmentnetwork.

MarekKowalski avatar MarekKowalski commented on May 10, 2024

No the parameter values are not shared between the stages.

Marek

from deepalignmentnetwork.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.