Giter Site home page Giter Site logo

Comments (9)

xhluca avatar xhluca commented on May 28, 2024 1

Practically it does perform better than official. -_- .

I've only observed better performance in one case so I'm not sure it generalizes. In that case, the improved performance does indicate that extreme low survival rates (<0.3) might be a good regularization approach.

from keras-applications.

darcula1993 avatar darcula1993 commented on May 28, 2024

I check the drop rate per block and it looks fine:

block1a_ 1.0
block1b_ 0.9875
block1c_ 0.975
block2a_ 0.9625
block2b_ 0.95
block2c_ 0.9375
block2d_ 0.925
block2e_ 0.9125
block3a_ 0.9
block3b_ 0.8875
block3c_ 0.875
block3d_ 0.8625
block3e_ 0.85
block4a_ 0.8375
block4b_ 0.825
block4c_ 0.8125
block4d_ 0.8
block4e_ 0.7875
block4f_ 0.775
block4g_ 0.7625
block5a_ 0.75
block5b_ 0.7375
block5c_ 0.725
block5d_ 0.7124999999999999
block5e_ 0.7
block5f_ 0.6875
block5g_ 0.675
block6a_ 0.6625
block6b_ 0.6499999999999999
block6c_ 0.6375
block6d_ 0.625
block6e_ 0.6125
block6f_ 0.6
block6g_ 0.5874999999999999
block6h_ 0.575
block6i_ 0.5625
block7a_ 0.55
block7b_ 0.5375
block7c_ 0.5249999999999999

from keras-applications.

xhluca avatar xhluca commented on May 28, 2024

@darcula1993 I'm confused. Shouldn't the block rate be at ~0.8 for the final block since the drop_connect_rate is 0.2 by default?

from keras-applications.

xhluca avatar xhluca commented on May 28, 2024

So it turns out I pasted different values. However the problem remains as indicated.

from keras-applications.

darcula1993 avatar darcula1993 commented on May 28, 2024
        for j in range(round_repeats(args.pop('repeats'))):
            # The first block needs to take care of stride and filter size increase.
            if j > 0:
                args['strides'] = 1
                args['filters_in'] = args['filters_out']
            x = block(x, activation_fn, drop_connect_rate * b / blocks,
                      name='block{}{}_'.format(i + 1, chr(j + 97)), **args)
            b += 1

I check the code and seems that b can be greater than num of blocks,not sure why.

from keras-applications.

xhluca avatar xhluca commented on May 28, 2024

I've observed the same thing as well.

from keras-applications.

fmbahrt avatar fmbahrt commented on May 28, 2024

Qubvel's implementation does not calculate the total number of blocks correctly for configurations larger than B0.

from keras-applications.

innat avatar innat commented on May 28, 2024

Practically it does perform better than official. -_- .

from keras-applications.

innat avatar innat commented on May 28, 2024

Well, I'm not sure, maybe I need to look again properly. In fact, I spent almost a week assuming that there probably some problem with my data loader using the official efficient-net. But when I use non-official implementation, it was just fine.

from keras-applications.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.