Giter Site home page Giter Site logo

Comments (6)

GMvandeVen avatar GMvandeVen commented on May 29, 2024

Hey, I quickly checked but I couldn't find any mistakes and those accuracies saved into the pickle seem correct to me. I think the confusion is with how the "all_tasks" entry is organised, which isn't very intuitive. The first list of "all_tasks" actually contains the test accuracy for the first task measured after training on each of the tasks. The n-th entry of the m-th list is the test accuracy for task m after training on task n. Sorry for that weird structure. I hope this explains it, but if not please let me know and I'll have another look.

from continual-learning.

Johswald avatar Johswald commented on May 29, 2024

from continual-learning.

GMvandeVen avatar GMvandeVen commented on May 29, 2024

Yes, that's correct!

from continual-learning.

Johswald avatar Johswald commented on May 29, 2024

from continual-learning.

GMvandeVen avatar GMvandeVen commented on May 29, 2024

Yes, that looks about right to me. The regularisation of EWC indeed hinders the network to learn new tasks (because it penalizes new parameter changes), and this effect gets larger the more tasks have been learned. You can make the network more plastic by reducing lambda (or reducing gamma might also work), which will probably increase the accuracies for the later tasks but reduce those for the earlier tasks. What the optimal values of these EWC hyper parameters are heavily depends on both the type and number of tasks. (The optimal values for different task protocols can differ by several orders of magnitude, see for example Appendix D here: https://arxiv.org/abs/1904.07734.)

from continual-learning.

Johswald avatar Johswald commented on May 29, 2024

Thank you again!

from continual-learning.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.