Giter Site home page Giter Site logo

shahul-rahman / mabsearch-learning-the-learning-rate Goto Github PK

View Code? Open in Web Editor NEW
3.0 5.0 1.0 518 KB

MABSearch: The Bandit Way of Learning the Learning Rate - A Harmony Between Reinforcement Learning and Gradient Descent

Home Page: https://link.springer.com/article/10.1007/s40009-023-01292-1

License: MIT License

Jupyter Notebook 100.00%
global-minimum global-optimization global-optimization-algorithms gradient-descent learning-rate metaheuristics multi-armed-bandit multiarm-bandit multiarmed-bandits optimization

mabsearch-learning-the-learning-rate's Introduction

MABSearch-Learning-the-learning-rate

MABSearch is an elementary/ easy to use, gradient descent based Global Optimization algorithm. It uses RL to learn the optimal learning rate for the given objective function.

MABSearch: The Bandit Way of Learning the Learning Rate - A Harmony Between Reinforcement Learning and Gradient Descent Published in: National Academy Science Letters Journal, Springer Publication [SCI Indexed]. Link to paper: https://link.springer.com/article/10.1007/s40009-023-01292-1

PDF of the full paper available at: https://rdcu.be/ddJ8n

What is Optimization (Video Explanation): https://www.youtube.com/watch?v=Gu7si5T0z_w

How to Cite: Syed Shahul Hameed, A.S., Rajagopalan, N. MABSearch: The Bandit Way of Learning the Learning Rate—A Harmony Between Reinforcement Learning and Gradient Descent. Natl. Acad. Sci. Lett. (2023). https://doi.org/10.1007/s40009-023-01292-1

How to Use: There are two ipython jupyter notebook in this repository. 0. No special prerequisite packages are required. The notebook can be downloaded and executed or the code can be simply copied.

  1. An experiment-ready version titled as: "MABSearch (Experiment Ready Version).ipynb". This note book has all the GD and the proposed MABSearch algroithm.
  2. An easier-to-understand version titled as: "MABSearch.ipynb", with comments explaining the proposed MABSearch algorithm alone.

For Any suggestions or doubt mail to: [email protected] Cite the paper, if you find it useful.

mabsearch-learning-the-learning-rate's People

Contributors

shahul-rahman avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

Forkers

habibi-zz

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.