pabloo22 / tabular_rl Goto Github PK
View Code? Open in Web Editor NEWA simple Reinforcement Learning framework to decouple agents from environments in discrete-state settings with implementations of double Q-learning and policy iteration through dynamic programming.
License: MIT License