Solomon Raj Panduga - [email protected]
- spanduga_assignment1_final.ipynb - Part 2 final submission notebook
- spanduga_assignment1_checkpoint.ipynb - Part 1 checkpoint submission notebook
- data_q_table_values.h5 - Trained Q-Table values
- data_total_rewards_epsilon.h5 - Total reward and epsion decay values of training data
- images - Folder with images used for visualization of the grid world
- Initial State
- When the agent is navigating
- When the agent collects the reward
- When the agent is at the goal state