This was a course assignment in Reinforcement Learning where I trained an algorithm to play Snake by iteratively improving a policy that decides the snake's best course of actions.
The app notebook contains the projects actual solution while the presentation notebook contains a brief explanation of the game and demonstrates why we chose the particular epsilon and gamma values that we did.