SanketAgrawal / ReinforcementLearning Star 3 Code Issues Pull requests Chapter wise implementation & analysis of all the algorithms in RL : An Intoduction by Richard S. Sutton and Andrew G. Barto reinforcement-learning artificial-intelligence epsilon-greedy python-3 ucb k-armed-bandit gradient-bandit optimistic-inital-values Updated Jul 18, 2020 Jupyter Notebook
hritikb / Reinforcement-Learning-Algorithms Star 1 Code Issues Pull requests reinforcement-learning q-learning grid-world epsilon-greedy sarsa dynamic-programming multi-armed-bandits policy-iteration value-iteration monte-carlo-methods temporal-differencing-learning upper-confidence-bound gradient-bandit optimistic-inital-values greedy-policy Updated Jun 29, 2023 Jupyter Notebook