salimandre / Metaheuristic-TSP-LSCO Star 1 Code Issues Pull requests Discrete and continuous optimization problems solved iteratively and approximately by metaheuritic algorithms. optimization-algorithms metaheuristics 2-opt travelling-salesman-problem greedy-policy 2-swap large-scale-continuous-optimization Updated Jun 22, 2022 Python
hritikb / Reinforcement-Learning-Algorithms Star 1 Code Issues Pull requests reinforcement-learning q-learning grid-world epsilon-greedy sarsa dynamic-programming multi-armed-bandits policy-iteration value-iteration monte-carlo-methods temporal-differencing-learning upper-confidence-bound gradient-bandit optimistic-inital-values greedy-policy Updated Jun 29, 2023 Jupyter Notebook
RezaSaadatyar / Reinforcement-Learning Star 0 Code Issues Pull requests The repository contains codes for RL (e.g., Q-Learning, Monte Carlo, …) in the form of Python files. reinforcement-learning q-learning dynamic-programming multi-armed-bandit policy-iteration monte-carlo-methods greedy-policy e-greedy-policy upper-confidence-bounds-policy stochastic-gradient-ascent-policy iterative-policy-evaluation monte-carlo-exploring-starts state-action-reward-state-action first-visit-mc-prediction value-iteration- Updated Sep 12, 2023 Jupyter Notebook