Reinforcement-Learning-TicTacToe

An implementation of an algorithm that plays TicTacToe. The algorithm is based in reinforcement learning, using the Monte Carlo Algorithm.

Author: Oliver Zhang
Last Modified: 3/19/18

Goal: Learn how to play Tic Tac Toe. I'm using the implementation of TicTacToe by nczempin: https://github.com/nczempin/gym-tic-tac-toe/blob/master/gym_tic_tac_toe/envs/tic_tac_toe_env.py#L1

Overall Design:

I use Monte Carlo Learning to train a model which predicts the value of an action given a state. Observation is the board state. My code then makes every possible move, and picks the best resulting board state.

Then it can learn from its wins/losses and figure out which board state is actually the best.

For learning reinforcement learning, I suggest David Silver's youtube lectures https://www.youtube.com/watch?v=2pWv7GOvuf0&list=PL7-jPKtc4r78-wCZcQn5IqyuWhBZ8fOxT

How to Run:

Copy the files tic_tac_toe_env.py and ExperiencedTTTAI.py to your computer.
Add the TicTacToe environment to your gym. Check here for more details: https://github.com/openai/gym/wiki/Environments
Modify path variable to point to a folder for saving weights.
Run it on python3.

Options:

By changing 'debug' variable to true, you can print debugging information.
By changing 'display_img' variable to true, you can visualize what your program is doing.

Note: This version is pretty messy; I will be cleaning up the code in the future.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
BasicTTTAI.py		BasicTTTAI.py
ExperiencedTTTAI.py		ExperiencedTTTAI.py
README.md		README.md
tic_tac_toe_env.py		tic_tac_toe_env.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement-Learning-TicTacToe

Overall Design:

How to Run:

Options:

About

Releases

Packages

Languages

oliverzhang42/reinforcement-learning-tictactoe

Folders and files

Latest commit

History

Repository files navigation

Reinforcement-Learning-TicTacToe

Overall Design:

How to Run:

Options:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages