Skip to content

Trains an agent with (stochastic) Policy Gradients on Pong

Notifications You must be signed in to change notification settings

VolkerFelix/pong

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Trains an agent with (stochastic) Policy Gradients on Pong

game

Based on this awesome blog post from the great Andrej Kaparthy: https://karpathy.github.io/2016/05/31/rl/

Install

Create a virtual env and activate it.
python -m venv venv
source venv/bin/activate
Install requirements.
pip install -r requirements.txt
Then install the gym.
pip install "gym[atari]"
Accept licences.
pip install "gym[accept-rom-license, atari]"

Run

python src/pong.py"

Architecture

model_arch Weights
W1: (200 x 6400)
W2: (1 x 200)

Math

Page1 Page2

References

Andrej Kaparthy blog
David Silver - Reinforcement Learning
Richard S. Sutton - Reinforcement Learning: An Introduction

About

Trains an agent with (stochastic) Policy Gradients on Pong

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published