Trains an agent with (stochastic) Policy Gradients on Pong

Based on this awesome blog post from the great Andrej Kaparthy: https://karpathy.github.io/2016/05/31/rl/

Install

Create a virtual env and activate it.
python -m venv venv
source venv/bin/activate
Install requirements.
pip install -r requirements.txt
Then install the gym.
pip install "gym[atari]"
Accept licences.
pip install "gym[accept-rom-license, atari]"

Run

python src/pong.py"

Architecture

Weights
W1: (200 x 6400)
W2: (1 x 200)

Math

References

Andrej Kaparthy blog
David Silver - Reinforcement Learning
Richard S. Sutton - Reinforcement Learning: An Introduction

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
architecture		architecture
math		math
media		media
model		model
src		src
.gitignore		.gitignore
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Trains an agent with (stochastic) Policy Gradients on Pong

Install

Run

Architecture

Math

References

About

Releases

Packages

Languages

VolkerFelix/pong

Folders and files

Latest commit

History

Repository files navigation

Trains an agent with (stochastic) Policy Gradients on Pong

Install

Run

Architecture

Math

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages