RL Baselines

Status: Maintenance (expect bug fixes and additional algorithms)

RL Baselines

Weg's RL Baselines is a set of minimum viable implementations of reinforcement learning algorithms.

Most common implementations are overly complex or require more intimate knowledge of a companies internal software pipeline or frameworks. This imposes a real barrier to education in the field of reinforcement learning, where complexity of the algorithms is already a concern on its own.

Anything that would expand the files beyong minimum complexity is not included. Anything that would be considered boilerplate is not included. No model saving, loading, or testing. You can easily add those features yourself. Itertools and less common tensor operations, numpy, and pytorch features are avoided to make the baselines accessable to the average python programmer.

These algorithms function as a starting point for your own implementations. While not distributed implementations, they could be used in production with minimal modification. These implementations include the most commonly used drl algorithms, and if hyperparameter tuned, can achieve state of the art results in non-reward-sparse continuous control environments.

Prerequisites

Baselines requires python3, pytorch, and openai-gym.

bash

pip3 install gym

This may also be necessary:

pip3 install gym[box2d]

Installation

Clone the repo and cd into it:

git clone https://github.com/wegfawefgawefg/wegs-drl-baselines.git
cd wegs-drl-baselines

Running an algo

Most of the algorithms in wegs-baselines repo are used by running the corresponding algorithm file:

python3 ./dqn.py

Saving and loading the model

Weg's DRL Baselines does not include model saving or loading, as it goes against the intention of the repo to be minimal. However, adding this functionality is trivial.

Logging and Visualization

You can store rewards and losses in a list and shove them into matplotlib or tensorboard.

Main Algorithms

Experimental Algorithms

These are complete and functioning, but are not recommended for use in making good quality RL bots.

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
TD3		TD3
_TODO/soft_double_dueling_batch_ac		_TODO/soft_double_dueling_batch_ac
_bugged		_bugged
_experimental		_experimental
ac		ac
batch_ppo		batch_ppo
ddpg		ddpg
double_dqn		double_dqn
dqn		dqn
icm		icm
noisy_dqn		noisy_dqn
per_dqn		per_dqn
LICENSE		LICENSE
README.md		README.md
logo.png		logo.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL Baselines

Prerequisites

bash

Installation

Running an algo

Saving and loading the model

Logging and Visualization

Main Algorithms

Experimental Algorithms

About

Languages

License

wegfawefgawefg/wegs-drl-baselines

Folders and files

Latest commit

History

Repository files navigation

RL Baselines

Prerequisites

bash

Installation

Running an algo

Saving and loading the model

Logging and Visualization

Main Algorithms

Experimental Algorithms

About

Topics

Resources

License

Stars

Watchers

Forks

Languages