#

ppo

Here are 624 public repositories matching this topic...

marioyc / learning-to-run

Learning to Run NIPS 2017 Competition

machine-learning reinforcement-learning tensorflow continuous-control trpo ppo

Updated Aug 18, 2017
Python

ChenglongChen / pytorch-DRL

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

reinforcement-learning deep-reinforcement-learning pytorch multi-agent dqn rl deep-q-network ddpg drl actor-critic deep-deterministic-policy-gradient proximal-policy-optimization ppo advantage-actor-critic a2c acktr madrl

Updated Nov 11, 2017
Python

pekaalto / sc2aibot

Implementing reinforcement-learning algorithms for pysc2 -environment

python reinforcement-learning tensorflow deepmind proximal-policy-optimization ppo starcraft2 a2c pysc2

Updated Dec 12, 2017
Python

mabirck / modular_DeepRL

Attempt to implement A2C and PPO algorithm with modular properties of Maxout and LWTA. # UNFINISHED AND FAILED

reinforcement-learning deep-learning deep-reinforcement-learning a3c maxout-networks ppo a2c lwta modular-training modular-networks

Updated Jan 11, 2018
Python

uidilr / ppo_tf

Implementation of proximal policy optimization(PPO) with tensorflow

machine-learning reinforcement-learning tensorflow deep-reinforcement-learning policy-gradient ppo

Updated Feb 10, 2018
Python

DartML / PPO-Stein-Control-Variate

Proximal Policy Optimization with Stein Control Variates:

reinforcement-learning ppo

Updated Feb 12, 2018
Python

menondj / RLearningUnity3D

Multiple Reinforcement learning techniques on 3x3 TicTacToe

machine-learning reinforcement-learning qlearning tensorflow tic-tac-toe unity3d artificial-intelligence hyperparameters tictactoe ppo ml-agents

Updated Feb 28, 2018
C#

araffin / pytorch_agents

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR). Python2 compatible (branch python2)

reinforcement-learning deep-learning python3 pytorch rl python2 ppo a2c acktr

Updated Mar 28, 2018
Python

avoroshilov / rl-selfplay

Simple reinforcement learning framework for selfplay experiments

reinforcement-learning policy-gradient actor-critic ppo self-play

Updated Apr 8, 2018
Python

lgvaz / rlbox

RLbox: Solving OpenAI Gym with TensorFlow

tensorflow deep-reinforcement-learning openai-gym dqn atari continuous-control mujoco deep-rl proximal-policy-optimization ppo

Updated Apr 19, 2018
Python

monoelh / deep-reinforcement-learning_DDQN_PPO_HER

MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.

game numpy deep-reinforcement-learning openai-gym deep-q-network ddqn prioritized-replay ppo advantage-actor-critic policy-network ddqn-framework mlp-framework hindsight-experience-replay

Updated May 24, 2018
Jupyter Notebook

huiwenzhang / rl-benchmark

simple and compact implementations of reinforcement learning benchmark algorithms

dqn reinforce actor-critic ppo

Updated Jun 9, 2018
Python

jimimvp / torch_rl

Reinforcement learning library for PyTorch.

reinforcement-learning artificial-intelligence neural-networks ddpg ddpg-algorithm ppo hindsight-experience-replay hindsight-policy-gradients torch-rl spiking-networks

Updated Jun 15, 2018
Python

cedrickchee / baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

reinforcement-learning deep-learning algorithms openai-gym policy policy-gradient machine-learning-engineering trpo proximal-policy-optimization ppo self-play dota2-bot openai-five

Updated Jun 20, 2018
Python

zbgzbg2007 / Machine-Learning

Projects or classes about ML

machine-learning deep-learning deep-reinforcement-learning pytorch dqn a3c ppo atari-game-enduro

Updated Jun 25, 2018
Jupyter Notebook

chagmgang / pysc2_rl

deep-learning deep-q-learning proximal-policy-optimization ppo advantage-actor-critic a2c pysc2 pysc2-mini-games reinfrocement-learning

Updated Jul 14, 2018
Python

sangyongjeong111 / gail_ppo

wgan ppo gail cocob

Updated Jul 30, 2018
Python

TianhongDai / distributed-ppo

This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).

pytorch reinforcement-learning-algorithms multiprocess proximal-policy-optimization ppo

Updated Jul 30, 2018
Python

zxgineng / deeprl

小时候练手的rl项目

reinforcement-learning tensorflow dqn policy-gradient a3c ddpg double-dqn prioritized-replay dueling-dqn ppo

Updated Aug 6, 2018
Python

qqadssp / PPO-Pytorch

Minimal implementation of PPO, running in Mujoco env, using Gym-mujoco

reinforcement-learning pytorch policy-gradient ppo

Updated Aug 17, 2018
Python

Improve this page

Add a description, image, and links to the ppo topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ppo topic, visit your repo's landing page and select "manage topics."