off-policy

Star

Here are 39 public repositories matching this topic...

lionelblonde / sam-pytorch-complete-history

Star

PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"

reinforcement-learning pytorch gan imitation-learning gail off-policy

Updated Aug 9, 2021
Python

lionelblonde / liayn-pytorch-complete-history

Star

PyTorch implementation of our work: "Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial Imitation Learning"

reinforcement-learning pytorch gan imitation-learning gail off-policy

Updated Apr 19, 2022
Python

lionelblonde / sam-tf-complete-history

Star

TensorFlow implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"

tensorflow gan imitation-learning gail off-policy reinfrocement-learning

Updated Mar 8, 2019
Python

Puneet2000 / Agent-DOoM

Star

A RL agent that learns to play doom's deadly corridor based on DDQN and PER.

reinforcement-learning q-learning deep-q-learning dueling-network-architecture pytorch-implmention prioritized-experience-replay off-policy experience-replay fixed-q-targets

Updated Dec 21, 2018
Python

raja-grewal / rlmd

Star

PROJECT MIGRATED TO CODEBERG - Reinforcement Learning in Multiplicative Domains

Updated Sep 26, 2023

lionelblonde / giwr-pytorch

Star

PyTorch implementation of our work: "Optimality Inductive Biases and Agnostic Guidelines for Offline Reinforcement Learning"

reinforcement-learning offline pytorch imitation-learning off-policy

Updated May 27, 2024
Python

lionelblonde / giwr-pytorch-complete-history

Star

PyTorch implementation of our work: "Where is the Grass Greener? Revisiting Generalized Policy Iteration for Offline Reinforcement Learning"

reinforcement-learning offline pytorch imitation-learning off-policy

Updated May 27, 2024
Python

mabirck / CS294-DeepRL

Star

My content of CS294 Deep Reinforcement Learning course, conduced by Sergey Levine from UC Berkeley.

deep-neural-networks reinforcement-learning deep-learning deep-reinforcement-learning pytorch neural-networks policy-gradient reinforcement pytorch-tutorials cs294 on-policy off-policy

Updated Jan 15, 2018
Python

baturaysaglam / DASE

Star

Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms

deep-reinforcement-learning actor-critic off-policy experience-replay multi-agent-reinforcement-learning

Updated Aug 11, 2022
Python

baturaysaglam / Q-Error-Exploration

Star

An Optimistic Approach to the Q-Network Error in Actor-Critic Methods

deep-reinforcement-learning actor-critic off-policy experience-replay exploration-exploitation

Updated Jun 23, 2022
Python

SaminYeasar / PyTorch-implementation-DICE-algorithms

Star

PyTorch-implementation-DICE-algorithms

pytorch rl imitation-learning off-policy algeadice valuedice

Updated Sep 24, 2020
Python

fardinabbasi / Tabulated_RL

Star

Interactive Learning [ECE 641] - Fall 2023 - University of Tehran - Prof. Nili

q-learning mdp grid-world sarsa markov-decision-processes value-iteration tree-backup on-policy off-policy

Updated Mar 22, 2024

NUS-LID / RENAULT

Star

Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning

deep-learning deep-reinforcement-learning ensemble-learning deep-q-learning multi-task-learning deep-rl off-policy auxiliary-tasks model-free-rl data-efficient-learning

Updated Jul 2, 2021
Python

Kalyani011 / RL-Q_Learning_Implementation

Star

Temporal Difference Method - Q-Learning Implementation for FrozenLake Grid Problem

reinforcement-learning q-learning temporal-differencing-learning off-policy value-based

Updated Apr 5, 2022
Jupyter Notebook

SaminYeasar / off_policy_ac

Star

Contains PyTorch Implementation of the following off policy actor critic algorithms

reinforcement-learning pytorch ddpg sac actor-critic mujoco off-policy td3

Updated Aug 5, 2021
Python

HYDesmondLiu / RUBICON

Star

A novel method to incorporate existing policy (Rule-based control) with Reinforcement Learning.

machine-learning reinforcement-learning deep-learning optimization deep-reinforcement-learning reinforcement-learning-algorithms optimal-control climate-change energy-efficiency thermal-comfort deterministic-policy-gradients actor-critic-algorithm off-policy hvac-control rule-based-controller

Updated May 10, 2023
Python

DjAzDeck / SPG

Star

Sample Policy Gradient

learning algorithm control optimization deep policy continuous action reinforcement deterministic actor-critic model-free off-policy

Updated Oct 31, 2021
Python

baturaysaglam / SWTD3

Star

Stochastic Weighted Twin Delayed Deep Deterministic Policy Gradient (SWTD3)

deep-reinforcement-learning reinforcement-learning-algorithms actor-critic off-policy

Updated Aug 11, 2022
Python

TheUnsolvedDev / ReinforcementLearning

Star

Repository containing basic algorithm applied in python.

algorithm reinforcement-learning monte-carlo policy-evaluation policy-iteration bandit-algorithms on-policy off-policy

Updated Dec 3, 2023
Jupyter Notebook

cbanerji / Sample_efficient_RL.

Star

Collection of codes pertaining to my research in model-free RL algorithms.

ddpg off-policy td3 soft-actor-critic model-free-rl sample-efficient-rl

Updated Oct 4, 2022
Python

Improve this page

Add a description, image, and links to the off-policy topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the off-policy topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

off-policy

Here are 39 public repositories matching this topic...

lionelblonde / sam-pytorch-complete-history

lionelblonde / liayn-pytorch-complete-history

lionelblonde / sam-tf-complete-history

Puneet2000 / Agent-DOoM

raja-grewal / rlmd

lionelblonde / giwr-pytorch

lionelblonde / giwr-pytorch-complete-history

mabirck / CS294-DeepRL

baturaysaglam / DASE

baturaysaglam / Q-Error-Exploration

SaminYeasar / PyTorch-implementation-DICE-algorithms

fardinabbasi / Tabulated_RL

NUS-LID / RENAULT

Kalyani011 / RL-Q_Learning_Implementation

SaminYeasar / off_policy_ac

HYDesmondLiu / RUBICON

DjAzDeck / SPG

baturaysaglam / SWTD3

TheUnsolvedDev / ReinforcementLearning

cbanerji / Sample_efficient_RL.

Improve this page

Add this topic to your repo