PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"
-
Updated
Aug 9, 2021 - Python
PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"
PyTorch implementation of our work: "Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial Imitation Learning"
TensorFlow implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"
A RL agent that learns to play doom's deadly corridor based on DDQN and PER.
PROJECT MIGRATED TO CODEBERG - Reinforcement Learning in Multiplicative Domains
PyTorch implementation of our work: "Optimality Inductive Biases and Agnostic Guidelines for Offline Reinforcement Learning"
PyTorch implementation of our work: "Where is the Grass Greener? Revisiting Generalized Policy Iteration for Offline Reinforcement Learning"
My content of CS294 Deep Reinforcement Learning course, conduced by Sergey Levine from UC Berkeley.
Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms
An Optimistic Approach to the Q-Network Error in Actor-Critic Methods
PyTorch-implementation-DICE-algorithms
Interactive Learning [ECE 641] - Fall 2023 - University of Tehran - Prof. Nili
Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning
Temporal Difference Method - Q-Learning Implementation for FrozenLake Grid Problem
Contains PyTorch Implementation of the following off policy actor critic algorithms
A novel method to incorporate existing policy (Rule-based control) with Reinforcement Learning.
Sample Policy Gradient
Stochastic Weighted Twin Delayed Deep Deterministic Policy Gradient (SWTD3)
Repository containing basic algorithm applied in python.
Collection of codes pertaining to my research in model-free RL algorithms.
Add a description, image, and links to the off-policy topic page so that developers can more easily learn about it.
To associate your repository with the off-policy topic, visit your repo's landing page and select "manage topics."