Skip to content
@LAMDA-RL

LAMDA-RL

We are a fork of reinforcement learning researchers from LAMDA Group @ Nanjing University.

LAMDA-RL Lab

LAMDA-RL Lab is at the forefront of advancing the field of reinforcement learning and its application to creating general decision-making intelligence, by pushing the boundaries of what's possible with RL techniques.

We focus on developing novel algorithms and architectures that enable RL systems to learn and make decisions in increasingly general and adaptable ways. Some key areas we are exploring include:

  • Imitation learning;
  • Offline reinforcement learning;
  • Model-based RL and world model learning;
  • Multi-agent and collaborative RL;
  • Planning and learning with large models.

Through both fundamental and application research, our aim is to create RL-based systems that exhibit truly intelligent and general decision-making capabilities. For more information about our lab and research, please refer to our website https://lamda-rl.nju.edu.cn/.

Pinned Loading

  1. OfflineRL-Lib OfflineRL-Lib Public

    Benchmarked implementations of Offline RL Algorithms.

    Python 72 7

  2. ODIS ODIS Public

    The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".

    Python 40 6

  3. PRDC PRDC Public

    Forked from kimoyami/PRDC

    Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D4RL gym and AntMaze tasks.

    Python 18 3

  4. ACT ACT Public

    Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)

    Python 13 3

  5. Pretrained_BWArea_2.7B_30G Pretrained_BWArea_2.7B_30G Public

    Pre-trained Models of BWArea Model

    Python 9

  6. CPR CPR Public

    Forked from LyndonKong/CPR

    Python 3

Repositories

Showing 10 of 36 repositories
  • RIMRO Public

    A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.

    LAMDA-RL/RIMRO’s past year of commit activity
    Python 0 0 0 0 Updated Apr 3, 2025
  • CoLA Public
    LAMDA-RL/CoLA’s past year of commit activity
    Python 4 0 0 0 Updated Mar 26, 2025
  • GMAIL Public Forked from chaoningjing/GMAIL

    Author's official implementation of TPAMI paper "Generalizable Multi-modal Adversarial Imitation Learning for Non-stationary Dynamics"

    LAMDA-RL/GMAIL’s past year of commit activity
    Python 0 1 0 0 Updated Mar 14, 2025
  • OfflineRL-Lib Public

    Benchmarked implementations of Offline RL Algorithms.

    LAMDA-RL/OfflineRL-Lib’s past year of commit activity
    Python 72 MIT 7 1 2 Updated Mar 4, 2025
  • Q-Adapter Public Forked from mansicer/Q-Adapter

    Author's implementation of ICLR'25 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"

    LAMDA-RL/Q-Adapter’s past year of commit activity
    Python 1 Apache-2.0 1 0 0 Updated Feb 28, 2025
  • DORA Public Forked from Xinyuz26/DORA

    Code for ICML'24 paper "Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics"

    LAMDA-RL/DORA’s past year of commit activity
    Python 0 1 0 0 Updated Feb 19, 2025
  • ADMPO Public Forked from HxLyn3/ADMPO

    Any-step Dynamics Model for Policy Optimization

    LAMDA-RL/ADMPO’s past year of commit activity
    Python 4 MIT 5 0 0 Updated Feb 1, 2025
  • WiseRL Public Forked from typoverflow/WiseRL

    PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms

    LAMDA-RL/WiseRL’s past year of commit activity
    Python 1 MIT 2 0 0 Updated Dec 6, 2024
  • PRDC Public Forked from kimoyami/PRDC

    Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D4RL gym and AntMaze tasks.

    LAMDA-RL/PRDC’s past year of commit activity
    Python 18 6 0 0 Updated Nov 8, 2024
  • ODIS Public

    The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".

    LAMDA-RL/ODIS’s past year of commit activity
    Python 40 Apache-2.0 6 2 0 Updated Oct 31, 2024

Top languages

Loading…

Most used topics

Loading…