holarissun

Follow

🎯

Focusing

Hao Sun holarissun

🎯

Focusing

Follow

PhD in Reinforcement Learning, LLM Alignment, RLHF

90 followers · 36 following

University of Cambridge
https://holarissun.github.io/
@HolarisSun

Achievements

Achievements

Highlights

Pro

Pinned Loading

Prompt-OIRL Prompt-OIRL Public

code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning

Python 39 6
RewardModelingBeyondBradleyTerry RewardModelingBeyondBradleyTerry Public

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

Python 34 1
RewardShifting RewardShifting Public

Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL

Python 29 3
embedding-based-llm-alignment embedding-based-llm-alignment Public

Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs

4 1
Accountable-Offline-RL Accountable-Offline-RL Public

Code for NeurIPS 2023 paper Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples

Python 5 1