shengpu-tang

Follow

Shengpu Tang shengpu-tang

Follow

PhD Candidate, UM-CSE

31 followers · 39 following

University of Michigan
Ann Arbor, MI
shengpu-tang.me
@shengpu_tang

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Highlights

Pro

Organizations

Block or Report

Block or report shengpu-tang

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Pinned

MLD3/CounterfactualAnnot-SemiOPE MLD3/CounterfactualAnnot-SemiOPE Public

[NeurIPS 2023] Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation. https://arxiv.org/abs/2310.17146

Jupyter Notebook 1 1
MLD3/OfflineRL_FactoredActions MLD3/OfflineRL_FactoredActions Public

[NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare.

Jupyter Notebook 7
MLD3/OfflineRL_ModelSelection MLD3/OfflineRL_ModelSelection Public

[MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003

Jupyter Notebook 8 5
MLD3/RL-Set-Valued-Policy MLD3/RL-Set-Valued-Policy Public

[ICML 2020] Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies. https://arxiv.org/abs/2007.12678, https://icml.cc/virtual/2020/poster/5797

Jupyter Notebook 15 3
MLD3/FIDDLE MLD3/FIDDLE Public

FlexIble Data-Driven pipeLinE – a preprocessing pipeline that transforms structured EHR data into feature vectors to be used with ML algorithms. https://doi.org/10.1093/jamia/ocaa139

Jupyter Notebook 81 16
microsoft/rl-offline-simulation microsoft/rl-offline-simulation Public

Data-driven offline simulation for online reinforcement learning: benchmark and baselines

Python 25 6