Skip to content
View BillChan226's full-sized avatar
🐝
learning
🐝
learning

Highlights

  • Pro

Organizations

@AI-secure

Block or report BillChan226

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
BillChan226/README.md

Hi there, I'm Zhaorun Personal Website 👋

Connect with me:

HZ HZ | GoogleScholar HZ | Twitter


🏖️ My Research Interests

  • Trustworthy deployment and safe interactions with large foundation models and agents from both a theoretical and empirical perspective.
  • enhancing LLM's trustworthiness via retrieval-augmented generation (RAG) and robustness certificates for hallucination, alignment, jailbreaks and privacy.

GitHub stats Language Stats

Pinned Loading

  1. SafeWatch Public

    [ICLR 2025] Official implementation for "SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations"

    Python 28

  2. AI-secure/AgentPoison Public

    [NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"

    Python 102 13

  3. HALC Public

    [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"

    Python 85 1

  4. MJ-Bench/MJ-Bench Public

    Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"

    Jupyter Notebook 44 5

  5. AI-secure/MMDT Public

    Comprehensive Assessment of Trustworthiness in Multimodal Foundation Models

    Jupyter Notebook 12 2

398 contributions in the last year

Contribution Graph
Day of Week March April May June July August September October November December January February March
Sunday
Monday
Tuesday
Wednesday
Thursday
Friday
Saturday
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More

Activity overview

Contributed to AI-secure/AgentPoison, BillChan226/MJ-Bench, MJ-Bench/MJ-Bench.github.io and 16 other repositories
Loading A graph representing BillChan226's contributions from March 17, 2024 to March 20, 2025. The contributions are 97% commits, 3% issues, 0% pull requests, 0% code review.

Contribution activity

March 2025

Created 2 commits in 2 repositories
Created 1 repository
4 contributions in private repositories Mar 9 – Mar 11
Loading