Skip to content
View StefanHeng's full-sized avatar

Highlights

  • Pro

Block or report StefanHeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Python implementation of John Gruber’s Markdown with Extension support.

Python 3,961 873 Updated Apr 21, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,082 58 Updated Feb 25, 2025

This repo implements the Adam + SPD (selective projection decay) regularization.

Python 9 Updated Oct 20, 2024

The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.

Jupyter Notebook 43 2 Updated Aug 28, 2024

Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting

Jupyter Notebook 3 Updated Sep 14, 2024

A python package to simulate typographical errors.

Python 34 5 Updated Dec 12, 2023

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,985 116 Updated Jun 1, 2023

An extremely fast Python package and project manager, written in Rust.

Rust 51,303 1,446 Updated Apr 25, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,650 1,471 Updated Apr 24, 2025

Aligning pretrained language models with instruction data generated by themselves.

Python 4,353 504 Updated Mar 27, 2023

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 14,263 1,022 Updated Mar 17, 2025

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 12,599 406 Updated Apr 18, 2025

List of AI Residency Programs

3,156 271 Updated Apr 4, 2025

Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective

Python 29 3 Updated Jan 31, 2025
Jupyter Notebook 45 8 Updated Apr 25, 2025

Yet another alternative curriculum vitae/résumé class with LaTeX

TeX 1,380 349 Updated Dec 16, 2024

Skywork Reward Model Series

10 1 Updated Sep 6, 2024

Tomorrow Theme

CSS 13,854 3,147 Updated Jul 9, 2022

A Terminal theme that mimics the One Dark theme made by the Atom team.

1,062 189 Updated Aug 25, 2020

A Python module for creating Excel XLSX files.

Python 3,756 643 Updated Apr 24, 2025

RewardBench: the first evaluation tool for reward models.

Python 559 66 Updated Feb 27, 2025

Rich is a Python library for rich text and beautiful formatting in the terminal.

Python 51,826 1,827 Updated Mar 30, 2025

The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity

C++ 316 21 Updated Apr 11, 2025

Python composable command line interface toolkit

Python 16,287 1,427 Updated Apr 24, 2025

Simple cross-platform colored terminal text in Python

Python 3,658 261 Updated Mar 14, 2025

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,955 246 Updated Apr 14, 2025

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,338 411 Updated Sep 13, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,462 4,699 Updated Apr 12, 2025

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut …

Python 935 83 Updated Oct 22, 2024
Next
Showing results