Skip to content
View jayfeather9's full-sized avatar

Block or report jayfeather9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GeoPort: Your Location, Anywhere! The iOS location simulator

HTML 815 59 Updated Jan 28, 2025

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

C++ 1,514 544 Updated Apr 27, 2025

An experimental patch for Cursor to force different machine ids.

Python 801 109 Updated Apr 18, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 46,648 6,600 Updated Apr 20, 2025

北京航空航天大学 BUAA LaTeX Beamer 非官方主题

TeX 29 2 Updated Mar 31, 2025

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

Python 15,427 2,285 Updated Mar 13, 2025

Your filesystem as a dungeon!

Rust 1,610 35 Updated Jan 13, 2025

A large-scale simulation framework for LLM inference

Python 368 64 Updated Nov 19, 2024

Test the GPU bandwidth of collectives operators like all-reduce, all-gather, broadcast and all-to-all primitives on single-node multi-GPU (2, 4, 8 cards) and multi-node multi-GPU (16 cards) setups,…

Python 2 Updated Oct 21, 2024

Source codes for paper "MACRec: A Multi-Agent Collaboration Framework for Recommendation" at SIGIR 2024

Python 70 5 Updated Nov 14, 2024

HLS-based Graph Processing Framework on FPGAs

C++ 144 32 Updated Oct 11, 2022

Curated collection of papers in MoE model inference

152 6 Updated Feb 19, 2025

[ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length

Python 79 4 Updated Apr 14, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 41,240 5,868 Updated Apr 26, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 106,439 17,299 Updated Apr 26, 2025

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 55,030 6,537 Updated Mar 31, 2025

[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable

Python 154 8 Updated Sep 21, 2024

Code for paper: Long cOntext aliGnment via efficient preference Optimization

Jupyter Notebook 13 Updated Feb 17, 2025

Zero Bubble Pipeline Parallelism

Python 385 24 Updated Apr 7, 2025

An implementation of differential dataflow using timely dataflow on Rust.

Rust 2,684 188 Updated Apr 18, 2025

A modular implementation of timely dataflow in Rust

Rust 3,421 281 Updated Mar 28, 2025
HTML 196 33 Updated Jan 2, 2025

A lecture notes template in Typst.

Typst 58 1 Updated Jan 4, 2025

A simple note template in Typst.

Typst 41 7 Updated Apr 4, 2025

Low-bit LLM inference on CPU with lookup table

C++ 752 59 Updated Apr 22, 2025

如何将ChatGPT调教成一只猫娘

3,004 164 Updated Jul 18, 2023

Blazingly fast LLM inference.

Rust 5,455 393 Updated Apr 25, 2025
C++ 1 Updated Jul 3, 2024
Next
Showing results