tugot17

Piotr Mazurek tugot17

Making LLMs go brrr @Aleph__Alpha

70 followers · 47 following

Achievements

x2 x2

Achievements

x2 x2

Stars

philipturner / metal-flash-attention

FlashAttention (Metal Port)

Swift 465 23 Updated Sep 22, 2024

ROCm / ROCm

AMD ROCm™ Software - GitHub Home

Shell 5,145 420 Updated Mar 28, 2025

Rust-GPU / Rust-CUDA

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

Rust 3,969 163 Updated Mar 24, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 3,414 240 Updated Mar 31, 2025

vllm-project / aibrix

Cost-efficient and pluggable Infrastructure components for GenAI inference

Jupyter Notebook 3,351 312 Updated Mar 29, 2025

AugmentOS-Community / AugmentOS

Smart glasses OS, with dozens of built-in apps. Users get AI assistant, notifications, translation, screen mirror, captions, and more. Devs get to write 1 app that runs on any pair of smart glases.

Java 403 43 Updated Mar 31, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 86,416 10,640 Updated Mar 31, 2025

eth-sri / lmql

A language for constraint-guided and efficient LLM programming.

Python 3,873 206 Updated Jun 3, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 12,686 1,394 Updated Mar 31, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 47,229 1,324 Updated Mar 31, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,186 142 Updated Mar 31, 2025

Aider-AI / aider

aider is AI pair programming in your terminal

Python 30,306 2,743 Updated Mar 31, 2025

bytedance / monolith

A Lightweight Recommendation System

Python 8,726 671 Updated Nov 8, 2023

rkinas / triton-resources

A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.

Python 318 20 Updated Mar 10, 2025

rkinas / cuda-learning

This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or look…

309 26 Updated Feb 22, 2025

fargusplumdoodle / dump_dir

Copy a bunch of files into your clipboard to provide context for LLMs

Go 106 5 Updated Jan 24, 2025

bluesky-social / atproto

Social networking technology created by Bluesky

TypeScript 8,344 660 Updated Mar 31, 2025

dzyim / ilya-sutskever-recommended-reading

It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.

211 33 Updated Jun 4, 2024

jflaherty / ptrtut13

A TUTORIAL ON POINTERS AND ARRAYS IN C

HTML 1,038 85 Updated Sep 2, 2023

jy0205 / Pyramid-Flow

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,870 282 Updated Dec 21, 2024

eloialonso / diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,771 124 Updated Dec 6, 2024

kamilstanuch / codebase-digest

🗜️ Codebase-digest is your AI-friendly codebase packer and analyzer. Features 60+ coding prompts and generates structured overviews with metrics. Ideal for feeding projects to LLMs like GPT-4, Clau…

Python 199 20 Updated Oct 21, 2024

ufal / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,670 326 Updated Jan 7, 2025

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 7,311 599 Updated Mar 31, 2025

NousResearch / Open-Reasoning-Tasks

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 426 48 Updated Sep 27, 2024

open-thought / system-2-research

System 2 Reasoning Link Collection

818 70 Updated Mar 16, 2025

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 13,281 807 Updated Mar 30, 2025

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 2,200 133 Updated Mar 31, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 8,976 985 Updated Mar 31, 2025

kilianhae / FlashAttention.C

Flash Attention in raw Cuda C beating PyTorch

Cuda 20 2 Updated May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Piotr Mazurek tugot17

Achievements

Achievements

Block or report tugot17

Stars

philipturner / metal-flash-attention

ROCm / ROCm

Rust-GPU / Rust-CUDA

ai-dynamo / dynamo

vllm-project / aibrix

AugmentOS-Community / AugmentOS

open-webui / open-webui

eth-sri / lmql

sgl-project / sglang

astral-sh / uv

fla-org / flash-linear-attention

Aider-AI / aider

bytedance / monolith

rkinas / triton-resources

rkinas / cuda-learning

fargusplumdoodle / dump_dir

bluesky-social / atproto

dzyim / ilya-sutskever-recommended-reading

jflaherty / ptrtut13

jy0205 / Pyramid-Flow

eloialonso / diamond

kamilstanuch / codebase-digest

ufal / whisper_streaming

xorbitsai / inference

NousResearch / Open-Reasoning-Tasks

open-thought / system-2-research

stas00 / ml-engineering

HazyResearch / ThunderKittens

axolotl-ai-cloud / axolotl

kilianhae / FlashAttention.C