Skip to content
View tugot17's full-sized avatar

Block or report tugot17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FlashAttention (Metal Port)

Swift 465 23 Updated Sep 22, 2024

AMD ROCm™ Software - GitHub Home

Shell 5,145 420 Updated Mar 28, 2025

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

Rust 3,969 163 Updated Mar 24, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 3,414 240 Updated Mar 31, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Jupyter Notebook 3,351 312 Updated Mar 29, 2025

Smart glasses OS, with dozens of built-in apps. Users get AI assistant, notifications, translation, screen mirror, captions, and more. Devs get to write 1 app that runs on any pair of smart glases.

Java 403 43 Updated Mar 31, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 86,416 10,640 Updated Mar 31, 2025

A language for constraint-guided and efficient LLM programming.

Python 3,873 206 Updated Jun 3, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 12,686 1,394 Updated Mar 31, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 47,229 1,324 Updated Mar 31, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,186 142 Updated Mar 31, 2025

aider is AI pair programming in your terminal

Python 30,306 2,743 Updated Mar 31, 2025

A Lightweight Recommendation System

Python 8,726 671 Updated Nov 8, 2023

A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.

Python 318 20 Updated Mar 10, 2025

This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or look…

309 26 Updated Feb 22, 2025

Copy a bunch of files into your clipboard to provide context for LLMs

Go 106 5 Updated Jan 24, 2025

Social networking technology created by Bluesky

TypeScript 8,344 660 Updated Mar 31, 2025

It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.

211 33 Updated Jun 4, 2024

A TUTORIAL ON POINTERS AND ARRAYS IN C

HTML 1,038 85 Updated Sep 2, 2023

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,870 282 Updated Dec 21, 2024

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,771 124 Updated Dec 6, 2024

🗜️ Codebase-digest is your AI-friendly codebase packer and analyzer. Features 60+ coding prompts and generates structured overviews with metrics. Ideal for feeding projects to LLMs like GPT-4, Clau…

Python 199 20 Updated Oct 21, 2024

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,670 326 Updated Jan 7, 2025

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 7,311 599 Updated Mar 31, 2025

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 426 48 Updated Sep 27, 2024

System 2 Reasoning Link Collection

818 70 Updated Mar 16, 2025

Machine Learning Engineering Open Book

Python 13,281 807 Updated Mar 30, 2025

Tile primitives for speedy kernels

Cuda 2,200 133 Updated Mar 31, 2025

Go ahead and axolotl questions

Python 8,976 985 Updated Mar 31, 2025

Flash Attention in raw Cuda C beating PyTorch

Cuda 20 2 Updated May 14, 2024
Next
Showing results