Skip to content
Change the repository type filter

All

    Repositories list

    • llama-recipes

      Public archive
      Examples and recipes for Llama 2 model
      Jupyter Notebook
      2.7k100Updated Jul 20, 2025Jul 20, 2025
    • Configuration for generating SDKs and Documentation.
      MDX
      5206Updated Oct 7, 2024Oct 7, 2024
    • Homebrew Tap of OctoML products and tools.
      Ruby
      Apache License 2.0
      0000Updated Sep 26, 2024Sep 26, 2024
    • EAGLE

      Public
      OctoML Implementation of EAGLE-1 and EAGLE-2
      Python
      Apache License 2.0
      281100Updated Sep 12, 2024Sep 12, 2024
    • A collection of reference solutions built on top of OctoAI SaaS
      Python
      MIT License
      0000Updated Sep 11, 2024Sep 11, 2024
    • Simple getting-started code examples for LLM applications powered by OctoAI
      Python
      MIT License
      214910Updated Sep 10, 2024Sep 10, 2024
    • mlc-llm

      Public
      Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
      Python
      Apache License 2.0
      2.1k5121Updated Sep 10, 2024Sep 10, 2024
    • FlashInfer: Kernel Library for LLM Serving
      Cuda
      Apache License 2.0
      993200Updated Sep 9, 2024Sep 9, 2024
    • Jupyter Notebook
      0000Updated Sep 8, 2024Sep 8, 2024
    • Multicloud Asset Code Review Public Repo example.
      Python
      MIT License
      0001Updated Sep 5, 2024Sep 5, 2024
    • Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
      HTML
      Apache License 2.0
      1.2k001Updated Aug 21, 2024Aug 21, 2024
    • msi-fe

      Public
      Python
      0000Updated Aug 14, 2024Aug 14, 2024
    • A framework for few-shot evaluation of autoregressive language models.
      Python
      MIT License
      3.3k000Updated Aug 12, 2024Aug 12, 2024
    • .github

      Public
      0101Updated Aug 2, 2024Aug 2, 2024
    • Python
      0000Updated Jul 31, 2024Jul 31, 2024
    • RULER

      Public
      This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
      Python
      Apache License 2.0
      127000Updated Jul 25, 2024Jul 25, 2024
    • TypeScript
      0000Updated Jun 21, 2024Jun 21, 2024
    • langchain

      Public
      ⚡ Building applications with LLMs through composability ⚡
      Python
      MIT License
      23k000Updated May 17, 2024May 17, 2024
    • Custom dyld version inherited from original Apple dyld implementation
      C++
      Other
      22300Updated Apr 27, 2024Apr 27, 2024
    • TypeScript
      0000Updated Mar 8, 2024Mar 8, 2024
    • A collection of OctoAI-based demos.
      TypeScript
      0511Updated Mar 5, 2024Mar 5, 2024
    • TFLint ruleset for terraform-provider-google
      Go
      Mozilla Public License 2.0
      22000Updated Feb 23, 2024Feb 23, 2024
    • Authentication server for Docker Registry 2
      Go
      Apache License 2.0
      312000Updated Feb 5, 2024Feb 5, 2024
    • go-jose

      Public
      An implementation of JOSE standards (JWE, JWS, JWT) in Go
      Go
      Apache License 2.0
      118000Updated Feb 5, 2024Feb 5, 2024
    • Pinecone + Vercel RAG application, showcasing a comparison between chat with no context and using a Pinecone index for context
      HTML
      26000Updated Jan 25, 2024Jan 25, 2024
    • A set of models you can build and deploy on octoai
      Python
      MIT License
      1000Updated Jan 19, 2024Jan 19, 2024
    • pre-commit hook which runs kustomize docker image (use with https://github.com/pre-commit/pre-commit)
      Dockerfile
      18200Updated Jan 4, 2024Jan 4, 2024
    • TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optim…
      C++
      Apache License 2.0
      2.4k000Updated Jan 3, 2024Jan 3, 2024
    • go-oidc

      Public
      A Go OpenID Connect client.
      Go
      Apache License 2.0
      428000Updated Dec 27, 2023Dec 27, 2023
    • A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      17k100Updated Dec 14, 2023Dec 14, 2023
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.