Skip to content
Change the repository type filter

All

    Repositories list

    • 3FS

      Public
      A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
      C++
      9729.5k11324Updated Dec 13, 2025Dec 13, 2025
    • DeepGEMM

      Public
      DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
      Cuda
      7736k426Updated Dec 8, 2025Dec 8, 2025
    • DeepEP

      Public
      DeepEP: an efficient expert-parallel communication library
      Cuda
      1k8.8k14729Updated Dec 5, 2025Dec 5, 2025
    • DeepSeek-Math-V2

      Public
      Python
      1151.5k160Updated Dec 1, 2025Dec 1, 2025
    • LPLB

      Public
      An early research stage expert-parallel load balancer for MoE models based on linear programming.
      Python
      2545800Updated Nov 19, 2025Nov 19, 2025
    • DeepSeek-V3.2-Exp

      Public
      Python
      1101.4k175Updated Nov 18, 2025Nov 18, 2025
    • awesome-deepseek-coder

      Public
      A curated list of open-source projects related to DeepSeek Coder
      20473700Updated Nov 11, 2025Nov 11, 2025
    • DeepSeek-Coder

      Public
      DeepSeek Coder: Let the Code Write Itself
      Python
      2.7k22k12225Updated Nov 11, 2025Nov 11, 2025
    • DeepSeek-Coder-V2

      Public
      DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
      1k6.3k615Updated Nov 11, 2025Nov 11, 2025
    • DeepSeek-OCR

      Public
      Contexts Optical Compression
      Python
      1.9k21k23035Updated Oct 25, 2025Oct 25, 2025
    • FlashMLA

      Public
      FlashMLA: Efficient Multi-head Latent Attention Kernels
      C++
      91412k526Updated Sep 30, 2025Sep 30, 2025
    • awesome-deepseek-integration

      Public
      Integrate the DeepSeek API into popular softwares
      3.9k35k9745Updated Sep 25, 2025Sep 25, 2025
    • DeepSeek-V3

      Public
      Python
      16k101k2845Updated Aug 28, 2025Aug 28, 2025
    • DeepSeek-Prover-V2

      Public
      931.2k102Updated Jul 18, 2025Jul 18, 2025
    • DeepSeek-R1

      Public
      12k92k1227Updated Jun 27, 2025Jun 27, 2025
    • ESFT

      Public
      Expert Specialized Fine-Tuning
      Python
      26071550Updated May 22, 2025May 22, 2025
    • open-infra-index

      Public
      Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
      2887.9k00Updated May 15, 2025May 15, 2025
    • DreamCraft3D

      Public
      [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
      Python
      3583k340Updated Apr 22, 2025Apr 22, 2025
    • EPLB

      Public
      Expert Parallelism Load Balancer
      Python
      1951.3k81Updated Mar 24, 2025Mar 24, 2025
    • profile-data

      Public
      Analyze computation-communication overlap in V3/R1.
      1441.1k110Updated Mar 21, 2025Mar 21, 2025
    • DualPipe

      Public
      A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
      Python
      3102.9k41Updated Mar 10, 2025Mar 10, 2025
    • smallpond

      Public
      A lightweight data processing framework built on DuckDB and 3FS.
      Python
      4324.9k226Updated Mar 5, 2025Mar 5, 2025
    • DeepSeek-VL2

      Public
      DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
      Python
      1.8k5.1k10117Updated Feb 26, 2025Feb 26, 2025
    • Janus

      Public
      Janus-Series: Unified Multimodal Understanding and Generation Models
      Python
      2.2k18k15621Updated Feb 1, 2025Feb 1, 2025
    • DeepSeek-V2

      Public
      DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
      5315k793Updated Sep 25, 2024Sep 25, 2024
    • DeepSeek-Prover-V1.5

      Public
      Python
      23154780Updated Aug 16, 2024Aug 16, 2024
    • DeepSeek-VL

      Public
      DeepSeek-VL: Towards Real-World Vision-Language Understanding
      Python
      5814k442Updated Apr 24, 2024Apr 24, 2024
    • DeepSeek-Math

      Public
      DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
      Python
      5573.1k342Updated Apr 15, 2024Apr 15, 2024
    • DeepSeek-LLM

      Public
      DeepSeek LLM: Let there be answers
      Makefile
      1k6.7k402Updated Feb 4, 2024Feb 4, 2024
    • DeepSeek-MoE

      Public
      DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
      Python
      2951.9k174Updated Jan 16, 2024Jan 16, 2024