Skip to content
Change the repository type filter

All

    Repositories list

    • sglang

      Public
      SGLang is a fast serving framework for large language models and vision language models.
      Python
      Apache License 2.0
      1.6k1700Updated Mar 30, 2025Mar 30, 2025
    • csm

      Public
      A Conversational Speech Generation Model
      Python
      Apache License 2.0
      1.2k13k403Updated Mar 27, 2025Mar 27, 2025
    • Python
      MIT License
      261500Updated Mar 17, 2025Mar 17, 2025
    • Faster Whisper with additional features
      Python
      MIT License
      1.3k4301Updated Mar 10, 2025Mar 10, 2025
    • wavtools

      Public
      Record and stream WAV audio data in the browser across all platforms
      JavaScript
      MIT License
      263000Updated Jan 28, 2025Jan 28, 2025
    • moshi

      Public
      Python
      Apache License 2.0
      6711900Updated Jan 8, 2025Jan 8, 2025
    • whisperX

      Public
      WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
      Python
      BSD 2-Clause "Simplified" License
      1.6k5100Updated Oct 25, 2024Oct 25, 2024
    • Silero VAD: pre-trained enterprise-grade Voice Activity Detector
      Python
      MIT License
      545400Updated Jun 27, 2024Jun 27, 2024
    • gpt-fast

      Public
      Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
      Python
      BSD 3-Clause "New" or "Revised" License
      5511200Updated Apr 30, 2024Apr 30, 2024