Skip to content

Roadmap

spetrel edited this page Jan 22, 2025 · 1 revision
  • EAGLE 2 faster decoding.
  • Optimize decoding attention speed on A100/A800 GPUs.
  • Support more multimodal models.
Clone this wiki locally