-
-
Notifications
You must be signed in to change notification settings - Fork 134
Issues: PygmalionAI/aphrodite-engine
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug]: FP8 KV Cache FLASHINFER AssertionError
bug
Something isn't working
#967
opened Dec 23, 2024 by
ScOut3R
[Bug]: Docker instance doesn't download model (affects VLLM as well)
bug
Something isn't working
#911
opened Dec 17, 2024 by
selalipop
[Misc]: should we be using flashinfer for CUDA 12.1 or 12.4?
#886
opened Dec 12, 2024 by
BlairSadewitz
[Bug]: Error at Custom KoboldAI Endpoint! The custom endpoint failed to respond correctly. You may wish to try a different URL or API type.
bug
Something isn't working
#883
opened Dec 12, 2024 by
baditaflorin
[Bug]: ModuleNotFoundError: No module named 'ray'
bug
Something isn't working
#854
opened Dec 2, 2024 by
gizbo
[Bug]: Generation sometimes slows to a crawl for all requests when there is a DRY sampler request
bug
Something isn't working
#853
opened Dec 2, 2024 by
Nero10578
[Bug]: loading a GPTQ-INT4 model on windows with a P40
bug
Something isn't working
#847
opened Nov 27, 2024 by
sorasoras
[Feature]: pass-through parameter from request to model.forward (already implemented)
#836
opened Nov 25, 2024 by
qpwo
[Tracker]: Passing all unit tests
help wanted
Extra attention is needed
#820
opened Nov 17, 2024 by
AlpinDale
100+
[Installation]: Cannot find CUDA_TOOLKIT_ROOT_DIR while trying to build for ROCm
#815
opened Nov 14, 2024 by
RuntimeRacer
[Bug]: 0.6.3.post1 regression: RuntimeError during mem profiling on Mistral Large AWQ with Something isn't working
-q awq_marlin
bug
#809
opened Nov 5, 2024 by
khanonnie
[Bug]: .\gguf_to_torch.py broken along with direct load GGUF
bug
Something isn't working
#804
opened Nov 3, 2024 by
sorasoras
[Bug]: unable to load 14B Qwen2.5 GGUF with newest version (0.6.2.post1)
bug
Something isn't working
#789
opened Oct 23, 2024 by
NeoChen1024
[Bug]: Several errors when deploying GGUF models
bug
Something isn't working
#786
opened Oct 21, 2024 by
musoles
[Bug]: Impossible dependency requirement with GGUF
bug
Something isn't working
#783
opened Oct 19, 2024 by
musoles
[Bug]: Metrics incorrect when having zero throughput
bug
Something isn't working
#782
opened Oct 18, 2024 by
mrseeker
[Bug]: Llama3 VocabParallelEmbedding error when loading
bug
Something isn't working
#781
opened Oct 18, 2024 by
gelim
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.