-
Notifications
You must be signed in to change notification settings - Fork 96
Pull requests: vllm-project/compressed-tensors
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[KV cache] Delegate structural reads to the wrapped Cache in QuantizedKVCache
#747
opened Jun 23, 2026 by
jethac
Loading…
Add
force_local_cache context manager for independent per-rank model loading
#742
opened Jun 22, 2026 by
yiliu30
Contributor
Loading…
[Bugfix] Fix cast_to_fp4 per-rank torch.compile recompilation (#734)
#741
opened Jun 20, 2026 by
Yatimai
Contributor
Loading…
Add XPU support to Buildkite pipeline and scripts
#736
opened Jun 16, 2026 by
chensuyue
Contributor
Loading…
Add GemmaConverter for gemma-QAT checkpoints
documentation
Improvements or additions to documentation
#727
opened Jun 6, 2026 by
mgoin
Member
Loading…
Add NVFP4 tensor-block quantization strategy
documentation
Improvements or additions to documentation
#721
opened Jun 2, 2026 by
GillchLee
Loading…
Add example for converting GSQ/Humming format checkpoints
documentation
Improvements or additions to documentation
#716
opened May 27, 2026 by
mgoin
Member
Loading…
Update documentation link for compression formats
#713
opened May 21, 2026 by
dsikka
Collaborator
Loading…
Treat imatrix weight observer as calibration data dependent
needs-rebase
#710
opened May 19, 2026 by
dshane1903
Loading…
[Offload] [1/2] Disambiguate synchronous
__setitem__/offload from update_offload
#709
opened May 18, 2026 by
kylesayrs
Collaborator
Loading…
[Offload] [2/2] Better abstractions for updating parameters
#703
opened May 10, 2026 by
kylesayrs
Collaborator
Loading…
[Offload] use weakref.finalize to handle shared tensor deletion
#698
opened May 1, 2026 by
dichn
Loading…
[match] optimized match_named_modules
#697
opened Apr 29, 2026 by
brian-dellabetta
Collaborator
Loading…
[Bugfix] Do not throw "unexpected non-targeted tensor" validation error for ignored tensors
#692
opened Apr 28, 2026 by
kylesayrs
Collaborator
Loading…
[Converters] Support multiple converters
documentation
Improvements or additions to documentation
needs-rebase
#690
opened Apr 27, 2026 by
kylesayrs
Collaborator
Loading…
feat: validate shared memory segment limits before CPU offloading
#688
opened Apr 27, 2026 by
kylesayrs
Collaborator
Loading…
[offload] DiskCache.clean_offload_dir
needs-rebase
#678
opened Apr 10, 2026 by
brian-dellabetta
Collaborator
•
Draft
[QuantizationMetadata] update for missing qparams and module attributes
needs-rebase
#669
opened Apr 8, 2026 by
brian-dellabetta
Collaborator
Loading…
[Misc] [Offload] Update typehints, add docstrings
quality-failed
#622
opened Mar 7, 2026 by
kylesayrs
Collaborator
Loading…
[Offload] Add stats tracker for debugging/ performance tuning model device movement
documentation
Improvements or additions to documentation
needs-rebase
#615
opened Mar 4, 2026 by
kylesayrs
Collaborator
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-19.