Update quantize_and_pack_int4.ipynb to use compress_model; remove compress_quantized_weights #526

zkl-ai · 2025-12-14T12:52:07Z

Summary

Switch the quantize_and_pack_int4.ipynb example to the latest compressed-tensors workflow:
- Use ModelCompressor.compress_model(model) for in-memory compression
- Remove the outdated compress_quantized_weights usage

Related Issue

Refs Update example to apply compressed-tensors to a model llm-compressor#2105
Context: The example should use compress_model and remove compress_quantized_weights as requested in the issue.

…te save flow and docs; Fixes #2105 Signed-off-by: zkl-ai <[email protected]>

kylesayrs

Really nice, thank you!

zkl-ai mentioned this pull request Dec 14, 2025

Update example to apply compressed-tensors to a model vllm-project/llm-compressor#2105

Closed

zkl-ai force-pushed the feat/update-int4-notebook-compress-model-2105 branch from f6e2c27 to 03ec34b Compare December 15, 2025 10:52

examples: use compress_model, remove compress_quantized_weights; upda…

18697a2

…te save flow and docs; Fixes #2105 Signed-off-by: zkl-ai <[email protected]>

zkl-ai force-pushed the feat/update-int4-notebook-compress-model-2105 branch from 03ec34b to 18697a2 Compare December 15, 2025 10:58

HDCharles requested review from HDCharles, brian-dellabetta, dsikka, fynnsu and kylesayrs December 15, 2025 16:04

kylesayrs approved these changes Dec 15, 2025

View reviewed changes

fynnsu approved these changes Dec 15, 2025

View reviewed changes

fynnsu merged commit 797d301 into vllm-project:main Dec 15, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update quantize_and_pack_int4.ipynb to use compress_model; remove compress_quantized_weights #526

Update quantize_and_pack_int4.ipynb to use compress_model; remove compress_quantized_weights #526

Uh oh!

zkl-ai commented Dec 14, 2025

Uh oh!

kylesayrs left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Update quantize_and_pack_int4.ipynb to use compress_model; remove compress_quantized_weights #526

Update quantize_and_pack_int4.ipynb to use compress_model; remove compress_quantized_weights #526

Uh oh!

Conversation

zkl-ai commented Dec 14, 2025

Uh oh!

kylesayrs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants