## Background ## * Some users want to quantize models and encode using the compressed-tensors format, but do not necessarily want to use compressed-tensors primitives ## Proposed Changes ## * Add documentation for encoding a model using compressed tensors format using, for example, QAT
Background
Proposed Changes