LLM-FlexQuant a python library that lets you quantize any layer in your llm pip install llm-flexquant *Currently adding more models and debugging issues. Please open an issue to request open source models or to report bugs