Add QLoRA and FP8 to finetuning tutorial (part 2) #2542

andrewor14 · 2025-07-14T23:17:33Z

This is part 2 of the end-to-end tutorial. Previously we already had QAT. This commit also adds QLoRA and FP8. To preview, visit https://docs-preview.pytorch.org/pytorch/ao/2542/finetuning.html

pytorch-bot · 2025-07-14T23:17:37Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2542

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 5 Pending

As of commit e6c8194 with merge base 2e2ce0b ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jerryzh168 · 2025-07-14T23:33:56Z

docs/source/finetuning.rst

+          self.weight.requires_grad_(False)
+          if self.bias is not None:
+              self.bias.requires_grad_(False)
+          nf4_weight = to_nf4(self.weight, **quantization_kwargs)


can we extend this to support any quantization? I just added a AOBaseTensorConfig that might be able to help: #2463

Ok, I'll add a note to say it's possible to extend to other quantization, but want to keep this example NF4 since that's what's used in the QLoRA paper

jerryzh168 · 2025-07-14T23:34:53Z

docs/source/finetuning.rst

+
+.. code::
+
+  tune run lora_finetune_single_device --config llama3_2/3B_qlora_single_device.yaml


what about integration with the transformer peft library?

also isn't tune deprecated

wow I didn't realize we had an integration with peft, don't think this was documented in any of our docs? Will add a note here

For torchtune I don't think there's a mature replacement yet from pytorch so I feel it's OK. Also that's where most of our fine-tuning integrations live today

yeah: https://huggingface.co/docs/peft/en/developer_guides/quantization#torchao-pytorch-architecture-optimization, I'll test this path a bit soon as well

jerryzh168

looks great, thanks!

This is part 2 of the end-to-end tutorial. Previously we already had QAT. This commit also adds QLoRA and FP8. To preview, visit https://docs-preview.pytorch.org/pytorch/ao/2542/finetuning.html

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 14, 2025

andrewor14 added the topic: documentation Use this tag if this PR adds or improves documentation label Jul 14, 2025

andrewor14 requested review from drisspg and jerryzh168 July 14, 2025 23:18

drisspg approved these changes Jul 14, 2025

View reviewed changes

jerryzh168 reviewed Jul 14, 2025

View reviewed changes

jerryzh168 approved these changes Jul 14, 2025

View reviewed changes

andrewor14 force-pushed the finetuning-tutorial-part2 branch from 6ccc6b1 to d04acdd Compare July 15, 2025 19:13

Add QLoRA and FP8 to finetuning tutorial (part 2)

e6c8194

This is part 2 of the end-to-end tutorial. Previously we already had QAT. This commit also adds QLoRA and FP8. To preview, visit https://docs-preview.pytorch.org/pytorch/ao/2542/finetuning.html

andrewor14 force-pushed the finetuning-tutorial-part2 branch from d04acdd to e6c8194 Compare July 15, 2025 19:17

andrewor14 merged commit 975bd57 into main Jul 15, 2025
20 of 21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add QLoRA and FP8 to finetuning tutorial (part 2) #2542

Add QLoRA and FP8 to finetuning tutorial (part 2) #2542

andrewor14 commented Jul 14, 2025

Uh oh!

pytorch-bot bot commented Jul 14, 2025 •

edited

Loading

Uh oh!

jerryzh168 Jul 14, 2025

Uh oh!

andrewor14 Jul 15, 2025

Uh oh!

jerryzh168 Jul 14, 2025

Uh oh!

andrewor14 Jul 15, 2025

Uh oh!

jerryzh168 Jul 15, 2025

Uh oh!

jerryzh168 left a comment

Uh oh!

Uh oh!

Uh oh!


		.. code::

		tune run lora_finetune_single_device --config llama3_2/3B_qlora_single_device.yaml

Add QLoRA and FP8 to finetuning tutorial (part 2) #2542

Add QLoRA and FP8 to finetuning tutorial (part 2) #2542

Conversation

andrewor14 commented Jul 14, 2025

Uh oh!

pytorch-bot bot commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2542

⏳ No Failures, 5 Pending

Uh oh!

jerryzh168 Jul 14, 2025

Choose a reason for hiding this comment

Uh oh!

andrewor14 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Jul 14, 2025

Choose a reason for hiding this comment

Uh oh!

andrewor14 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 14, 2025 •

edited

Loading