You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/fine-tuning-guide.md
+7-14Lines changed: 7 additions & 14 deletions
Original file line number
Diff line number
Diff line change
@@ -19,7 +19,7 @@ A Demo Project:
19
19
20
20
*\[2 mins]: [Define task, goals, and schema](fine-tuning-guide.md#step-1-define-your-task-and-goals)
21
21
*\[9 mins]: [Synthetic data generation](synthetic-data-generation.md): create 920 high-quality examples for training
22
-
*\[5 mins]: Dispatch 9 fine tuning jobs: [Fireworks](fine-tuning-guide.md#step-4-dispatch-training-jobs) (Llama 3.2 1b/3b/11b, Llama 3.1 8b/70b, Mixtral 8x7b), [OpenAI](fine-tuning-guide.md#step-4-dispatch-training-jobs) (GPT 4o, 4o-Mini), and [Unsloth](fine-tuning-guide.md#step-6-optional-training-on-your-own-infrastructure) (Llama 3.2 1b/3b). Note: since this guide was written we've added fine-tuning on Together.ai, allowing tuning additional models like Qwen 2.5 14B/72B.
22
+
*\[5 mins]: Dispatch 9 fine tuning jobs: [Fireworks](fine-tuning-guide.md#step-4-dispatch-training-jobs) (Llama 3.2 1b/3b/11b, Llama 3.1 8b/70b, Mixtral 8x7b), [OpenAI](fine-tuning-guide.md#step-4-dispatch-training-jobs) (GPT 4o, 4o-Mini), and [Unsloth](fine-tuning-guide.md#step-6-optional-training-on-your-own-infrastructure) (Llama 3.2 1b/3b). Note: since this guide was written we've added over 60 new models for fine tuning!
23
23
*\[2 mins]: [Deploy your new models and test they work](fine-tuning-guide.md#step-5-deploy-and-run-your-models)
24
24
25
25
Analysis:
@@ -57,20 +57,13 @@ Synthetic Data Generation
57
57
58
58
### Step 3: Select Models to Fine Tune
59
59
60
-
Kiln supports a wide range of models from our UI, including:
60
+
Kiln supports over 60 fine-tuneable models using three different service based tuning providers:
61
61
62
-
* OpenAI: 
63
-
* GPT 4o
64
-
* GPT 4o-Mini
65
-
* Meta:
66
-
* Llama 3.1 8b/70b
67
-
* Llama 3.2 1b/3b
68
-
* Together AI
69
-
* Llama 3.1 8b/70b
70
-
* Llama 3.2 1b/3b
71
-
* Qwen2.5 14b/72b
62
+
* Open AI: GPT 4o and 4o-mini
63
+
* Fireworks.ai: over 60 open weight models including Qwen 2.5, Llama 2/3.x, Deepseek V3/R1, QwQ, and more. See the [full list here](models-and-ai-providers.md#additional-fine-tuneable-models).
For this demo we used all the models available at the time of writing.
66
+
For this experiment we choose 9 model to expriment with.
74
67
75
68
### Step 4: Dispatch Training Jobs
76
69
@@ -110,7 +103,7 @@ If a Fireworks fine tune gives you the error \`Model not found, inaccessible, an
110
103
111
104
Kiln can also export your dataset to common formats for fine tuning on your own infrastructure. Simply select one of the "Download" options when creating your fine tune, and use the exported JSONL file to train with your own tools.
112
105
113
-
We currently recommend [Unsloth](https://github.com/unslothai/unsloth) and Axolotl. These platforms let you train almost any open model, including Gemma, Mistral, Llama, Qwen, Smol, and many more.
106
+
We currently recommend [Unsloth](https://github.com/unslothai/unsloth) and Axolotl. These platforms let you train almost any open model, including Gemma, Mistral, Llama, Qwen, Smol, and [many more](https://docs.unsloth.ai/get-started/all-our-models). 
0 commit comments