Add 'Considerations for Fine-Tuning Training'

kavanase · cw-tan · commit 0b1f31663f58 · 2025-11-04T17:26:26.000-05:00
diff --git a/docs/guide/training-techniques/fine_tuning.md b/docs/guide/training-techniques/fine_tuning.md
@@ -74,3 +74,12 @@ model:
 ```
 
 See [Dataset Statistics](../configuration/data.md#dataset-statistics) for more details on configuring dataset statistics.
+
+## Considerations for Fine-Tuning Training
+There are a number of considerations and changes you may want to make to training setup and hyperparameters when fine-tuning, rather than training from scratch. This is an active area of research within the field and the `NequIP` userbase.
+
+Key differences to training from scratch are:
+
+- **Decrease the learning rate**: It is typically best to use a lower learning rate for fine-tuning a pre-trained model, compared to the optimal LR for from-scratch training. 
+- **Update energy shifts**: As discussed above, you will likely want to update the atomic energy shifts of the model to match the settings (and thus absolute energies) of your data, to ensure smooth fine-tuning.
+- **Fixed model hyperparameters**: When fine-tuning, the architecture of the pre-trained model (number of layers _l_-max, radial cutoff etc. – e.g. provided on [nequip.net](https://www.nequip.net/)) cannot be modified. When comparing the performance of fine-tuning and from-scratch training, it is advised to use the same model hyperparameters for appropriate comparison.