(Mis)Fitting Scaling Laws: A Survey of Scaling Laws

This repository contains the data and code to reproduce the figures and tables in the paper "(Mis)Fitting Scaling Laws: A Survey of Scaling Laws", by Margaret Li, Sneha Kudugunta and Luke Zettlemoyer.

Training models

Model training code was hard-forked from open_lm and modified.

We've compiled all of the commands needed to download and preprocess data and to train models in one script:

bash scaling_laws/open_lm/misfitting/run_train_scale.sh

If you need to make modifications, refer to the commands within the script. There are strings in this repo which need to be replaced with constants specific to your environment -- e.g., wandb username, slurm account and partition, or conda environment, if relevant. These can be found by grepping for ## TODO: FILL IN.

Configurations for all of the model architectures we use are listed in scaling_laws/open_lm/model_configs and named with the convention misfitting_{size}. Additional hyperparameters and training settings may be found in open_lm/constants/slurm_constants.py.

Scaling law fitting and plotting

All of the code used to fit scaling laws and generate plots found in the paper are in paper_analysis_and_plots.py. Parts of this code were adapted from the code released by Besiroglu, et. al. and by Porian, et. al..

Data

All of the data used to conduct the analyses in our paper can be found in the data/ folder. This includes data taken from Besiroglu, et. al. and Porian, et. al., which we reproduce here for your convenience. Please be sure to cite them as well, should you use their data.

Models

Available here, but awaiting updates -- a few models missing at the moment.

Citation

@inproceedings{
li2025misfitting,
title={(Mis)Fitting Scaling Laws: A Survey of Scaling Law Fitting Techniques in Deep Learning},
author={Margaret Li and Sneha Kudugunta and Luke Zettlemoyer},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
url={https://openreview.net/forum?id=xI71dsS3o4}
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
open_lm		open_lm
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
paper_analysis_and_plots.py		paper_analysis_and_plots.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

(Mis)Fitting Scaling Laws: A Survey of Scaling Laws

Training models

Scaling law fitting and plotting

Data

Models

Citation

About

Releases

Packages

Contributors 2

Languages

hadasah/scaling_laws

Folders and files

Latest commit

History

Repository files navigation

(Mis)Fitting Scaling Laws: A Survey of Scaling Laws

Training models

Scaling law fitting and plotting

Data

Models

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages