[Proposal] Support early stopping in huggingface LLM wrapper #429

chanind · 2025-02-16T03:56:19Z

Proposal

We currently have a TODO in load_model.py#L98 to support the stop_at_layer param in HookedProxyLM for huggingface models. Adding support for this will save compute when we're just trying to extract LLM activations at a specific layer, since there's no need to extract activations at later layers.

An example of how to do this is in baukit/nethook.py . We just need to throw an exception after we process the layer we care about to stop processing, and then catch that exception before returning to the user.

Checklist

I have checked that there is no similar issue in the repo (required)

The text was updated successfully, but these errors were encountered:

chanind linked a pull request Feb 17, 2025 that will close this issue

perf: support early-stopping in HF models #430

Open

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Proposal] Support early stopping in huggingface LLM wrapper #429

[Proposal] Support early stopping in huggingface LLM wrapper #429

chanind commented Feb 16, 2025

[Proposal] Support early stopping in huggingface LLM wrapper #429

[Proposal] Support early stopping in huggingface LLM wrapper #429

Comments

chanind commented Feb 16, 2025

Proposal

Checklist