HF tokenizer_call_args #793

alexandrasouly-aisi · 2024-11-01T12:33:44Z

This PR contains:

[x ] New features
Changes to dev-tools e.g. CI config / github tooling
Docs
Bug fixes
Code refactor

I am trying to add strongreject to inspect evals, and some parts use a HF model with a small context length, so they use the tokeniser to truncate the input. I would like the tokenizer call to be more flexible take custom arguments.

What is the current behavior? (You can also link to an open issue here)

HF tokenizer is called with default parameters tokenizer(response, max_length=max_response_length, truncation=True)

What is the new behavior?

The API would take a tokenizer_call_args dict to be passed onto the tokenizer call.

Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)

No

Other information:

jjallaire-aisi · 2024-11-01T12:41:04Z

src/inspect_ai/model/_providers/hf.py

@@ -71,6 +71,9 @@ def collect_model_arg(name: str) -> Any | None:
        tokenizer_path = collect_model_arg("tokenizer_path")
        self.batch_size = collect_model_arg("batch_size")
        self.chat_template = collect_model_arg("chat_template")
+        self.tokenizer_call_args = collect_model_arg("tokenizer_call_args")


How about just tokenizer_args?

@alexandrasouly-aisi Are you okay w/ the change to tokenizer_args?

tokenizer_call_args

c38c4bb

alexandrasouly-aisi requested a review from jjallaire-aisi November 1, 2024 12:34

jjallaire-aisi reviewed Nov 1, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HF tokenizer_call_args #793

HF tokenizer_call_args #793

alexandrasouly-aisi commented Nov 1, 2024

jjallaire-aisi Nov 1, 2024

jjallaire Nov 7, 2024

HF tokenizer_call_args #793

Are you sure you want to change the base?

HF tokenizer_call_args #793

Conversation

alexandrasouly-aisi commented Nov 1, 2024

This PR contains:

What is the current behavior? (You can also link to an open issue here)

What is the new behavior?

Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)

Other information:

jjallaire-aisi Nov 1, 2024

Choose a reason for hiding this comment

jjallaire Nov 7, 2024

Choose a reason for hiding this comment