add snippets tests & response_format support #1524

TGlide · 2025-06-11T18:58:28Z

No description provided.

Wauplin · 2025-06-18T13:19:49Z

Hey @TGlide let me know if you want any guidance about inference snippets or a review :)

TGlide · 2025-07-11T11:20:25Z

Hi @Wauplin sorry for the delay! Yes, I would love a review, specially to see if I've gone about this the right way.

Wauplin

Hey @TGlide , understood! I've had a first look at it, let me know if you have extra questions :)

Wauplin · 2025-07-15T15:38:03Z

packages/inference/test/getInferenceSnippets.spec.ts

sorry about all the work done on this module but tests are actually defined in the tasks-gen package (see ./tasks-gen/scripts/generate-snippets-fixtures.ts) and fixtures are auto-generated + committed in ./packages/tasks-gen/snippets-fixtures. This way we can easily spot in a PR what has changed and if the snippets are looking good.

Wauplin · 2025-07-15T15:38:39Z

packages/tasks/src/tasks/automatic-speech-recognition/inference.ts

@@ -3,6 +3,9 @@
 *
 * Using src/scripts/inference-codegen
 */
+
+import type { ChatCompletionInputGrammarType } from "../chat-completion/inference.js";


this is not needed in ASR types I believe

Wauplin · 2025-07-15T15:45:23Z

packages/inference/src/snippets/getInferenceSnippets.ts

+		temperature?: GenerationParameters["temperature"];
+		max_tokens?: GenerationParameters["max_new_tokens"];
+		top_p?: GenerationParameters["top_p"];
+		response_format?: Record<string, unknown>;


I think there is a confusion between the Chat Completion API that accepts a response_format and the Text Generation API that accepts a grammar input.

Wauplin · 2025-07-15T16:00:25Z

packages/inference/src/snippets/getInferenceSnippets.ts

@@ -408,6 +436,46 @@ function formatBody(obj: object, format: "curl" | "json" | "python" | "ts"): str
 	}
 }

+function formatPythonValue(obj: unknown, depth?: number): string {


I start to really think we should first generate the snippets and then format it. Seems that there are a few solutions although not very popular (blackjs, prettier/plugin-python). For now, let's keep it like this.

Out of curiosity, was this code auto-generate or written manually? (if yes, better to mention it in docstring)

(note to myself, maybe a single-file python formatter would be enough given our small requirements. Here's an example)

TGlide added 2 commits June 11, 2025 19:20

add snippets tests

6fb1214

add response_format support

dd504e3

Wauplin reviewed Jul 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add snippets tests & response_format support #1524

add snippets tests & response_format support #1524

TGlide commented Jun 11, 2025

Uh oh!

Wauplin commented Jun 18, 2025

Uh oh!

TGlide commented Jul 11, 2025

Uh oh!

Wauplin left a comment

Uh oh!

Wauplin Jul 15, 2025

Uh oh!

Wauplin Jul 15, 2025

Uh oh!

Wauplin Jul 15, 2025

Uh oh!

Wauplin Jul 15, 2025

Uh oh!

Wauplin Jul 15, 2025

Uh oh!

Wauplin Jul 15, 2025

Uh oh!

Uh oh!

add snippets tests & response_format support #1524

Are you sure you want to change the base?

add snippets tests & response_format support #1524

Conversation

TGlide commented Jun 11, 2025

Uh oh!

Wauplin commented Jun 18, 2025

Uh oh!

TGlide commented Jul 11, 2025

Uh oh!

Wauplin left a comment

Choose a reason for hiding this comment

Uh oh!

Wauplin Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

Wauplin Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

Wauplin Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

Wauplin Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

Wauplin Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

Wauplin Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!