Return partial results if task errors #123

tomhonour · 2025-03-14T11:01:30Z

Hi, thanks for evalite - it's fantastic!

I just ran a long eval and forgot to include error handling in the task. The task ran successfully for most of the inputs, but errored on some. This meant that no table was printed.

In relation to a solution, I think tasks that error should be excluded from the partial table as to not improperly lower the overall score.

You can test it like this:

evalite("My Eval", {
	data: async () => {
		return [
			{ input: "hello", expected: "hello, world" },
			{ input: "fail", expected: "hello, world" },
		];
	},
	task: async (input) => {
		if (input === "fail") {
			throw new Error();
		}
		return input + ", world";
	},
	scorers: [],
});

mattpocock · 2025-03-14T20:27:03Z

Gotcha, makes total sense.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return partial results if task errors #123

Return partial results if task errors #123

tomhonour commented Mar 14, 2025 •

edited

Loading

mattpocock commented Mar 14, 2025

Return partial results if task errors #123

Return partial results if task errors #123

Comments

tomhonour commented Mar 14, 2025 • edited Loading

mattpocock commented Mar 14, 2025

tomhonour commented Mar 14, 2025 •

edited

Loading