improve performance, alternative approach (+149% throughput and more) #38

AlCalzone · 2023-03-20T21:54:24Z

This PR is an alternative to #37

The idea is to first analyze the input string and turn it into an array of ANSI codes and characters. Slicing is then done by operating on the array. It basically rewrites the entire library, but yields a higher performance (+149% throughput, and more when reusing some of the work for slicing the same string multiple times). For my methodology on testing the performance, see #37. Some of the improvements are also taken from there.

I had to change two tests though - one unnecessarily used two separate foreground colors where the 2nd overwrote the 1st. And in another one, the expected string had the end codes in the same order as the start codes, while all others had it in the opposite order. I didn't see any difference when printing those strings.

  original:
    20 822 ops/s, ±1.70%
  
  (snip)

  other PR:
    41 278 ops/s, ±4.02%

  this PR:
    51 791 ops/s, ±2.93%

and if the result of the tokenization is memorized (export tokenize function, and add a copy of the sliceAnsi function which operates on a token array, instead of a string) , repeated slices on the same input (like in the test case) are much faster (+300% throughput):

  this PR, with reusing work:
    84 850 ops/s, ±3.21%

FYI, I have TypeScript code for this change. Let me know if that is preferred.

closes: #37

AlCalzone · 2023-03-20T21:58:39Z

test.js

- t.is(sliceAnsi('\u001B[1m\u001B[48;2;255;255;255m\u001B[38;2;255;0;0municorn\u001B[39m\u001B[49m\u001B[22m', 0, 3), '\u001B[1m\u001B[48;2;255;255;255m\u001B[38;2;255;0;0muni\u001B[22m\u001B[49m\u001B[39m');
+ t.is(sliceAnsi('\u001B[1m\u001B[48;2;255;255;255m\u001B[38;2;255;0;0municorn\u001B[39m\u001B[49m\u001B[22m', 0, 3), '\u001B[1m\u001B[48;2;255;255;255m\u001B[38;2;255;0;0muni\u001B[39m\u001B[49m\u001B[22m');


after this change, the start codes get undone in the same order as in the input string.

AlCalzone · 2023-03-20T21:58:49Z

test.js

- t.is(JSON.stringify(sliceAnsi('\u001B[31m' + output, 0, 4)), JSON.stringify(`\u001B[31m${chalk.black.bgYellow(' RUN')}`));
+ t.is(JSON.stringify(sliceAnsi('\u001B[31m' + output, 0, 4)), JSON.stringify(chalk.black.bgYellow(' RUN')));


The ANSI code for yellow is unnecessary in the output, because it immediately gets overwritten with black

sindresorhus · 2023-03-24T08:30:14Z

I like this approach. Tokenizing it first makes a lot of sense.

index.js

sindresorhus · 2023-03-24T12:46:30Z

Thanks :)

sindresorhus · 2023-03-24T12:54:18Z

https://github.com/chalk/slice-ansi/releases/tag/v6.0.0

AlCalzone changed the title ~~perf: tokenize input and operate on analyzed array~~ perf: tokenize input and operate on analyzed array (+149% throughput and more) Mar 20, 2023

perf: tokenize input and operate on analyzed array

5a7f251

AlCalzone force-pushed the perf2 branch from 70da8cf to 5a7f251 Compare March 20, 2023 21:58

AlCalzone commented Mar 20, 2023

View reviewed changes

AlCalzone changed the title ~~perf: tokenize input and operate on analyzed array (+149% throughput and more)~~ improve performance, alternative approach (+149% throughput and more) Mar 20, 2023

sindresorhus requested changes Mar 24, 2023

View reviewed changes

index.js Outdated Show resolved Hide resolved

index.js Outdated Show resolved Hide resolved

index.js Outdated Show resolved Hide resolved

index.js Outdated Show resolved Hide resolved

index.js Outdated Show resolved Hide resolved

review feedback

1a3772b

AlCalzone requested a review from sindresorhus March 24, 2023 11:14

sindresorhus merged commit 29f76a4 into chalk:main Mar 24, 2023
2 checks passed

vadimdemedes mentioned this pull request Mar 24, 2023

Update slice-ansi vadimdemedes/ink#565

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve performance, alternative approach (+149% throughput and more) #38

improve performance, alternative approach (+149% throughput and more) #38

AlCalzone commented Mar 20, 2023 •

edited

AlCalzone Mar 20, 2023

AlCalzone Mar 20, 2023

sindresorhus commented Mar 24, 2023

sindresorhus commented Mar 24, 2023

sindresorhus commented Mar 24, 2023

		t.is(sliceAnsi('\u001B[1m\u001B[48;2;255;255;255m\u001B[38;2;255;0;0municorn\u001B[39m\u001B[49m\u001B[22m', 0, 3), '\u001B[1m\u001B[48;2;255;255;255m\u001B[38;2;255;0;0muni\u001B[22m\u001B[49m\u001B[39m');
		t.is(sliceAnsi('\u001B[1m\u001B[48;2;255;255;255m\u001B[38;2;255;0;0municorn\u001B[39m\u001B[49m\u001B[22m', 0, 3), '\u001B[1m\u001B[48;2;255;255;255m\u001B[38;2;255;0;0muni\u001B[39m\u001B[49m\u001B[22m');

		t.is(JSON.stringify(sliceAnsi('\u001B[31m' + output, 0, 4)), JSON.stringify(`\u001B[31m${chalk.black.bgYellow(' RUN')}`));
		t.is(JSON.stringify(sliceAnsi('\u001B[31m' + output, 0, 4)), JSON.stringify(chalk.black.bgYellow(' RUN')));

improve performance, alternative approach (+149% throughput and more) #38

improve performance, alternative approach (+149% throughput and more) #38

Conversation

AlCalzone commented Mar 20, 2023 • edited

AlCalzone Mar 20, 2023

Choose a reason for hiding this comment

AlCalzone Mar 20, 2023

Choose a reason for hiding this comment

sindresorhus commented Mar 24, 2023

sindresorhus commented Mar 24, 2023

sindresorhus commented Mar 24, 2023

AlCalzone commented Mar 20, 2023 •

edited