Move tfjs to first, average, worst scoring. #23

kmiller68 · 2024-12-21T16:47:19Z

Since this test is relatively slow per iteration I reduced the iteration counts to: non-simd: 15 iterations with worst 2.
simd: 80 iterations with worst 4 (default).

Also, I moved all the benchmark files to a tfjs directory to help organize the wasm directory a bit.

Since this test is relatively slow per iteration I reduced the iteration counts to: non-simd: 15 iterations with worst 2. simd: 80 iterations with worst 4 (default). Also, I moved all the benchmark files to a tfjs directory to help organize the wasm directory a bit.

danleh · 2025-01-15T17:19:52Z

Overall this looks good to me, with some high-level comments though:

Unlike the other Emscripten workloads, this doesn't contain a build script. How are the Wasm and JS files generated? Did you compile them at some point, or do they come from NPM packages? Ideally, we should be able to recompile with an up-to-date toolchain, but alternatively can we at least download them from somewhere, in case the upstream packages gets updated? in-depth.html mentions several repositories for Tensorflow.js, the Wasm backend, and models, but I am not sure how this is setup or copied together.
Even with the lowered iteration counts, in particular the SIMD test is pretty slow (in the order of 30 to 40s). Could we reduce the workload per iteration or just lower iterations further, say to 30?
Looking at the profile/flamegraph, a substantial number of samples are spent on parsing the .js files and an even larger amount on base64ToArrayBuffer, which is used when reading the model weights. Other folks have reported JetStream 3 initialization taking much longer than before. Could this be because of the large .js files and in particular model weights of this benchmark? Should we choose maybe just one or two models/inference tasks, not four?
Given that the non-SIMD variant takes roughly twice the runtime and SIMD is widely supported by now. Should we only run the SIMD variant?

kmiller68 · 2025-01-16T13:53:50Z

r.e. your comments:

Yeah setting up a build script for this is on my todo list. First I have to figure out how the test was originally built though.
Interesting, for me the simd version is about ~14s (jsc) / ~16s (v8) on my M4 MBP. The non-simd is ~8/~9s respectively. What CPU are you using? That still might be a bit long so I'm happy to trim the simd iteration count into e.g. half?
Let's add that to the doc and talk about it at the next sync.
Yeah, I brought this up at the last sync I'm happy to trim the non-SIMD. Although, I think we should keep it until we have enough other wasm tests. At least given the plan to remove the smaller wasm benchmarks already.

This makes the runtime for the benchmark ~7s/9s/11s for jsc/v8/sm respectively on my M4 Max MBP.

danleh · 2025-01-16T14:21:37Z

Perfect, I'll take another look with a build script then.
I'm running on a server x86 CPU, very high core count but pretty low frequency/boost, that could explain it. Changing iterations to 40 for SIMD SGTM.
Sure, I added it to the doc/agenda for next time.
Sure, let's keep the non-SIMD for now.

kmiller68 · 2025-01-16T14:22:58Z

Ok sounds like we have a plan. Gonna merge for now.

This reverts commit 3f50521, reversing changes made to c1a122d.

Revert "Merge pull request #23 from WebKit/tfjs-to-js-scoring"

Change tfjs-wasm-simd iteration count to 40 (worst case count to 3)

176a7f7

This makes the runtime for the benchmark ~7s/9s/11s for jsc/v8/sm respectively on my M4 Max MBP.

kmiller68 merged commit 3f50521 into main Jan 16, 2025

kmiller68 added a commit that referenced this pull request Jan 21, 2025

Revert "Merge pull request #23 from WebKit/tfjs-to-js-scoring"

b23b9b9

This reverts commit 3f50521, reversing changes made to c1a122d.

kmiller68 added a commit that referenced this pull request Jan 21, 2025

Merge pull request #31 from WebKit/revert-tfjs-to-js-scoring

86e3148

Revert "Merge pull request #23 from WebKit/tfjs-to-js-scoring"

kmiller68 deleted the tfjs-to-js-scoring branch February 7, 2025 10:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move tfjs to first, average, worst scoring. #23

Move tfjs to first, average, worst scoring. #23

Uh oh!

kmiller68 commented Dec 21, 2024

Uh oh!

danleh commented Jan 15, 2025 •

edited

Loading

Uh oh!

kmiller68 commented Jan 16, 2025

Uh oh!

danleh commented Jan 16, 2025

Uh oh!

kmiller68 commented Jan 16, 2025

Uh oh!

Uh oh!

Move tfjs to first, average, worst scoring. #23

Move tfjs to first, average, worst scoring. #23

Uh oh!

Conversation

kmiller68 commented Dec 21, 2024

Uh oh!

danleh commented Jan 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kmiller68 commented Jan 16, 2025

Uh oh!

danleh commented Jan 16, 2025

Uh oh!

kmiller68 commented Jan 16, 2025

Uh oh!

Uh oh!

danleh commented Jan 15, 2025 •

edited

Loading