[v2] Add support for chunking performance tests, and architectural changes #9485

aemous · 2025-05-09T20:18:33Z

Context:

The main purpose of this PR is adding 'chunking' support to performance tests to enable sharding of the tests across parallel workers in a distributed workflow. The secondary purpose of this PR is to refactor the architecture of the performance testing code to make it easier to integrate more complex performance tests. The architectural decisions were led by implementing more complex RPCv2 CBOR performance test cases and observing points of friction.

Description of changes:

Adds support for evenly chunking performance test cases and only running a specified chunk
Implemented a BaseBenchmarkSuite class which defines the abstract functions that must be implemented to define new performance tests. The default performance test cases were abstracted out into a JSONStubbedBenchmarkSuite implementation.
Modifications to JSONStubbedBenchmarkSuite
- Modified dimensions format to require the usage of name and value keys to reflect the CloudWatch Metrics model.
- Added support for writing binary content via a new mode key on file_literals definition.
Abstracted code for serializing benchmarks results into its own BenchmarkResultsSerializer class. This makes it easier to override this class for one-off performance testing when a different output format is expected.
Architectural changes to performance testing
- Change Summarizer to return metric results in a new Metric data model, which includes additional data like description and unit.

Description of tests:

Ran the default JSON stubbed tests via ./scripts/performance/run-benchmarks --num-iterations 20 and observed expected results.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…ts further.

…ark definitions.

…ses.

ashovlin

When I saw the new /tests directory, I thought those would be unit/integration/etc. tests for the performance framework. I'd rather keep in the top level if it's still just implementation.

ashovlin · 2025-06-10T17:08:00Z

scripts/performance/README.md

+        - `mode` (string) **(optional)**: The write mode to use for writing the
+file contents.
+          - Default: `w`


Is anything actually using this? It looks hardcoded below in _create_file_with_size and begin_iteration.

Ah, it's intended to be used in the JSONStubbedBenchmarkSuite. In particular, it was useful for one-off performance testing of CBOR protocol for an operation with binary input. Will update in next revision

ashovlin · 2025-06-10T19:27:45Z

scripts/performance/run-benchmarks

+    if (
+        sum(
+            x is not None
+            for x in [parsed_args.num_chunks, parsed_args.chunk_id]
+        )
+        == 1
+    ):


Is this pattern common? For readability summing bools seems odd.

ashovlin · 2025-06-10T19:40:21Z

scripts/performance/tests/__init__.py

@@ -0,0 +1,24 @@
+class BaseBenchmarkSuite:


Do we need a base class if there's only one implementation of it?

If keeping, could you add class-level comments when when someone would use this?

I like having this base class so that it forces us to implement our only test suite via a contract that we know is extendable to other use cases. I previously extended this base class to implement CBOR benchmarks for a couple other services, and those tests fit nicely against the functions on this base class.

Even though we won't ship those CBOR tests, they served to show locally that the function contract on this class is extendible/reusable.

I will add those class-level comments in the next iteration

aemous added 23 commits April 1, 2025 15:42

Progress on benchmarks

d6765ba

Update remove prints.

03cd651

Update request time to be more accurate

607df62

Stage architectural changes that allow more complex benchmark suites.

957f6ba

More progress on performance framework.

62a606d

More updates/questions

5c1ba4f

Add chunking support to performance scripts. Refine performance scrip…

bc9ce9b

…ts further.

Bugfix in chunking code.

464143d

Cleanup data files.

9cce974

Update performance tests output format.

7b1e20b

Clean up performance framework for push.

5ce2a28

Merge branch 'v2' into perf-tests-v2-github

df46a22

Remove typing

4607429

Cleanup planned tasks.

f7d9dc1

Progress towards using decorators to define test cases.

69b0053

Add more vision to the skeleton.

e8635be

Fix import paths and fix percentage metric name.

1368faf

Adjust dimensions JSON format.

33d89b5

Switch from using binary-content key in file literals for JSON Benchm…

5671a8b

…ark definitions.

Develop outline for using decorators for defining performance test ca…

b6032eb

…ses.

Move all Metric creation to within the Summarizer.

4018ae1

Move results processing from suite to a separate class.

a90b887

Finishing touches on refactoring.

0f4c3e5

aemous marked this pull request as ready for review June 4, 2025 20:37

aemous requested a review from a team June 4, 2025 20:38

aemous added 2 commits June 5, 2025 11:42

Formatting/linting.

404f63d

Divide percent by 100. Add missing dimension to s3 mv download,

c6899f5

ashovlin reviewed Jun 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[v2] Add support for chunking performance tests, and architectural changes #9485

[v2] Add support for chunking performance tests, and architectural changes #9485

Uh oh!

aemous commented May 9, 2025 •

edited

Loading

Uh oh!

ashovlin left a comment

Uh oh!

ashovlin Jun 10, 2025

Uh oh!

aemous Jun 10, 2025

Uh oh!

ashovlin Jun 10, 2025

Uh oh!

ashovlin Jun 10, 2025

Uh oh!

aemous Jun 10, 2025

Uh oh!

Uh oh!

[v2] Add support for chunking performance tests, and architectural changes #9485

Are you sure you want to change the base?

[v2] Add support for chunking performance tests, and architectural changes #9485

Uh oh!

Conversation

aemous commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ashovlin left a comment

Choose a reason for hiding this comment

Uh oh!

ashovlin Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

aemous Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

ashovlin Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

ashovlin Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

aemous Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aemous commented May 9, 2025 •

edited

Loading