ion-python-benchmark-cli's threshold value need to be more accurate. #265

cheqianh · 2023-05-10T19:39:03Z

The benchmark-cli's performance detection workflow fails if the performance regression exceeds the threshold value. Currently, the threshold is set to 1, meaning that the workflow only fails when the new commit is twice as slow as the original one.

Need a better strategy to calculate the performance gap between the original commit and the new commit to generate a desired and accurate threshold value.

Related issue - amazon-ion/ion-java-benchmark-cli#53

cheqianh · 2023-05-12T21:45:03Z

Some thoughts on this.

In the short term, we should increase the warm-up and iteration numbers to make sure the results become stable.

in the long term,

We should compare the new commit not only to the most recent one but also to the average value from the past months to prevent unnoticed false negatives from occurring.
We also need to reduce the noise caused by the machine. Currently, we only run performance regression detection on Ubuntu but we plan to add other OSs in the future. To ensure more consistent benchmark results on Linux, we can control factors such as CPU affinity - https://easyperf.net/blog/2019/08/02/Perf-measurement-environment-on-Linux
We should create more specific files that target specific data models. For example, if a new commit introduces changes that impact the speed of string serialization, it can be hard to find. By having files that include numerous test cases for Ion string values, we can magnify this regression and easily detect it.
We can also mimic JMH (Java Microbenchmark Harness) and implement another warm-up mode based on"clock time" instead of "execution numbers" which ensures that the benchmark-cli warm up for a specific duration. For example, instead of executing the specific benchmark code 20 times, we warm up the benchmark-cli for 20 minutes.

cheqianh · 2023-06-09T00:18:37Z

Few TODOs:

We can measure the percentile of the benchmark results - https://learn.microsoft.com/en-us/previous-versions/msp-n-p/bb924370(v=pandp.10)
We can try running each command for both previous and current commits together and see if it has less variance (currently we are running all commands for each commit each time).

cheqianh added enhancement benchmark ion-python-benchmark-cli labels May 10, 2023

cheqianh mentioned this issue May 10, 2023

Adds a CI/CD workflow to detect performance regression. #264

Merged

cheqianh mentioned this issue Jun 9, 2023

Increases benchmark warm-up and decreases the threshold value #268

Merged

popematt mentioned this issue Aug 1, 2023

Update benchmark sampling method and statistics #282

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ion-python-benchmark-cli's threshold value need to be more accurate. #265

ion-python-benchmark-cli's threshold value need to be more accurate. #265

cheqianh commented May 10, 2023

cheqianh commented May 12, 2023 •

edited

Loading

cheqianh commented Jun 9, 2023 •

edited

Loading

ion-python-benchmark-cli's threshold value need to be more accurate. #265

ion-python-benchmark-cli's threshold value need to be more accurate. #265

Comments

cheqianh commented May 10, 2023

cheqianh commented May 12, 2023 • edited Loading

cheqianh commented Jun 9, 2023 • edited Loading

cheqianh commented May 12, 2023 •

edited

Loading

cheqianh commented Jun 9, 2023 •

edited

Loading