Create script for automated regression testing #13

ilectra · 2024-11-27T12:13:13Z

Fixes #8 .
~~This is stil WIP, I need to~~

check that all files are copied properly to the build directory and paths are correct
possibly convert to pytest

But you can already use it from inside the directory that the test sits in build, ~~after copying the file with the reference values over.~~
@edbennett are the values we're checking the right ones? Or do we need the palette value to a gazillion decimals?

tests/run_regression_test.py

qiUip

In terms of functionality, this works great! A few suggestions in the review.

I also noticed that the results change based on compilation options, e.g. when compiling for sp2n (--enable-Sp --enable-Nc=4) and running Test_hmc_IwasakiGauge I got different results than the same test when compiled for su3 (--enable-Nc=3). I'm not sure these values will make a lot of sense without a very strict compile and run parameters.

tests/run_regression_test.py

tests/sp2n/Test_hmc_Sp_WilsonFundFermionGauge_expected.txt

tests/sp2n/Test_hmc_Sp_WilsonFundFermionGauge.cc

edbennett · 2024-12-03T16:21:43Z

Something I should have considered earlier: since this uses a Metropolis algorithm, checking the final value of the plaquette is only valid if the version being compared against accepts the change. (In the case of a rejected change, then the value of the plaquette output is the input value, and only depends on the code we are touching if it changes in such a way that the change is accepted instead, which is unlikely.)

Provided we consistently stick with the same test case (and have verified that it accepts), then I think this is fine, but we should be careful about generalising this.

Co-authored-by: Mashy Green <[email protected]>

ilectra · 2024-12-20T11:01:36Z

To add more parameters/references to check for: nthreads, CPUvsGPU
~~read references from command line (if possible)~~ I don't think this is a good idea. It should be easy to add a line in the ...._expected.txt file.

SOme improvements in error checking and reporting.

Future test reference value files added, will have to be linked over explicitly here as well. autotools does not do wildcards.

ilectra · 2025-01-30T16:56:13Z

To be able to tell if the run was with GPU or CPU, look for nvidia::memalloc or unified memory in the output.

Co-authored-by: Mashy Green <[email protected]>

ilectra · 2025-01-31T16:59:29Z

I think this is ready for a final review. Please have a look and add test cases values that you find useful to the Test_hmc_Sp_WilsonFundFermionGauge_expected.txt file. @qiUip @edbennett @asifsamiarain

ilectra · 2025-01-31T17:04:07Z

tests/hmc/Test_hmc_IwasakiGauge_expected.txt

@@ -0,0 +1 @@
+8.8.8.8 1.1.1.1 0.269298793 633bf471 3a22ad20


Those two expected values files have to be adjusted for the extra test parameters and line numbers. @qiUip

ilectra · 2025-02-07T10:33:57Z

Add usage note to script

edbennett · 2025-02-07T13:49:32Z

As you noted, a note on usage would be useful. You might want to run ruff format or similar to tidy up some very minor things. Other than that, looks good to me!

Add some more test parameters and expected values - in progress... Modify saveInterval in hmc tests to produce outputs. Fix number of threads to 1 if not specified.

ilectra · 2025-03-03T10:58:37Z

The lat.checkoint file checksum is machine dependent, therefore not appropriate for regression tests (unless we keep note of the machine where the reference values were generated as well). A better value to use for reference is LINK_TRACE, found in line 9 of the lat.checkpoint header.
Do we still want to keep the checksum for more detailed comparisons? In this case, the user would have to generate the checksum before the changes in the code they want to check for exact bit-wise identity, and then compare it with the one after. A command line option for the pytest, eg. --checksum, could take no argument to just report the checksum after the run, or take the value to compare with as an argument. Either way, it'll be a 2-step process, where the user will have to run pytest with the old version of the code to generate the checksum value, then re-compile with the new version of the code and run pytest again to do the comparison. Does this seem worth the effort @edbennett ?

edbennett · 2025-03-03T11:55:33Z

Having a tool that makes it easier to do a checksum comparison before and after a change, so you can verify that a trajectory is bitwise compatible when run on the same machine, would be useful. Whether it's worth the effort of course depends on how much effort it would be. I'd say it's worth a single-digit number of hours of effort, but not that much more than that.

First attempt at regression test script.

fc40b16

ilectra requested review from edbennett and qiUip November 27, 2024 12:13

ilectra self-assigned this Nov 27, 2024

qiUip reviewed Dec 3, 2024

View reviewed changes

tests/run_regression_test.py Outdated Show resolved Hide resolved

Adding expected values for 2 additional SU3 tests

fc8822a

qiUip requested changes Dec 3, 2024

View reviewed changes

ilectra and others added 3 commits December 18, 2024 12:23

Apply suggestions from code review

8531bf1

Co-authored-by: Mashy Green <[email protected]>

Initial port to pytest - messy!

9352015

Clean pytest implementation.

7de81af

ilectra added 4 commits January 29, 2025 11:39

Add function to read expected values by line.

ec81f62

SOme improvements in error checking and reporting.

Read expected values from file by line.

57a0049

Read test parameters as well from the file, remove from command line.

a15688c

Link over needed files for tests with autoconf.

bed8c17

Future test reference value files added, will have to be linked over explicitly here as well. autotools does not do wildcards.

ilectra and others added 3 commits January 31, 2025 14:23

Add nthreads as test parameter

565a051

Add the rest of the requested test parameters.

85e8906

Keep MDsteps and trajL in the test as they were

77d5119

Co-authored-by: Mashy Green <[email protected]>

ilectra commented Jan 31, 2025

View reviewed changes

ilectra added 4 commits February 13, 2025 16:49

Add documentation to run_regression_tests script. Various debugging.

187b549

Add another test case to Test_hmc_Sp_WilsonFundFermionGauge

dc184a4

Add the hmc tests to configure.ac.

c70335c

Add some more test parameters and expected values - in progress... Modify saveInterval in hmc tests to produce outputs. Fix number of threads to 1 if not specified.

Merge upstream and local develop

fbd4349

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create script for automated regression testing #13

Create script for automated regression testing #13

ilectra commented Nov 27, 2024 •

edited

Loading

qiUip left a comment

edbennett commented Dec 3, 2024

ilectra commented Dec 20, 2024 •

edited

Loading

ilectra commented Jan 30, 2025

ilectra commented Jan 31, 2025

ilectra Jan 31, 2025

ilectra commented Feb 7, 2025

edbennett commented Feb 7, 2025

ilectra commented Mar 3, 2025

edbennett commented Mar 3, 2025

Create script for automated regression testing #13

Are you sure you want to change the base?

Create script for automated regression testing #13

Conversation

ilectra commented Nov 27, 2024 • edited Loading

qiUip left a comment

Choose a reason for hiding this comment

edbennett commented Dec 3, 2024

ilectra commented Dec 20, 2024 • edited Loading

ilectra commented Jan 30, 2025

ilectra commented Jan 31, 2025

ilectra Jan 31, 2025

Choose a reason for hiding this comment

ilectra commented Feb 7, 2025

edbennett commented Feb 7, 2025

ilectra commented Mar 3, 2025

edbennett commented Mar 3, 2025

ilectra commented Nov 27, 2024 •

edited

Loading

ilectra commented Dec 20, 2024 •

edited

Loading