Attempt to fix non-deterministic tests #160

Pennycook · 2025-01-24T12:03:37Z

Related issues

unittest workflow fails non-deterministically #159

Proposed changes

Fix a weird-looking assignment to self.tmp inside of the cbi-cov tests.
Remove ResourceWarning filters from all tests, so we can't ignore them.
Add explicit calls to cleanup() when we're finished with a TemporaryDirectory.
Use setUpClass and tearDownClass in cases where all tests can share the same resources.
Use assertCountEqual instead of assertEqual when comparing coverage.json files.

Even the old tests worked deterministically on my laptop, so I have no idea if this will actually fix anything. But the changes make sense to me, so 🤞.

EDIT: I suspect the assertCountEqual change here is the real fix, but I think we should merge the TemporaryDirectory changes as well. Even if they weren't the cause of the non-determinism in #159, the new versions of the tests are cleaner.

There were two unusual things about this test: - We assigned a local variable to "self.tmp"; and - We ignored a ResourceWarning. One or both of these things could be the source of the non-deterministic behavior we've been seeing. Signed-off-by: John Pennycook <[email protected]>

Based on our experience with the cbi-cov tests, these could be hiding bugs. Signed-off-by: John Pennycook <[email protected]>

Multiple tests were issuing ResourceWarnings due to relying on implicit cleanup of a TemporaryDirectory. I've rewritten these tests to implicitly call cleanup(), and in cases where the TemporaryDirectory can be shared by all tests in the class I've switched over to setUpClass and tearDownClass. Even though we haven't seen any of these tests fail, it's possible that they might share the same issues as the cbi-cov tests. Signed-off-by: John Pennycook <[email protected]>

Pennycook · 2025-01-24T12:04:43Z

...they're still failing, so something else is going on here.

Since the ordering of files on the file system is not guaranteed, the order of the files in a coverage report is not guaranteed either. Signed-off-by: John Pennycook <[email protected]>

Pennycook · 2025-01-27T13:35:58Z

@laserkelvin - Since this started working after the assertCountEqual fix, I think it's ready to review now.

laserkelvin

For the purpose of fixing CI, it seems to be working but I've made a note about refactoring to use context managers for the temporary directories to ensure that it's actually cleaned up within a defined scope, as opposed to relying on tearDown methods.

I think that would improve maintainability, although I'm not 100% sure it would be functionally different to now.

laserkelvin · 2025-01-31T15:47:48Z

tests/cli/test_cbicov.py

@@ -66,8 +64,8 @@ def test_compute(self):
        """Check that coverage is computed correctly."""
        sys.stdout = io.StringIO()
        # Create a temporary codebase to work on.
-        self.tmp = tempfile.TemporaryDirectory()
-        p = Path(self.tmp.name)
+        tmp = tempfile.TemporaryDirectory()


So my comment - I don't know if you can be bothered to refactor it - would be to use TemporaryDirectory in a context so it does clean up after itself.

I guess you call it manually, but I don't know if there are other missed behaviors omitted if you don't rely on the context's __exit__

So, I did try using the context manager and honestly I couldn't really wrap my head around how it was supposed to work.

If you just want a temporary file to dump stuff in and then it goes away, the context manager approach is straightforward. But if you want to write to a file and then read from it later, you end up with this weird nesting of context managers and have to use a bunch of non-standard options. Here's an example from the documentation:

with tempfile.NamedTemporaryFile(delete_on_close=False) as fp: fp.write(b'Hello world!') fp.close() # the file is closed, but not removed # open the file again by using its name with open(fp.name, mode='rb') as f: f.read()

I think we'd also have to rewrite things to create and destroy the temporary files for every test, rather than re-using the same temporary files across all tests... That probably isn't that important, but it might make the tests run a little longer.

laserkelvin · 2025-01-31T15:50:04Z

tests/code-base/test_code_base.py

@@ -32,6 +31,11 @@ def setUp(self):
        open(p2 / "quux.h", mode="w").close()
        open(p2 / "README.md", mode="w").close()

+    @classmethod
+    def tearDownClass(self):
+        self.tmp1.cleanup()


Yeah this kind of thing is what makes it a bit clunky to me - I know that tearDown methods are meant to be called under all circumstances, pass or fail, but it makes it hard to validate behavior

I don't disagree, and I'm not opposed to finding a better way to rewrite these tests. But I'd like to defer it until a future PR, since you're not opposed -- I want to make sure that we actually did fix the issue, before spending additional time refactoring and/or rewriting tests.

Pennycook added 3 commits January 24, 2025 11:46

Remove ResourceWarning filters from all tests

1a7122b

Based on our experience with the cbi-cov tests, these could be hiding bugs. Signed-off-by: John Pennycook <[email protected]>

Pennycook added the bug Something isn't working label Jan 24, 2025

Pennycook requested a review from laserkelvin January 24, 2025 12:03

Use assertCountEqual instead of assertEqual

9f4d627

Since the ordering of files on the file system is not guaranteed, the order of the files in a coverage report is not guaranteed either. Signed-off-by: John Pennycook <[email protected]>

Pennycook linked an issue Jan 29, 2025 that may be closed by this pull request

unittest workflow fails non-deterministically #159

Closed

Pennycook removed a link to an issue Jan 29, 2025

unittest workflow fails non-deterministically #159

Closed

laserkelvin approved these changes Jan 31, 2025

View reviewed changes

Pennycook merged commit 8ffa6db into intel:main Jan 31, 2025
3 checks passed

Pennycook deleted the deterministic-tests branch January 31, 2025 16:06

Pennycook mentioned this pull request Feb 6, 2025

unittest workflow fails non-deterministically #159

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attempt to fix non-deterministic tests #160

Attempt to fix non-deterministic tests #160

Pennycook commented Jan 24, 2025 •

edited

Loading

Pennycook commented Jan 24, 2025

Pennycook commented Jan 27, 2025

laserkelvin left a comment

laserkelvin Jan 31, 2025

Pennycook Jan 31, 2025

laserkelvin Jan 31, 2025

Pennycook Jan 31, 2025

Attempt to fix non-deterministic tests #160

Attempt to fix non-deterministic tests #160

Conversation

Pennycook commented Jan 24, 2025 • edited Loading

Related issues

Proposed changes

Pennycook commented Jan 24, 2025

Pennycook commented Jan 27, 2025

laserkelvin left a comment

Choose a reason for hiding this comment

laserkelvin Jan 31, 2025

Choose a reason for hiding this comment

Pennycook Jan 31, 2025

Choose a reason for hiding this comment

laserkelvin Jan 31, 2025

Choose a reason for hiding this comment

Pennycook Jan 31, 2025

Choose a reason for hiding this comment

Pennycook commented Jan 24, 2025 •

edited

Loading