Improvements to thread safety detection and implement disallowing whole modules #82

lysnikolaou · 2025-06-25T16:47:07Z

This includes a few things:

General improvements to thread safety detection like:
- Call generic_visit in all paths and bail out early when already thread unsafe
- Do not recurse when target is not a function
- Fix message printed in error report when unsafety is detected because of __thread_safe__
Add ability to blocklist whole modules and do that for ctypes and unittest.mock

It's easier to review this commit by commit.

Also, here's the test results for scipy and scikit-learn when it comes to collection performance:

SciPy:

	Collection time	Collected items to run in parallel
main	33.92s	53101
HEAD	38.19s	53089

scikit-learn:

	Collection time	Collected items to run in parallel
main	11.40s	23049
HEAD	10.46s	22833

EDIT: Changed number of tests discovered to run in parallel under HEAD in scikit-learn.

ngoldbaum · 2025-06-25T16:48:27Z

Can you update the readme based on the new functionality?

I'll try to take a look at the code ASAP.

ngoldbaum · 2025-06-25T16:49:41Z

Do you have any idea why there are so many more scikit-learn tests getting collected? What's special about the 12 tests in SciPy that don't get collected anymore?

ngoldbaum · 2025-06-26T16:12:54Z

All the code changes look reasonable to me and the new tests look like they're catching tricky corner cases along with just detecting that a module is imported.

Maybe add a test to make sure that tests still run in parallel if they don't use any functionality from a blacklisted module?

lysnikolaou · 2025-06-26T16:14:47Z

Can you update the readme based on the new functionality?

README updated.

Do you have any idea why there are so many more scikit-learn tests getting collected? What's special about the 12 tests in SciPy that don't get collected anymore?

The scikit-learn number was due to a bug after all. The inspect.isfunction returns False for methods so that thread safety inspection was stopping there.

What's special about the 12 tests in SciPy that don't get collected anymore?

The SciPy and scikit-learn that now get skipped were false positives. The ones I had a look at had calls to functions that were importing a different way than what we had blocklisted (e.g. from unittest.mock import patch) and/or tricky edge cases with methods that weren't analyzed before.

ngoldbaum · 2025-06-26T16:17:05Z

Thanks for looking at that! Making this stuff more robust will pay ecosystem-wide dividends.

Does adding the test I suggested in my other comment make sense?

lysnikolaou · 2025-06-26T16:27:11Z

Maybe add a test to make sure that tests still run in parallel if they don't use any functionality from a blacklisted module?

Isn't this covered from all of the other tests (in files other than tests/test_thread_unsafe_detection.py) that do not exercise this path?

ngoldbaum · 2025-06-26T16:30:06Z

Isn't this covered from all of the other tests (in files other than tests/test_thread_unsafe_detection.py) that do not exercise this path?

I don't think so. Or at least it's not obvious to me.

I think you have to add a new test to the test file created by test_thread_unsafe_ctypes that asserts num_parallel_threads is > 1 and doesn't actually use the ctypes module.

lysnikolaou · 2025-06-26T16:35:33Z

I think you have to add a new test to the test file created by test_thread_unsafe_ctypes that asserts num_parallel_threads is > 1 and doesn't actually use the ctypes module.

Done!

ngoldbaum · 2025-06-26T21:51:27Z

I spent some time profiling this and I didn't see any obvious way to speed things up - now that the visitor is doing more stuff, it takes more time. We also spend a decent amount of time in ast.parse and inspect.getsource - if there were faster alternatives that would also help, but I don't think there are any right now.

ngoldbaum · 2025-06-26T21:51:51Z

@rgommers what do you think about the marginally increased collection time for SciPy?

lysnikolaou added 5 commits June 23, 2025 17:15

Call generic_visit in all paths and bail out early in all visit methods

a0787bd

Add isfunction check when recursively checking for thread safety

6cbbdd3

Fix message when thread unsafety is detected because of __thread_safe__

074f14f

Always return early when thread unsafety is detected

a124c7f

Add ability to blocklist whole modules and block ctypes

953e9d3

lysnikolaou added 2 commits June 26, 2025 16:35

Change isfunction to callable check to do check for methods too

2b0b765

Update README

56eee11

Add test that test item actually runs in parallel when not using ctypes

dcc2bf5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improvements to thread safety detection and implement disallowing whole modules #82

Improvements to thread safety detection and implement disallowing whole modules #82

Uh oh!

lysnikolaou commented Jun 25, 2025 •

edited

Loading

Uh oh!

ngoldbaum commented Jun 25, 2025

Uh oh!

ngoldbaum commented Jun 25, 2025

Uh oh!

ngoldbaum commented Jun 26, 2025

Uh oh!

lysnikolaou commented Jun 26, 2025

Uh oh!

ngoldbaum commented Jun 26, 2025

Uh oh!

lysnikolaou commented Jun 26, 2025

Uh oh!

ngoldbaum commented Jun 26, 2025

Uh oh!

lysnikolaou commented Jun 26, 2025

Uh oh!

ngoldbaum commented Jun 26, 2025

Uh oh!

ngoldbaum commented Jun 26, 2025

Uh oh!

Uh oh!

Improvements to thread safety detection and implement disallowing whole modules #82

Are you sure you want to change the base?

Improvements to thread safety detection and implement disallowing whole modules #82

Uh oh!

Conversation

lysnikolaou commented Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ngoldbaum commented Jun 25, 2025

Uh oh!

ngoldbaum commented Jun 25, 2025

Uh oh!

ngoldbaum commented Jun 26, 2025

Uh oh!

lysnikolaou commented Jun 26, 2025

Uh oh!

ngoldbaum commented Jun 26, 2025

Uh oh!

lysnikolaou commented Jun 26, 2025

Uh oh!

ngoldbaum commented Jun 26, 2025

Uh oh!

lysnikolaou commented Jun 26, 2025

Uh oh!

ngoldbaum commented Jun 26, 2025

Uh oh!

ngoldbaum commented Jun 26, 2025

Uh oh!

Uh oh!

lysnikolaou commented Jun 25, 2025 •

edited

Loading