[python] Add hypothesis tests #3560

bkmartinjr · 2025-01-14T21:44:32Z

Add an initial set of tests based upon the Hypothesis property-based testing framework. Included are:

API tests for IntIndexer and fastercsx
State machines covering basic operations of DataFrame, SparseNDArray and DenseNDArray
Integration into pytest and the testing CI

See the README included in this PR for some other important information regarding these new tests.

codecov · 2025-01-15T02:40:46Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 86.31%. Comparing base (60cd908) to head (5ad9e59).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3560      +/-   ##
==========================================
+ Coverage   86.25%   86.31%   +0.06%     
==========================================
  Files          55       55              
  Lines        6381     6381              
==========================================
+ Hits         5504     5508       +4     
+ Misses        877      873       -4

Flag	Coverage Δ
python	`86.31% <ø> (+0.06%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
python_api	`86.31% <ø> (+0.06%)`	⬆️
libtiledbsoma	`∅ <ø> (∅)`

ryan-williams · 2025-02-04T03:29:35Z

This is very cool, thanks @bkmartinjr. I've proposed #3663 on top of this, with some suggested changes, all pretty minor.

The only thing I want to flag is the added nondeterminism in CI runs (which I know is a common convention with frameworks like this). The current setup is, roughly:

Run "a million" cases locally/once (to be sure the code is correct; --hypothesis-profile=expensive).
Future CI jobs each run a random "thousand" cases (to balance [catching regressions] vs. [wasting CI time/compute]).

Would it be better to seed part 2, so that CI runs the same "thousand" cases every time (which are known to work)? I'm guessing the answer is "no," but wanted to mention this tradeoff and make sure we're on the same page. If [ongoing fuzzing of future, unrelated PRs] might turn up novel bugs, then we should just 10x or 100x part 1 above, to catch them now (is the argument).

bkmartinjr · 2025-02-04T03:41:24Z

Would it be better to seed part 2, so that CI runs the same "thousand" cases every time (which are known to work)? I'm guessing the answer is "no," but wanted to mention this tradeoff and make sure we're on the same page. If [ongoing fuzzing of future, unrelated PRs] might turn up novel bugs, then we should just 10x or 100x part 1 above, to catch them now (is the argument).

We are on the same page - the whole point of this type of testing is to mix it up. Test new stuff. Because you can't afford to do exhaustive testing in CI, you do a bit (randomly), and have logging tell you how to recreate the issue (works most of the time, not all of the time). I advocate for keeping it this way, as over the long haul, you end up running far more unique tests.

Info on how you recreate fails is here.

The other alternative we could consider is manually running these tests on a regular basis. But history has shown that this is something that will fail, as there is a human in the loop.

Long term, I think the best approach is CI, coupled with self-hosted GHA runners. With this strategy, you can run an "expensive" test on a schedule (say every Sunday night), and do a much, much more exhaustive search.

late edit: this may be worth raising with the whole team, as everybody is impacted by stochastic tests

bkmartinjr · 2025-02-04T03:42:16Z

@ryan-williams - also, a commit to main in the past few days now causes the tests to fail. I'll need to debug that before landing this PR (it may be a test bug, or a bug bug - won't know until I dig into it)

Update: root cause is a regression on main, filed as sc-62887.

johnkerl

🚢

Thanks @bkmartinjr !!

* factor README link * use `mode: OpenMode` instead of `mode: str` * add missing `self` params * OpenMode * rm unused `SOMAArrayStateMachine._reopen` * `s/work-arounds/workarounds/` * add/tweak type annotations * `get_entries`: return `set[str]` * parameterize `Ledger` with a `LedgerEntryType` this allows the type system to understand that e.g. the return value of a `.read()` can be a `PyDictLedgerEntry`, which can then have `to_dict()` invoked * rm unused `concurrency` fixture * rm unused imports * avoid `st` shadowing

bkmartinjr added 2 commits January 14, 2025 13:43

hypothesis draft

b28686a

backport to python 3.9 and pandas<2.0

e767ab2

bkmartinjr added 27 commits January 15, 2025 07:42

Merge branch 'main' into bkm/hypothesis

e692285

remove metadata work-arounds

2401bce

add metadata time travel ledger

a043fe1

add readme

b7e5014

Merge branch 'main' into bkm/hypothesis

3f69d14

fix numeric overflow in fastercsx test

9e708c7

Merge branch 'main' into bkm/hypothesis

a8e36b7

remove sensitivity to another numeric precision corner case

4ee1235

increase scope of fastercsx coords tested

9bcdce9

Merge branch 'main' into bkm/hypothesis

9a151e8

Merge branch 'main' into bkm/hypothesis

b25d4ec

add string/binary columns to dataframe tests

a336bfa

Merge branch 'main' into bkm/hypothesis

72f9014

lint

d94a07e

add enum/dict

3c70e43

Merge branch 'main' into bkm/hypothesis

e5f779f

disable io

b3cdcf8

fix padding required for arrow offset testing

006140a

remove obsolete comment

143737f

Merge branch 'main' into bkm/hypothesis

45396be

fix overflow warning

67634d4

Merge branch 'main' into bkm/hypothesis

3cd7292

Merge branch 'main' into bkm/hypothesis

7ec9e81

Merge branch 'main' into bkm/hypothesis

f792448

update readme

fa8303f

lint

f3098e8

remove sc-61239 work-around

1d86e4c

remove incomplete test stub

5ad9e59

bkmartinjr marked this pull request as ready for review February 1, 2025 02:03

bkmartinjr requested review from johnkerl, ryan-williams and jp-dark February 1, 2025 02:03

Merge branch 'main' into bkm/hypothesis

89bc61c

ryan-williams mentioned this pull request Feb 4, 2025

#3560 review #3663

Merged

johnkerl approved these changes Feb 4, 2025

View reviewed changes

ryan-williams and others added 2 commits February 4, 2025 08:37

Merge branch 'main' into bkm/hypothesis

374356d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python] Add hypothesis tests #3560

[python] Add hypothesis tests #3560

bkmartinjr commented Jan 14, 2025 •

edited

Loading

codecov bot commented Jan 15, 2025 •

edited

Loading

ryan-williams commented Feb 4, 2025

bkmartinjr commented Feb 4, 2025 •

edited

Loading

bkmartinjr commented Feb 4, 2025 •

edited

Loading

johnkerl left a comment

[python] Add hypothesis tests #3560

Are you sure you want to change the base?

[python] Add hypothesis tests #3560

Conversation

bkmartinjr commented Jan 14, 2025 • edited Loading

codecov bot commented Jan 15, 2025 • edited Loading

Codecov Report

ryan-williams commented Feb 4, 2025

bkmartinjr commented Feb 4, 2025 • edited Loading

bkmartinjr commented Feb 4, 2025 • edited Loading

johnkerl left a comment

Choose a reason for hiding this comment

bkmartinjr commented Jan 14, 2025 •

edited

Loading

codecov bot commented Jan 15, 2025 •

edited

Loading

bkmartinjr commented Feb 4, 2025 •

edited

Loading

bkmartinjr commented Feb 4, 2025 •

edited

Loading