BUG: correct the test loop in test_arpack.eval_evec #14798

dimpase · 2021-10-03T10:02:51Z

In scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py, eval_evec
one tests for correctness eigenpairs, in a loop,
which is meant to allow for an error (the
k-tuple of eigenpairs found differs from what's asked by which
argument) in Krylov subspaces method. Cf. comments there:

# on rare occasions, ARPACK routines return results that are proper
# eigenvalues and -vectors, but not necessarily the ones requested in
# the parameter which. This is inherent to the Krylov methods, and
# should not be treated as a failure. If such a rare situation
# occurs, the calculation is tried again (but at most a few times).

One should 1st check whether one got correct eigenvalues, and try again
if they are not the wanted ones.
The current implementation does it backwards: it checks whether
the eigenpair works, without 1st checking for eigenvalues.

This commit corrects this programming error.

Reference issue

Requested by @rgommers in review of gh-14786

Additional information

#14786 (comment)

dimpase · 2021-10-03T11:18:23Z

ok, a test in GitHub Actions fails in the changed function on Linux Python 3.8:

_____________________________ test_hermitian_modes _____________________________
[gw1] linux -- Python 3.8.0 /usr/bin/python3.8-dbg
scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py:421: in test_hermitian_modes
    eval_evec(symmetric, D, typ, k, which,
        D          = <gen-hermitian-Mc>
        k          = 2
        mattype    = <built-in function asarray>
        params     = <scipy.sparse.linalg.eigen.arpack.tests.test_arpack.SymmetricParams object at 0x7fd8a0dcf500>
        sigma      = None
        symmetric  = True
        typ        = 'F'
        which      = 'LA'
scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py:271: in eval_evec
    assert_allclose(LHS, RHS, rtol=rtol, atol=atol, err_msg=err)
E   AssertionError: 
E   Not equal to tolerance rtol=0.000357628, atol=0.000357628
E   error for eigsh:general, typ=F, which=LA, sigma=None, mattype=asarray, OPpart=None, mode=normal
E   Mismatched elements: 1 / 12 (8.33%)
E   Max absolute difference: 0.00344393
E   Max relative difference: 0.00106749
...

and a different failure on Nightly CPython

_____________________________ test_hermitian_modes _____________________________
scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py:421: in test_hermitian_modes
    eval_evec(symmetric, D, typ, k, which,
        D          = <gen-hermitian-Mc>
        k          = 2
        mattype    = <class 'scipy.sparse.csr.csr_matrix'>
        params     = <scipy.sparse.linalg.eigen.arpack.tests.test_arpack.SymmetricParams object at 0x7f0e211f24a0>
        sigma      = None
        symmetric  = True
        typ        = 'D'
        which      = 'LM'
scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py:271: in eval_evec
    assert_allclose(LHS, RHS, rtol=rtol, atol=atol, err_msg=err)
E   AssertionError: 
E   Not equal to tolerance rtol=4.44089e-13, atol=4.44089e-13
E   error for eigsh:general, typ=D, which=LM, sigma=None, mattype=csr_matrix, OPpart=None, mode=normal
E   Mismatched elements: 1 / 12 (8.33%)
E   Max absolute difference: 3.6483776e-12
E   Max relative difference: 9.24609212e-13
...

and on macOS - a similar to Nightly Python failure (same test data):

_____________________________ test_hermitian_modes _____________________________
scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py:421: in test_hermitian_modes
    eval_evec(symmetric, D, typ, k, which,
        D          = <gen-hermitian-Mc>
        k          = 2
        mattype    = <class 'scipy.sparse.csr.csr_matrix'>
        params     = <scipy.sparse.linalg.eigen.arpack.tests.test_arpack.SymmetricParams object at 0x131cf32e0>
        sigma      = None
        symmetric  = True
        typ        = 'D'
        which      = 'LM'
scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py:271: in eval_evec
    assert_allclose(LHS, RHS, rtol=rtol, atol=atol, err_msg=err)
E   AssertionError: 
E   Not equal to tolerance rtol=4.44089e-13, atol=4.44089e-13
E   error for eigsh:general, typ=D, which=LM, sigma=None, mattype=csr_matrix, OPpart=None, mode=normal
E   Mismatched elements: 1 / 12 (8.33%)
E   Max absolute difference: 4.38825759e-12
E   Max relative difference: 1.14883516e-12
...

rgommers · 2021-10-03T15:31:47Z

The new failures are not from reversing the checks right, but from this piece of code have the wrong indentation for assert_allclose(...) so it's not run at all for general is True?

        LHS = np.dot(a, evec)
        if general:
            RHS = eigenvalues * np.dot(b, evec)
        else:
            RHS = eigenvalues * evec

            assert_allclose(LHS, RHS, rtol=rtol, atol=atol, err_msg=err)

rgommers · 2021-10-03T15:34:23Z

Reversing the checks makes sense to me, and the last two failures seem minor. The first one is a large change, increasing atol by an order of magnitude (0.000357 to 0.00344). Maybe that is what it is, but it would be good to bump the tolerance only for the subset of test parameters where that is really necessary.

dimpase · 2021-10-03T17:25:32Z

On Sun, Oct 3, 2021 at 4:31 PM Ralf Gommers ***@***.***> wrote: The new failures are not from reversing the checks right, but from this piece of code have the wrong indentation for assert_allclose(...) so it's not run at all for general is True?

Good catch, thanks! I was puzzled by these tests passing at all. Needless to say, I corrected the indentation, without noticing it was wrong before.

…

LHS = np.dot(a, evec) if general: RHS = eigenvalues * np.dot(b, evec) else: RHS = eigenvalues * evec assert_allclose(LHS, RHS, rtol=rtol, atol=atol, err_msg=err) — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#14798 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAJXYHATGTWMYVIDY3Y4VYLUFBZO3ANCNFSM5FHNQDAA> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

dimpase · 2021-10-03T19:22:31Z

Reversing the checks makes sense to me, and the last two failures seem minor. The first one is a large change, increasing atol by an order of magnitude (0.000357 to 0.00344).

It's a bit suspect to me that it depends on the Python version. 3.8 is getting old, perhaps time to move on 3.9?
(or perhaps it's cause it's a "dbg" build, whatever it is)

rgommers · 2021-10-04T10:39:58Z

It's a bit suspect to me that it depends on the Python version. 3.8 is getting old, perhaps time to move on 3.9?
(or perhaps it's cause it's a "dbg" build, whatever it is)

We can't drop 3.8, we're only just dropping 3.7 now - 3.8 will be around for another ~1.5 years. This is also not likely to be dependent on Python version I'd say.

A dbg build is a debug build of Python, so debug symbols are included. Such builds are typically also shipped in Linux distros as separate packages (typically named python-dbg).

dimpase · 2021-10-06T10:20:42Z

Is there documentation on how to fix a particular test?
We have to fix 2 instances, which differ from one another in 3 places, marked by * below:

    eval_evec(symmetric, D, typ, k, which,
        D          = <gen-hermitian-Mc>
        k          = 2
*       mattype    = <built-in function asarray>
        params     = <scipy.sparse.linalg.eigen.arpack.tests.test_arpack.SymmetricParams object at 0x7fd8a0dcf500>
        sigma      = None
        symmetric  = True
 *      typ        = 'F'
 *      which      = 'LA'

and

    eval_evec(symmetric, D, typ, k, which,
        D          = <gen-hermitian-Mc>
        k          = 2
*       mattype    = <class 'scipy.sparse.csr.csr_matrix'>
        params     = <scipy.sparse.linalg.eigen.arpack.tests.test_arpack.SymmetricParams object at 0x131cf32e0>
        sigma      = None
        symmetric  = True
 *      typ        = 'D'
 *      which      = 'LM'

rgommers · 2021-10-06T10:45:37Z

There aren't any docs, this is too detailed to have docs. I'd say bump the test tolerance unconditionally to make the 2 tests with a small mismatch pass, and then special-case the last test based on the parameters you point out above.

dimpase · 2021-10-07T09:07:45Z

I actually find $\ell_\infty$ norm used by assert_allclose not very suitable here, it seems to me that $\ell_n$ norm, with n=1 or 2, is more suitable for checking vectors for near equality in the context of numeric linear algebra.

rgommers · 2021-10-07T09:19:03Z

I actually find $\ell_\infty$ norm used by assert_allclose not very suitable here, it seems to me that $\ell_n$ norm, with n=1 or 2, is more suitable for checking vectors for near equality in the context of numeric linear algebra.

Agreed. I don't think that has been discussed before, at least not recently. Worth a separate follow-up.

dimpase · 2021-10-08T11:38:17Z

I am running into some sort of Heisenbug here. Namely, the 1st error in #14798 (comment) consistently reproduces on a Linux system, by running

python3 runtests.py -j4 -v -t scipy.sparse.linalg.eigen.arpack.tests.test_arpack

and get one failing test.

Then I apply the following change, special-casing this error, and changing tolerances for it:

--- a/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py
+++ b/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py
@@ -28,7 +28,7 @@ from scipy._lib._gcutils import assert_deallocated, IS_PYPY
 _ndigits = {'f': 3, 'd': 11, 'F': 3, 'D': 11}
 
 
-def _get_test_tolerance(type_char, mattype=None):
+def _get_test_tolerance(type_char, mattype=None, D_type=None, which=None):
     """
     Return tolerance values suitable for a given test:
 
@@ -67,6 +67,14 @@ def _get_test_tolerance(type_char, mattype=None):
         # sparse in single precision: worse errors
         rtol *= 5
 
+
+    if mattype is np.asarray and type_char == 'F' and which == 'LA' \
+          and D_type.name == "gen-hermitian-Mc":
+        # missing case from PR 14798
+        tol = 30 * np.finfo(np.float32).eps
+        rtol *= 5
+
+
     return tol, rtol, atol
 
 
@@ -223,8 +231,7 @@ def eval_evec(symmetric, d, typ, k, which, v0=None, sigma=None,
         kwargs['OPpart'] = OPpart
 
     # compute suitable tolerances
-    kwargs['tol'], rtol, atol = _get_test_tolerance(typ, mattype)
-
+    kwargs['tol'], rtol, atol = _get_test_tolerance(typ, mattype, d, which)
     # on rare occasions, ARPACK routines return results that are proper
     # eigenvalues and -vectors, but not necessarily the ones requested in
     # the parameter which. This is inherent to the Krylov methods, and

With it, on the same system I consistently reproduce 2nd error from #14798 (comment), instead of the 1st error.

This feels like something something floating point interrupts touched by NumPy...

dimpase · 2021-10-08T12:25:37Z

OK, I've bumped whatever was needed

dimpase · 2021-10-08T15:29:57Z

I think I've fixed everything I could (some test pipelines crash for unrelated reasons).

rgommers · 2021-10-08T19:55:08Z

@dimpase the change to the PROPACK submodule was a mistake I assume?

dimpase · 2021-10-08T20:00:23Z

the change to the PROPACK submodule was a mistake I assume?

indeed, sorry, I seldom work with submodules - and this change does not show up in "normal" git.
Will be fixed.

dimpase · 2021-10-08T20:12:28Z

somehow scipy/PROPACK#1 got mixed in.

dimpase · 2021-10-08T20:21:55Z

it's a pure evil - the submodule update got mixed into d6f6eec - nothing and nobody told me this happened...

rgommers · 2021-10-08T20:58:21Z

it's a pure evil - the submodule update got mixed into d6f6eec - nothing and nobody told me this happened...

Yes this happens often unfortunately, if you change branches and then forget to re-run git submodule update.

charris · 2021-10-09T01:10:16Z

it's a pure evil

git status is your boon companion during this quest :) Also, don't write commit messages in line, but rather in the editor where you can check what is being committed.

dimpase · 2021-10-09T07:37:06Z

macOS tests fail while building quadpack extension, something unrelated:

/usr/local/bin/gfortran -Wall -g -Wall -g -undefined dynamic_lookup -bundle build/temp.macosx-10.14-x86_64-3.9/scipy/integrate/_quadpackmodule.o -L/usr/local/lib -L/usr/local/gfortran/lib/gcc/x86_64-apple-darwin13/4.9.0 -L/usr/local/gfortran/lib/gcc/x86_64-apple-darwin13/4.9.0/../../.. -L/usr/local/gfortran/lib/gcc/x86_64-apple-darwin13/4.9.0/../../.. -L/Users/runner/hostedtoolcache/Python/3.9.7/x64/lib -Lbuild/temp.macosx-10.14-x86_64-3.9 -Wl,-rpath,/usr/local/lib -lquadpack -lmach -lopenblas -lopenblas -lgfortran -o build/lib.macosx-10.14-x86_64-3.9/scipy/integrate/_quadpack.cpython-39-darwin.so
ld: library not found for -lSystem
collect2: error: ld returned 1 exit status

The rest passes.

dimpase · 2021-10-09T07:42:41Z

git status is your boon companion during this quest :)

I tend to rely on git diff, which ignores submodules :-(

scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py

In scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py, eval_evec one tests for correctness eigenpairs, in a loop, which is meant to allow for an error (the `k`-tuple of eigenpairs found differs from what's asked by `which` argument) in Krylov subspaces method. Cf. comments there: # on rare occasions, ARPACK routines return results that are proper # eigenvalues and -vectors, but not necessarily the ones requested in # the parameter which. This is inherent to the Krylov methods, and # should not be treated as a failure. If such a rare situation # occurs, the calculation is tried again (but at most a few times). One should 1st check whether one got correct eigenvalues, and try again if they are not the wanted ones. The current implementation does it backwards: it checks whether the eigenpair works, without 1st checking for eigenvalues. This commit corrects this programming error.

dimpase · 2021-10-10T20:19:10Z

the failing check is due to the rebase over the current master, unrelated to the PR.

rgommers

Okay, this should be good to go now - let's give it a try. Thanks @dimpase!

dimpase · 2021-10-12T16:35:51Z

Agreed. I don't think that has been discussed before, at least not recently. Worth a separate follow-up.

implemented in #14846

github-actions bot added the scipy.sparse.linalg label Oct 3, 2021

dimpase changed the title ~~correct the test loop in test_arpack.eval_evec~~ BUG: correct the test loop in test_arpack.eval_evec Oct 3, 2021

dimpase mentioned this pull request Oct 3, 2021

ENH: update to arpack-ng 3.8.0 release #14786

Closed

rgommers added the maintenance Items related to regular maintenance tasks label Oct 3, 2021

This was referenced Oct 4, 2021

CI, MAINT: pin Cython for azure pre-rel #14801

Merged

CI: Azure Main coverage job failure #14802

Closed

dimpase force-pushed the test_arpack_fix branch from f100ce5 to e33ed07 Compare October 8, 2021 12:24

dimpase mentioned this pull request Oct 8, 2021

BUG: vectors/matrices produced by eigenvalues/vectors routines should not be tested using ell_infinity norm #14823

Closed

dimpase mentioned this pull request Oct 9, 2021

macOS CI failing with ld: library not found for -lSystem #14829

Closed

rgommers reviewed Oct 10, 2021

View reviewed changes

scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py Outdated Show resolved Hide resolved

dimpase added 9 commits October 10, 2021 18:58

fix 1st error

402b8e7

more "gen-hermitian-Mc" test toleranced bumped

ed3b492

also need type 'F' bump for Hermitean on Python 3.8-dbg

6d2d761

make a linter happy

8497eaf

make more linters happy

8b33b8b

fix the branch of PROPACK

9f4ffb1

linters can't decide on E129 vs E127/E128

9d53abc

more care to "gen-hermitian-Mc" test tol bumps

998400e

dimpase force-pushed the test_arpack_fix branch from bb7423d to 998400e Compare October 10, 2021 19:20

rgommers approved these changes Oct 10, 2021

View reviewed changes

rgommers merged commit cf3b30c into scipy:master Oct 10, 2021

rgommers added this to the 1.8.0 milestone Oct 10, 2021

dimpase deleted the test_arpack_fix branch October 12, 2021 17:12

dimpase mentioned this pull request Oct 15, 2021

BUG: correctly check error of ARPACK eigenpairs #14846

Closed

Uh oh!

BUG: correct the test loop in test_arpack.eval_evec #14798

BUG: correct the test loop in test_arpack.eval_evec #14798

Uh oh!

Conversation

dimpase commented Oct 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference issue

Additional information

Uh oh!

dimpase commented Oct 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rgommers commented Oct 3, 2021

Uh oh!

rgommers commented Oct 3, 2021

Uh oh!

dimpase commented Oct 3, 2021 via email

Uh oh!

dimpase commented Oct 3, 2021

Uh oh!

rgommers commented Oct 4, 2021

Uh oh!

dimpase commented Oct 6, 2021

Uh oh!

rgommers commented Oct 6, 2021

Uh oh!

dimpase commented Oct 7, 2021

Uh oh!

rgommers commented Oct 7, 2021

Uh oh!

dimpase commented Oct 8, 2021

Uh oh!

dimpase commented Oct 8, 2021

Uh oh!

dimpase commented Oct 8, 2021

Uh oh!

rgommers commented Oct 8, 2021

Uh oh!

dimpase commented Oct 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dimpase commented Oct 8, 2021

Uh oh!

dimpase commented Oct 8, 2021

Uh oh!

rgommers commented Oct 8, 2021

Uh oh!

charris commented Oct 9, 2021

Uh oh!

dimpase commented Oct 9, 2021

Uh oh!

dimpase commented Oct 9, 2021

Uh oh!

Uh oh!

dimpase commented Oct 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rgommers left a comment

Choose a reason for hiding this comment

Uh oh!

dimpase commented Oct 12, 2021

Uh oh!

Uh oh!

dimpase commented Oct 3, 2021 •

edited

Loading

dimpase commented Oct 3, 2021 •

edited

Loading

dimpase commented Oct 8, 2021 •

edited

Loading

dimpase commented Oct 10, 2021 •

edited

Loading