Bug fixes across Lee and swap to survival function for #243 #245

ljwolf · 2023-05-11T16:29:03Z

This addresses a few bugs filed on the Lee statistic, as well as addressing the precision concerns raised in #243 with using 1-stats.norm.cdf().

ljwolf · 2023-05-11T16:29:50Z

Once merged, it would be good to make a bug fix release addressing this, given the lee standardisation issue can have an impact for odd weights specifications.

codecov · 2023-05-11T16:38:29Z

Codecov Report

Merging #245 (fe407b2) into main (165e139) will increase coverage by 1.5%.
The diff coverage is 72.7%.

@@           Coverage Diff           @@
##            main    #245     +/-   ##
=======================================
+ Coverage   71.5%   73.0%   +1.5%     
=======================================
  Files         24      24             
  Lines       3246    3246             
  Branches     519     519             
=======================================
+ Hits        2320    2369     +49     
+ Misses       763     709     -54     
- Partials     163     168      +5

Impacted Files	Coverage Δ
esda/lee.py	`72.8% <ø> (+53.3%)`	⬆️
esda/geary.py	`92.5% <33.3%> (ø)`
esda/moran.py	`74.5% <83.3%> (ø)`
esda/getisord.py	`66.4% <100.0%> (ø)`

ljwolf · 2023-05-24T17:13:39Z

test failures arise in #244, so they are unrelated to these changes.

ci/310-DEV.yaml

ljwolf · 2023-05-24T19:17:39Z

So the testing failures in the join counts is, I think, due to the changes upstream in libpysal having to do with adjacency lists.

ljwolf · 2023-05-24T19:30:33Z

OK, my understanding is the following.

With pysal/libpysal@main, the following is broken:

from libpysal.weights.util import lat2W

w = lat2W(3,3)
w.neighbors # this is correct
{0: [3, 1],
 3: [0, 6, 4],
 1: [0, 4, 2],
 4: [1, 3, 7, 5],
 2: [1, 5],
 5: [2, 4, 8],
 6: [3, 7],
 7: [4, 6, 8],
 8: [5, 7]}
w.to_adjlist().head() # this is not
   focal  neighbor  weight
0      0         3     1.0
1      0         4     1.0
2      3         0     1.0
3      3         1     1.0
4      3         2     1.0

The issue arises because, at line 440 of weights.py, we use self.neigbors.keys(). Since ids are sorted by default and dicts now retain their insertion order, the two are not the same:

w.id_order
[0, 1, 2, 3, 4, 5, 6, 7, 8]
w.neighbors.keys()
dict_keys([0, 3, 1, 4, 2, 5, 6, 7, 8])

The ordering that is needed is w.id_order, which matches the order in the sparse array we use for the edge tuples. With this libpysal fix, the test failures disappear.

martinfleis

Code-wise looks good. If someone close to the actual stats can have a look as well, it'd be good.

sjsrey · 2023-05-26T21:34:27Z

Looks good!

ljwolf added bug enhancement labels May 11, 2023

ljwolf force-pushed the lee_bugs branch from 0735d09 to 7139ac2 Compare May 24, 2023 17:19

martinfleis reviewed May 24, 2023

View reviewed changes

ci/310-DEV.yaml Outdated Show resolved Hide resolved

ljwolf mentioned this pull request May 24, 2023

fix indexing issue with adjlist construction pysal/libpysal#528

Merged

ljwolf force-pushed the lee_bugs branch from f9e88df to 4c991f1 Compare May 24, 2023 19:47

ljwolf added 7 commits May 24, 2023 14:47

resolve esda#207 by keeping summation dimensions

201595d

lag before subtract in lee statistic (pysal#213)

cf86698

swap from 1-cdf to survival function (pysal#243)

5f30ae7

update testing matrix

b202e7d

pip requires a colon to trigger

b5bb9ca

add 311 numba and dev

c5c8c15

add rtree to testing matrix

385fb6e

ljwolf force-pushed the lee_bugs branch from 3e06bb2 to 385fb6e Compare May 24, 2023 21:49

This was linked to issues May 24, 2023

Question about reference distribution calculation in lee.py #213

Closed

More float precision on p values #243

Closed

Potential wrong row-standardization in lee.py #207

Closed

ljwolf added 2 commits May 24, 2023 15:04

add test_lee.py from pysal#89

81805a9

use reshape instead of keepdims, which does not exist in matrix

fe407b2

ljwolf requested review from sjsrey, weikang9009 and jeffcsauer May 24, 2023 22:31

martinfleis approved these changes May 24, 2023

View reviewed changes

sjsrey approved these changes May 26, 2023

View reviewed changes

sjsrey merged commit 7f3b9cf into pysal:main May 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug fixes across Lee and swap to survival function for #243 #245

Bug fixes across Lee and swap to survival function for #243 #245

ljwolf commented May 11, 2023

ljwolf commented May 11, 2023

codecov bot commented May 11, 2023 •

edited

Loading

ljwolf commented May 24, 2023 •

edited

Loading

ljwolf commented May 24, 2023

ljwolf commented May 24, 2023

martinfleis left a comment

sjsrey commented May 26, 2023

Bug fixes across Lee and swap to survival function for #243 #245

Bug fixes across Lee and swap to survival function for #243 #245

Conversation

ljwolf commented May 11, 2023

ljwolf commented May 11, 2023

codecov bot commented May 11, 2023 • edited Loading

Codecov Report

ljwolf commented May 24, 2023 • edited Loading

ljwolf commented May 24, 2023

ljwolf commented May 24, 2023

martinfleis left a comment

Choose a reason for hiding this comment

sjsrey commented May 26, 2023

codecov bot commented May 11, 2023 •

edited

Loading

ljwolf commented May 24, 2023 •

edited

Loading