loco test bug fix #423

AdamChit · 2019-10-16T16:45:18Z

Related issues
test refactor made in #412 causes one of the test cases to fail.

Specially this test assumed that at-least 1 of the 4 features would be selected https://github.com/salesforce/TransmogrifAI/blob/master/core/src/test/scala/com/salesforce/op/stages/impl/insights/RecordInsightsLOCOTest.scala#L236

Describe the proposed solution
Test should allow the case where none of the features are picked

Additional context
example of the test failing:

…ogrifAI into LocoTestRefactor merge master

codecov · 2019-10-16T16:59:29Z

Codecov Report

Merging #423 (b4a2116) into master (ac83ad7) will decrease coverage by 60.42%.
The diff coverage is n/a.

@@             Coverage Diff             @@
##           master     #423       +/-   ##
===========================================
- Coverage   86.94%   26.52%   -60.43%     
===========================================
  Files         340      337        -3     
  Lines       11388    11131      -257     
  Branches      363      593      +230     
===========================================
- Hits         9901     2952     -6949     
- Misses       1487     8179     +6692

Impacted Files	Coverage Δ
...main/scala/com/salesforce/op/dsl/RichFeature.scala	`0.00% <0.00%> (-100.00%)`	⬇️
...main/scala/com/salesforce/op/filters/Summary.scala	`0.00% <0.00%> (-100.00%)`	⬇️
.../scala/com/salesforce/op/cli/gen/ProblemKind.scala	`0.00% <0.00%> (-100.00%)`	⬇️
...n/scala/com/salesforce/op/dsl/RichSetFeature.scala	`0.00% <0.00%> (-100.00%)`	⬇️
.../scala/com/salesforce/op/dsl/RichListFeature.scala	`0.00% <0.00%> (-100.00%)`	⬇️
.../scala/com/salesforce/op/stages/impl/package.scala	`0.00% <0.00%> (-100.00%)`	⬇️
...cala/com/salesforce/op/cli/gen/FileInProject.scala	`0.00% <0.00%> (-100.00%)`	⬇️
...scala/org/apache/spark/util/SparkThreadUtils.scala	`0.00% <0.00%> (-100.00%)`	⬇️
...cala/com/salesforce/op/OpWorkflowModelWriter.scala	`0.00% <0.00%> (-100.00%)`	⬇️
...cala/com/salesforce/op/evaluators/Evaluators.scala	`0.00% <0.00%> (-100.00%)`	⬇️
... and 201 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ac83ad7...b6829fe. Read the comment docs.

tovbinm

Why would it not pick any features?

leahmcguire · 2019-10-17T20:07:39Z

core/src/test/scala/com/salesforce/op/stages/impl/insights/RecordInsightsLOCOTest.scala

-      it("should pick between 1 and 4 of the features") {
-        all(parsed.map(_.size)) should (be >= 1 and be <= 4)
+      it("should pick between 0 and 4 features") {
+        all(parsed.map(_.size)) should (be >= 0 and be <= 4)


this seems like it is not checking very much... can you do a check for the conditions described above and then do the size check?

AdamChit · 2019-10-19T00:39:34Z

Why would it not pick any features?

So in the test the labels are generated based on the PickList feature:

        p.value match {
          case Some("A") | Some("B") | Some("C") => RealNN(1.0)
          case _ => RealNN(0.0)
        }
      )

And all the features have a chance of being empty:

      val countryData: Seq[Country] = RandomText.countries.withProbabilityOfEmpty(0.3).take(numRows).toList
      val pickListData: Seq[PickList] = RandomText.pickLists(domain = List("A", "B", "C", "D", "E", "F", "G"))
        .withProbabilityOfEmpty(0.1).limit(numRows)
      val currencyData: Seq[Currency] = RandomReal.logNormal[Currency](mean = 10.0, sigma = 1.0)
        .withProbabilityOfEmpty(0.3).limit(numRows)

One case that could happen:
2) All the feature are set to empty
3) LOCO changes the pickList from empty to non-empty and because it is not in the set {A,B,C} it is NOT a strong of a predictor
=> label wouldn't change => pickList would not have a strong insight => No feature is selected

Note: the chance of this happening is very low which explains why it doesn't always appear in the CI build.

tovbinm · 2019-10-19T17:19:31Z

@AdamChit are you still working on adding a more robust test?

tovbinm · 2020-01-09T18:54:59Z

any updates? @AdamChit

AdamChit and others added 11 commits September 27, 2019 16:11

Changes to FunSpec and refactored test

a17c13c

refactored test to make it more readable and changed to Funspec

b0cdfae

Merge branch 'master' into LocoTestRefactor

ebedf1d

Changes to FunSpec and refactored test

96ec077

refactored test to make it more readable and changed to Funspec

b3ca610

Merge branch 'LocoTestRefactor' of https://github.com/AdamChit/Transm…

fb28e65

…ogrifAI into LocoTestRefactor merge master

revert formatting to follow scala style

3095e19

more descriptive title for section

973aa0d

threshold was too large and would fail on some runs of the test

e31bd22

fork sync

acd97a6

non of the features could be selected

236bb7c

AdamChit requested review from gerashegalov, Jauntbox, leahmcguire, tovbinm and wsuchy as code owners October 16, 2019 16:45

salesforce-cla bot added the cla:signed label Oct 16, 2019

AdamChit requested a review from crupley October 16, 2019 16:46

tovbinm reviewed Oct 16, 2019

View reviewed changes

leahmcguire reviewed Oct 17, 2019

View reviewed changes

AdamChit changed the title ~~Achit/loco test bug fix~~ loco test bug fix Oct 19, 2019

Merge branch 'master' into achit/LOCO-Test-Bug-Fix

3a9369f

AdamChit added the work in progress label Oct 22, 2019

tovbinm added 2 commits November 14, 2019 11:30

Merge branch 'master' into achit/LOCO-Test-Bug-Fix

92c1e43

Merge branch 'master' into achit/LOCO-Test-Bug-Fix

b4a2116

Merge branch 'master' into achit/LOCO-Test-Bug-Fix

b6829fe

nicodv mentioned this pull request Jan 13, 2021

Add multiclassification topk and confmatrix metrics to model insights serialization format #537

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

loco test bug fix #423

loco test bug fix #423

AdamChit commented Oct 16, 2019

codecov bot commented Oct 16, 2019 •

edited

Loading

tovbinm left a comment

leahmcguire Oct 17, 2019

AdamChit commented Oct 19, 2019

tovbinm commented Oct 19, 2019

tovbinm commented Jan 9, 2020

loco test bug fix #423

Are you sure you want to change the base?

loco test bug fix #423

Conversation

AdamChit commented Oct 16, 2019

codecov bot commented Oct 16, 2019 • edited Loading

Codecov Report

tovbinm left a comment

Choose a reason for hiding this comment

leahmcguire Oct 17, 2019

Choose a reason for hiding this comment

AdamChit commented Oct 19, 2019

tovbinm commented Oct 19, 2019

tovbinm commented Jan 9, 2020

codecov bot commented Oct 16, 2019 •

edited

Loading