Catch sqlite exceptions when filter entities #6601

grzesiek2010 · 2025-02-11T23:57:27Z

Closes #6600
Closes #6597

Why is this the best possible solution? Were any other approaches considered?

For: #6600
When a filter applied to entities contains an expression referencing another question, we cannot build an optimized SQLite query. Instead, we need to fall back to Javarosa, which has all the necessary knowledge to evaluate such an expression. The optimized way of filtering can be used only if it uses entity properties available in the entity database.
For: #6597
At the moment of building an SQLite query to optimize filtering, both #6600 and #6597 appear to be the same issue (a missing column). However, when we fall back to Javarosa, #6600 is evaluated with the referenced question, while #6597 is still treated as a missing property.

How does this change affect users? Describe intentional changes to behavior and behavior that could have accidentally been affected by code changes. In other words, what are the regression risks?

For: #6600
We need to test filtering with expressions that reference other questions in a form, not just expressions that use entity properties. The form attached to the issue is an example of it.
For: #6597
We need to test filtering with expressions that simply use non-existing properties.

Do we need any specific form for testing your changes? If so, please attach one.

The form is attached to the issue.

Does this change require updates to documentation? If so, please file an issue here and include the link below.

No.

Before submitting this PR, please make sure you have:

added or modified tests for any new or changed behavior
run ./gradlew connectedAndroidTest (or ./gradlew testLab) and confirmed all checks still pass
added a comment above any new strings describing it for translators
added any new strings with date formatting to DateFormatsTest
verified that any code or assets from external sources are properly credited in comments and/or in the about file.
verified that any new UI elements use theme colors. UI Components Style guidelines

seadowg

Would it be possible to solve this at the DatabaseEntitiesRepository level instead by adding tests for these cases to/modifying #query?

lognaturel · 2025-02-14T00:27:37Z

...es/src/test/java/org/odk/collect/entities/javarosa/filter/LocalEntitiesFilterStrategyTest.kt

+
+        val choices = scenario.choicesOf("/data/question").map { it.value }
+        assertThat(choices.isEmpty(), equalTo(true))
+        assertThat(fallthroughFilterStrategy.fellThrough, equalTo(true))


works correctly in the optimized way -- what makes it optimized? This looks like it fell through which I thought would be non-optimized.

This test looks identical to https://github.com/getodk/collect/pull/6601/files#diff-4a25b4c0ba0780186c2edabe26d8518d4e3e6a022a482669373e2a71ceac6dfbR585 to me

In this test not_existing_property is the reference in the secondary instance and 'value' is a static value.

In the test above, undefined is the reference in the secondary instance and /data/ref_question is a static value that came from the form.

It's confusing because the order is different and the intended meaning is different! But I believe that from the system's standpoint they should be in the same category. If there's code somewhere that treats them as different we should dig into it, I think.

works correctly in the optimized way -- what makes it optimized? This looks like it fell through which I thought would be non-optimized.

I made a mistake in the title it's not optimized of course. I will fix that.

If you set not_existing_property='value', it will always return an empty list of entities (because not_existing_property does not exist).
However, if you use undefined=/data/ref_question or 'value'=/data/ref_question (because we focus too much on the undefined case), the list won't be empty - as long as the expected value matches the one entered in the question.
Wouldn't it be enough to treat these two cases as distinct and have separate tests for each?

If you set not_existing_property='value', it will always return an empty list of entities (because not_existing_property does not exist).

And with not_existing_property='', you should always get all Entities. That's the same as the undefined=/data/ref_question case when /data/ref_question has a blank value.

I sort of see the distinction that you're making, thanks for explaining it again, that was helpful. It doesn't hurt to have a few extra tests and I don't think there's a fundamental misunderstanding here. 👍

'value'=/data/ref_question

This one does definitely feel different from the other two to me. This should be identified as not an expression that can use the optimized filter strategy and bypass before it even gets there (maybe good to double check this?). The other two should be identified as candidates for the filter strategy and then be kicked back to the next one once the column is determined to be unknown.

grzesiek2010 · 2025-02-14T01:46:42Z

Would it be possible to solve this at the DatabaseEntitiesRepository level instead by adding tests for these cases to/modifying #query?

We have to catch queries that contain non-existent columns and fall back to Javarosa so no there is nothing we can do just in the DatabaseEntitiesRepository

lognaturel · 2025-02-14T05:28:23Z

there is nothing we can do just in the DatabaseEntitiesRepository

An alternative would be to "look before you leap" and fall back if there are any unknown properties in the expression, I think. I believe that's what @seadowg said the code did before.

lognaturel · 2025-02-14T06:23:18Z

I think one step even further would be to detect unknown columns in subexpressions, treat those as ‘’, and then evaluate the sub expression. That should always be possible for the shape of subexpression that makes it to the entities filter strategy.

For example, /data/ref=undefined would be detected as such an expression, it would be evaluated as /data/ref=‘’ and then the result (true or false) would be used in the SQL. That would avoid the need to fall back to another filter strategy. Not a must but hopefully the example further illustrates why I think of all the cases with unknown properties as being in the same category.

grzesiek2010 added 2 commits February 12, 2025 00:55

Added failing tests

1810c8d

Catch sqlite exceptions when filter entities

565f8c2

grzesiek2010 marked this pull request as ready for review February 13, 2025 05:02

grzesiek2010 added the high priority Should be looked at before other PRs/issues label Feb 13, 2025

seadowg self-requested a review February 13, 2025 11:18

seadowg requested changes Feb 13, 2025

View reviewed changes

lognaturel reviewed Feb 14, 2025

View reviewed changes

Added a test to cover non existing properties

ca26c40

grzesiek2010 force-pushed the COLLECT-6600 branch from 1f8153a to ca26c40 Compare February 14, 2025 01:47

seadowg mentioned this pull request Feb 14, 2025

Crash when filtering entities using an expression that references another question with an undefined/null value #6600

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Catch sqlite exceptions when filter entities #6601

Catch sqlite exceptions when filter entities #6601

grzesiek2010 commented Feb 11, 2025 •

edited

Loading

seadowg left a comment

lognaturel Feb 14, 2025

grzesiek2010 Feb 14, 2025 •

edited

Loading

lognaturel Feb 14, 2025 •

edited

Loading

grzesiek2010 commented Feb 14, 2025

lognaturel commented Feb 14, 2025

lognaturel commented Feb 14, 2025 •

edited

Loading

Catch sqlite exceptions when filter entities #6601

Are you sure you want to change the base?

Catch sqlite exceptions when filter entities #6601

Conversation

grzesiek2010 commented Feb 11, 2025 • edited Loading

Why is this the best possible solution? Were any other approaches considered?

How does this change affect users? Describe intentional changes to behavior and behavior that could have accidentally been affected by code changes. In other words, what are the regression risks?

Do we need any specific form for testing your changes? If so, please attach one.

Does this change require updates to documentation? If so, please file an issue here and include the link below.

Before submitting this PR, please make sure you have:

seadowg left a comment

Choose a reason for hiding this comment

lognaturel Feb 14, 2025

Choose a reason for hiding this comment

grzesiek2010 Feb 14, 2025 • edited Loading

Choose a reason for hiding this comment

lognaturel Feb 14, 2025 • edited Loading

Choose a reason for hiding this comment

grzesiek2010 commented Feb 14, 2025

lognaturel commented Feb 14, 2025

lognaturel commented Feb 14, 2025 • edited Loading

grzesiek2010 commented Feb 11, 2025 •

edited

Loading

grzesiek2010 Feb 14, 2025 •

edited

Loading

lognaturel Feb 14, 2025 •

edited

Loading

lognaturel commented Feb 14, 2025 •

edited

Loading