New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: error from hybrid search's bm25 search by field #1692
base: main
Are you sure you want to change the base?
Conversation
@JoanFM @AnneYang720 Commit 3a6a293 is causing the the following tests in the test_subindex.py file to fail:
is the implementation of the subindex in weaviate somehow dependent on the ID field being a 'string' datatype? |
Maybe implicitly there is. How does it fail? |
@hsm207 check this comment: TODO: add more types and figure out how to handle text vs string type on the top of the file, it may be related to this? |
it's always related to the length of the documents returned. E.g. in:
the value of
before it was:
Does
Can you confirm that nested documents don't somehow override |
Maybe @JohannesMessner knows better |
Check this: def _filter_by_parent_id(self, id: str) -> Optional[List[str]]:
results = (
self._client.query.get(self._db_config.index_name, ['docarrayid'])
.with_where(
{'path': ['parent_id'], 'operator': 'Equal', 'valueString': f'{id}'}
)
.do()
)
ids = [
res['docarrayid']
for res in results['data']['Get'][self._db_config.index_name]
]
return ids I guess the fact that the ID column is not a |
If there are nested Documents it performs the same |
Codecov ReportPatch coverage:
Additional details and impacted files@@ Coverage Diff @@
## main #1692 +/- ##
==========================================
- Coverage 83.74% 81.37% -2.37%
==========================================
Files 134 134
Lines 8845 8848 +3
==========================================
- Hits 7407 7200 -207
- Misses 1438 1648 +210
Flags with carried forward coverage won't be shown. Click here to find out more.
☔ View full report in Codecov by Sentry. |
I do not know, @JohannesMessner may be better to judge. However, I do not like at all that test folder, since each test cannot be run individually, as they all depend on the same fixture. I did a refactoring of that test for HNSWLib, I will see if I can do it for this one |
Hey @hsm207, what is the ststus of this issue? |
hey @JoanFM I haven't been able to delve into the nested document feature yet due to current commitments on other projects. I've moved this to my backlog since I don't see an opportunity to work on this until at least mid next month. |
fixes #1612