Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SOLR tuning #37

Open
5 of 6 tasks
nickdos opened this issue Oct 18, 2022 · 2 comments
Open
5 of 6 tasks

SOLR tuning #37

nickdos opened this issue Oct 18, 2022 · 2 comments
Labels
back-end SOLR, ES, Node

Comments

@nickdos
Copy link
Collaborator

nickdos commented Oct 18, 2022

  • add weighting to scientificName and vernacularName to boost results.
  • searching accession numbers not returning top hit at top. E.g. GCF_002099425.1 gives 1,833,302 results (due to splitting into 3 terms and ORing them) but adding quotes returns 1 result. Therefore tune the query parsing to better handle this (not to split on _ might help).
  • grid view (taxon grouping) boost taxa with more DR coverage (i.e. taxon that has 3 DRs gets higher rank than taxon with 3 sequences from 1 DR)
  • Boost whole genome data over other types (like BOLD, SNP, eDNA), with complete over partial in assembly level field.
  • Boost docs with "taxa found in Australia" (not in index yet)
  • Boost native species (not in index)
@nickdos nickdos added the back-end SOLR, ES, Node label Oct 25, 2022
@nickdos
Copy link
Collaborator Author

nickdos commented Nov 2, 2022

q=marketing&qf="field1^10 field2^8, field3^7 field4^6"

edismax parser can boost on values:

q=video&defType=edismax&qf=features^20.0+text^0.3&bq=cat:electronics^5.0

nickdos added a commit that referenced this issue Nov 7, 2022
nickdos added a commit that referenced this issue Nov 7, 2022
Fixed text searches
nickdos added a commit that referenced this issue Nov 8, 2022
nickdos added a commit that referenced this issue Nov 8, 2022
nickdos added a commit that referenced this issue Nov 8, 2022
@nickdos
Copy link
Collaborator Author

nickdos commented Nov 8, 2022

Waiting for index to contain data for remaining items.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
back-end SOLR, ES, Node
Projects
None yet
Development

No branches or pull requests

1 participant