- Get indexes endpoint:
GET /indexes
(#181). Use this endpoint to inspect existing Marqo indexes. Read about usage here. - Non-tensor fields(#161). During the indexing phase, mark fields as non-tensor to prevent tensors being created for them. This helps speed up indexing and reduce storage for fields where keyword search is good enough. For example: email, name and categorical fields. These fields can still be used for filtering. Read about usage here.
- Configurable preloaded models(#155). Specify which machine learning model to load as Marqo starts. This prevents a delay during initial search and index commands after Marqo starts. Read about usage here.
- New example and article: use Marqo to provide context for up-to-date GPT3 news summary generation (#171, #174). Special thanks to @iain-mackie for this example.
- Updated developer guide (#164)
- Updated requirements which prevented Marqo being built as an arm64 image (#173)
- Backend updated to use marqo-os:0.0.3 (#183)
- Default request timeout has been increased from 2 to 75 seconds (#184)
- For work on the GPT3 news summary generation example: @iain-mackie
- For contributing the non-tensor fields feature: @jeadie
- Thank you to our users who raise issues and give us valuable feeback
- Thank you to our 1.4k+ star gazers and 50+ forkers!
- 429 (too many request errors) are propagated from Marqo-os to the user properly (#150)
- Health check endpoint:
GET /health
. An endpoint that can be used to inspect the status of Marqo and Marqo's backend (Marqo-os) (#128). Read about usage here. - Marqo can be launched with environment variables that define limits around maximum number of fields per index, maximum document size and the maximum number of documents that can be retrieved (#135). Read about usage here.
- README translations:
- Chinese 🇨🇳 (by @wanliAlex, #133)
- Polish 🇵🇱 (by @MichalLuck, #136)
- Ukrainian 🇺🇦 (by @dmyzlata, #138)
- French 🇫🇷 (by @rym-oualha, #147)
- The home
/
json response has been updated. If you have logic that reads the endpoint root, please update it (#128). - The Python client's
add_documents()
andupdate_documents()
batch_size
parameter has been replaced byserver_batch_size
andclient_batch_size
parameters (py-marqo#27), (py-marqo#28)
- Each text field just creates a top level Marqo-os text field, without any keywords (#135)
- Very large fields get their tensor_facet keywords ignored, rather than Marqo-OS preventing the doc being indexed (#135)
- Tensor facets can no longer have _id as a filtering field (#135)
- FastAPI runs with better concurrency (#128)
- Get documents by IDs and lexical search and no longer returns vectors if expose_facets isn't specified
- Fixed batching bug in Python client (py-marqo#28)
- If a large request to add_documents or update_documents results in a document adding fields such that the index field limit is exceeded, the entire operation will fail (without resilience). Mitigate this sending
add_documents
andupdate_documents
requests with smaller batches of documents. - For optimal indexing of large volumes of images, we recommend that the images are hosted on the same region and cloud provider as Marqo.
- For their translation work: @rym-oualha, @dmyzlata, @wanliAlex, @dmyzlata, @MichalLuck
- For raising issues and helping with READMEs: @kdewald, @llermaly, @namit343
- Thank you to our 900+ star gazers and 30+ forkers
Added Open CLIP models and added features to the get document endpoint.
- Added Open CLIP models (#116). Read about usage here
- Added the ability to get multiple documents by ID (#122). Read about usage here
- Added the ability to get document tensor facets through the get document endpoint (#122). Read about usage here
Adding the attributesToRetrieve to the search endpoint and added the update documents endpoints