Serving a conversational LangChain chain using Databricks Model Serving #11480

ishaan-mehta · 2024-03-20T22:12:05Z

ishaan-mehta
Mar 20, 2024

I have managed to log my conversational chain using the logging model as code feature released in v2.11.2. Once logged, I can deploy to a Databricks Model Serving Endpoint as well.

However, the issue arises when I query the endpoint — it fails in APIRequest.call_api().

That function is configured to check if the lc_model is one of the supported Runnable types or a Retriever — if it is neither, then it tries to call the lc_model directly (instead of using invoke()). Since my chain (a RunnableWithMessageHistory) is not one of the specified Runnable types nor a Retriever, call_api() attempts to call it directly, but the chain is not a callable, so it throws an exception.

Is there a way to monkey patch the version of mlflow running on the Databricks Model Serving Endpoint so that the chain can be called using invoke() like the other Runnable types?

cc: @BenWilson2 @daniellok-db (since you seem to be involved in the recent LangChain-related model serving efforts 🙂)

Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Serving a conversational LangChain chain using Databricks Model Serving #11480

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Serving a conversational LangChain chain using Databricks Model Serving #11480

ishaan-mehta Mar 20, 2024

Replies: 0 comments

ishaan-mehta
Mar 20, 2024