Meta Models Error with Tooling Chat History #39509

Luke-Dornburgh · 2025-01-31T18:24:58Z

Package Name: azure-ai-inference
Package Version: 1.0.0b7
Operating System: Windows
Python Version: 3.11.2

Describe the bug
Using this SDK in a scenario where the client processing the requests can switch from an OpenAI model to a Meta model.

Working Scenario:

Create Client using OpenAI model
Ask questions without any tools involved
Get response
Save all messages to history list
Create new Client using a Meta model
Ask the model to summarize the chat history

Failed Scenario:

Create a client using OpenAI model (gpt-4o)
Ask a question that invokes a tool suggestion
Execute the tool
Update the chat history so that it is formatted like this:

[
  {'role': 'user', 'content': 'Next flight from Seattle to Miami'}
  {'role': 'assistant', 'tool_calls': [{'function': {'arguments': '{"origin_city":"Seattle","destination_city":"Miami"}', 'name': 'get_flight_info'}, 'id': 'call_C7lglfMNVEAjFQU98SYtUF0y', 'type': 'function'}]}
  {'role': 'tool', 'content': '{"airline": "Delta", "flight_number": "DL123", "flight_date": "May 7th, 2024", "flight_time": "10:00AM"}', 'tool_call_id': 'call_C7lglfMNVEAjFQU98SYtUF0y'}
]

Re-invoke the client with the history from step 4
Get the response and save it to the chat history
Create a new client using a Meta model
Ask that client another question along with the previous history. This question does not invoke a tool or do anything special. Below is the EXACT chat history that is being passed to client.complete(messages=messages):

[
  {'role': 'user', 'content': 'Next flight from Seattle to Miami'}
  {'role': 'assistant', 'tool_calls': [{'function': {'arguments': '{"origin_city":"Seattle","destination_city":"Miami"}', 'name': 'get_flight_info'}, 'id': 'call_C7lglfMNVEAjFQU98SYtUF0y', 'type': 'function'}]}
  {'role': 'tool', 'content': '{"airline": "Delta", "flight_number": "DL123", "flight_date": "May 7th, 2024", "flight_time": "10:00AM"}', 'tool_call_id': 'call_C7lglfMNVEAjFQU98SYtUF0y'}
  {'role': 'assistant', 'content': 'The next flight from Seattle to Miami is operated by Delta, flight number DL123. It is scheduled for May 7th, 2024, at 10:00 AM.'}
  {'role': 'user', 'content': 'Explain inflation in a sentence'}
]

Error output:

ERROR - Error during standard chat call: (Bad Request) {"object":"error","message":"1 validation error for ValidatorIterator\n0.function.arguments\n  Input should be a valid string [type=string_type, input_value={'origin_city': 'Seattle'...tination_city': 'Miami'}, input_type=dict]\n    For further information visit https://errors.pydantic.dev/2.9/v/string_type","type":"BadRequestError","param":null,"code":400}
Code: Bad Request
Message: {"object":"error","message":"1 validation error for ValidatorIterator\n0.function.arguments\n  Input should be a valid string [type=string_type, input_value={'origin_city': 'Seattle'...tination_city': 'Miami'}, input_type=dict]\n    For further information visit https://errors.pydantic.dev/2.9/v/string_type","type":"BadRequestError","param":null,"code":400}
2025-01-31 18:07:32 - response_generation.call_llm - ERROR - Traceback (most recent call last):
  File "/app/******/call_llm.py", line 163, in call_llm
    response = await client.complete(
               ^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/azure/ai/inference/aio/_patch.py", line 670, in complete
    raise HttpResponseError(response=response)
azure.core.exceptions.HttpResponseError: (Bad Request) {"object":"error","message":"1 validation error for ValidatorIterator\n0.function.arguments\n  Input should be a valid string [type=string_type, input_value={'origin_city': 'Seattle'...tination_city': 'Miami'}, input_type=dict]\n    For further information visit https://errors.pydantic.dev/2.9/v/string_type","type":"BadRequestError","param":null,"code":400}
Code: Bad Request
Message: {"object":"error","message":"1 validation error for ValidatorIterator\n0.function.arguments\n  Input should be a valid string [type=string_type, input_value={'origin_city': 'Seattle'...tination_city': 'Miami'}, input_type=dict]\n    For further information visit https://errors.pydantic.dev/2.9/v/string_type","type":"BadRequestError","param":null,"code":400}

I have checked the typing of the arguments value and it returns as type str. I have also tried to cast it to no avail.

To Reproduce
Steps to reproduce the behavior:

Follow the failed scenario step above

Expected behavior
If I create the new client with an OpenAI model this output is a standard AssistantMessage that answers the question asked. No errors

Additional context
In the call to the Meta-based client, I am not binding any tools to it as that is not supported.

The text was updated successfully, but these errors were encountered:

github-actions · 2025-01-31T21:51:40Z

Thanks for the feedback! We are routing this to the appropriate team for follow-up. cc @dargilco @jhakulin @trangevi.

dargilco · 2025-02-03T17:57:01Z

Hi @Luke-Dornburgh, thank you for opening this separate GitHub issue and providing all the details. I appreciate it!

I modified our chat completions with tools sample: https://github.com/Azure/azure-sdk-for-python/blob/main/sdk/ai/azure-ai-inference/samples/sample_chat_completions_with_tools.py such that the first complete call was done to a gpt-4o model, and the second complete call was done to a Llama-3.3-70B-Instruct model, without changing any code lines related to how the messages variables is constructed leading to the 2nd complete call. That sample worked fine. So, I was not able to reproduce your issue. Let me try some other Meta models.

dargilco · 2025-02-03T18:10:50Z

When using model Meta-Llama-3.1-70B-Instruct for the second complete call, looks like it did not error out, but gave an answer that indicates it was not able to ingest the tool response.

Luke-Dornburgh · 2025-02-03T19:38:29Z

@dargilco thanks for looking into this. If you have an output for the message history from the first scenario you tested I would love to view it. Or even perhaps the sample code.

One note is that I am using .as_dict() on each of our messages BEFORE adding them to our chat_history list. Then the list is fed back to the new client as input. Not sure if this is at all a cause. It does cause any issues with re-using GPT based clients as outlined above.

dargilco · 2025-02-03T23:26:53Z

@Luke-Dornburgh seems like different Lllama models behaver differently, and I believe this is all going to be resolved once tool support is enabled on all of them. Use of .as_dict() should not make a difference. You would be able to specify tool call in chat history, and mix calls to different models when the models support calling. I'll report back once I head from the relevant service team. For the time being, can you use models other than Meta models, that have tools support?

Luke-Dornburgh mentioned this issue Jan 31, 2025

Missing support for tool calling in Llama models (was: Tool Calling for AzureML Managed Deployments) #39391

Open

github-actions bot added the needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team label Jan 31, 2025

dargilco self-assigned this Feb 1, 2025

kristapratico added the Client This issue points to a problem in the data-plane of the library. label Feb 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Meta Models Error with Tooling Chat History #39509

Meta Models Error with Tooling Chat History #39509

Luke-Dornburgh commented Jan 31, 2025

github-actions bot commented Jan 31, 2025

dargilco commented Feb 3, 2025

dargilco commented Feb 3, 2025

Luke-Dornburgh commented Feb 3, 2025

dargilco commented Feb 3, 2025

Meta Models Error with Tooling Chat History #39509

Meta Models Error with Tooling Chat History #39509

Comments

Luke-Dornburgh commented Jan 31, 2025

github-actions bot commented Jan 31, 2025

dargilco commented Feb 3, 2025

dargilco commented Feb 3, 2025

Luke-Dornburgh commented Feb 3, 2025

dargilco commented Feb 3, 2025