GPT-4o Car Vision Sample

Inspired by this example from Denise Schlesinger, This Jupyter notebook demonstrates the use of AzureOpenAI's GPT-4o model to generate a comprehensive natural language description of a car.

The demo covers the following:

Multi-modal capabilities (text and vision for now, speech is coming to GPT-4o soon)
Computer vision - extracting the features of a car from an uploaded photo
Output using JSON Mode for consistency and accuracy
Natural language text summarisation using different prompts
(Optional) RAG based on the response from the DVLA VES API (third-party API, free API key available from GOV.UK on request)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

GPT-4o Car Vision Sample

Files

README.md

Latest commit

History

README.md

File metadata and controls

GPT-4o Car Vision Sample