Add OpenAI Priority Load Balancer for Azure OpenAI #1626

simonkurtz-MSFT · 2024-05-17T00:18:38Z

This PR is only intended to showcase how the OpenAI Prioritized Load Balancer integrates.

This PR should NOT be merged.

simonkurtz-MSFT · 2024-05-17T00:20:12Z

This is how the OpenAI Priority Load Balancer integrates. Nevermind the hard-coded backend and the location of the backends list in this PR. I don't intend to ask for a merge, but this was the best way to give you an idea of the setup.

If you have two AOAI instances with the same model, you can plug them both in and should see load-balancing.

…-demo

simonkurtz-MSFT · 2024-05-17T15:57:39Z

I brought up two AOAI instances and related assets and configured both instances as backends in app.py. Then I started to have a conversation.

Both backends are responding. It's important to note that this is not a uniform distribution because available backends are randomized (have to do so as part of multi-process workloads).

At no point did the conversation break down or showed any kind of error through the chat bot.

simonkurtz-MSFT and others added 2 commits May 16, 2024 20:17

Add openai-priority-loadbalancer

b087a8e

Merge branch 'Azure-Samples:main' into main

0f5f800

simonkurtz-MSFT added 3 commits May 17, 2024 11:44

Add second working backend

791e51a

Merge branch 'main' of github.com:simonkurtz-MSFT/azure-search-openai…

8e1ffb2

…-demo

Lock openai_priority_loadbalancer to 1.0.6

77d4516

simonkurtz-MSFT added 3 commits May 17, 2024 11:59

Clean up

6442893

Update openai-priority-loadbalancer to 1.0.8

4239b03

Update openai-priority-loadbalancer to 1.0.9

f385736

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add OpenAI Priority Load Balancer for Azure OpenAI #1626

Add OpenAI Priority Load Balancer for Azure OpenAI #1626

simonkurtz-MSFT commented May 17, 2024 •

edited

simonkurtz-MSFT commented May 17, 2024

simonkurtz-MSFT commented May 17, 2024

Add OpenAI Priority Load Balancer for Azure OpenAI #1626

Are you sure you want to change the base?

Add OpenAI Priority Load Balancer for Azure OpenAI #1626

Conversation

simonkurtz-MSFT commented May 17, 2024 • edited

simonkurtz-MSFT commented May 17, 2024

simonkurtz-MSFT commented May 17, 2024

simonkurtz-MSFT commented May 17, 2024 •

edited