[LocalNet] Add infrastructure to run LLM inference #508

okdas · 2024-04-27T00:10:42Z

Summary

Adds infrastructure to run and develop against LLM on LocalNet.

Issue

[Infra] Add support for an LLM service in LocalNet/DevNet infra #130

Type of change

Select one or more:

Testing

Documentation changes (only if making doc changes)

make docusaurus_start; only needed if you make doc changes

Local Testing (only if making code changes)

Unit Tests: make go_develop_and_test
LocalNet E2E Tests: make test_e2e
See quickstart guide for instructions

PR Testing (only if making code changes)

DevNet E2E Tests: Add the devnet-test-e2e label to the PR.
- THIS IS VERY EXPENSIVE, so only do it after all the reviews are complete.
- Optionally run make trigger_ci if you want to re-trigger tests without any code changes
- If tests fail, try re-running failed tests only using the GitHub UI as shown here

Sanity Checklist

I have tested my changes using the available tooling
I have commented my code
I have performed a self-review of my own code; both comments & source code
I create and reference any new tickets, if applicable
I have left TODOs throughout the codebase, if applicable

okdas · 2024-04-27T00:20:25Z

Note: this functionality is behind the gate and is turned off by default to avoid downloading and serving an LLM to preserve resources. Turn on ollama in localnet_config.yaml when needed.

The infrastructure by itself works. Can run the request with curl:

kubectl exec "$(tilt get kd validator -ojsonpath='{.status.pods[0].name}')" -- \
curl -X POST http://ollama:11434/v1/chat/completions -H "Content-Type: application/json" \
    -d '{
        "model": "qwen:0.5b",
        "messages": [
            {
                "role": "system",
                "content": "You are a helpful assistant."
            },
            {
                "role": "user",
                "content": "Hello!"
            }
        ]
    }'

However, it doesn't seem like we support anything but json-rpc at the moment:

poktroll/pkg/partials/partial.go

Line 50 in aba098d

    
           // TODO_BLOCKER(@h5law): This function currently only supports JSON-RPC and must

I get the following error:

{"level":"error","error":"got: {\n        \"model\": \"qwen:0.5b\",\n        \"messages\": [\n            {\n                \"role\": \"system\",\n                \"content\": \"You are a helpful assistant.\"\n            },\n            {\n                \"role\": \"user\",\n                \"content\": \"Hello!\"\n            }\n        ]\n    }: unrecognised request format in partial payload","service_id":"ollama","message":"failed getting error reply"}
{"level":"error","error":"got: {\n        \"model\": \"qwen:0.5b\",\n        \"messages\": [\n            {\n                \"role\": \"system\",\n                \"content\": \"You are a helpful assistant.\"\n            },\n            {\n                \"role\": \"user\",\n                \"content\": \"Hello!\"\n            }\n        ]\n    }: unrecognised request format in partial payload","message":"failed getting request type"}

I suggest we merge this as is to unblock work on other than json-rpc request types.

Btw, I picked qwen:0.5b as it was one of the smallest recent LLMs. We don't get hardware optimizations in that environment, so it makes sense to use the smallest possible. We can go crazy on DevNet, though.

Olshansk · 2024-04-27T16:33:44Z

Great find @okdas.

@red-0ne We'll have to prioritize adding support for gRPC, REST and all the other stuff shortly so we're not limited to just json-rpc.

red-0ne

We should also have suppliers to stake for that service if we want the RelayMiners to run with these configs

Olshansk · 2024-04-28T14:55:36Z

@red-0ne With @okdas OOO for the next week, can you update the branch so we can merge it in please?

It'll help unlock development on non json-rpc.

red-0ne · 2024-04-29T05:34:41Z

@Olshansk , I added ollama services to supplier_stake_configs with a small change to the config parser to support lower/upper case rpc type values.

Tiltfile

github-actions · 2024-04-29T20:07:46Z

The CI will now also run the e2e tests on devnet, which increases the time it takes to complete all CI checks. If you just created a pull request, you might need to push another commit to produce a container image DevNet can utilize to spin up infrastructure. You can use make trigger_ci to push an empty commit.

Olshansk · 2024-04-29T20:11:51Z

@red-0ne I added this TODO in the code: # TODO(#511): Add support for REST and enabled this.

Assuming E2E tests pass, let's merge it in assuming there are no further changes you deem necessary.

Olshansk · 2024-05-03T22:22:35Z

@red-0ne - @okdas helped me figure out the issue with E2E bugs, which I resolved in [1]. Are you okay with approving this so we can merge it in and iterate on REST later?

[1] pokt-network/protocol-infra#18

…testutils * pokt/main: [LocalNet] Add infrastructure to run LLM inference (#508)

…cept * pokt/main: [LocalNet] Add infrastructure to run LLM inference (#508)

* pokt/main: [Code Health] chore: cleanup localnet testutils (#515) Zero retryLimit Support in ReplayClient (#442) [LocalNet] Add infrastructure to run LLM inference (#508) [LocalNet] Documentation for MVT/LocalNet (#488) [GATEWAY] Makefile target added to send relays to grove gateway (#487) Update README [CI] Add GATEWAY_URL envar for e2e tests (#506) [Tooling] Add gateway stake/unstake/ logs (#503)

Adds infrastructure to run and develop against LLM on LocalNet. --- Co-authored-by: Redouane Lakrache <[email protected]> Co-authored-by: Daniel Olshansky <[email protected]>

add infrastructure to run llm on localnet

177e63e

okdas changed the title ~~[LocalNet] Add infrastructure to run llm inference~~ [LocalNet] Add infrastructure to run LLM inference Apr 27, 2024

okdas self-assigned this Apr 27, 2024

okdas added infra Infra or tooling related improvements, additions or fixes tooling Tooling - CLI, scripts, helpers, off-chain, etc... labels Apr 27, 2024

okdas added this to the Shannon Private TestNet milestone Apr 27, 2024

Merge branch 'main' into dk-ollama

59f3ec4

okdas marked this pull request as ready for review April 27, 2024 00:21

okdas requested a review from Olshansk April 27, 2024 00:22

okdas mentioned this pull request Apr 27, 2024

[Infra] Add support for an LLM service in LocalNet/DevNet infra #130

Closed

7 tasks

Olshansk requested a review from red-0ne April 27, 2024 16:33

red-0ne requested changes Apr 27, 2024

View reviewed changes

red-0ne added 2 commits April 29, 2024 06:52

add ollama to supplier stake config

4c60ad9

fix: Support uppercase service ids in configs

451ea9d

Olshansk mentioned this pull request Apr 29, 2024

[SDK] Add REST Support #511

Closed

8 tasks

Olshansk assigned red-0ne Apr 29, 2024

Olshansk reviewed Apr 29, 2024

View reviewed changes

Tiltfile Show resolved Hide resolved

Olshansk approved these changes Apr 29, 2024

View reviewed changes

Olshansk added the devnet-test-e2e label Apr 29, 2024

github-actions bot added devnet push-image CI related - pushes images to ghcr.io labels Apr 29, 2024

Olshansk added 2 commits April 29, 2024 13:07

Empty commit

10cf5ef

Merge branch 'main' into dk-ollama

61804ea

Olshansk force-pushed the dk-ollama branch from 664ab9b to 61804ea Compare April 29, 2024 20:08

Added TODO

d951aba

Olshansk added 2 commits May 2, 2024 12:04

Merge branch 'main' into dk-ollama

68b63d6

Empty commit

ae1bcc1

red-0ne approved these changes May 3, 2024

View reviewed changes

Olshansk merged commit 3dee9c1 into main May 3, 2024
9 checks passed

bryanchriswhite added a commit that referenced this pull request May 6, 2024

Merge remote-tracking branch 'pokt/main' into chore/cleanup-localnet-…

ee1d9b9

…testutils * pokt/main: [LocalNet] Add infrastructure to run LLM inference (#508)

bryanchriswhite added a commit that referenced this pull request May 6, 2024

Merge remote-tracking branch 'pokt/main' into issues/322/proof-of-con…

07840c8

…cept * pokt/main: [LocalNet] Add infrastructure to run LLM inference (#508)

bryanchriswhite removed push-image CI related - pushes images to ghcr.io devnet-test-e2e labels May 16, 2024

github-actions bot removed the devnet label May 16, 2024

Olshansk deleted the dk-ollama branch May 29, 2024 16:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LocalNet] Add infrastructure to run LLM inference #508

[LocalNet] Add infrastructure to run LLM inference #508

okdas commented Apr 27, 2024

okdas commented Apr 27, 2024 •

edited

Loading

Olshansk commented Apr 27, 2024

red-0ne left a comment

Olshansk commented Apr 28, 2024

red-0ne commented Apr 29, 2024

github-actions bot commented Apr 29, 2024

Olshansk commented Apr 29, 2024

Olshansk commented May 3, 2024

[LocalNet] Add infrastructure to run LLM inference #508

[LocalNet] Add infrastructure to run LLM inference #508

Conversation

okdas commented Apr 27, 2024

Summary

Issue

Type of change

Testing

Sanity Checklist

okdas commented Apr 27, 2024 • edited Loading

Olshansk commented Apr 27, 2024

red-0ne left a comment

Choose a reason for hiding this comment

Olshansk commented Apr 28, 2024

red-0ne commented Apr 29, 2024

github-actions bot commented Apr 29, 2024

Olshansk commented Apr 29, 2024

Olshansk commented May 3, 2024

okdas commented Apr 27, 2024 •

edited

Loading