[Infra] Add support for an `LLM` service in LocalNet/DevNet infra #130

Olshansk · 2023-11-02T22:30:33Z

Objective

Increase & diversify the types of services supported in the LocalNet / DevNet infrastructure.

Origin Document

The Shannon upgrade has several goals, one of which is to support any type (i.e. general-purpose) RPC service.

Goals

Enable other types of RPCs in our network in the early development/testing stages
Cater to the AI market in addition to the Web3 market

Deliverables

Identify a small, cheap and light-weight LLM model that can be part of our LocalNet / DevNet infrastructure (e.g. Llama 2 - 7B)
Deploy the LLM service in a separate node, similar to anvil, with our infrastructure
Make sure that model inference can be triggered via a JSON RPC call directly to the supplier node

Non-goals / Non-deliverables

Guaranteeing correctness or efficiency of the LLM models
Optimizing the model service's performance

General deliverables

Comments: Add/update TODOs and comments alongside the source code so it is easier to follow.
Testing: Add new tests (unit and/or E2E) to the test suite.
Makefile: Add new targets to the Makefile to make the new functionality easier to use.
Documentation: Update architectural or development READMEs; use mermaid diagrams where appropriate.

Creator: @Olshansk
Co-Owners: @okdas

The text was updated successfully, but these errors were encountered:

Olshansk · 2023-11-02T22:31:48Z

@okdas I haven't attached an iteration to this since it's not urgent, but feel free to pick it up when you have time :)

okdas · 2023-12-01T00:09:17Z

Had some progress on this. The plan is to:

Use a small model like neural-chat-7b-v3-1.Q4_K_M.gguf.
Sync model into the container via https://github.com/tilt-dev/tilt-extensions/tree/master/file_sync_only.
Disable this by default, as the file is still kind of large and the process eats a decent amount of ram.
llama.cpp server.
Provide a curl example.
Optional: Add configuration for the endpoint in relayminer.
Optional: Add en e2e test.

^ this should probably be moved to the deliverables.. :)

Olshansk · 2023-12-03T22:59:15Z

^ this should probably be moved to the deliverables.. :)

What's stopping you?

okdas · 2024-04-27T00:25:28Z

Checkout current limitations - something we need to address: #508 (comment)

Olshansk · 2024-05-06T20:50:15Z

The goal here was to add support for LLMs in our infra. It did not account for SDK changes and REST support, so I'm classifying this as done.

cc @okdas @red-0ne

Olshansk assigned okdas Nov 2, 2023

Olshansk added the infra Infra or tooling related improvements, additions or fixes label Nov 2, 2023

Olshansk added this to the Shannon TestNet milestone Nov 2, 2023

Olshansk mentioned this issue Nov 15, 2023

[Relay] E2E Relay Gaps #177

Merged

10 tasks

okdas mentioned this issue Apr 27, 2024

[LocalNet] Add infrastructure to run LLM inference #508

Merged

14 tasks

Olshansk mentioned this issue Apr 29, 2024

[SDK] Add REST Support #511

Closed

8 tasks

Olshansk closed this as completed May 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Infra] Add support for an `LLM` service in LocalNet/DevNet infra #130

[Infra] Add support for an `LLM` service in LocalNet/DevNet infra #130

Olshansk commented Nov 2, 2023 •

edited by okdas

Loading

Olshansk commented Nov 2, 2023

okdas commented Dec 1, 2023

Olshansk commented Dec 3, 2023

okdas commented Apr 27, 2024

Olshansk commented May 6, 2024

[Infra] Add support for an LLM service in LocalNet/DevNet infra #130

[Infra] Add support for an LLM service in LocalNet/DevNet infra #130

Comments

Olshansk commented Nov 2, 2023 • edited by okdas Loading

Objective

Origin Document

Goals

Deliverables

Non-goals / Non-deliverables

General deliverables

Olshansk commented Nov 2, 2023

okdas commented Dec 1, 2023

Olshansk commented Dec 3, 2023

okdas commented Apr 27, 2024

Olshansk commented May 6, 2024

[Infra] Add support for an `LLM` service in LocalNet/DevNet infra #130

[Infra] Add support for an `LLM` service in LocalNet/DevNet infra #130

Olshansk commented Nov 2, 2023 •

edited by okdas

Loading