Add FIFO and Llama Services #1339

AutonomicPerfectionist · 2023-09-06T19:25:23Z

This PR adds a circular fifo service and a Llama service. The FIFO can be used to store short-term conversational memory for LLM assistants like ChatGPT or Llama-based local models.

The Llama service is an experimental service I added just for proof of concept testing. It uses the new java-llama-cpp library to call into a llama.cpp compiled library to perform inference. Yes, the dependency uses JNA; I just quickly whipped it up so I can do some tests with the FIFO service. Eventually, it will be replaced with a Python-based service.

Currently, the llama service requires java 13 or higher, and the user must supply their own libllama.so library file

AutonomicPerfectionist added 11 commits October 22, 2023 16:39

Add AutoEjectFIFO service

d142e38

Add Llama service and update pom

19e76c0

Add reset() to Llama and basic thread detection

efaaa96

Update java-llama-cpp to 1.1.1

b5d7838

Regenerate pom

0991c75

Add physical and logical cores to the platform description

2d22fcd

Use only physical core count for number of llama inference threads

969eea7

Add Whisper service and its deps, and regen pom

fafa981

Implement basic Whisper transcription

11191b8

Update java-llama-cpp to 1.1.4 for bundled lib

96f39a0

Update java-llama to 2.0

1af1922

AutonomicPerfectionist force-pushed the ap-fifo-and-llama branch from 5370fa6 to 1af1922 Compare October 22, 2023 23:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add FIFO and Llama Services #1339

Add FIFO and Llama Services #1339

AutonomicPerfectionist commented Sep 6, 2023

Add FIFO and Llama Services #1339

Are you sure you want to change the base?

Add FIFO and Llama Services #1339

Conversation

AutonomicPerfectionist commented Sep 6, 2023