LactChain: Language Action Chain Reinforcement Learning

This repo serves as a template for coding out a Reinforcement Learning (RL) system. This system is meant to be a multi-purpose system with multiple possible applications.

TODOs:

Add things needed to enforce structure of any subsequent code

Make generic baseclass for actor (policy) network
Make generic baseclass for critic (value) network
Make generic baseclass for reward function
Finish thinking about generic lactchain baseclass. Yes, it is state-->action-->state, but what is action? Does action involve taking in a fluid prompt? A prompt menu? What?
Write unit tests

Build out specific use cases

Draw schematic of simple use case
Add plausibly useful language action chains using lactchain class
Add code extractor and other functions in state class
Add other extractors to lactchains if you need to pull certain things (like code) from gpt4 responses
Define example format for textblock in state class
Define Policy and Value Function networks
Define Actor-Critic teaching moments (TD learning? Whatever it's called)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
ARGO_WRAPPER		ARGO_WRAPPER
classes		classes
images		images
use_cases		use_cases
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LactChain: Language Action Chain Reinforcement Learning

TODOs:

About

Releases

Packages

Languages

chian/LactChain

Folders and files

Latest commit

History

Repository files navigation

LactChain: Language Action Chain Reinforcement Learning

TODOs:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages