Skip to content

chian/LactChain

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LactChain: Language Action Chain Reinforcement Learning

This repo serves as a template for coding out a Reinforcement Learning (RL) system. This system is meant to be a multi-purpose system with multiple possible applications.

TODOs:

Add things needed to enforce structure of any subsequent code

  1. Make generic baseclass for actor (policy) network
  2. Make generic baseclass for critic (value) network
  3. Make generic baseclass for reward function
  4. Finish thinking about generic lactchain baseclass. Yes, it is state-->action-->state, but what is action? Does action involve taking in a fluid prompt? A prompt menu? What?
  5. Write unit tests

Build out specific use cases

  1. Draw schematic of simple use case
  2. Add plausibly useful language action chains using lactchain class
  3. Add code extractor and other functions in state class
  4. Add other extractors to lactchains if you need to pull certain things (like code) from gpt4 responses
  5. Define example format for textblock in state class
  6. Define Policy and Value Function networks
  7. Define Actor-Critic teaching moments (TD learning? Whatever it's called)

About

Langauge Action Chains for RL

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages