Agent Intentions

Introduction / Summary

This Github is the companion to the paper Evaluating Language Model Language Traits

Repository Structure

`/LGBT`

`/COHERENCE`

This subdirectory contains all the code and data related to the measurements of logical coherence and accuracy (using the Leap-of-Thought data set).

`/HHH`

This subdirectory contains all code and data related to the generation of the Helpful Harmless (HHH) dataset and subsequent testing.

`/UII`

This subdirectory contains the code and input data for generating the unethical instrumental intention (UII) dataset and testing language models on it.

Others

all_plots.ipynb: Contains code that generates the distribution plots fig 1, 3, 4 in the paper.
tqa.py: Contains code that generates the distribution plots fig 5, 6 in the paper.
requirements.txt: Lists dependencies required, run pip install -r requirements.txt to install all the packages.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent Intentions

Introduction / Summary

Repository Structure

`/LGBT`

`/COHERENCE`

`/HHH`

`/UII`

Others

About

Releases

Packages

Contributors 4

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
COHERENCE		COHERENCE
HHH		HHH
LGBT		LGBT
UII		UII
.DS_Store		.DS_Store
Agent_Intentions-5.pdf		Agent_Intentions-5.pdf
LICENSE		LICENSE
README.md		README.md
all_plots.ipynb		all_plots.ipynb
requirements.txt		requirements.txt
tqa.py		tqa.py

License

graceebc9/agent_intentions

Folders and files

Latest commit

History

Repository files navigation

Agent Intentions

Introduction / Summary

Repository Structure

/LGBT

/COHERENCE

/HHH

/UII

Others

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

`/LGBT`

`/COHERENCE`

`/HHH`

`/UII`

Packages