GitHub - genlm/genlm-eval: Evaluation framework for language model probabilistic programs

A flexible framework for evaluating constrained generation models, built for the GenLM ecosystem. This library provides standardized interfaces and benchmarks for assessing model performance across various constrained generation tasks.

Documentation

Getting Started: Visit our documentation for installation and usage guides.
API Reference: Browse the API documentation for detailed information about the library's components.
Cookbook: Check out our examples and tutorials for:
- Using built-in domains (Pattern Matching, Text-to-SQL, Molecular Synthesis)
- Creating custom evaluation domains

Components

Datasets: Specifies and iterates over the dataset instances of a constrained generation task.
Evaluators: Evaluates the model's output.
Model Adapters: Wraps the model to provide a unified interface for evaluation.
Runners: Orchestrates the evaluation process with output caching.

Installation

Note: This library is still under active development.

git clone https://github.com/genlm/genlm-eval.git
cd genlm-eval
pip install -e .

For domain-specific dependencies, refer to the cookbook in the docs.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.github/workflows		.github/workflows
assets		assets
docs		docs
genlm/eval		genlm/eval
tests		tests
.coveragerc		.coveragerc
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Documentation

Components

Installation

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

genlm/genlm-eval

Folders and files

Latest commit

History

Repository files navigation

Documentation

Components

Installation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages