ruleminer

Python package to discover association rules in Pandas DataFrames.

Features

Here is what the package does:

Generate human-readable validation rules using rule templates containing regular expressions and a Pandas DataFrame dataset
- available functions: min, max, abs, quantile, sum, substr, split, count, sumif and countif
- including parameters for metric filters and rule precisions (including XBRL tolerances)
Evaluate rules and calculate association rules metrics
- available metrics: abs support, abs exceptions, confidence, support, added value, casual confidence, casual support, conviction, lift and rule power factor

Here are some examples of rule templates with regexes with which you can generate validation rules:

The first template generates (with the dataset described in the Usage section) rules like

These generated validation rules can then be used to validate new datasets.

Name		Name	Last commit message	Last commit date
Latest commit History 182 Commits
.github		.github
docs		docs
notebooks		notebooks
ruleminer		ruleminer
tests		tests
.editorconfig		.editorconfig
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
mkdocs.yml		mkdocs.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
tox.ini		tox.ini