Connectome is a framework for datasets management with strong emphasis on simplicity, composability and reusability.
- Self-consistency: connectome encourages data transformations that keep entries' fields consistent
- Caching: transformations' caching works out of the box and supports both caching to RAM and to Disk
- Automatic cache invalidation: connectome tracks all the changes made to a dataset and automatically invalidates the cache when something changes, making sure that your cache is always consistent with the data
- Invertible transformations: write consistent pre- and post- processing to build production-ready pipelines
The simplest way is to get it from PyPi:
pip install connectome
Or if you want to try the latest version from GitHub:
git clone https://github.com/neuro-ml/connectome.git
cd connectome
pip install -e .
# or let pip handle the cloning:
pip install git+https://github.com/neuro-ml/connectome.git
The docs are located here
Also, you can check out our Intro to connectome
series of
tutorials here
Some parts of our automatic cache invalidation machinery vere heavily inspired by the cloudpickle project.