Skip to content

NaturalHistoryMuseum/data-importer

Repository files navigation

Data Importer

Tests Coveralls Python version Docs

Overview

This repository contains the pipeline which moves data from the Museum's EMu instance into the Data Portal.

Data is exported from EMu as texexport reports 5 nights a week and then this pipelines moves it into the Data Portal's databases.

The data is versioned using the Splitgill library and stored in MongoDB and Elasticsearch.

This is the 3rd iteration of this importer to date. The code history in this repository only contains the first iteration of the importer as the second iteration was developed in a private repository. To make the code available for the current version, the private parts of the repository have been moved to a different private repo, allowing this repository to be public.

Installation

The data importer can be installed from GitHub:

pip install git+git://github.com/NaturalHistoryMuseum/data-importer.git#egg=data-importer

Tests

Tests are run through docker-compose so that MongoDB and Elasticsearch are available for real testing.

To run the tests:

docker compose run --build test

About

Import EMu data into the NHM Data Portal

Resources

Stars

Watchers

Forks

Packages

No packages published

Contributors 5

Languages