ETL to index PDS data to pdssp
graph TD
A[PDS ODE Web Service - collection] --> |JSON| D(Extraction)
B[PDS ODE Web Service - records] --> |JSON| E(Extraction)
C[PDS ODE Web Site] --> |REFERENCE_CATALOG, MISSION_CATALOG,<br>PERSONNEL_CATALOG, INSTRUMENT_CATALOG,<br>INSTRUMENT_HOST_CATALOG,DATA_SET_CATALOG,<br>VOL_DESC, DATA_SET_MAP_PROJECTION_CATALOG| F(Extraction)
E(Extraction) --> |Files| H[Storage File System]
F(Extraction) --> |Files| M[Storage File System]
D(Extraction) --> |JSON PdsRegistryModel| I[HDF5]
I[HDF5] --> |PdsRegistryModel| N[Transform]
M[Storage File System] --> |PdsRecordsModel, DataSetMapProjectionModel,<br>MissionModel, ReferencesModel,<br>PersonnelsModel, VolumeModel,<br>InstrumentModel, InstrumentHostModel,<br>DataSetModel| L[Transform]
H[Storage File System] --> |PdsRecordModel| N[Transform]
I[HDF5] --> |PdsRegistryModel| L[Transform]
N[Transform] --> |STAC Item, STAC Collection, STAC Catalog| O[STAC repository]
L[Transform] --> |STAC Collection, STAC Catalog| O[STAC repository]
To install pds-crawler, run this command in your terminal:
$ pip install git+https://github.com/pdssp/pds_crawler.git
The sources for pds-crawler can be downloaded from the Github repo.
You can either clone the public repository:
$ git clone git://github.com/pdssp/pds_crawler
Or download the tarball:
$ curl -OJL https://github.com/pdssp/pds_crawler/tarball/master
Once you have a copy of the source, you can install it with:
$ make # install in the system root
$ make user # or Install for non-root usage
$ git clone https://github.com/pdssp/pds_crawler
$ cd pds_crawler
$ make prepare-dev
$ source .pds_crawler
$ make install-dev
To get more information about the preconfigured tasks:
$ make help
To use pds-crawler in a project:
import pds_crawler
$make tests
👤 Jean-Christophe Malapert
Contributions, issues and feature requests are welcome!<br />Feel free to check [issues page](https://github.com/pdssp/pds_crawler/issues). You can also take a look at the [contributing guide](https://github.com/pdssp/pds_crawler/blob/master/CONTRIBUTING.rst)
This project is [GNU Lesser General Public License v3](https://github.com/pdssp/pds_crawler/blob/master/LICENSE) licensed.