Personio dlt-dbt Package

Overview

The Personio dlt-dbt package offers data models to help you transform and analyze Personio data. It's designed to integrate seamlessly with the dlt Personio pipeline, which extracts and loads Personio data into your data warehouse.

Who is this for?

This package is perfect for dbt users who want to integrate Personio data into their analytics workflows without building models from scratch.

Key Features

Staging Models: Clean and prepare raw Personio data for downstream analysis.
Mart Models: Pre-built dimension and fact tables for key Personio entities like employees, projects, etc.
Incremental Loading: Supports incremental data processing to optimize performance.
Easy Integration: Designed to work out-of-the-box with the dlt Personio pipeline.

Setup Instructions

Prerequisites

dbt Core installed in your environment.
Access to a supported data warehouse: BigQuery, Snowflake, Redshift, Athena, or PostgreSQL.
The dlt Personio pipeline is set up and running.

Step 1: Set Up the dlt Personio Pipeline

Install dlt:
```
pip install dlt
```
Configure the Pipeline: Follow the dlt Personio pipeline documentation to set up your pipeline. Ensure you have your Personio credentials and destination credentials configured.
Run the Pipeline: Extract and load data from Personio into your data warehouse by running the pipelines.

Step 2: Install and Configure the dbt Project

Install the Personio dbt package into your dbt environment.
Configure your 'dbt_project.yml' file with the appropriate connection details for your data warehouse.
Ensure the data from your dlt Personio pipeline is available in your warehouse.

This is how the tables in dbt packages look like:

dbt_personio/
├── models/
│   ├── marts/
│   │   ├── dim__dlt_loads.sql
│   │   ├── dim_absence_types.sql
│   │   ├── dim_absences.sql
│   │   ├── dim_document_categories.sql
│   │   ├── dim_employees__absence_entitlement.sql
│   │   ├── dim_employees_absences_balance.sql
│   │   ├── dim_employees.sql
│   │   ├── dim_projects.sql
│   ├── staging/
│   │   ├── stg__dlt_loads.sql
│   │   ├── stg_absence_types.sql
│   │   ├── stg_absences.sql
│   │   ├── stg_document_categories.sql
│   │   ├── stg_employees__absence_entitlement.sql
│   │   ├── stg_employees_absences_balance.sql
│   │   ├── stg_employees.sql
│   │   ├── stg_projects.sql
│   ├── dlt_active_load_ids.sql
│   ├── dlt_processed_load_ids.sql
│   ├── sources.yml

Step 3: Run dbt

Execute the dbt models to transform the raw Personio data into useful tables:

dbt build

Customization

While this package provides a solid foundation, you can customize it to suit your specific needs:

Modify the models to align with your business logic.
Add relationships between tables by modifying your dlt pipeline schema.

The dimensional modelling part of the package was created with a declarative code generator and suffers from limitations inherent to modelling raw data directly. We advise you consider the raw data tables and adjust the modelled layer as needed.

Schema diagram

The dbt model above can be further customized according to the requirements. Using this package you'll get a basic template for data model which can be further modified as required.

The schema of Personio data modelled above using dlt-dbt-generator:

⚠️ Note:

Please note that this is a starting template for your data model and is not the final product. It is advised to customize the data model as per your needs.

Here's the link to the DB diagram: link.

Optional: Advanced Usage (Generator and Licensing)

This package was created using the dlt-dbt-generator by dlt-plus. For more information about dlt-plus, refer to the dlt-plus documentation. To learn more about the dlt-dbt-generator, consult the dlt-dbt-generator documentation.

The dimensional modelling part of the package was created with a declarative code generator and suffers from limitations inherent to modelling raw data directly. We advise you consider the raw data tables and adjust the modelled layer as needed.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dbt_project.yml		dbt_project.yml
personio_pipeline.py		personio_pipeline.py
requirements.txt		requirements.txt
run_personio_dbt.py		run_personio_dbt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Personio dlt-dbt Package

Overview

Who is this for?

Key Features

Setup Instructions

Prerequisites

Step 1: Set Up the dlt Personio Pipeline

Step 2: Install and Configure the dbt Project

Step 3: Run dbt

Customization

Schema diagram

Optional: Advanced Usage (Generator and Licensing)

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

dlt-hub/dlt-dbt-personio

Folders and files

Latest commit

History

Repository files navigation

Personio dlt-dbt Package

Overview

Who is this for?

Key Features

Setup Instructions

Prerequisites

Step 1: Set Up the dlt Personio Pipeline

Step 2: Install and Configure the dbt Project

Step 3: Run dbt

Customization

Schema diagram

Optional: Advanced Usage (Generator and Licensing)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages