Skip to content
#

airflow-dag

Here are 21 public repositories matching this topic...

This project focuses on implementing an ETL pipeline using Apache Airflow to efficiently extract data from Reddit, transform it as needed, and load it into an AWS S3 bucket. The use of Airflow allows for robust orchestration of the data workflow, ensuring that each step of the ETL process is executed in a reliable and repeatable manner.

  • Updated Oct 30, 2024
  • Python

The script automates the collection and insertion of KPIs related to transaction time and storage usage in a Data Warehouse, using Apache Airflow. It calculates the time elapsed since the last transaction and the percentage of storage usage, recording this data periodically in specific tables.

  • Updated Sep 21, 2024
  • Python

Improve this page

Add a description, image, and links to the airflow-dag topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the airflow-dag topic, visit your repo's landing page and select "manage topics."

Learn more