Python Data Engineering Projects
Discover data engineering projects built with Python. Browse workflows, pipelines, and integrations from the community.
11 projects found
1.Drift Detective
Drift Detective is a Python library for tracking schema evolution using versioned JSON snapshots
2.AIRFLow Medical Data Pipeline
Enterprise-grade ETL pipeline transforming medical XML data into actionable business intelligence
3.Airflow Python React Widgets
Python React Experiment
4.Cricket Analytics Data Pipeline
CAP is an end-to-end cricket analytics platform built on Cricsheet ball-by-ball data
5.Reddit ETL Pipeline in Docker
Reddit Data Engineering ETL Pipeline: Spark, Airflow, MinIO in Docker Medallion Architecture
6.Baskpipe
Fully AWS-native data pipelines for processing basketball (NBA) data.
7.Github Stars Monitor
Never miss a new top starred repository
8.Daggie The Airflow DAG Quality Auditor
A friendly (and sometimes strict!) animated DAG auditor for Apache Airflow 3.1+
9.Dbt power tools AI based Documentation
A powerful CLI tool that generates LLM-powered documentation for dbt models and columns
10.AIRFLOW YAHOO ETL
SCALABLE_YAHOO_API_ETL_PIPELINE_USING_AIRFLOW
11.Airflow Bulk Pause Unpause Plugin
Bulk manage Airflow DAG states effortlessly — pause or unpause in one action.