Back to all projects

Python Data Engineering Projects

Discover data engineering projects built with Python. Browse workflows, pipelines, and integrations from the community.

11 projects found

1.Drift Detective

Drift Detective is a Python library for tracking schema evolution using versioned JSON snapshots

by varga.dani6

2.AIRFLow Medical Data Pipeline

Enterprise-grade ETL pipeline transforming medical XML data into actionable business intelligence

by imen.bnamar

3.Airflow Python React Widgets

Python React Experiment

by Rahul Rajasekharan

4.Cricket Analytics Data Pipeline

CAP is an end-to-end cricket analytics platform built on Cricsheet ball-by-ball data

by Rahul Rajasekharan

5.Reddit ETL Pipeline in Docker

Reddit Data Engineering ETL Pipeline: Spark, Airflow, MinIO in Docker Medallion Architecture

by Abdullah

6.Baskpipe

Fully AWS-native data pipelines for processing basketball (NBA) data.

by dominik.zsajovic

7.Github Stars Monitor

Never miss a new top starred repository

by maxime.lemaitre

8.Daggie The Airflow DAG Quality Auditor

A friendly (and sometimes strict!) animated DAG auditor for Apache Airflow 3.1+

by Rahul Rajasekharan

9.Dbt power tools AI based Documentation

A powerful CLI tool that generates LLM-powered documentation for dbt models and columns

by Rahul Rajasekharan

10.AIRFLOW YAHOO ETL

SCALABLE_YAHOO_API_ETL_PIPELINE_USING_AIRFLOW

by ravitejach888

11.Airflow Bulk Pause Unpause Plugin

Bulk manage Airflow DAG states effortlessly — pause or unpause in one action.

by Rahul Rajasekharan