Back to all projects
Apache Spark Data Engineering Projects
Discover data engineering projects built with Apache Spark. Browse workflows, pipelines, and integrations from the community.
7 projects found
1
1.Real-Time-Sales-Streaming-Pipeline
Modern Lakehouse Architecture with Kafka + Spark Structured Streaming + Delta Lake
by imen.bnamar
2
2.Bluesky NBA Real-Time Sentiment Analysis
A real-time data streaming pipeline that captures live posts from Bluesky regarding the NBA, perform
by imen.bnamar
3
3.Yelp Batch ETL Pipeline
A batch ETL pipeline that processes Yelp business raw data to generate analytics and insights
by darracq.aurelien
4
4.Reddit ETL Pipeline in Docker
Reddit Data Engineering ETL Pipeline: Spark, Airflow, MinIO in Docker Medallion Architecture
by Abdullah
5
5.F1 Insights Real Time Replay
What if your dashboards were as realtime as Max vestappen!
by hiteshkhk0105
6
6.Automated News Intelligence Pipeline
An end-to-end automated pipeline for collecting, processing, and analyzing news articles with machin
by charbeldaher34
7
7.AIRFLOW YAHOO ETL
SCALABLE_YAHOO_API_ETL_PIPELINE_USING_AIRFLOW
by ravitejach888