Discover data and AI engineering projects built with Apache Spark. Browse workflows, pipelines, applications, and integrations from the community.
Apache Spark is part of how teams build, orchestrate, test, or operate data and AI systems in production. This page groups together real projects that use Apache Spark, so readers can see how it is applied in practice across pipelines, applications, tooling, and platform work. With 9 published projects currently listed, this landing page works best when it helps visitors compare implementations rather than just browse a tag.
Useful Apache Spark projects usually explain the problem they solve, the surrounding stack, and how Apache Spark fits into the broader architecture. Strong examples also show operational details such as deployment approach, testing, observability, data quality controls, and documentation quality. That kind of context makes this page more valuable to searchers evaluating tools and to agents looking for grounded examples.