Core libraries for Apache Spark, a unified analytics engine for large-scale data processing.

Artifacts using Spark Project Core (2,818)
Sort by:Popular

Spark SQL is Apache Spark's module for working with structured data based on DataFrames.
Last Release on Feb 2, 2026
The machine learning library for Apache Spark, providing scalable algorithms and tools for ML pipelines.
Last Release on Feb 2, 2026
Hive Query Language
Last Release on Nov 23, 2025
Spark Project Streaming
Last Release on Feb 2, 2026
Spark Project Hive
Last Release on Feb 2, 2026
Spark Project Catalyst
Last Release on Feb 2, 2026
Hive Common
Last Release on Nov 23, 2025
Spark Avro
Last Release on Feb 2, 2026
Kafka 0.10+ Source For Structured Streaming
Last Release on Feb 2, 2026
Spark Project GraphX
Last Release on Feb 2, 2026