Distributed Computing
Core libraries for Apache Spark, a unified analytics engine for large-scale data processing.
Last Release on Feb 2, 2026
3.Flink : Core679 usages
org.apache.flink » flink-core Apache
Flink : Core
Last Release on May 12, 2026
Legacy core libraries for Apache Hadoop, including HDFS and MapReduce functionality.
Last Release on Jul 24, 2013
Relocated → org.apache.hadoop »
hadoop-client
5.Scalding Core55 usages
com.twitter » scalding-core Apache
scalding-core
Last Release on Dec 22, 2017
Java-based middleware for in-memory processing of big data in a distributed environment.
Last Release on Aug 10, 2016
10.Apache Crunch Core7 usages
org.apache.crunch » crunch Apache
