10

8

6

4

2


10.0

9.9

9.9

9.9

9.9

9.8

9.7

7.8

9.3

5.4

8.6

7.7

17 Big Data packages and projects

  • Apache Spark

    10.0 9.9 Scala
    Big data platform
  • Deeplearning4J

    9.9 9.9 L1 Java
    Deep Learning for Java, Scala & Clojure on Hadoop & Spark - From Skymind
  • Kafka

    9.9 9.8 L2 Java
    Kafka is a message broker project and aims to provide a unified, high-throughput, low-latency platform for handling real-time data feeds.
  • Scalding

    9.7 7.8 Scala
    A Scala binding for the Cascading abstraction of Hadoop MapReduce.
  • Summingbird

    9.3 5.4 Scala
    An implementation of the “lambda architecture” as a software abstraction
  • Reactive-kafka

    8.6 7.7 Scala
    Reactive Streams API for Apache Kafka.
  • BIDMach

    8.5 9.0 Jupyter Notebook
    CPU and GPU machine learning library, using JNI for GPU computation.
  • Gearpump

    8.2 1.1 Scala
    Lightweight real-time big data streaming engine
  • Sparkta

    7.7 8.9 Scala
    Real Time Aggregation based on Spark Streaming.
  • Scio

    7.6 9.5 Scala
    A Scala API for Apache Beam and Google Cloud Dataflow
  • Scoobi

    7.2 0.0 Scala
    Write type-safe Hadoop programs in idiomatic Scala way
  • Hail

    5.6 9.6 Scala
    Scalable genetic data analysis
  • Scoozie

    4.7 0.0 Scala
    Scala DSL on top of Oozie XML
  • Scrunch

    4.6 5.9 L3 Java
    A Scala wrapper for Apache Crunch which provides a framework for writing, testing, and running MapReduce pipelines.
  • spark-deployer

    3.5 3.9 Scala
    A sbt plugin which helps deploying Apache Spark stand-alone cluster and submitting job on cloud system like AWS EC2.
  • Shadoop

    1.7 0.0 Scala
    A Scala DSL for Hadoop MapReduce.
  • GridScale

    1.3 7.1 Java
    A Scala API for computing clusters and grids.

Add another 'Big Data' Package