10

8

6

4

2


9.9

9.9

10.0

10.0

8.4

9.6

10.0

9.9

9.3

9.1

9.9

8.3

26 Big Data packages and projects

  • Flink

    9.9 9.9 L2 Java
    Apache Flink
  • Apache Spark

    10.0 10.0 Scala
    Apache Spark - A unified analytics engine for large-scale data processing
  • SaaSHub helps you find the best software and product alternatives
    Promo www.saashub.com
    SaaSHub Logo
  • Hail

    8.4 9.6 Python
    Cloud-native genomic dataframes and batch computing
  • Kafka

    10.0 9.9 L2 Java
    Apache Kafka - A distributed event streaming platform
  • Scio

    9.3 9.1 Scala
    A Scala API for Apache Beam and Google Cloud Dataflow.
  • Deeplearning4J

    9.9 8.3 L1 Java
    Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...
  • metorikku

    7.4 2.4 Scala
    DISCONTINUED. A simplified, lightweight ETL Framework based on Apache Spark
  • Jupyter Scala

    8.8 8.9 Scala
    A Scala kernel for Jupyter
  • Reactive-kafka

    8.9 7.2 Scala
    Alpakka Kafka connector - Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.
  • GridScale

    2.2 8.0 Scala
    Scala library for accessing various file, batch systems, job schedulers and grid middlewares.
  • Sparkplug

    1.9 0.0 Scala
    Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌
  • Spark Tools

    1.1 0.0 Scala
    Executable Apache Spark Tools: Format Converter & SQL Processor
  • Spark Utils

    2.0 4.6 Scala
    Basic framework utilities to quickly start writing production ready Apache Spark applications
  • Scalding

    9.5 2.5 Scala
    A Scala API for Cascading
  • Scrunch

    5.1 1.4 L3 Java
    DISCONTINUED. Mirror of Apache Crunch (Incubating)
  • BIDMach

    8.2 0.0 Scala
    CPU and GPU-accelerated Machine Learning Library
  • Scoozie

    4.7 0.0 Scala
    DISCONTINUED. Scala DSL on top of Oozie XML [GET https://api.github.com/repos/klout/scoozie: 404 - Not Found // See: https://docs.github.com/rest/repos/repos#get-a-repository]
  • Scoobi

    6.7 0.0 Scala
    A Scala productivity framework for Hadoop.
  • Gearpump

    8.0 0.0 Scala
    Lightweight real-time big data streaming engine over Akka
  • Vegas

    7.5 0.0 Scala
    The missing MatPlotLib for Scala + Spark
  • raster-frames

    2.1 0.0 Scala
    DISCONTINUED. Spark DataFrames for earth observation data
  • Schemer

    3.6 0.0 Scala
    Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
  • Summingbird

    9.3 1.7 Scala
    DISCONTINUED. Streaming MapReduce with Scalding and Storm
  • Sparkta

    8.0 0.0 Scala
    DISCONTINUED. Real Time Analytics and Data Pipelines based on Spark Streaming
  • spark-deployer

    3.3 0.0 Scala
    Deploy Spark cluster in an easy way.
  • Shadoop

    1.2 0.0 Scala
    A wrapper for Hadoop in Scala

Add another 'Big Data' Package