Selected Tags

Click on a tag to remove it

More Tags

Click on a tag to add it and filter down

Big Data packages

Showing projects tagged as Big Data

  • Kafka

    10.0 9.8 L2 Java
    Kafka is a message broker project and aims to provide a unified, high-throughput, low-latency platform for handling real-time data feeds.
  • Apache Spark

    10.0 9.9 Scala
    Big data platform
  • Deeplearning4J

    9.9 8.9 L1 Java
    Deep Learning for Java, Scala & Clojure on Hadoop & Spark - From Skymind
  • Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
    Promoted scoutapm.com
  • Flink

    9.9 10.0 L2 Java
    Processing framework with powerful stream- and batch-processing capabilities.
  • Scalding

    9.6 2.7 Scala
    A Scala binding for the Cascading abstraction of Hadoop MapReduce.
  • Summingbird

    9.3 0.0 Scala
    An implementation of the “lambda architecture” as a software abstraction
  • Scio

    9.3 9.5 Scala
    A Scala API for Apache Beam and Google Cloud Dataflow
  • Reactive-kafka

    8.9 8.6 Scala
    Reactive Streams API for Apache Kafka.
  • Jupyter Scala

    8.7 8.5 Scala
    Lightweight Scala kernel for Jupyter / IPython 3
  • BIDMach

    8.4 1.0 Jupyter Notebook
    CPU and GPU machine learning library, using JNI for GPU computation.
  • Gearpump

    8.2 0.0 Scala
    Lightweight real-time big data streaming engine
  • Sparkta

    8.1 0.0 Scala
    Real Time Aggregation based on Spark Streaming.
  • Hail

    8.0 9.7 Scala
    Scalable genetic data analysis
  • Vegas

    7.7 0.0 Scala
    The missing MatPlotLib for Scala + Spark
  • Scoobi

    7.1 0.0 Scala
    Write type-safe Hadoop programs in idiomatic Scala way
  • metorikku

    6.4 8.5 Scala
    A simplified, lightweight ELT Framework based on Apache Spark
  • DynaML

    5.2 7.4 Scala
    Scala Library/REPL for Machine Learning Research
  • Scrunch

    5.1 2.6 L3 Java
    A Scala wrapper for Apache Crunch which provides a framework for writing, testing, and running MapReduce pipelines.
  • Scoozie

    4.7 0.0 Scala
    Scala DSL on top of Oozie XML
  • spark-deployer

    3.5 0.0 Scala
    A sbt plugin which helps deploying Apache Spark stand-alone cluster and submitting job on cloud system like AWS EC2.
  • Schemer

    3.4 0.0 Scala
    Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
  • raster-frames

    2.1 0.0 Scala
    Spark DataFrames for earth observation data
  • GridScale

    2.0 5.2 Java
    A Scala API for computing clusters and grids.
  • Shadoop

    1.9 0.0 Scala
    A Scala DSL for Hadoop MapReduce.
  • Sparkplug

    1.9 4.1 Scala
    Spark package to "plug" holes in data using SQL based rules
  • Spark Utils

    1.6 3.4 Scala
    Basic framework utilities to quickly start writing production ready Apache Spark applications
  • Spark Tools

    1.0 3.4 Scala
    Executable Apache Spark Tools: Format Converter & SQL Processor