Popularity

9.5

Stable

Activity

0.0

Stable

Stars 3,147

Watchers 190

Forks 654

Last Commit 12 months ago

Code Quality Rank: L1

Programming language: JavaScript

License: Apache License 2.0

Tags: Science And Data Analysis

Latest version: v0.8.3

Spark Notebook alternatives and similar packages

Based on the "Science and Data Analysis" category.
Alternatively, view Spark Notebook alternatives based on common mentions on social networks and blogs.

MLLib

10.0 10.0 Spark Notebook VS MLLib

Apache Spark - A unified analytics engine for large-scale data processing
PredictionIO

9.9 0.0 Spark Notebook VS PredictionIO

DISCONTINUED. PredictionIO, a machine learning server for developers and ML engineers.

WorkOS - The modern identity platform for B2B SaaS

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

Promo workos.com

Zeppelin

9.8 8.7 L2 Spark Notebook VS Zeppelin

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Smile

9.7 9.8 L2 Spark Notebook VS Smile

Statistical Machine Intelligence & Learning Engine
BigDL

9.7 9.9 Spark Notebook VS BigDL

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max). A PyTorch LLM library that seamlessly integrates with llama.cpp, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, ModelScope, etc.
Breeze

9.5 5.1 Spark Notebook VS Breeze

Breeze is a numerical processing library for Scala.
Algebird

9.3 7.6 Spark Notebook VS Algebird

Abstract Algebra for Scala
Spire

8.9 5.6 Spark Notebook VS Spire

Powerful new number types and numeric abstractions for Scala.
Figaro

8.0 0.0 Spark Notebook VS Figaro

Figaro Programming Language and Core Libraries
Tensorflow_scala

8.0 0.0 Spark Notebook VS Tensorflow_scala

TensorFlow API for the Scala Programming Language
Squants

7.8 2.9 Spark Notebook VS Squants

The Scala API for Quantities, Units of Measure and Dimensional Analysis
FACTORIE

7.5 0.0 Spark Notebook VS FACTORIE

FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a succinct language for creating relational factor graphs, estimating parameters and performing inference.
Saddle

6.9 0.0 Spark Notebook VS Saddle

DISCONTINUED. A minimalist port of Pandas to Scala
ND4S

6.1 0.0 Spark Notebook VS ND4S

DISCONTINUED. ND4S: N-Dimensional Arrays for Scala. Scientific Computing a la Numpy. Based on ND4J.
Chalk

5.7 0.0 Spark Notebook VS Chalk

DISCONTINUED. Chalk is a natural language processing library.
Compute.scala

4.8 0.0 Spark Notebook VS Compute.scala

Scientific computing with N-dimensional arrays
Libra

4.5 0.0 Spark Notebook VS Libra

A dimensional analysis library based on dependent types
Numsca

4.4 2.7 Spark Notebook VS Numsca

numsca is numpy for scala
OpenMOLE

4.4 9.4 Spark Notebook VS OpenMOLE

Workflow engine for exploration of simulation models using high throughput computing
Clustering4Ever

4.0 0.0 Spark Notebook VS Clustering4Ever

C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Optimus * 96

4.0 0.0 Spark Notebook VS Optimus * 96

Optimus is a mathematical programming library for Scala.
rscala

3.6 6.1 Spark Notebook VS rscala

The Scala interpreter is embedded in R and callbacks to R from the embedded interpreter are supported. Conversely, the R interpreter is embedded in Scala.
LoMRF

3.2 0.0 Spark Notebook VS LoMRF

LoMRF is an open-source implementation of Markov Logic Networks
Tyche

2.9 0.0 Spark Notebook VS Tyche

Statistics utilities for the JVM - in Scala!
MGO

2.8 5.6 Spark Notebook VS MGO

Purely functional genetic algorithms for multi-objective optimisation
Rings

2.7 3.5 Spark Notebook VS Rings

Rings: efficient JVM library for polynomial rings
Synapses

2.5 0.0 Spark Notebook VS Synapses

A group of neural-network libraries for functional and mainstream languages
Axle

2.4 5.5 Spark Notebook VS Axle

Axle Domain Specific Language for Scientific Cloud Computing and Visualization
SwiftLearner

2.2 0.0 Spark Notebook VS SwiftLearner

SwiftLearner: Scala machine learning library
Persist-Units

0.9 0.0 Spark Notebook VS Persist-Units

Scala Units of Measure Types
OscaR

0.2 - Spark Notebook VS OscaR

a Scala toolkit for solving Operations Research problems

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of Spark Notebook or a related project?

Add another 'Science and Data Analysis' Package

Popular Comparisons

README

Spark Notebook

The Spark Notebook is the open source notebook aimed at enterprise environments, providing Data Scientists and Data Engineers with an interactive web-based editor that can combine Scala code, SQL queries, Markup and JavaScript in a collaborative manner to explore, analyse and learn from massive data sets.

[notebook intro](./docs/images/geo-airports.png)

The Spark Notebook allows performing reproducible analysis with Scala, Apache Spark and the Big Data ecosystem.

Features Highlights

[Apache Spark](./docs/images/spark-logo-192x100px.png)

Apache Spark is available out of the box, and is simply accessed by the variable sparkContext or sc.

Multiple Spark Context Support

One of the top most useful feature brought by the Spark Notebook is its separation of the running notebooks. Each started notebook will spawn a new JVM with its own SparkSession instance. This allows a maximal flexibility for:

dependencies without clashes
access different clusters
tune differently each notebook
external scheduling (on the roadmap)

Metadata-driven configuration

We achieve maximum flexibility with the availability of multiple sparkContexts by enabling [metadata driven](./docs/metadata.md) configuration.

Scala

The Spark Notebook supports exclusively the Scala programming language, the Unpredicted Lingua Franca for Data Science and extensibly exploits the JVM ecosystem of libraries to drive an smooth evolution of data-driven software from exploration to production.

The Spark Notebook is available for *NIX and Windows systems in easy to use ZIP/TAR, Docker and DEB packages.

Reactive

All components in the Spark Notebook are dynamic and reactive.

The Spark Notebook comes with dynamic charts and most (if not all) components can be listened for and can react to events. This is very helpful in many cases, for example:

data entering the system live at runtime
visually plots of events
multiple interconnected visual components Dynamic and reactive components mean that you don't have write the html, js, server code just for basic use cases.

Quick Start

Go to [Quick Start](./docs/quick_start.md) for our 5-minutes guide to get up and running with the Spark Notebook.

C'mon on to Gitter to discuss things, to get some help, or to start contributing!

Learn more

[Explore the Spark Notebook](./docs/exploring_notebook.md)
[HTML Widgets](./docs/widgets_html.md)
[Visualization Widgets](./docs/widgets_viz.md)
[Notebook Browser](./docs/notebook_browser.md)
Configuration
- [Notebook Configuration and Metadata](./docs/metadata.md)
- [Using Cluster Configurations](./docs/using_cluster_tab.md)
- [Versioned notebook storage with Git(hub)](./modules/git-notebook-provider/README.md)
[Running on Clusters and Clouds](./docs/clusters_clouds.md)
[Community](./docs/community.md)
Advanced Topics
- [Using Releases](./docs/using_releases.md)
- [Building from Sources](./docs/build_from_source.md)
- [Creating Specific Distributions](./docs/build_specific_distros.md)
- [Creating your own custom visualizations](./docs/custom_charts.md)
- [User Authentication](./docs/authentication.md)
  - Supports: Basic, Form & Kerberos auth, and many more via pac4j (OAuth, OpendID, ...)
  - Passing the logged in user to Secure Hadoop+YARN clusters via the [proxy-user impersonation](./docs/proxyuser_impersonation.md)
Advanced: How to Develop/improve spark-notebook
- [Overview of Project structure](./docs/code_structure.md)

Testimonials

Skymind - Deeplearning4j

Spark Notebook gives us a clean, useful way to mix code and prose when we demo and explain our tech to customers. The Spark ecosystem needed this.

Vinted.com

It allows our analysts and developers (15+ users) to run ad-hoc queries, to perform complex data analysis and data visualisations, prototype machine learning pipelines. In addition, we use it to power our BI dashboards.

Adopters

Name	URL	Description
Kensu	website	Lifting Data Science to the Enterprise level
Agile Lab	website	The only Italian Spark Certified systems integrator
CloudPhysics	website	Data-Driven Inisghts for Smarter IT
Aliyun	product	Spark runtime environment on ECS and management tool of Spark Cluster running on Aliyun ECS
EMBL European Bioinformatics Institute	website	EMBL-EBI provides freely available data from life science experiments, performs basic research in computational biology and offers an extensive user training programme, supporting researchers in academia and industry.
Metail	website	The best body shape and garment fit company in the world. To create and empower everyone’s online body identity.
kt NexR	website	the kt NexR is one of the leading BigData company in the Korea from 2007.
Skymind	website	At Skymind, we’re tackling some of the most advanced problems in data analysis and machine intelligence. We offer start-of-the-art, flexible, scalable deep learning for industry.
Amino	website	A new way to get the facts about your health care choices.
Vinted	website	Online marketplace and a social network focused on young women’s lifestyle.
Vingle	website	Vingle is the community where you can meet someone like you.
47 Degrees	website	47 Degrees is a global consulting firm and certified Typesafe & Databricks Partner specializing in Scala & Spark.
Barclays	website	Barclays is a British multinational banking and financial services company headquartered in London.
Swisscom	website	Swisscom is the leading mobile service provider in Switzerland.
Knoldus	website	Knoldus is a global consulting firm and certified "Select" Lightbend & Databricks Partner specializing in Scala & Spark ecosystem.