Popularity

9.7

Stable

Activity

9.8

Stars 5,924

Watchers 271

Forks 1,118

Last Commit about 4 hours ago

Code Quality Rank: L2

Programming language: Java

License: GNU General Public License v3.0 or later

Tags: Science And Data Analysis

Latest version: v2.6.0

Smile alternatives and similar packages

Based on the "Science and Data Analysis" category.
Alternatively, view Smile alternatives based on common mentions on social networks and blogs.

MLLib

10.0 10.0 Smile VS MLLib

Apache Spark - A unified analytics engine for large-scale data processing
PredictionIO

9.9 0.0 Smile VS PredictionIO

DISCONTINUED. PredictionIO, a machine learning server for developers and ML engineers.

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

Promo www.influxdata.com

Zeppelin

9.8 8.7 L2 Smile VS Zeppelin

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
BigDL

9.7 9.9 Smile VS BigDL

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max). A PyTorch LLM library that seamlessly integrates with llama.cpp, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, ModelScope, etc.
Breeze

9.5 5.1 Smile VS Breeze

Breeze is a numerical processing library for Scala.
Spark Notebook

9.5 0.0 L1 Smile VS Spark Notebook

Interactive and Reactive Data Science using Scala and Spark.
Algebird

9.3 7.6 Smile VS Algebird

Abstract Algebra for Scala
Spire

8.9 5.6 Smile VS Spire

Powerful new number types and numeric abstractions for Scala.
Figaro

8.0 0.0 Smile VS Figaro

Figaro Programming Language and Core Libraries
Tensorflow_scala

8.0 0.0 Smile VS Tensorflow_scala

TensorFlow API for the Scala Programming Language
Squants

7.8 2.9 Smile VS Squants

The Scala API for Quantities, Units of Measure and Dimensional Analysis
FACTORIE

7.5 0.0 Smile VS FACTORIE

FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a succinct language for creating relational factor graphs, estimating parameters and performing inference.
Saddle

6.9 0.0 Smile VS Saddle

DISCONTINUED. A minimalist port of Pandas to Scala
ND4S

6.1 0.0 Smile VS ND4S

DISCONTINUED. ND4S: N-Dimensional Arrays for Scala. Scientific Computing a la Numpy. Based on ND4J.
Chalk

5.7 0.0 Smile VS Chalk

DISCONTINUED. Chalk is a natural language processing library.
Compute.scala

4.8 0.0 Smile VS Compute.scala

Scientific computing with N-dimensional arrays
Libra

4.5 0.0 Smile VS Libra

A dimensional analysis library based on dependent types
Numsca

4.4 2.7 Smile VS Numsca

numsca is numpy for scala
OpenMOLE

4.4 9.4 Smile VS OpenMOLE

Workflow engine for exploration of simulation models using high throughput computing
Clustering4Ever

4.0 0.0 Smile VS Clustering4Ever

C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Optimus * 96

4.0 0.0 Smile VS Optimus * 96

Optimus is a mathematical programming library for Scala.
rscala

3.6 6.1 Smile VS rscala

The Scala interpreter is embedded in R and callbacks to R from the embedded interpreter are supported. Conversely, the R interpreter is embedded in Scala.
LoMRF

3.2 0.0 Smile VS LoMRF

LoMRF is an open-source implementation of Markov Logic Networks
Tyche

2.9 0.0 Smile VS Tyche

Statistics utilities for the JVM - in Scala!
MGO

2.8 5.6 Smile VS MGO

Purely functional genetic algorithms for multi-objective optimisation
Rings

2.7 3.5 Smile VS Rings

Rings: efficient JVM library for polynomial rings
Synapses

2.5 0.0 Smile VS Synapses

A group of neural-network libraries for functional and mainstream languages
Axle

2.4 5.5 Smile VS Axle

Axle Domain Specific Language for Scientific Cloud Computing and Visualization
SwiftLearner

2.2 0.0 Smile VS SwiftLearner

SwiftLearner: Scala machine learning library
Persist-Units

0.9 0.0 Smile VS Persist-Units

Scala Units of Measure Types
OscaR

0.2 - Smile VS OscaR

a Scala toolkit for solving Operations Research problems

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of Smile or a related project?

Add another 'Science and Data Analysis' Package

Popular Comparisons

README

Smile

Smile (Statistical Machine Intelligence and Learning Engine) is a fast and comprehensive machine learning, NLP, linear algebra, graph, interpolation, and visualization system in Java and Scala. With advanced data structures and algorithms, Smile delivers state-of-art performance. Smile is well documented and please check out the project website for programming guides and more information.

Smile covers every aspect of machine learning, including classification, regression, clustering, association rule mining, feature selection, manifold learning, multidimensional scaling, genetic algorithms, missing value imputation, efficient nearest neighbor search, etc.

Smile implements the following major machine learning algorithms:

Classification: Support Vector Machines, Decision Trees, AdaBoost, Gradient Boosting, Random Forest, Logistic Regression, Neural Networks, RBF Networks, Maximum Entropy Classifier, KNN, Naïve Bayesian, Fisher/Linear/Quadratic/Regularized Discriminant Analysis.
Regression: Support Vector Regression, Gaussian Process, Regression Trees, Gradient Boosting, Random Forest, RBF Networks, OLS, LASSO, ElasticNet, Ridge Regression.
Feature Selection: Genetic Algorithm based Feature Selection, Ensemble Learning based Feature Selection, TreeSHAP, Signal Noise ratio, Sum Squares ratio.
Clustering: BIRCH, CLARANS, DBSCAN, DENCLUE, Deterministic Annealing, K-Means, X-Means, G-Means, Neural Gas, Growing Neural Gas, Hierarchical Clustering, Sequential Information Bottleneck, Self-Organizing Maps, Spectral Clustering, Minimum Entropy Clustering.
Association Rule & Frequent Itemset Mining: FP-growth mining algorithm.
Manifold Learning: IsoMap, LLE, Laplacian Eigenmap, t-SNE, UMAP, PCA, Kernel PCA, Probabilistic PCA, GHA, Random Projection, ICA.
Multi-Dimensional Scaling: Classical MDS, Isotonic MDS, Sammon Mapping.
Nearest Neighbor Search: BK-Tree, Cover Tree, KD-Tree, SimHash, LSH.
Sequence Learning: Hidden Markov Model, Conditional Random Field.
Natural Language Processing: Sentence Splitter and Tokenizer, Bigram Statistical Test, Phrase Extractor, Keyword Extractor, Stemmer, POS Tagging, Relevance Ranking

You can use the libraries through Maven central repository by adding the following to your project pom.xml file.

    <dependency>
      <groupId>com.github.haifengl</groupId>
      <artifactId>smile-core</artifactId>
      <version>2.6.0</version>
    </dependency>

For NLP, use the artifactId smile-nlp.

For Scala API, please use

    libraryDependencies += "com.github.haifengl" %% "smile-scala" % "2.6.0"

For Kotlin API, add the below into the dependencies section of Gradle build script.

    implementation("com.github.haifengl:smile-kotlin:2.6.0")

For Clojure API, add the following dependency to your project or build file:

    [org.clojars.haifengl/smile "2.6.0"]

Some algorithms rely on BLAS and LAPACK (e.g. manifold learning, some clustering algorithms, Gaussian Process regression, MLP, etc). To use these algorithms, you should include OpenBLAS for optimized matrix computation:

    libraryDependencies ++= Seq(
      "org.bytedeco" % "javacpp"   % "1.5.4"        classifier "macosx-x86_64" classifier "windows-x86_64" classifier "linux-x86_64" classifier "linux-arm64" classifier "linux-ppc64le" classifier "android-arm64" classifier "ios-arm64",
      "org.bytedeco" % "openblas"  % "0.3.10-1.5.4" classifier "macosx-x86_64" classifier "windows-x86_64" classifier "linux-x86_64" classifier "linux-arm64" classifier "linux-ppc64le" classifier "android-arm64" classifier "ios-arm64",
      "org.bytedeco" % "arpack-ng" % "3.7.0-1.5.4"  classifier "macosx-x86_64" classifier "windows-x86_64" classifier "linux-x86_64" classifier "linux-arm64" classifier "linux-ppc64le"
    )

In this example, we include all supported 64-bit platforms and filter out 32-bit platforms. The user should include only the needed platforms to save spaces.

If you prefer other BLAS implementations, you can use any library found on the "java.library.path" or on the class path, by specifying it with the "org.bytedeco.openblas.load" system property. For example, to use the BLAS library from the Accelerate framework on Mac OS X, we can pass options such as -Djava.library.path=/usr/lib/ -Dorg.bytedeco.openblas.load=blas.

For a default installation of MKL that would be -Dorg.bytedeco.openblas.load=mkl_rt. Or you may simply include smile-mkl module in your project, which includes MKL binaries. With smile-mkl module in the class path, Smile will automatically switch to MKL.

    libraryDependencies += "com.github.haifengl" %% "smile-mkl" % "2.6.0"

Shell

Smile comes with interactive shells for Java, Scala and Kotlin. Download pre-packaged Smile from the releases page. In the home directory of Smile, type

    ./bin/smile

to enter the Scala shell. You can run any valid Scala expressions in the shell. In the simplest case, you can use it as a calculator. Besides, all high-level Smile operators are predefined in the shell. By default, the shell uses up to 75% memory. If you need more memory to handle large data, use the option -J-Xmx or -XX:MaxRAMPercentage. For example,

    ./bin/smile -J-Xmx30G

You can also modify the configuration file ./conf/smile.ini for the memory and other JVM settings.

To use Java's JShell, type

    ./bin/jshell.sh

which has Smile's jars in the classpath. Similarly, run

    ./bin/kotlin.sh

to enter Kotlin REPL.

Model Serialization

Most models support the Java Serializable interface (all classifiers do support Serializable interface) so that you can use them in Spark. For reading/writing the models in non-Java code, we suggest XStream to serialize the trained models. XStream is a simple library to serialize objects to XML and back again. XStream is easy to use and doesn't require mappings (actually requires no modifications to objects). Protostuff is a nice alternative that supports forward-backward compatibility (schema evolution) and validation. Beyond XML, Protostuff supports many other formats such as JSON, YAML, protobuf, etc.

Visualization

Smile provides a Swing-based data visualization library SmilePlot, which provides scatter plot, line plot, staircase plot, bar plot, box plot, histogram, 3D histogram, dendrogram, heatmap, hexmap, QQ plot, contour plot, surface, and wireframe.

To use SmilePlot, add the following to dependencies

    <dependency>
      <groupId>com.github.haifengl</groupId>
      <artifactId>smile-plot</artifactId>
      <version>2.6.0</version>
    </dependency>

Smile also support data visualization in declarative approach. With smile.plot.vega package, we can create a specification that describes visualizations as mappings from data to properties of graphical marks (e.g., points or bars). The specification is based on Vega-Lite. The Vega-Lite compiler automatically produces visualization components including axes, legends, and scales. It then determines properties of these components based on a set of carefully designed rules.

Gallery

Kernel PCA IsoMap Multi-Dimensional Scaling SOM Neural Network SVM Agglomerative Clustering X-Means DBSCAN Neural Gas Wavelet Exponential Family Mixture

Smile

Statistical Machine Intelligence & Learning Engine

Smile alternatives and similar packages

Popular Comparisons

README

Smile

Shell

Model Serialization

Visualization

Gallery