Popularity

4.0

Declining

Activity

0.0

Stable

Stars 115

Watchers 16

Forks 29

Last Commit almost 3 years ago

Programming language: Scala

License: Apache License 2.0

Tags: Database

Latest version: v2.4.0

Scruid alternatives and similar packages

Based on the "Database" category.
Alternatively, view Scruid alternatives based on common mentions on social networks and blogs.

Slick

9.4 8.7 Scruid VS Slick

Slick (Scala Language Integrated Connection Kit) is a modern database query and access library for Scala
doobie

9.2 8.8 Scruid VS doobie

Functional JDBC layer for Scala.

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

Promo www.influxdata.com

Elastic4s

9.2 8.6 Scruid VS Elastic4s

Elasticsearch Scala Client - Reactive, Non Blocking, Type Safe, HTTP Client
Quill

9.1 9.1 Scruid VS Quill

Compile-time Language Integrated Queries for Scala
PostgreSQL and MySQL async

8.9 0.0 Scruid VS PostgreSQL and MySQL async

DISCONTINUED. Async database drivers to talk to PostgreSQL and MySQL in Scala.
ScalikeJDBC

8.5 8.9 Scruid VS ScalikeJDBC

A tidy SQL-based DB access library for Scala developers. This library naturally wraps JDBC APIs and provides you easy-to-use APIs.
Phantom

8.4 0.0 Scruid VS Phantom

Schema safe, type-safe, reactive Scala driver for Cassandra/Datastax Enterprise
scala-redis

8.4 0.0 Scruid VS scala-redis

A scala library for connecting to a redis server, or a cluster of redis nodes using consistent hashing on the client side.
ReactiveMongo

8.2 7.2 Scruid VS ReactiveMongo

:leaves: Non-blocking, Reactive MongoDB Driver for Scala
Slick-pg

7.8 8.3 Scruid VS Slick-pg

Slick extensions for PostgreSQL
rediscala

7.7 0.0 Scruid VS rediscala

Non-blocking, Reactive Redis driver for Scala (with Sentinel support)
Casbah

7.3 2.6 Scruid VS Casbah

DISCONTINUED. Casbah is now officially end-of-life (EOL).
Squeryl

7.2 8.1 Scruid VS Squeryl

A Scala DSL for talking with databases with minimum verbosity and maximum type safety
Salat

7.0 0.0 Scruid VS Salat

Salat is a simple serialization library for case classes.
mongo-scala-driver

6.6 1.7 Scruid VS mongo-scala-driver

DISCONTINUED. A modern idiomatic MongoDB Scala Driver.
gremlin-scala

6.6 0.0 Scruid VS gremlin-scala

Scala wrapper for Apache TinkerPop 3 Graph DSL
Scanamo

6.3 7.4 Scruid VS Scanamo

Simpler DynamoDB access for Scala
#<Sawyer::Resource:0x00007f161059a678>

5.9 8.3 Scruid VS #<Sawyer::Resource:0x00007f161059a678>

Strong type constraints for Scala
Scala ActiveRecord

5.8 0.0 Scruid VS Scala ActiveRecord

ActiveRecord-like ORM library for Scala
Activate

5.7 0.0 Scruid VS Activate

Abandoned: Pluggable persistence in Scala
Anorm

5.5 7.6 Scruid VS Anorm

The Anorm database library
Sorm

5.2 0.0 Scruid VS Sorm

DISCONTINUED. A functional boilerplate-free Scala ORM
SwayDB

5.2 3.7 Scruid VS SwayDB

Persistent and in-memory key-value storage engine for JVM that scales on a single machine.
Pulsar4s

5.0 6.5 Scruid VS Pulsar4s

Idiomatic, typesafe, and reactive Scala client for Apache Pulsar
Relate

4.9 6.1 Scruid VS Relate

Performant database access in Scala
scredis

4.7 0.0 Scruid VS scredis

Non-blocking, ultra-fast Scala Redis client built on top of Akka IO, used in production at Livestream
Scala-Forklift

4.6 0.0 Scruid VS Scala-Forklift

Type-safe data migration tool for Slick, Git and beyond.
AnormCypher

4.3 0.0 Scruid VS AnormCypher

Neo4j Scala library based on Anorm in the Play Framework
neotypes

4.3 9.0 Scruid VS neotypes

Scala lightweight, type-safe, asynchronous driver for neo4j
Troy

4.0 0.0 Scruid VS Troy

Type-safe and Schema-safe Scala wrapper for Cassandra driver
Clickhouse-scala-client

3.8 8.3 Scruid VS Clickhouse-scala-client

Clickhouse Scala Client with Reactive Streams support
rethink-scala

3.7 0.0 Scruid VS rethink-scala

Scala Driver for RethinkDB
Shade

3.5 0.0 Scruid VS Shade

Memcached client for Scala
scala-sql

3.4 2.4 Scruid VS scala-sql

scala SQL api
longevity

3.2 0.0 Scruid VS longevity

A Persistence Framework for Scala and NoSQL
Tepkin

3.1 0.0 Scruid VS Tepkin

DISCONTINUED. Reactive MongoDB Driver for Scala built on top of Akka IO and Akka Streams.
Couchbase

3.1 9.6 Scruid VS Couchbase

The Couchbase Monorepo for JVM Clients: Java, Scala, io-core…
laserdisc

3.0 8.2 Scruid VS laserdisc

A Future-free Fs2 native pure FP Redis client
Morpheus

2.9 0.0 Scruid VS Morpheus

Reactive type-safe Scala driver for SQL databases
CouchDB-Scala

2.8 0.0 Scruid VS CouchDB-Scala

A purely functional Scala client for CouchDB
ReactiveCouchbase

2.7 0.0 Scruid VS ReactiveCouchbase

Play 2 plugin for ReactiveCouchbase
ScalaRelational

2.7 0.0 Scruid VS ScalaRelational

Type-Safe framework for defining, modifying, and querying SQL databases
lucene4s

2.7 0.0 Scruid VS lucene4s

Light-weight convenience wrapper around Lucene to simplify complex tasks and add Scala sugar.
ReactiveNeo

2.6 0.0 Scruid VS ReactiveNeo

[DISCONTINUED] Reactive type-safe Scala driver for Neo4J
Memcontinuationed

2.3 0.0 Scruid VS Memcontinuationed

Memcached client for Scala
d4s

2.2 8.7 Scruid VS d4s

DISCONTINUED. Dynamo DB Database Done Scala-way
scala-migrations

2.0 0.0 Scruid VS scala-migrations

Database migrations written in Scala
etcd4s

1.7 0.0 Scruid VS etcd4s

Scala etcd client implementing V3 APIs
neo4akka

1.6 0.0 Scruid VS neo4akka

Neo4j Scala client using Akka-Http
GCP Datastore Akka Persistence Plugin

1.5 0.0 Scruid VS GCP Datastore Akka Persistence Plugin

akka-persistence-gcp-datastore is a journal and snapshot store plugin for akka-persistence using google cloud firestore in datastore mode.

Do you think we are missing an alternative of Scruid or a related project?

Add another 'Database' Package

Popular Comparisons

README

[Scruid](logo/logo-with-tagline.svg)

Scruid (Scala+Druid) is an open source library that allows you to compose Druid queries easily in Scala. The library will take care of the translation of the query into json, parse the result in the case class that you define.

Currently, the API is under heavy development, so changes might occur.

Release Notes

Please view the Releases page on GitHub.

Installation

The binaries are hosted on Maven Central. We publish builds for Scala 2.11, 2.12 and 2.13.

libraryDependencies += "com.ing.wbaa.druid" %% "scruid" % "2.5.0"

Example queries:

Scruid provides query constructors for TopNQuery, GroupByQuery, TimeSeriesQuery, ScanQuery and SearchQuery (see below for details). You can call the execute method on a query to send the query to Druid. This will return a Future[DruidResponse]. This response contains the Circe JSON data without having it parsed to a specific case class yet. To interpret this JSON data you can run two methods on a DruidResponse:

.list[T](implicit decoder: Decoder[T]): List[T] : This decodes the JSON to a list with items of type T.
.series[T](implicit decoder: Decoder[T]): Map[ZonedDateTime, T] : This decodes the JSON to a timeseries map with the timestamp as key and T as value.

Below the example queries supported by Scruid. For more information about how to query Druid, and what query to pick, please refer to the Druid documentation

TopN query

case class TopCountry(count: Int, countryName: String = null)

val response = TopNQuery(
  dimension = Dimension(
    dimension = "countryName"
  ),
  threshold = 5,
  metric = "count",
  aggregations = List(
    CountAggregation(name = "count")
  ),
  intervals = List("2011-06-01/2017-06-01")
).execute

val result: Future[Map[ZonedDateTime, List[TopCountry]]] = response.map(_.series[List[TopCountry]])

GroupBy query

case class GroupByIsAnonymous(isAnonymous: Boolean, count: Int)

val response = GroupByQuery(
  aggregations = List(
    CountAggregation(name = "count")
  ),
  dimensions = List("isAnonymous"),
  intervals = List("2011-06-01/2017-06-01")
).execute()

val result: Future[List[GroupByIsAnonymous]] = response.map(_.list[GroupByIsAnonymous])

The returned Future[DruidResponse] will contain json data where isAnonymouse is either true or false. Please keep in mind that Druid is only able to handle strings, and recently also numerics. So Druid will be returning a string, and the conversion from a string to a boolean is done by the json parser.

TimeSeries query

case class TimeseriesCount(count: Int)

val response = TimeSeriesQuery(
  aggregations = List(
    CountAggregation(name = "count")
  ),
  granularity = GranularityType.Hour,
  intervals = List("2011-06-01/2017-06-01")
).execute

val series: Future[Map[ZonedDateTime, TimeseriesCount]] = response.map(_.series[TimeseriesCount])

Scan query

case class ScanResult(channel: Option[String], cityName: Option[String], countryIsoCode: Option[String], user: Option[String])

val response = ScanQuery(
    granularity = GranularityType.Hour
    intervals = List("2011-06-01/2017-06-01")
    dimensions = List("channel", "cityName", "countryIsoCode", "user"),
    limit = 100
).execute() 

val result: Future[List[ScanResult]] = response.map(_.list[ScanResult])

Search query

Search query is a bit different, since it does not take type parameters as its results are of type com.ing.wbaa.druid.DruidSearchResult

val response = SearchQuery(
    granularity = GranularityType.Hour,
    intervals = List("2011-06-01/2017-06-01"),
    query = ContainsInsensitive("GR"),
    searchDimensions = List("countryIsoCode")
).execute()

val result = Future[List[DruidSearchResult]] = response.map(_.list)

Query context

Queries can be configured using Druid query context, such as timeout, queryId and groupByStrategy. All types of query contain the argument context which associates query parameter with their corresponding values. The parameter names can also be accessed by com.ing.wbaa.druid.definitions.QueryContext object. Consider, for example, a timeseries query with custom query id and priority:

TimeSeriesQuery(
  aggregations = List(
    CountAggregation(name = "count")
  ),
  granularity = GranularityType.Hour,
  intervals = List("2011-06-01/2017-06-01"),
  context = Map(
    QueryContext.QueryId -> "some_custom_id",
    QueryContext.Priority -> 1
  )
)

Druid query language (DQL)

Scruid also provides a rich Scala API for building queries using the fluent pattern.

case class GroupByIsAnonymous(isAnonymous: String, country: String, count: Int)

val query: GroupByQuery = DQL
    .granularity(GranularityType.Day)
    .interval("2011-06-01/2017-06-01")
    .agg(count as "count")
    .where(d"countryName".isNotNull)
    .groupBy(d"isAnonymous", d"countryName".extract(UpperExtractionFn()) as "country")
    .having(d"count" > 100 and d"count" < 200)
    .limit(10, d"count".desc(DimensionOrderType.Numeric))
    .build()

val response: Future[List[GroupByIsAnonymous]] = query.execute().map(_.list[GroupByIsAnonymous])

For details and examples see the [DQL documentation](docs/dql.md).

Print native Druid JSON representation

For all types of queries you can call the function toDebugString, in order to get the corresponding native Druid JSON query representation.

For example the following:

import com.ing.wbaa.druid.dql.DSL._

val query: TopNQuery = DQL
    .from("wikipedia")
    .agg(count as "count")
    .interval("2011-06-01/2017-06-01")
    .topN(dimension = d"countryName", metric = "count", threshold = 5)
    .build()

println(query.toDebugString)

will print to the standard output:

{
  "dimension" : {
    "dimension" : "countryName",
    "outputName" : "countryName",
    "outputType" : null,
    "type" : "default"
  },
  "threshold" : 5,
  "metric" : "count",
  "aggregations" : [
    {
      "name" : "count",
      "type" : "count"
    }
  ],
  "intervals" : [
    "2011-06-01/2017-06-01"
  ],
  "granularity" : "all",
  "filter" : null,
  "postAggregations" : [
  ],
  "context" : {

  },
  "queryType" : "topN",
  "dataSource" : "wikipedia"
}

Handling large payloads with Akka Streams

For queries with large payload of results (e.g., half a million of records), Scruid can transform the corresponding response into an Akka Stream Source. The results can be processed, filtered and transformed using Flows and/or output to Sinks, as a continuous stream, without collecting the entire payload first. To process the results with Akka Stream, you can call one of the following methods:

.stream: gives a Source of DruidResult.
.streamAs[T](implicit decoder: Decoder[T]): gives a Source where each JSON record is being decoded to the type of T.
.streamSeriesAs[T](implicit decoder: Decoder[T]): gives a Source where each JSON record is being decoded to the type of T and it is accompanied by its corresponding timestamp.

All the methods above can be applied to any timeseries, group-by or top-N query created either directly by using query constructors or by DQL.

Druid SQL support

Instead of using the Druid native API, Scruid also supports Druid queries via SQL.

import com.ing.wbaa.druid.SQL._

val query = dsql"""SELECT COUNT(*) as "count" FROM wikipedia WHERE "__time" >= TIMESTAMP '2015-09-12 00:00:00'"""

val response = query.execute()

For details see the [SQL documentation](docs/sql.md).

Example

implicit val mat = DruidClient.materializer

case class TimeseriesCount(count: Int)

val query = TimeSeriesQuery(
  aggregations = List(
    CountAggregation(name = "count")
  ),
  granularity = GranularityType.Hour,
  intervals = List("2011-06-01/2017-06-01")
)

// Decode each record into the type of `TimeseriesCount` and sum all `count` results
val result: Future[Int] = query
        .streamAs[TimeseriesCount]
        .map(_.count)
        .runWith(Sink.fold(0)(_ + _))

Configuration

The configuration is done by Typesafe config. The configuration can be overridden by using environment variables, e.g. DRUID_HOSTS (DRUID_HOST and DRUID_PORT are still supported for backward compatibility) and DRUID_DATASOURCE. Or by placing an application.conf in your own project and this will override the reference.conf of the scruid library.

druid = {
  host = "localhost"
  host = ${?DRUID_HOST}
  port = 8082
  port = ${?DRUID_PORT}
  hosts = ${druid.host}":"${druid.port}
  hosts = ${?DRUID_HOSTS}
  secure = false
  secure = ${?DRUID_USE_SECURE_CONNECTION}
  url = "/druid/v2/"
  url = ${?DRUID_URL}
  health-endpoint = "/status/health"
  health-endpoint = ${?DRUID_HEALTH_ENDPOINT}
  client-backend = "com.ing.wbaa.druid.client.DruidHttpClient"
  client-backend = ${?DRUID_CLIENT_BACKEND}

  scan-query-legacy-mode = false
  scan-query-legacy-mode = ${?DRUID_SCAN_QUERY_LEGACY_MODE}

  datasource = "wikipedia"
  datasource = ${?DRUID_DATASOURCE}

  response-parsing-timeout = 5 seconds
  response-parsing-timeout = ${?DRUID_RESPONSE_PARSING_TIMEOUT}

  zone-id = "UTC"
}

Alternatively it can be programmatically overridden by defining an implicit instance of com.ing.wbaa.druid.DruidConfig:

import java.time.ZonedDateTime
import com.ing.wbaa.druid._
import com.ing.wbaa.druid.definitions._
import scala.concurrent.duration._


implicit val druidConf = DruidConfig(
  hosts = Seq("localhost:8082"),
  datasource = "wikipedia",
  responseParsingTimeout = 10.seconds
)

case class TimeseriesCount(count: Int)

val response = TimeSeriesQuery(
  aggregations = List(
    CountAggregation(name = "count")
  ),
  granularity = GranularityType.Week,
  intervals = List("2011-06-01/2017-06-01")
).execute

val series: Map[ZonedDateTime, TimeseriesCount] = response.series[TimeseriesCount]

All parameters of DruidConfig are optional, and in case that some parameter is missing then the default behaviour is to use the value that is defined in the configuration file.

Druid Clients

Scruid provides two client implementations, one for simple requests over a single Druid query host (default) and an advanced one with a queue, cached pool connections and, a load balancer when multiple Druid query hosts are provided. Depending on your use case, it is also possible to create a custom client. For details regarding clients, their configuration, as well the creation of a custom one see the [Scruid Clients](docs/scruid_clients.md) documentation.

Authentication

The Advanced client can be configured to authenticate with the Druid cluster. See the [Scruid Clients](docs/scruid_clients.md) document for more information.

Tests

The test suite relies on a docker-compose with supporting services. The dockerfiles for the images it uses are in the docker/ subdirectory. Dependency versions of the dockerized resources are defined in ./env.

To run the tests, please make sure that you have the Druid instance running:

./services.sh start

This command will build the local images as needed. You can manually build these using the ./services.sh build_images command or the Makefile in ./docker.

Scruid

Scala + Druid: Scruid. A library that allows you to compose queries in Scala, and parse the result back into typesafe classes.