Run Analytical Queries Through ScalarDB Analytics
This guide explains how to develop ScalarDB Analytics applications. For details on the architecture and design, see ScalarDB Analytics Design
ScalarDB Analytics currently uses Spark as an execution engine and provides a Spark custom catalog plugin to provide a unified view of ScalarDB-managed and non-ScalarDB-managed data sources as Spark tables. This allows you to execute arbitrary Spark SQL queries seamlessly.
Preparation​
This section describes the prerequisites, setting up ScalarDB Analytics in the Spark configuration, and adding the ScalarDB Analytics dependency.
Prerequisites​
- ScalarDB Analytics server: A running instance that manages catalog information and connects to your data sources. The server must be set up with at least one data source registered. For registering data sources, see Create a ScalarDB Analytics Catalog.
- Apache Spark: A compatible version of Apache Spark. For supported versions, see Spark. If you don't have Spark installed yet, please download the Spark distribution from Apache's website.
Set up ScalarDB Analytics in the Spark configuration​
ScalarDB Analytics requires specific Spark configurations to integrate with ScalarDB Analytics server.