Integration with Apache Spark

This chapter will provide different options for a SQL interface for the Apache Cassandra NoSQL database using Spark CLIs. With distributed, non-relational databases such as Apache Cassandra, it's really hard to run ad hoc analytical queries (that require data aggregation). These types of queries require both a relational interface and an aggregation capability, and there are out-of-the-box options, among which Spark is one. This chapter will provide an overview of Spark architecture with its installation and configuration, along with different CLIs to perform Create, Read, Update, and Delete (CRUD) operations using any relational queries. Additionally, there is a web UI for multiple components that are integrated ...

Get Mastering Apache Cassandra 3.x - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.