O'Reilly logo

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Cassandra Administration

Video Description

Configure and administer Cassandra for optimum database performance and a completely operational cluster

About This Video

  • Quickly grasp the concepts behind Cassandra and understand its internal working mechanism

  • Build a single node cluster into a running multi-node cluster from scratch

  • Learn through clearly demonstrated steps how to administrate and manage day-to-day operations with Cassandra, including data migration, backup, restore, monitoring, and performance tuning

  • In Detail

    Cassandra is the flagship product among the NoSQL databases with its distributed, fault-tolerant, extremely fast, linearly scalable, and low-cost features. These make it a high-calibre core component of the rapidly expanding cloud computing and mission critical data systems. This course shows you how to plan and prepare the right installation platform for a cluster and effectively administer Cassandra.

    This quick-start video course covers administration tasks performed in real-world systems. It starts by providing you with a solid grounding in Cassandra architecture basics and then proceeds to build a single node cluster from scratch using step-by-step demonstrations. It then moves on progressively to show you how to build a completely operating multi node cluster.

    This video course provides you with the necessary know-how to effectively administrate and perform operations in Cassandra. You will also get to know, from an architecture perspective, how Cassandra differs from traditional databases and other popular NoSQL databases.

    The course starts off by showing you the steps to build your own cluster, and introduces you to the different use cases for Cassandra. Along the way, you will encounter caveats based on scenarios that may not be a good fit. After learning how to install Cassandra, you will learn how to use different monitoring tools and methods to import and export large amount of data. Next, primary and secondary indexes are explained to help you select the appropriate indexing that enables fast and efficient data lookup to respond to search and retrieval requests. Finally, you will use various tools to facilitate performance tuning, and conduct a stress test on a cluster, and interpret the performance results.

    The course starts off by showing you the steps to build your own cluster, and introduces you to the different use cases for Cassandra. Along the way, you will encounter caveats based on scenarios that may not be a good fit. After learning how to install Cassandra, you will learn how to use different monitoring tools and methods to import and export large amount of data. Next, primary and secondary indexes are explained to help you select the appropriate indexing that enables fast and efficient data lookup to respond to search and retrieval requests. Finally, you will use various tools to facilitate performance tuning, and conduct a stress test on a cluster, and interpret the performance results.

    Table of Contents

    1. Chapter 1: Exploring Cassandra
      1. Apache Cassandra Overview 00:03:45
      2. Features and Benefits 00:04:58
      3. When Not to Use Cassandra 00:02:41
      4. Other Similar NoSQL Databases 00:04:48
      5. Differences from Relational Databases 00:04:29
    2. Chapter 2: Basic Theory and Mechanism
      1. Understanding the CAP Theorem and Eventual Consistency 00:03:33
      2. Creating the Data Model 00:03:23
      3. Write and Read Paths 00:03:50
      4. Understanding the Cluster Architecture 00:03:51
      5. Detecting Seed, Gossip, and Failure 00:03:16
    3. Chapter 3: Prerequisites and Preinstallation Preparation
      1. Preinstallation Planning 00:03:37
      2. Single Node Preparation 00:03:53
      3. Building from the Source Tarball 00:03:29
      4. Installing and Configuring Cassandra 00:04:37
      5. Building a Cluster of Multiple Nodes 00:04:09
    4. Chapter 4: Monitoring Cassandra
      1. Logging and Log Rotation 00:03:28
      2. The CLI Monitoring Tool -- nodetool 00:04:27
      3. The GUI Monitoring Tool -- JConsole 00:05:00
      4. Garbage Collection 00:03:53
      5. System-Level Monitoring 00:04:31
    5. Chapter 5: Data Migration and Indexing
      1. Loading Bulk Data 00:04:52
      2. JSON Format Import and Export 00:04:03
      3. Using the CQL Command -- COPY 00:03:43
      4. Using Primary Indexes 00:05:02
      5. Using Secondary Indexes 00:04:54
    6. Chapter 6: Backup and Restore
      1. Understanding Backup and Recovery Concepts 00:03:31
      2. Taking a Snapshot 00:04:50
      3. Incremental Backup 00:04:30
      4. Using the Commitlog Feature 00:03:34
      5. Using Restore Methods 00:03:56
    7. Chapter 7: Performance Tuning
      1. Storage Strategies and OS Tuning 00:05:38
      2. JVM Tuning 00:04:44
      3. Caching Strategies 00:05:12
      4. Compaction and Compression 00:06:13
      5. Stress Testing 00:04:55
    8. Chapter 8: Conclusion
      1. Repairing Nodes 00:05:40
      2. Scaling In and Out 00:05:32
      3. Supplement 1 -- Cloning Virtual Machine 00:04:44
      4. Supplement 2 -- Installing and Configuring ClusterSSH 00:04:02
      5. Closing Comments 00:02:27