Cassandra Training

Introduction to Cassandra

Learn about Apache Cassandra, a distributed NoSQL database designed to handle large amounts of data across many commodity servers. This module covers Cassandra's architecture, features, and use cases, providing a foundational understanding of why and how Cassandra is used.

Cassandra Installation and Setup

Discover the steps to install and configure Cassandra on different operating systems. This section includes installation prerequisites, configuration settings, and initial setup to get Cassandra up and running.

Cassandra Data Model

Explore Cassandra's data model, including concepts like keyspaces, tables, rows, and columns. Learn about data modeling techniques, schema design, and how to efficiently structure data for optimal performance.

Querying with CQL (Cassandra Query Language)

Understand how to use Cassandra Query Language (CQL) for querying and manipulating data. This section covers basic and advanced CQL operations, including SELECT, INSERT, UPDATE, DELETE, and working with data types.

Cassandra Architecture and Internals

Dive into Cassandra's architecture and internals, including its distributed nature, data replication, and consistency model. Learn about the components like nodes, clusters, and the role of the commit log and SSTables.

Data Replication and Consistency

Explore Cassandra's data replication and consistency mechanisms. Learn about replication strategies, consistency levels, and how to configure and manage data replication to ensure high availability and reliability.

Performance Tuning and Optimization

Learn techniques for tuning and optimizing Cassandra performance. This section covers best practices for configuration, indexing, caching, and monitoring to enhance the performance and scalability of your Cassandra deployment.

Backup and Recovery

Understand how to perform backups and recoveries in Cassandra. Learn about backup strategies, tools, and procedures to ensure data integrity and recovery in case of failures or data loss.

Security and Access Control

Discover security features and access control mechanisms in Cassandra. Learn how to configure user roles, permissions, and encryption to protect your data and secure your Cassandra environment.

Real-World Case Studies and Applications

Review case studies and real-world applications of Cassandra. Learn from practical examples of how Cassandra is used in different industries to solve data management challenges and achieve business goals.

Advanced Topics and Emerging Trends

Explore advanced topics and emerging trends in Cassandra. This section covers new features, integrations, and developments in the Cassandra ecosystem, as well as future directions and innovations in distributed databases.

Best Practices and Troubleshooting

Discover best practices for working with Cassandra and learn how to troubleshoot common issues. Explore tips for maintaining a healthy Cassandra cluster and resolving performance or operational problems.

Cassandra Syllabus

1. Introduction to Big Data and Cassandra

  • Introduction to Big Data and Problems Caused by It
  • 5V – Volume, Variety, Velocity, Veracity, and Value
  • Traditional Database Management Systems
  • Limitations of RDBMS
  • NoSQL Databases
  • Common Characteristics of NoSQL Databases
  • CAP Theorem
  • How Cassandra Solves the Limitations?
  • History of Cassandra
  • Features of Cassandra

Hands-On:

  • Edureka VM Tour

2. Cassandra Data Model

  • Introduction to Database Models
  • Understanding the Analogy between RDBMS and Cassandra Data Model
  • Database Elements: Cluster, Keyspace, Column Family/Table, Column
  • Column Family Options
  • Columns
  • Wide Rows, Skinny Rows
  • Static and Dynamic Tables

Hands-On:

  • Creating Keyspace
  • Creating Tables

3. Cassandra Architecture

  • Cassandra as a Distributed Database
  • Key Cassandra Elements:
    • Memtable
    • Commit Log
    • SSTables
  • Replication Factor
  • Data Replication in Cassandra
  • Gossip Protocol – Detecting Failures
  • Gossip: Uses
  • Snitch: Uses
  • Data Distribution
  • Staged Event-Driven Architecture (SEDA)
  • Managers and Services
  • Virtual Nodes: Write Path and Read Path
  • Consistency Level
  • Repair
  • Incremental Repair

4. Deep Dive into Cassandra Database

  • Replication Factor
  • Replication Strategy
  • Defining Columns and Data Types
  • Defining a Partition Key
  • Recognizing a Partition Key
  • Specifying a Descending Clustering Order
  • Updating Data
  • Tombstones
  • Deleting Data
  • Using TTL
  • Updating TTL

Hands-On/Demo:

  • Create Keyspace in Cassandra
  • Check Created Keyspace in System_Schema.Keyspaces
  • Update Replication Factor of Previously Created Keyspace
  • Drop Previously Created Keyspace
  • Create a Table Using cqlsh
  • Create a Table Using UUID & TIMEUUID
  • Create a Table Using Collection & UDT Column
  • Create Secondary Index on a Table
  • Insert Data into Table
  • Insert Data into Table with UUID & TIMEUUID Columns
  • Insert Data Using COPY Command
  • Deleting Data from Table

5. Node Operations in a Cluster

  • Cassandra Nodes
  • Specifying Seed Nodes
  • Bootstrapping a Node
  • Adding a Node (Commissioning) in Cluster
  • Removing (Decommissioning) a Node
  • Removing a Dead Node
  • Repair
  • Read Repair
  • What’s New in Incremental Repair
  • Run a Repair Operation
  • Cassandra and Spark Implementation

Hands-On:

  • Commissioning a Node
  • Decommissioning a Node
  • Nodetool Commands

6. Managing and Monitoring the Cluster

  • Cassandra Monitoring Tools
  • Logging
  • Tailing
  • Using Nodetool Utility
  • Using JConsole
  • Learning about OpsCenter
  • Runtime Analysis Tools

Hands-On:

  • JMX and JConsole
  • OpsCenter

7. Backup & Restore and Performance Tuning

  • Creating a Snapshot
  • Restoring from a Snapshot
  • RAM and CPU Recommendations
  • Hardware Choices
  • Selecting Storage
  • Types of Storage to Avoid
  • Cluster Connectivity, Security, and Factors Affecting Distributed System Performance
  • End-to-End Performance Tuning of Cassandra Clusters against Very Large Data Sets
  • Load Balance and Streams

Hands-On:

  • Creating Snapshots
  • Integration with Kafka
  • Integration with Spark

8. Hosting Cassandra Database on Cloud

  • Security
  • Ongoing Support of Cassandra Operational Data
  • Hosting a Cassandra Database on Cloud

Hands-On:

  • Hosting Cassandra Database on Amazon Web Services

Training

Basic Level Training

Duration : 1 Month

Advanced Level Training

Duration : 1 Month

Project Level Training

Duration : 1 Month

Total Training Period

Duration : 3 Months

Course Mode :

Available Online / Offline

Course Fees :

Please contact the office for details

Placement Benefit Services

Provide 100% job-oriented training
Develop multiple skill sets
Assist in project completion
Build ATS-friendly resumes
Add relevant experience to profiles
Build and enhance online profiles
Supply manpower to consultants
Supply manpower to companies
Prepare candidates for interviews
Add candidates to job groups
Send candidates to interviews
Provide job references
Assign candidates to contract jobs
Select candidates for internal projects

Note

100% Job Assurance Only
Daily online batches for employees
New course batches start every Monday