Thank you for sending your enquiry! One of our team members will contact you shortly.
Thank you for sending your booking! One of our team members will contact you shortly.
Course Outline
- Section 1: Introduction to Big Data / NoSQL
- Overview of NoSQL databases
- The CAP theorem
- Scenarios where NoSQL is appropriate
- Columnar storage systems
- The NoSQL ecosystem
- Section 2: Cassandra Basics
- Design and architecture
- Cassandra nodes, clusters, and datacenters
- Keyspaces, tables, rows, and columns
- Partitioning, replication, and tokens
- Quorum and consistency levels
- Labs: interacting with Cassandra using CQLSH
- Section 3: Data Modeling – Part 1
- Introduction to CQL
- CQL data types
- Creating keyspaces and tables
- Selecting columns and data types
- Defining primary keys
- Data layout for rows and columns
- Time to live (TTL)
- Executing queries with CQL
- Performing updates in CQL
- Collections (list, map, set)
- Labs: various data modeling exercises using CQL; experimenting with queries and supported data types
- Section 4: Data Modeling – Part 2
- Creating and utilizing secondary indexes
- Composite keys (partition keys and clustering keys)
- Handling time series data
- Best practices for time series data
- Counters
- Lightweight transactions (LWT)
- Labs: creating and using indexes; modeling time series data
- Section 5: Cassandra Internals
- Understanding the underlying design of Cassandra
- SSTables, memtables, and commit log
- Section 6: Administration
- Hardware selection guidelines
- Cassandra distributions
- Communication between Cassandra nodes
- Writing and reading data to/from the storage engine
- Data directories
- Anti-entropy operations
- Cassandra compaction
- Selecting and implementing compaction strategies
- Cassandra best practices (compaction, garbage collection)
- Setting up a test Cassandra instance with a low memory footprint
- Troubleshooting tools and tips
- Lab: installing Cassandra and running benchmarks
Requirements
- Familiarity with the Linux environment, including command-line navigation and file editing using vi or nano
- For in-person courses, a laptop or desktop computer with at least 8 GB of RAM is required
- For remote courses, a functional Cassandra lab environment will be provided; participants need only a web browser to access it
14 Hours
Testimonials (2)
Extensive knowledge of NoSQL environments, not only Cassandra (ex: HADOOP)
Stefan Marcoci - Videotron ltee
Course - Cassandra Administration
The 1:1 style meant the training was tailored to my individual needs.