Get in Touch

Course Outline

1. Introduction to Distributed PostgreSQL

  • Challenges associated with scaling single-node PostgreSQL
  • Overview of the Citus extension: purpose, architecture, and components
  • Core concepts: coordinator node, worker nodes, metadata, and distribution keys

2. Cluster Architecture and Setup

  • Node types: distinguishing between coordinator and workers
  • Table types: distributed, replicated, and local tables
  • Installing and configuring Citus within existing PostgreSQL environments
  • Cluster discovery and node management techniques

3. Data Distribution and Sharding Strategies

  • Sharding methodologies: hash versus append sharding
  • Selecting an optimal distribution column for performance
  • Managing distributed and replicated tables
  • Re-balancing shards and scaling out infrastructure

4. Distributed Query Execution and Optimisation

  • Mechanisms by which Citus routes and parallelises queries
  • Understanding distributed query execution plans
  • Query pushdown and execution optimisation techniques

5. Consistency, Transactions and Fault Tolerance

  • Two-Phase Commit (2PC) and atomic operations
  • Handling failures within distributed transactions

6. Operational Management and Use Cases

  • Monitoring tools and views specific to Citus
  • Maintenance procedures and upgrades in distributed environments

Requirements

  • Completion of the Advanced Administration (High Availability & Replication) course or equivalent experience
  • Robust understanding of PostgreSQL configuration and performance tuning
  • Familiarity with Linux operating systems and fundamental network concepts

Audience

This course is designed for experienced Database Administrators, DevOps Engineers, and System Architects who currently manage production PostgreSQL environments and require methods to scale them horizontally.

 7 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories