Get in Touch

Course Outline

1. Introduction to Distributed PostgreSQL

  • Addressing scaling challenges inherent in single-node PostgreSQL
  • Overview of the Citus extension: its purpose, architecture, and key components
  • Core concepts: coordinator nodes, worker nodes, metadata, and distribution keys

2. Cluster Architecture and Setup

  • Distinctions between node types: coordinators versus workers
  • Types of tables: distributed, replicated, and local
  • Installing and configuring Citus within existing PostgreSQL environments
  • Cluster discovery processes and node management

3. Data Distribution and Sharding Strategies

  • Sharding methodologies: hash-based versus append-based
  • Choosing a distribution column to ensure optimal performance
  • Managing distributed and replicated tables
  • Re-balancing shards and scaling out the cluster

4. Distributed Query Execution and Optimisation

  • Mechanisms by which Citus routes and parallelises queries
  • Comprehending distributed query plans
  • Query pushdown techniques and execution optimisation

5. Consistency, Transactions and Fault Tolerance

  • Two-Phase Commit (2PC) and atomic operations
  • Strategies for handling failures in distributed transactions

6. Operational Management and Use Cases

  • Monitoring tools and views specific to Citus
  • Maintenance procedures and upgrades within distributed environments

Requirements

  • Completion of Advanced Administration (High Availability & Replication) or equivalent professional experience
  • Strong proficiency in PostgreSQL configuration and performance tuning
  • Familiarity with Linux operating systems and fundamental network concepts

Audience

Seasoned Database Administrators, DevOps Engineers, and System Architects who currently manage production PostgreSQL environments and require methods to scale them horizontally.

 7 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories