How to set up a CDC pipeline to capture and analyze real-time database changes with Parseable
Jen Wike Huger

Jen Wike Huger @jenwikehuger

About: Tech writer with a love for developers, engineers, and open source.

Location:
Raleigh, NC
Joined:
Jun 21, 2019

How to set up a CDC pipeline to capture and analyze real-time database changes with Parseable

Publish Date: Jul 22 '24
4 1

Databases are critical for any application. Data constantly gets updated, inserted, and deleted. In most of cases it is important for the business to keep track of these changes due to security concerns, auditing requirements, and to keep other relevant systems up to date.

Change Data Capture (CDC) has gained popularity, precisely to address this problem. CDC is a technique used to track all changes in a database and capture them in destination systems. Debezium is a popular CDC tool that leverages database logs as the source of truth, and streams the changes to Kafka and compatible systems like Redpanda.

The key thing to note here is that while databases are typically mutable, the changes captured by the CDC system should be immutable. Only then users have a system of record that can be trusted. Parseable is built for immutable logs and streams of events and data. It is a great fit for CDC targets. The way we designed Parseable allows for long term retention and querying of logs, and it also provides a rich set of tools for analysis and monitoring.

In this tutorial, we use Redpanda, PostgreSQL, and Debezium to ingest CDC events into Parseable.

CDC pipeline with Redpanda, PostgreSQL, Debezium, and Parseable

Read more in the full tutorial to:

  • Set up Docker Compose
  • Set up Debezium Connect
  • Set up Parseable
  • Set up Redpanda Connect
  • Test the CDC pipeline

Comments 1 total

  • Martin Baun
    Martin BaunJul 23, 2024

    Great work compiling this, Jen. About to check out the full tutorial!

Add comment