Near real-time CDC using DataStream

Replicate data from Cloud SQL to BigQuery in almost real-time

The Old Ways

The past few years

Cloud
Analytics
Databases

Near real-time change data capturing (CDC)

Change Data Capturing (CDC)

DataStream CDC

Let's have a look at the setup

Summary

DataStream is mostly ready for BigQuery CDC

Make sure the TCP Proxy is setup or it won't work

Use a separate process to manage one-off load

Use Authorised views to present data

Near real-time cdc using DataStream

By Richard He

Near real-time cdc using DataStream

Replicate data from Cloud SQL to BigQuery in almost real-time

  • 469