HBase Cluster Replication for Geographic Data Distribution

HBase provides a cluster replication mechanism which allows you to keep one cluster’s state synchronized with that of another cluster, using the write-ahead log (WAL) of the source cluster to propagate the changes.

The use cases for cluster replication include the following scenarios:

Backup and disaster recovery

Data aggregation

Geographic data distribution, such as data centers

Online data ingestion combined with offline data analytics

Note

Replication is enabled at the granularity of the column family. Before
enabling replication for a column family, create the table and all column
families to be replicated on the destination cluster.