Multi-Datacenter Deployments¶

Strategies for deploying Apache Kafka across multiple datacenters for disaster recovery and global distribution.

Deployment Models¶

Model	RPO	RTO	Complexity	Use Case
Active-Passive	Minutes	Minutes	Low	Disaster recovery
Active-Active	Near-zero	Near-zero	High	Global distribution
Stretch Cluster	Zero	Seconds	Medium	Low-latency DR

Active-Passive with MirrorMaker 2¶

Primary datacenter handles all traffic. Secondary datacenter receives replicated data for failover.

MirrorMaker 2 Configuration¶

# mm2.properties

# Define clusters
clusters=primary,secondary

primary.bootstrap.servers=kafka-primary-1:9092,kafka-primary-2:9092
secondary.bootstrap.servers=kafka-secondary-1:9092,kafka-secondary-2:9092

# Replication flows
primary->secondary.enabled=true
primary->secondary.topics=.*
primary->secondary.groups=.*

# Exclude internal topics
primary->secondary.topics.exclude=.*[\-\.]internal,.*\.replica,__.*

# Replication settings
replication.factor=3
checkpoints.topic.replication.factor=3
heartbeats.topic.replication.factor=3
offset-syncs.topic.replication.factor=3

# Consumer offset sync (for failover)
sync.group.offsets.enabled=true
sync.group.offsets.interval.seconds=60

# Emit checkpoints for offset translation
emit.checkpoints.enabled=true
emit.checkpoints.interval.seconds=60

Failover Procedure¶

Detect failure - Monitor primary cluster health
Stop MirrorMaker 2 - Prevent split-brain
Translate offsets - Use checkpoint data
Redirect producers - Update bootstrap servers
Start consumers - Resume from translated offsets

# Translate consumer group offsets
kafka-consumer-groups.sh --bootstrap-server kafka-secondary:9092 \
  --group my-consumer-group \
  --reset-offsets \
  --to-offset <translated-offset> \
  --topic primary.my-topic \
  --execute

Active-Active with MirrorMaker 2¶

Both datacenters handle traffic. Bidirectional replication requires careful handling of data provenance to prevent infinite replication loops and enable correct data aggregation.

The Provenance Problem¶

In active-active replication, the system must track where each record originated. Without provenance tracking, records would replicate infinitely:

1. Producer writes to east.orders in DC East
2. MirrorMaker replicates to DC West as east.orders
3. Without provenance: MirrorMaker replicates back to DC East
4. Infinite loop of replication

MirrorMaker 2 solves this through topic prefixing—each replicated topic carries its origin datacenter in the name.

How Provenance Works¶

Topic in DC East	Origin	Description
`orders`	DC East	Locally produced records
`west.orders`	DC West	Replicated from DC West

Topic in DC West	Origin	Description
`orders`	DC West	Locally produced records
`east.orders`	DC East	Replicated from DC East

MirrorMaker 2 never replicates prefixed topics, preventing loops:

east.orders in DC West is not replicated back to DC East
west.orders in DC East is not replicated back to DC West

Consuming from Multiple Origins¶

Consumers that need a global view must subscribe to both local and replicated topics:

// Consumer in DC East wanting all orders globally
consumer.subscribe(Arrays.asList(
    "orders",        // Local DC East orders
    "west.orders"    // Replicated DC West orders
));

// Process records with origin awareness
while (true) {
    ConsumerRecords<String, Order> records = consumer.poll(Duration.ofMillis(100));
    for (ConsumerRecord<String, Order> record : records) {
        String origin = record.topic().startsWith("west.") ? "west" : "east";
        processOrder(record.value(), origin);
    }
}

Provenance in Record Headers¶

For more granular provenance tracking, producers can add origin metadata to record headers:

// Producer adds provenance headers
ProducerRecord<String, Order> record = new ProducerRecord<>("orders", order.getId(), order);
record.headers()
    .add("origin-dc", "east".getBytes())
    .add("origin-timestamp", Long.toString(System.currentTimeMillis()).getBytes())
    .add("origin-producer", producerId.getBytes());

producer.send(record);

Consumers can then extract provenance regardless of topic name:

Header originHeader = record.headers().lastHeader("origin-dc");
String originDc = new String(originHeader.value());

Aggregation Patterns¶

Pattern	Implementation	Use Case
Union	Subscribe to `orders` + `west.orders`	Global view of all orders
Local-first	Subscribe to `orders` only	DC-local processing
Kafka Streams	Merge streams with origin tracking	Complex aggregations

Kafka Streams aggregation example:

// Merge streams from both origins
KStream<String, Order> localOrders = builder.stream("orders");
KStream<String, Order> remoteOrders = builder.stream("west.orders");

KStream<String, Order> allOrders = localOrders.merge(remoteOrders);

// Process with origin awareness using headers
allOrders.foreach((key, order) -> {
    // Origin available in record headers
});

Bidirectional Configuration¶

# mm2-active-active.properties

clusters=east,west

east.bootstrap.servers=kafka-east-1:9092,kafka-east-2:9092
west.bootstrap.servers=kafka-west-1:9092,kafka-west-2:9092

# East to West replication
east->west.enabled=true
east->west.topics=orders,events

# West to East replication
west->east.enabled=true
west->east.topics=orders,events

# Prevent replication loops
replication.policy.class=org.apache.kafka.connect.mirror.DefaultReplicationPolicy

# Topic naming (default adds source cluster prefix)
# east.orders in west cluster
# west.orders in east cluster

Conflict Avoidance Strategies¶

Strategy	Description	Trade-off
Topic prefixing	Different topic names per DC	Consumers must aggregate
Key partitioning	Route keys to owning DC	Requires consistent routing
Last-write-wins	Accept all writes, latest wins	Potential data loss
Application merge	Application-level conflict resolution	Complexity

Stretch Cluster¶

Single Kafka cluster spanning multiple datacenters with synchronous replication.

Configuration¶

# Rack awareness for cross-DC placement
broker.rack=dc1

# Minimum ISR spans DCs
min.insync.replicas=2
default.replication.factor=3

# Replica placement
replica.selector.class=org.apache.kafka.common.replica.RackAwareReplicaSelector

Requirements¶

Requirement	Threshold
Network latency	< 10ms RTT between DCs
Network bandwidth	Sufficient for replication traffic
Broker count	Odd number for controller quorum

Comparison¶

Aspect	Active-Passive	Active-Active	Stretch Cluster
RPO	Minutes	Near-zero	Zero
RTO	Minutes	Near-zero	Seconds
Latency impact	None	None	Cross-DC latency
Network requirement	Async-capable	Async-capable	Low-latency
Topic namespace	Separate	Separate (prefixed)	Single
Failover complexity	Manual/automated	Minimal	Automatic

Consumer Offset Handling¶

MirrorMaker 2 Offset Sync¶

MirrorMaker 2 synchronizes consumer group offsets using checkpoints.

Offset Translation¶

# View checkpoint topic
kafka-console-consumer.sh --bootstrap-server kafka-secondary:9092 \
  --topic primary.checkpoints.internal \
  --from-beginning \
  --property print.key=true

Monitoring¶

Key Metrics¶

Metric	Description	Alert Threshold
`kafka.connect.mirror.record-count`	Records replicated	Sudden drops
`kafka.connect.mirror.record-age-ms`	Replication lag	> 60000 ms
`kafka.connect.mirror.checkpoint-latency-ms`	Checkpoint delay	> 120000 ms
`kafka.connect.mirror.replication-latency-ms`	End-to-end latency	> 30000 ms

Health Checks¶

# Check MirrorMaker 2 status
curl http://connect:8083/connectors/mirror-source-connector/status

# Check replication lag
kafka-consumer-groups.sh --bootstrap-server kafka-secondary:9092 \
  --group mirror-source-connector \
  --describe

Best Practices¶

Practice	Rationale
Test failover regularly	Ensure procedures work
Monitor replication lag	Detect issues early
Use rack awareness	Distribute replicas across DCs
Document failover procedures	Reduce MTTR
Automate where possible	Reduce human error

Kafka Connect - Connect framework
Operations - Operational procedures
Fault Tolerance - HA design
Backup/Restore - DR procedures