Skip to content

AxonOps Kafka Controller Dashboard Metrics Mapping

Overview

The Kafka Controller Dashboard monitors the health and performance of the Kafka controller, particularly in KRaft mode (Kafka without ZooKeeper). It tracks Raft consensus metrics, metadata operations, authentication rates, and controller state changes.

Metrics Mapping

Dashboard Metric Description Attributes
Controller State Metrics
kaf_KafkaController_FencedBrokerCount Number of fenced (isolated) brokers -
kaf_KafkaController_LastAppliedRecordTimestamp Timestamp of last applied metadata record -
kaf_KafkaController_MetadataErrorCount Count of metadata errors -
Raft Consensus Metrics
kaf_raft_metrics_ (function='current_leader') Current Raft leader node ID function='current_leader'
kaf_raft_metrics_ (function='log_end_offset') End offset of the Raft log function='log_end_offset'
kaf_raft_metrics_ (function='commit_latency_avg') Average commit latency function='commit_latency_avg'
kaf_raft_metrics_ (function='fetch_records_rate') Rate of fetching records function='fetch_records_rate'
Raft Channel Metrics
kaf_raft_channel_metrics_ (function='request_rate') Raft request rate function='request_rate'
kaf_raft_channel_metrics_ (function='successful_authentication_rate') Successful authentication rate function='successful_authentication_rate'
kaf_raft_channel_metrics_ (function='failed_reauthentication_rate') Failed re-authentication rate function='failed_reauthentication_rate'

Query Examples

Controller State

// Fenced broker count
kaf_KafkaController_FencedBrokerCount

// Last applied record timestamp rate of change
kaf_KafkaController_LastAppliedRecordTimestamp{axonfunction='rate'}

// Metadata error rate
kaf_KafkaController_MetadataErrorCount{axonfunction='rate'}

Raft Leader Information

// Current Raft leader
kaf_raft_metrics_{function='current_leader'}

// Raft log end offset changes
sum(kaf_raft_metrics_{function='log_end_offset'}) by (host_id)

Raft Performance

// Commit latency average
kaf_raft_metrics_{function='commit_latency_avg'}

// Fetch records rate
kaf_raft_metrics_{function='fetch_records_rate'}

// Request rate
kaf_raft_channel_metrics_{function='request_rate'}

Authentication Metrics

// Successful authentication rate
kaf_raft_channel_metrics_{function='successful_authentication_rate'}

// Failed re-authentication rate
kaf_raft_channel_metrics_{function='failed_reauthentication_rate'}

Panel Organization

Overview Section

  • Empty row for spacing/organization

Controller

  • Fenced Broker Count (counter)
  • Current Raft Leader (counter)
  • Last record offset timestamp
  • Raft log offset change
  • Commit Latency Avg
  • Fetch Records Rate
  • Raft Request Rate
  • Metadata Error Rate
  • New Active Controllers Count (placeholder)

Authentication

  • Successful Auth Rate
  • Failed Auth Rate

Filters

Note: The controller dashboard has no configurable filters, as controller metrics are cluster-wide.

Best Practices

Controller Health Monitoring

  • Fenced broker count should be 0 for healthy clusters
  • Monitor metadata error rate for controller issues
  • Track last applied record timestamp for activity

Raft Consensus Monitoring

  • Ensure stable Raft leader (minimal changes)
  • Monitor commit latency for consensus performance
  • High fetch records rate indicates active metadata changes

Log Growth Monitoring

  • Track Raft log offset growth rate
  • Rapid growth may indicate frequent metadata changes
  • Monitor for log compaction effectiveness

Authentication Monitoring

  • High failed authentication rates indicate security issues
  • Monitor successful auth rate for normal operations
  • Investigate spikes in failed re-authentication

Performance Tuning

  • Low commit latency ensures fast metadata propagation
  • Monitor request rates for controller load
  • Balance metadata operations across the cluster

Troubleshooting

  • Fenced brokers indicate network or configuration issues
  • Metadata errors suggest controller processing problems
  • Authentication failures may indicate credential issues

KRaft Mode Considerations

  • Controller metrics are specific to KRaft mode
  • In ZooKeeper mode, different metrics apply
  • Monitor for smooth leader elections and stable consensus