Skip to main content

Kafka Performance Metrics

Cluster Metrics

Metric AliasMetric DescriptionUnit
Version
Total PartitionsSum of Partition counts across all nodes in the clusterCount
Total Unreplicated PartitionsSum of unreplicated partitions across all nodes in the clusterCount
Total Offline PartitionsSum of offline partitions across all nodes in the clusterCount
Controller Alive CountNumber of alive Controllers in the clusterCount
Failed Requests (Producer)Sum of producer failed request rates across all nodes in the clusterCount/s
Failed Requests (Consumer)Consumer failed request rate across all nodes in the clusterCount/s
Producer TrafficSum of producer traffic across all nodes in the clusterBytes/s
Consumer TrafficSum of consumer traffic across all nodes in the clusterBytes/s
Producer QPSSum of producer QPS across all nodes in the clusterCount/s
Consumer QPSSum of consumer QPS across all nodes in the clusterCount/s
Alive Node CountNumber of alive nodes in the clusterCount
Incoming Messages per SecondSum of incoming messages per second across all nodes in the clusterCount/s
Max CPU UsageMaximum CPU usage among all nodes in the cluster
Max Memory UsageMaximum memory usage among all nodes in the cluster
Max Heap UsageMaximum heap memory usage among all nodes in the cluster
Max Non-Heap UsageMaximum non-heap memory usage among all nodes in the cluster
Max Minor GC CountMaximum Minor GC count per minute among all nodes in the cluster
Max Major GC CountMaximum Major GC count per minute among all nodes in the cluster
Max Minor GC TimeMaximum Minor GC time per minute among all nodes in the cluster
Max Major GC TimeMaximum Major GC time per minute among all nodes in the cluster

Broker Metrics

Metric AliasMetric DescriptionUnitAlarm MetricDefault Alarm Policy
CPU Usage%
Memory Usage%
Memory UsedBytes
Disk Throughput (Read)Bytes/s
Disk Throughput (Write)Bytes/s
Producer TrafficBytes
Consumer TrafficBytes
Producer QPS
Consumer QPS
Is Controller0: No, 1: Yes
Follower Max Lag From LeaderCount
Total PartitionsTotal number of partitions on this nodeCount
Total Leader PartitionsCount
Total Unreplicated PartitionsCount
ISR Expansion RateCount/s
ISR Shrinkage RateCount/s
Total Offline PartitionsCount
Broker Rejected MessagesByte/s
Producer Failed RequestsCount/s
Consumer Failed RequestsCount/s
Producer Request Response Time Percentilerequest="Produce"ms
Consumer Request Response Time Percentilerequest="FetchConsumer"ms
Fetch Request Rate
Failed Fetch Requests
Zookeeper Disconnection RateClient has disconnected from server and is attempting to reconnect. Session may not have expired.
Zookeeper Session Expiration Rate
Broker Zookeeper Request Response Time Percentile

Topic Metrics

Metric AliasMetric DescriptionUnitAlarm MetricDefault Alarm Policy
Producer Traffic
Consumer Traffic
Producer QPS
Consumer QPS
Topic Size

Consumer Group Metrics

Metric AliasMetric DescriptionUnitAlarm MetricDefault Alarm Policy
Producer QPS
Consumer QPS
Current-Offset
Log-End-Offset
Lag
Consumer Group Status