Metrics API¶

NOTE: The metrics API may change in the future, this serves as a snapshot of the current metrics.

Admin¶

Administrators can monitor Serving control plane based on the metrics exposed by each Serving component. Metrics are listed next.

Activator¶

Autoscaler¶

Controller¶

The following metrics are emitted by any component that implements a controller logic. The metrics show details about the reconciliation operations and the workqueue behavior on which reconciliation requests are enqueued.

Metric Name	Description	Type	Tags	Unit	Status
work_queue_depth	Depth of the work queue	Gauge	reconciler	Dimensionless	Stable
reconcile_count	Number of reconcile operations	Counter	reconciler success	Dimensionless	Stable
reconcile_latency	Latency of reconcile operations	Histogram	reconciler success	Milliseconds	Stable
workqueue_adds_total	Total number of adds handled by workqueue	Counter	name	Dimensionless	Stable
workqueue_depth	Current depth of workqueue	Gauge	reconciler	Dimensionless	Stable
workqueue_queue_latency_seconds	How long in seconds an item stays in workqueue before being requested	Histogram	name	Seconds	Stable
workqueue_retries_total	Total number of retries handled by workqueue	Counter	name	Dimensionless	Stable
workqueue_work_duration_seconds	How long in seconds processing an item from a workqueue takes.	Histogram	name	Seconds	Stable
workqueue_unfinished_work_seconds	How long in seconds the outstanding workqueue items have been in flight (total).	Histogram	name	Seconds	Stable
workqueue_longest_running_processor_seconds	How long in seconds the longest outstanding workqueue item has been in flight	Histogram	name	Seconds	Stable

Webhook¶

Go Runtime - memstats¶

NOTE: name tag is empty.

Developer - User Services¶

Every Knative service has a proxy container that proxies the connections to the application container. A number of metrics are reported for the queue peroxy performance. Using the following metrics application developers, devops and others, could measure if requests are queued at the proxy side (need for backpressure) and what is the actual delay in serving requests at the application side.

Queue proxy¶

Requests endpoint

Metric Name	Description	Type	Tags	Unit	Status
revision_request_count	The number of requests that are routed to queue-proxy	Counter	configuration_name container_name namespace_name pod_name response_code response_code_class revision_name service_name	Dimensionless	Stable
revision_request_latencies	The response time in millisecond	Histogram	configuration_name container_name namespace_name pod_name response_code response_code_class revision_name service_name	Milliseconds	Stable
revision_app_request_count	The number of requests that are routed to user-container	Counter	configuration_name container_name namespace_name pod_name response_code response_code_class revision_name service_name	Dimensionless	Stable
revision_app_request_latencies	The response time in millisecond	Histogram	configuration_name namespace_name pod_name response_code response_code_class revision_name service_name	Milliseconds	Stable
revision_queue_depth	The current number of items in the serving and waiting queue, or not reported if unlimited concurrency	Gauge	configuration_name event-display container_name namespace_name pod_name response_code_class revision_name service_name	Dimensionless	Stable