Kubernetes Data Collected

This page lists data collected by the Datadog Agent when deployed on a Kubernetes cluster. The set of metrics collected may vary depending on the version of Kubernetes in use.

Note: For Windows containers, see Limited metrics for Windows deployments.

Metrics

Kubernetes

Note: For more information about kubernetes.cpu.* metrics, see Discrepancies in kubernetes.cpu.* and container.cpu.* metrics.

Kubelet

For more information, see the documentation for the Kubelet integration.

Kubernetes state metrics core

For more information, see the documentation for the Kubernetes state metrics core integration. This check requires Datadog Cluster Agent v1.12 or later.

Kubernetes state

Note: kubernetes_state.* metrics are gathered from the kube-state-metrics API. The kubernetes_state check is a legacy check. For an alternative, see Kubernetes state metrics core. Datadog recommends that you do not enable both checks simultaneously.

Kubernetes DNS

Kubernetes proxy

Kubernetes API server

For more information, see the documentation for the Kubernetes API server integration.

Kubernetes controller manager

For more information, see the documentation for the Kubernetes controller manager integration.

Kubernetes metrics server

For more information, see the documentation for the Kubernetes metrics server integration.

Kubernetes scheduler

For more information, see the documentation for the Kubernetes scheduler integration.

Events

  • Backoff
  • Conflict
  • Delete
  • DeletingAllPods
  • Didn’t have enough resource
  • Error
  • Failed
  • FailedCreate
  • FailedDelete
  • FailedMount
  • FailedSync
  • Failedvalidation
  • FreeDiskSpaceFailed
  • HostPortConflict
  • InsufficientFreeCPU
  • InsufficientFreeMemory
  • InvalidDiskCapacity
  • Killing
  • KubeletsetupFailed
  • NodeNotReady
  • NodeoutofDisk
  • OutofDisk
  • Rebooted
  • TerminatedAllPods
  • Unable
  • Unhealthy

Service checks

Kubelet

For more information, see the documentation for the Kubelet integration.

Kubernetes controller manager

For more information, see the documentation for the Kubernetes controller manager integration.

Kubernetes metrics server

For more information, see the documentation for the Kubernetes metrics server integration.

Kubernetes scheduler

For more information, see the documentation for the Kubernetes scheduler integration.

Kubernetes state metrics core

For more information, see the documentation for the Kubernetes state metrics core integration.

kubernetes_state.cronjob.complete
Whether the last job of the cronjob is failed or not. Tags:kube_cronjob kube_namespace (env service version from standard labels).
kubernetes_state.cronjob.on_schedule_check
Alert if the cronjob’s next schedule is in the past. Tags:kube_cronjob kube_namespace (env service version from standard labels).
kubernetes_state.job.complete
Whether the job is failed or not. Tags:kube_job or kube_cronjob kube_namespace (env service version from standard labels).
kubernetes_state.node.ready
Whether the node is ready. Tags:node condition status.
kubernetes_state.node.out_of_disk
Whether the node is out of disk. Tags:node condition status.
kubernetes_state.node.disk_pressure
Whether the node is under disk pressure. Tags:node condition status.
kubernetes_state.node.network_unavailable
Whether the node network is unavailable. Tags:node condition status.
kubernetes_state.node.memory_pressure
Whether the node network is under memory pressure. Tags:node condition status.

Further Reading

PREVIEWING: may/embedded-workflows