This check monitors Flux through the Datadog Agent. Flux is a set of continuous and progressive delivery solutions for Kubernetes that is open and extensible.
Follow the instructions below to install and configure this check for an Agent running on a host. For containerized environments, see the Autodiscovery Integration Templates for guidance on applying these instructions.
This is an example configuration with Kubernetes annotations on your Flux pods. See the sample configuration file for all available configuration options.
Flux logs can be collected from the different Flux pods through Kubernetes. Collecting logs is disabled by default in the Datadog Agent. To enable it, see Kubernetes Log Collection.
Sum of the duration in seconds of a GitOps Toolkit resource reconciliation. Shown as second
fluxcd.gotk.suspend.status (gauge)
The current suspend status of a GitOps Toolkit resource.
fluxcd.leader_election_master_status (gauge)
Gauge of if the reporting system is master of the relevant lease, 0 indicates backup, 1 indicates master. 'name' is the string used to identify the lease. Make sure to group by name.
fluxcd.process.cpu_seconds.count (count)
Total user and system CPU time spent in seconds. Shown as second
fluxcd.process.max_fds (gauge)
Maximum number of open file descriptors.
fluxcd.process.open_fds (gauge)
Number of open file descriptors.
fluxcd.process.resident_memory (gauge)
Resident memory size in bytes. Shown as byte
fluxcd.process.start_time (gauge)
Start time of the process since unix epoch in seconds. Shown as second
fluxcd.process.virtual_memory (gauge)
Virtual memory size in bytes. Shown as byte
fluxcd.process.virtual_memory.max (gauge)
Maximum amount of virtual memory available in bytes. Shown as byte
fluxcd.rest_client_requests.count (count)
Number of HTTP requests, partitioned by status code, method, and host. Shown as request
The number of seconds that has the longest running processor for a workqueue that has been running. Shown as second
fluxcd.workqueue.retries.count (count)
Total number of retries handled by workqueue.
fluxcd.workqueue.unfinished_work (gauge)
The number of seconds of work that has been done that is in progress and hasn't been observed by work_duration. Large values indicate stuck threads. One can deduce the number of stuck threads by observing the rate at which this increases. Shown as second