Service checks monitor the up or down status of a specific service. Alerts are triggered when the monitoring Agent fails to connect to the service in a specified number of consecutive checks. The Check Status widget can visually display service degradation, service failures, cluster-wide issues, drops in throughput, or increases in latency in your dashboard. For more information, see the Service check documentation.
Check status shows the current status or number of results for any check performed:
Choose a reporting time frame. This time frame always includes up to the present, so you can choose an option such as The past 10 minutes or The past 1 day and it reports a status that includes that time frame up to the present moment. If you choose Global Time, the person using the dashboard can select a range using the time frame selector in the upper right, but they must choose one that includes the present moment, that is any past X time frame. Otherwise the widget is blank.
Choose your scope:
A single check: Select this option if your Check Status widget is for a specific element only, for example: one host:<HOSTNAME>, one service:<SERVICE_NAME>.
A cluster of checks: Select this option if your Check Status widget is for a scope of elements as in all hosts, or all services.
After selecting your scope, define your Check Status widget context with the Reported by field.
For the scope A Cluster of checks, you have the option to select a subset with the Group by field. Note: The check status does not show you the count of checks per group, it shows the count of groups running the check. For example, if you are monitoring Agent Up, grouped by env, the check status shows you the number of env that matches your scope configurations and is running the Agent, not the count of Agents in an environment.
List of tag prefixes to group by in the case of a cluster check.
grouping [required]
enum
The kind of grouping to use.
Allowed enum values: check,cluster
tags
[string]
List of tags used to filter the groups reporting a cluster check.
time
<oneOf>
Time setting for the widget.
Option 1
object
Wrapper for live span
live_span
enum
The available timeframes depend on the widget you are using.
Allowed enum values: 1m,5m,10m,15m,30m,1h,4h,1d,2d,1w,1mo,3mo,6mo,week_to_date,month_to_date,1y,alert
Option 2
object
Used for arbitrary live span times, such as 17 minutes or 6 hours.
type [required]
enum
Type "live" denotes a live span in the new format.
Allowed enum values: live
unit [required]
enum
Unit of the time span.
Allowed enum values: minute,hour,day,week,month,year
value [required]
int64
Value of the time span.
Option 3
object
Used for fixed span times, such as 'March 1 to March 7'.
from [required]
int64
Start time in seconds since epoch.
to [required]
int64
End time in seconds since epoch.
type [required]
enum
Type "fixed" denotes a fixed span.
Allowed enum values: fixed
title
string
Title of the widget.
title_align
enum
How to align the text on the widget.
Allowed enum values: center,left,right
title_size
string
Size of the title.
type [required]
enum
Type of the check status widget.
Allowed enum values: check_status