- 필수 기능
- 시작하기
- Glossary
- 표준 속성
- Guides
- Agent
- 통합
- 개방형텔레메트리
- 개발자
- API
- Datadog Mobile App
- CoScreen
- Cloudcraft
- 앱 내
- 서비스 관리
- 인프라스트럭처
- 애플리케이션 성능
- APM
- Continuous Profiler
- 스팬 시각화
- 데이터 스트림 모니터링
- 데이터 작업 모니터링
- 디지털 경험
- 소프트웨어 제공
- 보안
- AI Observability
- 로그 관리
- 관리
A process check monitor watches the status produced by the Agent check process.up
. At the Agent level you can configure your check thresholds based on the number of matching processes.
To create a process check monitor in Datadog, use the main navigation: Monitors –> New Monitor –> Process Check.
From the dropdown list, select a process to monitor. Filter the list by entering your search criteria.
Select the hosts to monitor by choosing host names, tags, or choose All Monitored Hosts
. Only hosts or tags reporting a status for the selected process are displayed. If you need to exclude certain hosts, use the second field to list names or tags.
AND
logic. All listed host names and tags must be present on a host for it to be included.OR
logic. Any host with a listed name or tag is excluded.A check alert tracks consecutive statuses submitted per check grouping and compares it to your thresholds. For process check monitors, the groups are static: host
and process
.
Set up the check alert:
Trigger the alert after selected consecutive failures: <NUMBER>
Each check run submits a single status of OK
, WARN
, or CRITICAL
. Choose how many consecutive runs with the WARN
and CRITICAL
status trigger a notification. For example, your process might have a single blip where connection fails. If you set this value to > 1
, the blip is ignored but a problem with more than one consecutive failure triggers a notification.
Resolve the alert after selected consecutive successes: <NUMBER>
Choose how many consecutive runs with the OK
status resolves the alert.
A cluster alert calculates the percent of process checks in a given status and compares it to your thresholds.
Set up a cluster alert:
Decide whether or not to group your process checks according to a tag. Ungrouped
calculates the status percentage across all sources. Grouped
calculates the status percentage on a per group basis.
Select the percentage for alert and warn thresholds. Only one setting (alert or warn) is required.
Each check tagged with a distinct combination of tags is considered to be a distinct check in the cluster. Only the status of the last check of each combination of tags is taken into account in the cluster percentage calculation.
For example, a cluster check monitor grouped by environment can alert if more that 70% of the checks on any of the environments submit a CRITICAL
status, and warn if more that 70% of the checks on any of the environments submit a WARN
status.
See the Monitor configuration documentation for information on No data, Auto resolve, and New group delay options.
For detailed instructions on the Configure notifications and automations section, see the Notifications page.