On-Call

This page is not yet available in Spanish. We are working on its translation.
If you have any questions or feedback about our current translation project, feel free to reach out to us!

On-Call is not supported for your selected Datadog site ().

Join the Beta!

On-Call is in private beta. Complete the form to request access.

Request Access

Datadog On-Call integrates monitoring, paging, and incident response into one platform.

Overview of how Pages are routed. From a monitor, incident, security signal, or API call, the Page is sent to a Team (e.g. 'payments-team'), then to processing rules (e.g. based on priority) then to an escalation policy. There, it can be sent to a schedule or directly to a user.

Concepts

  • Pages represent something to get alerted for, such as a monitor, incident, or security signal. A Page can have a status of Triggered, Acknowledged, or Resolved.
  • Teams are groups configured within Datadog to handle specific types of Pages, based on expertise and operational roles.
  • Processing rules allow Teams to finely adjust their reactions to specific types of incoming events. These rules can set a Page’s urgency level and route Pages to different escalation policies depending on the event’s metadata.
  • Escalation policies determine how Pages are escalated within or across Teams.
  • Schedules set timetables for when specific Team members are on-call to respond to Pages.

How it works

Teams are the central organizational unit of Datadog On-Call. When a notification is triggered in Datadog, a Page is sent to the designated On-Call Team.

Notification that mentions an On-Call Team.

Each Team owns escalation policies and schedules. Escalation policies define how a Page is sent to various schedules, such as Checkout Operations - Interrupt Handler, Primary, and Secondary in the following screenshot. Each Team can also configure processing rules to route Pages to different escalation policies.

A sample escalation policy.

A schedule defines specific times when Team members are assigned to respond to Pages. Schedules organize and manage the availability of Team members across different time zones and shifts.

A sample schedule, with multiple layers for JP, EU, and US business hours.

Start using Datadog On-Call

To get started with On-Call, onboard an On-Call Team and ensure that all Team members configure their On-Call profile settings to receive notifications.


Further Reading

Más enlaces, artículos y documentación útiles:

PREVIEWING: brett0000FF/node-compatibility