Monitors

Overview

Datadog Monitors provide vital visibility into your infrastructure, enabling proactive detection and real-time response to performance issues and outages. By configuring monitors to track key metrics and thresholds, organizations can receive immediate alerts and address problems before they impact customers or cause system downtime.

Monitor critical changes by checking metrics, integration availability, and network endpoints through the Alerting platform. With Datadog Monitors you can:

  • Simplify monitoring and response processes
  • Enhance operational efficiency
  • Optimize performance

Get started

The fastest way to start with Datadog Monitors is with Recommended Monitors. These are a collection of monitors within Datadog that are preconfigured by Datadog and integration partners.

You can also build your own monitors from scratch in lab environments in the Learning Center, or in your application by following the Getting Started with Monitors guide.


Analyze aggregate data

Data should be well-understood, granular, tagged by scope, and long-lived. Use different data types for alerts and diagnostics, based on the level of urgency. Instrument all applications and collect as much relevant data as possible for comprehensive measurements and observability of complex systems.

Measure the health of your applications and the state of your infrastructure with Datadog. Use data from across the Datadog platform to create alerts on potential issues.

Alert on what matters

Set up Monitor Notifications to keep your team informed of issues and provide troubleshooting guidance. Route the notifications to the correct people, leverage template variables to include details, and attach snapshots when sending the alerts by email or Slack.

Reduce alerting fatigue so teams can focus on resolving alerts when it matters. Create downtimes to mute alerts during application maintenance.

What’s next

Monitors and alerts are essential tools for ensuring the reliability, performance, and availability of IT systems and applications. They help maintain operational efficiency, improve user experience, and mitigate potential risks by enabling quick detection and response to issues before they escalate. Learn more about Monitor features:

  1. Schedule downtimes to mute monitors.
  2. Organize and manage monitors.
  3. Resolve misconfigured monitors on the Monitor Quality page.

Further Reading

PREVIEWING: rtrieu/product-analytics-ui-changes