Serverless Monitoring for AWS Step Functions

このページは日本語には対応しておりません。随時翻訳に取り組んでいます。翻訳に関してご質問やご意見ございましたら、お気軽にご連絡ください。

AWS Step Functions is a serverless orchestration service that lets developers create and manage multi-step application workflows. In addition to getting Cloudwatch metrics from Datadog’s AWS Step Functions integration, Datadog also provides AWS Step Function tracing, logs, and enhanced metrics through the collection of Cloudwatch logs.

An AWS Step Function oveview tab.

How it works

Datadog AWS Step Functions Monitoring makes use of Cloudwatch metrics from the AWS Step Functions integration and Cloudwatch logs sent through the Datadog Forwarder or Amazon Data Firehose. Both Forwarder and Firehose run in your environment. Sending Cloudwatch logs provides tracing and enhanced metrics.

Monitor the overall health of Step Functions in the Serverless view

The Serverless view shows key metrics for your Step Functions in one place to easily provide a snapshot of the health of your Step Functions. You can access a detailed view of each Step Function to see all associated metrics, logs, and traces within a certain time frame and set monitors for problematic executions.

An AWS Step Function visualization with span tags.

Visualize AWS Step Function Traces on a State Machine Map

When Step Function tracing is enabled, you can use a visual representation of an AWS Step Function execution through a state machine map. Get an at-a-glance-view of whether a successful or failed execution took the expected path through the state machine. Drill into any anomalous executions to identify what states are problematic or have a high latency.

An AWS Step Function span displayed in a flame graph visualization.

Reduce Step Function debugging time with detailed execution traces

You can view end-to-end traces for a single Step Function execution and its associated logs, errors, and metrics, which enable you to identify issues in your Step Function logic. Step Function spans also contain rich metadata for step inputs and outputs, associated Lambda traces, and step duration length that help you to reproduce bugs and fix bottlenecks.

To get started, follow the installation instructions.

Further Reading

PREVIEWING: rtrieu/product-analytics-ui-changes