- 필수 기능
- 시작하기
- Glossary
- 표준 속성
- Guides
- Agent
- 통합
- 개방형텔레메트리
- 개발자
- Administrator's Guide
- API
- Datadog Mobile App
- CoScreen
- Cloudcraft
- 앱 내
- 서비스 관리
- 인프라스트럭처
- 애플리케이션 성능
- APM
- Continuous Profiler
- 스팬 시각화
- 데이터 스트림 모니터링
- 데이터 작업 모니터링
- 디지털 경험
- 소프트웨어 제공
- 보안
- AI Observability
- 로그 관리
- 관리
Data Pipeline Lineage is available in Preview. If you're interested in this feature, complete the form to request access.
Request AccessDatadog’s Data Pipeline Lineage helps you monitor data flow throughout your pipelines end-to-end, including ingestion, processing, and storage. With expanded visibility into your streaming data pipelines, data jobs, and data warehouses in a unified view, you can detect issues with your data, identify related upstream failures, and troubleshoot faster.
You can visualize lineage of data between components (streaming data, data processing jobs, data warehouses) with upstream and downstream dependencies, monitor throughput, and detect issues such as consumer lag, schema changes, along with the downstream data impacted.
This feature requires both Data Streams Monitoring and Data Jobs Monitoring.
Type | Technology |
---|---|
Streaming |
|
Processing |
|
Storage |
|
Don’t see your tech stack here? Submit a request.
Set up Data Streams Monitoring on your producer and consumer services. Follow the instructions in the Data Streams Monitoring setup documentation. If you are using Java, ensure that you use the Datadog APM client for Java v1.34.0+.
Set up Data Jobs Monitoring on your Spark workloads. See the instructions for Spark on Kubernetes or Spark on EMR.
Enable Data Streams Monitoring for your Spark jobs. Add -Ddd.data.streams.enabled=true
to your spark-submit
command line.
For example:
spark-submit \
--conf spark.driver.extraJavaOptions="-Ddd.data.jobs.enabled=true -Ddd.data.streams.enabled=true" \
--conf spark.executor.extraJavaOptions="-Ddd.data.jobs.enabled=true -Ddd.data.streams.enabled=true" \
application.jar
For Snowflake services, install APM clients. Install Datadog’s Java or Python APM client for any services that interact with Snowflake. Set the DD_TRACE_REMOVE_INTEGRATION_SERVICE_NAMES_ENABLED
environment variable to true
.
After you set up Data Pipeline Lineage, go the Data Streams Monitoring page in Datadog and select Map to see your visualized pipelines.
추가 유용한 문서, 링크 및 기사: