Ambari

Supported OS Linux Mac OS

Integration version4.2.1

Overview

This check monitors Ambari through the Datadog Agent.

Setup

Installation

The Ambari check is included in the Datadog Agent package. No additional installation is needed on your server.

Configuration

Host

To configure this check for an Agent running on a host:

Metric collection
  1. Edit the ambari.d/conf.yaml file, in the conf.d/ folder at the root of your Agent’s configuration directory to start collecting your Ambari performance data. See the sample ambari.d/conf.yaml for all available configuration options.

    init_config:
    
    instances:
      ## @param url - string - required
      ## The URL of the Ambari Server, include http:// or https://
      #
      - url: localhost
    
  2. Restart the Agent.

Log collection

Available for Agent versions >6.0

  1. Collecting logs is disabled by default in the Datadog Agent. Enable it in your datadog.yaml file:

    logs_enabled: true
    
  2. Edit your ambari.d/conf.yaml by uncommenting the logs lines at the bottom. Update the logs path with the correct path to your Ambari log files.

      logs:
        - type: file
          path: /var/log/ambari-server/ambari-alerts.log
          source: ambari
          service: ambari
          log_processing_rules:
              - type: multi_line
                name: new_log_start_with_date
                # 2019-04-22 15:47:00,999
                pattern: \d{4}\-(0?[1-9]|1[012])\-(0?[1-9]|[12][0-9]|3[01])
      ...
    
  3. Restart the Agent.

Containerized

For containerized environments, see the Autodiscovery Integration Templates for guidance on applying the parameters below.

Metric collection
ParameterValue
<INTEGRATION_NAME>ambari
<INIT_CONFIG>blank or {}
<INSTANCE_CONFIG>{"url": "http://%%host%%"}
Log collection

Available for Agent versions >6.0

Collecting logs is disabled by default in the Datadog Agent. To enable it, see Kubernetes Log Collection.

ParameterValue
<LOG_CONFIG>`{“source”: “ambari”, “service”: “<SERVICE_NAME>”, “log_processing_rules”:{“type”:“multi_line”,“name”:“new_log_start_with_date”,“pattern”:"\d{4}-(0?[1-9]

Validation

Run the Agent’s status subcommand and look for ambari under the Checks section.

Data Collected

This integration collects for every host in every cluster the following system metrics:

  • boottime
  • cpu
  • disk
  • memory
  • load
  • network
  • process

If service metrics collection is enabled with collect_service_metrics this integration collects for each included service component the metrics with headers in the inclusion list.

Metrics

ambari.boottime
(gauge)
Host boot time.
Shown as millisecond
ambari.cpu.cpu_idle
(gauge)
Host Idle CPU.
Shown as percent
ambari.cpu.cpu_nice
(gauge)
Host Nice CPU.
Shown as percent
ambari.cpu.cpu_num
(gauge)
Host Idle CPU.
ambari.cpu.cpu_system
(gauge)
Host System CPU.
Shown as percent
ambari.cpu.cpu_user
(gauge)
Host User CPU.
Shown as percent
ambari.cpu.cpu_wio
(gauge)
Host CPU waiting for IO.
Shown as percent
ambari.disk.disk_free
(gauge)
Free disk space.
Shown as byte
ambari.disk.disk_total
(gauge)
Total disk size.
Shown as byte
ambari.disk.read_bytes
(gauge)
Read bytes.
Shown as byte
ambari.disk.read_count
(gauge)
Read count.
ambari.disk.read_time
(gauge)
Disk read time.
Shown as millisecond
ambari.disk.write_bytes
(gauge)
Written bytes.
Shown as byte
ambari.disk.write_count
(gauge)
Written count.
ambari.disk.write_time
(gauge)
Disk write time.
Shown as millisecond
ambari.load_fifteen
(gauge)
Load fifteen.
Shown as percent
ambari.load_five
(gauge)
Load Five.
Shown as percent
ambari.load_one
(gauge)
Load one.
Shown as percent
ambari.memory.mem_cached
(gauge)
Cached Memory.
Shown as byte
ambari.memory.mem_free
(gauge)
Free Memory.
Shown as byte
ambari.memory.mem_shared
(gauge)
Shared Memory.
Shown as byte
ambari.memory.mem_total
(gauge)
Total Memory
Shown as byte
ambari.memory.swap_free
(gauge)
Free Swap
Shown as byte
ambari.memory.swap_total
(gauge)
Total Swap
Shown as byte
ambari.network.bytes_in
(gauge)
Network bytes in.
Shown as byte
ambari.network.bytes_out
(gauge)
Network bytes out.
Shown as byte
ambari.network.pkts_in
(gauge)
Network packets in.
Shown as byte
ambari.network.pkts_out
(gauge)
Network packets out.
Shown as byte
ambari.process.proc_run
(gauge)
Process run.
ambari.process.proc_total
(gauge)
Process total.

Events

Ambari does not include any events.

Service Checks

ambari.can_connect
Returns OK if the cluster is reachable, CRITICAL otherwise.
Statuses: ok, critical

ambari.state
Returns OK if the service is installed or running, WARNING if the service is stopping or uninstalling, CRITICAL if the service is uninstalled or stopped.
Statuses: ok, warning, critical

Troubleshooting

Need help? Contact Datadog support.

PREVIEWING: esther/docs-8632-slo-blog-links