Overview
This check monitors Cisco SD-WAN controllers through the Datadog Agent using an SD-WAN platform provided by Netnology. The
check enables users to monitor the network health and performance of multiple Cisco SD-WAN controllers simultaneously. Collected
information can then be used for aggregated dashboarding and notifications on configured monitors/alerts.
Currently, only Cisco vManage devices are supported as SD-WAN controller targets.
Setup
The Netnology Cisco SD-WAN integration is not included in the Datadog Agent package, so you need to install it manually.
Installation
For Agent v7.21+ / v6.21+, follow the instructions below to install the check on your host. See Use Community Integrations to install with the Docker Agent or earlier versions of the Agent.
Run the following command to install the Agent integration:
datadog-agent integration install -t nn_sdwan==1.0.1
Configure your integration similar to core integrations.
Configuration
Edit the nn_sdwan.d/conf.yaml
file in the conf.d/
folder at the root of your Agent’s configuration directory to start collecting your Cisco SD-WAN performance data. See sample nn_sdwan.d/conf.yaml for all available configuration options.
Restart the Agent.
Validation
Run the Agent’s status subcommand and look for nn_sdwan
under the Checks section.
Data Collected
Metrics
nn_sdwan.app_aware_routing.jitter (gauge) | The change in latency for packets when traversing a link Shown as millisecond |
nn_sdwan.app_aware_routing.latency (gauge) | The time required for packets to traverse a link Shown as millisecond |
nn_sdwan.app_aware_routing.loss_percentage (gauge) | The percentage of packets lost during transmission Shown as percent |
nn_sdwan.app_aware_routing.rx_octets (gauge) | The number of octets received on a link Shown as byte |
nn_sdwan.app_aware_routing.tx_octets (gauge) | The number of octets transmitted on a link Shown as byte |
nn_sdwan.cert_summary (gauge) | The number of normal, warning, and invalid certificates |
nn_sdwan.connection_summary_stats_error (gauge) | The number of certificate errors for WAN edges, vBond devices, and vSmart devices |
nn_sdwan.connection_summary_stats_total (gauge) | The number of certificates for WAN edges, vBond devices, and vSmart devices |
nn_sdwan.device_control_status (gauge) | The number of devices that are up, partially up, and/or down |
nn_sdwan.reboot_count (gauge) | The number of times in the past 24 hours that the controller has rebooted |
nn_sdwan.site_health (gauge) | The number of sites that are up, contain warnings, and/or are down |
nn_sdwan.top_app_stats (gauge) | The number of bytes sent across a link by application Shown as byte |
nn_sdwan.transport_interface (gauge) | The number of links with active throughput within a particular range |
nn_sdwan.vmanage_count (gauge) | The number of devices for each status |
nn_sdwan.wan_edge_health (gauge) | The number of normal, warning and error WAN edges |
nn_sdwan.wan_edge_inventory (gauge) | The number of deployed, authorized, staging, and total WAN edges |
Events
The Netnology Cisco SD-WAN integration does not include any events.
Service Checks
nn_sdwan.sdwan_controller.online
Returns CRITICAL
if the remote host cannot be pinged, WARNING
if the controller can be pinged but is not responding to API calls, and OK
otherwise.
Statuses: ok, warning, critical
Troubleshooting
Need help? Contact Datadog support.