Fiddler

Supported OS Linux Windows Mac OS

Integration version4.0.0

Agent Check: Fiddler

Overview

Fiddler’s Model Performance Management platform monitors Machine Learning model performance by sending real-time alerts when model performance metrics drop, allowing users to analyze inference data to understand why model performance is degrading. This integration includes metrics and an out-of-the-box dashboard that displays performance metrics such as accuracy, traffic, and drift.

Setup

The Fiddler check is not included in the Datadog Agent package, so you need to install it.

Installation

For Agent v7.21+ / v6.21+, follow the instructions below to install the Fiddler check on your host. See Use Community Integrations to install with the Docker Agent or earlier versions of the Agent.

  1. Run the following command to install the Agent integration:

    datadog-agent integration install -t datadog-fiddler==4.0.0
    
  2. Configure your integration similar to Agent-based integrations.

Configuration

  1. Edit the fiddler.d/conf.yaml file, in the conf.d/ folder at the root of your Agent’s configuration directory to start collecting your Fiddler performance data. See the sample fiddler.d/conf.yaml for all available configuration options. The url, org, and fiddler_api_key parameters need to be updated for the Fiddler environment you wish the integration to query. Fiddler also suggests setting the minimum_collection_interval setting in the conf.yaml file to 300 (5 minutes). Additionally, bin_size, v1compat and enabled_metrics can also be configured.

  2. Restart the Agent.

Validation

Run the Agent’s status subcommand and look for fiddler under the Checks section.

Data Collected

Metrics

fiddler.accuracy
(gauge)
The accuracy of the model
Shown as percent
fiddler.histogram_drift
(gauge)
The histogram drift of the model[deprecated]
Shown as percent
fiddler.psi
(gauge)
The PSI of the model
Shown as percent
fiddler.jsd
(gauge)
The JSD of the model
Shown as percent
fiddler.feature_average
(gauge)
The feature average of the model[deprecated]
Shown as percent
fiddler.output_average
(gauge)
The output average of the model[deprecated]
Shown as percent
fiddler.average
(gauge)
The feature average of the model
Shown as percent
fiddler.traffic_count
(gauge)
The traffic count of the model[deprecated]
Shown as percent
fiddler.traffic
(gauge)
The traffic count of the model
Shown as percent
fiddler.fpr
(gauge)
The false positive rate of the model
Shown as percent
fiddler.tpr
(gauge)
The true positive rate of the model[deprecated]
Shown as percent
fiddler.auc
(gauge)
The AUC of the model
Shown as percent
fiddler.f1_score
(gauge)
The F1 Score of the model
Shown as percent
fiddler.precision
(gauge)
The precision of the model
Shown as percent
fiddler.recall
(gauge)
The recall of the model
Shown as percent
fiddler.mape
(gauge)
The MAPE of the model
Shown as percent
fiddler.wmape
(gauge)
The WMAPE of the model
Shown as percent
fiddler.mae
(gauge)
The MAE of the model
Shown as percent
fiddler.mse
(gauge)
The MSE of the model
Shown as percent
fiddler.r2
(gauge)
The R-squared metric of the model
Shown as percent
fiddler.binary_cross_entropy
(gauge)
The binary cross-entropy metric of the model[deprecated]
Shown as percent
fiddler.log_loss
(gauge)
The log-loss metric of the model
Shown as percent
fiddler.expected_callibration_error
(gauge)
The expected calibration error of the model[deprecated]
Shown as percent
fiddler.data_count
(gauge)
The data count metric of the model
Shown as percent
fiddler.auroc
(gauge)
The AUROC metric of the model
Shown as percent
fiddler.callibrated_threshold
(gauge)
The calibrated threshold metric of the model[deprecated]
Shown as percent
fiddler.calibrated_threshold
(gauge)
The calibrated threshold metric of the model
Shown as percent
fiddler.g_mean
(gauge)
The geometric mean metric of the model[deprecated]
Shown as percent
fiddler.geometric_mean
(gauge)
The geometric mean metric of the model
Shown as percent
fiddler.sum
(gauge)
The SUM of a feature the model
Shown as percent
fiddler.expected_calibration_error
(gauge)
The Expected Callibaration Error of the model
Shown as percent

Service Checks

fiddler.can_connect

Returns CRITICAL if the Agent is unable to connect to and collect metrics from the monitored Fiddler instance. Returns OK otherwise.

Statuses: ok, critical

Troubleshooting

Need help? Contact Fiddler support.

PREVIEWING: may/embedded-workflows