- 필수 기능
- 시작하기
- Glossary
- 표준 속성
- Guides
- Agent
- 통합
- 개방형텔레메트리
- 개발자
- API
- Datadog Mobile App
- CoScreen
- Cloudcraft
- 앱 내
- 서비스 관리
- 인프라스트럭처
- 애플리케이션 성능
- APM
- Continuous Profiler
- 스팬 시각화
- 데이터 스트림 모니터링
- 데이터 작업 모니터링
- 디지털 경험
- 소프트웨어 제공
- 보안
- AI Observability
- 로그 관리
- 관리
Supported OS
ProphetStor Federator.ai is an AI-based solution designed to enhance computing resource management for Kubernetes and Virtual Machine (VM) clusters. With its holistic observability of IT operations, including multi-tenant Large Language Model (LLM) training, resources for mission-critical applications, namespaces, nodes, and clusters can be efficiently allocated, and KPIs can be effectively achieved with minimum resource wastage.
Using advanced machine learning algorithms to predict application workloads, Federator.ai offers:
ProphetStor Federator.ai provides full-stack observability through its APIs integrated with Datadog Agents, from application-level workloads, including LLM training, to cluster-level resource consumption. This integration fosters a dynamic loop between live monitoring and predictive analytics, continuously improving resource management, optimizing costs, and ensuring efficient application operation. You can easily track and predict the resource usages of Kubernetes containers, namespaces, and cluster nodes to make the right recommendations to prevent costly over-provisioning or performance-impacting under-provisioning. With easy integration to CI/CD pipeline, Federator.ai enables continuous optimization of containers whenever they are deployed in a Kubernetes cluster. Using application workload predictions, Federator.ai auto-scales application containers at the right time and optimizes performance with the right number of container replicas through Kubernetes HPA or Datadog Watermark Pod Autoscaling (WPA).
For additional information on Federator.ai, see the ProphetStor Federator.ai Feature Demo and ProphetStor Federator.ai for Datadog videos.
ProphetStor Federator.ai Cluster Overview
Cluster Resource Usage Predictions and Recommendations
Cluster Node Resource Usage Predictions and Recommendations
Node Current/Predicted Memory Usage (Daily)
Node Current/Predicted Memory Usage (Weekly)
Node Current/Predicted Memory Usage (Monthly)
Node Current/Predicted CPU Usage (Daily)
Node Current/Predicted CPU Usage (Weekly)
Node Current/Predicted CPU Usage (Monthly)
ProphetStor Federator.ai Application Overview
Workload Prediction for Next 24 Hours
Workload Prediction for Next 7 Days
Workload Prediction for Next 30 Days
Current/Predicted CPU Usage (Daily)
Current/Predicted CPU Usage (Weekly)
Current/Predicted CPU Usage (Monthly)
Current/Predicted Memory Usage (Daily)
Current/Predicted Memory Usage (Weekly)
Current/Predicted Memory Usage (Monthly)
Current/Desired/Recommended Replicas
Memory Usage/Request/Limit vs Rec Memory Limit
CPU Usage/Request/Limit vs Rec CPU Limit
CPU Usage/Limit Utilization
ProphetStor Federator.ai Kafka Overview
Recommended Replicas vs Current/Desired Replicas
Production vs Consumption vs Production Prediction
Kafka Consumer Lag
Consumer Queue Latency (msec)
Deployment Memory Usage
Deployment CPU Usage
ProphetStor Federator.ai Multi-Cloud Cost Analysis Overview
Current Cluster Cost and Current Cluster Configuration
Recommended Cluster - AWS and Recommended Cluster Configuration - AWS
Recommended Cluster - Azure and Recommended Cluster Configuration - Azure
Recommended Cluster - GCP and Recommended Cluster Configuration - GCP
Namespace with Highest Cost ($/day)
Namespace with Highest Predicted Cost ($/month)
Log in to your OpenShift/Kubernetes cluster
Install Federator.ai for OpenShift/Kubernetes with the following command:
$ curl https://raw.githubusercontent.com/containers-ai/prophetstor/master/deploy/federatorai-launcher.sh | bash
$ curl https://raw.githubusercontent.com/containers-ai/prophetstor/master/deploy/federatorai-launcher.sh | bash
...
Please enter Federator.ai version tag [default: latest]:latest
Please enter the path of Federator.ai directory [default: /opt]:
Downloading v4.5.1-b1562 tgz file ...
Done
Do you want to use a private repository URL? [default: n]:
Do you want to launch Federator.ai installation script? [default: y]:
Executing install.sh ...
Checking environment version...
...Passed
Enter the namespace you want to install Federator.ai [default: federatorai]:
.........
Downloading Federator.ai alamedascaler sample files ...
Done
========================================
Which storage type you would like to use? ephemeral or persistent?
[default: persistent]:
Specify log storage size [e.g., 2 for 2GB, default: 2]:
Specify AI engine storage size [e.g., 10 for 10GB, default: 10]:
Specify InfluxDB storage size [e.g., 100 for 100GB, default: 100]:
Specify storage class name: managed-nfs-storage
Do you want to expose dashboard and REST API services for external access? [default: y]:
----------------------------------------
install_namespace = federatorai
storage_type = persistent
log storage size = 2 GB
AI engine storage size = 10 GB
InfluxDB storage size = 100 GB
storage class name = managed-nfs-storage
expose service = y
----------------------------------------
Is the above information correct [default: y]:
Processing...
(snipped)
.........
All federatorai pods are ready.
========================================
You can now access GUI through https://<YOUR IP>:31012
Default login credential is admin/admin
Also, you can start to apply alamedascaler CR for the target you would like to monitor.
Review administration guide for further details.
========================================
========================================
You can now access Federatorai REST API through https://<YOUR IP>:31011
The default login credential is admin/admin
The REST API online document can be found in https://<YOUR IP>:31011/apis/v1/swagger/index.html
========================================
Install Federator.ai v4.5.1-b1562 successfully
Downloaded YAML files are located under /opt/federatorai/installation
Downloaded files are located under /opt/federatorai/repo/v4.5.1-b1562
Verify Federator.ai pods are running properly.
$ kubectl get pod -n federatorai
Log in to Federator.ai GUI, URL and login credential could be found in the output of Step 2.
Log in to Datadog with your account and get an API key and application key for using the Datadog API.
Configure Federator.ai for the metrics data source per cluster.
See the Federator.ai - Installation and Configuration Guide and User Guide for more details.
federatorai.integration.status (gauge) | integration status for showing Federator.ai health status. |
federatorai.recommendation (gauge) | recommended deployment/statefulset replicas. |
federatorai.prediction.kafka (gauge) | Workload prediction for Kafka metrics. |
federatorai.kafka.broker_offset_rate (gauge) | The delta of kafka.broker_offset timeseries in one minute. |
federatorai.kafka.consumer_offset_rate (gauge) | The delta of kafka.consumer_offset timeseries in one minute. |
federatorai.prediction.node (gauge) | Workload prediction for a Kubernetes node. |
federatorai.prediction.node.avg (gauge) | The average value of workload predictions for a Kubernetes node over a prediction window. |
federatorai.prediction.node.min (gauge) | The minimum value of workload predictions for a Kubernetes node over a prediction window. |
federatorai.prediction.node.max (gauge) | The maximum value of workload predictions for a Kubernetes node over a prediction window. |
federatorai.prediction.controller (gauge) | Workload prediction for a specific controller |
federatorai.prediction.controller.avg (gauge) | The average value of workload predictions for a specific controller over a prediction window. |
federatorai.prediction.controller.min (gauge) | The minimum value of workload predictions for a specific controller over a prediction window. |
federatorai.prediction.controller.max (gauge) | The maximum value of workload predictions for a specific controller over a prediction window. |
federatorai.prediction.nginx_ingress_controller_request_rate (gauge) | Workload prediction of request rate for the upstream service of Nginx ingress |
federatorai.resource_planning.node (gauge) | Workload predictions for resource planning of a Kubernetes node. |
federatorai.resource_planning.controller (gauge) | Workload predictions for resource planning of a Kubernetes controller. |
federatorai.recommendation.instance (gauge) | Cost of a recommended cloud instance. |
federatorai.cost_analysis.instance.cost (gauge) | Cost analysis for a cloud instance. |
federatorai.cost_analysis.namespace.cost (gauge) | Cost analysis for a namespace in a Kubernetes cluster |
federatorai.prediction.namespace.cost (gauge) | Cost prediction for a namespace in a Kubernetes cluster |
federatorai.kubernetes.cpu.usage.total.controller (gauge) | The number of cores (in millicore) used by the Kubernetes controller. |
federatorai.kubernetes.memory.usage.controller (gauge) | The memory usage (in bytes) of the Kubernetes controller. |
federatorai.kubernetes.cpu.usage.total.node (gauge) | The number of cores (in millicore) used by the Kubernetes node. |
federatorai.kubernetes.memory.usage.node (gauge) | The memory usage (in bytes) of the Kubernetes node. |
federatorai.cost_analysis.resource_alloc_cost.cluster (gauge) | The cost per hour/per 6 hours/per day based on resource allocation of a Kubernetes cluster for daily/weekly/monthly cost analysis |
federatorai.cost_analysis.resource_alloc_cost.node (gauge) | The cost per hour/per 6 hours/ per day based on resource allocation of a Kubernetes node for daily/weekly/monthly cost analysis |
federatorai.cost_analysis.resource_alloc_cost.namespace (gauge) | The cost per hour/per 6 hours/per day based on resource allocation of a Kubernetes namespace for daily/weekly/monthly cost analysis |
federatorai.cost_analysis.resource_usage_cost.cluster (gauge) | The cost per hour/per 6 hours/per day based on resource usage of a Kubernetes cluster for daily/weekly/monthly cost analysis |
federatorai.cost_analysis.resource_usage_cost.node (gauge) | The cost per hour/per 6 hours/per day based on resource usage of a Kubernetes node for daily/weekly/monthly cost analysis |
federatorai.cost_analysis.resource_usage_cost.namespace (gauge) | The cost per hour/per 6 hours/per day based on resource usage of a Kubernetes namespace for daily/weekly/monthly cost analysis |
federatorai.cost_analysis.cost_per_day.cluster (gauge) | The cost of the entire 24 hours based on resource allocation of a Kubernetes cluster |
federatorai.cost_analysis.cost_per_day.node (gauge) | The cost of the entire 24 hours based on resource allocation of a Kubernetes node |
federatorai.cost_analysis.cost_per_day.namespace (gauge) | The cost of the entire 24 hours based on resource allocation of a Kubernetes namespace |
federatorai.cost_analysis.cost_per_week.cluster (gauge) | The cost of the entire 7 days based on resource allocation of a Kubernetes cluster |
federatorai.cost_analysis.cost_per_week.node (gauge) | The cost of the entire 7 days based on resource allocation of a Kubernetes node |
federatorai.cost_analysis.cost_per_week.namespace (gauge) | The cost of the entire 7 days based on resource allocation of a Kubernetes namespace |
federatorai.cost_analysis.cost_per_month.cluster (gauge) | The cost of the entire 30 days based on resource allocation of a Kubernetes cluster |
federatorai.cost_analysis.cost_per_month.node (gauge) | The cost of the entire 30 days based on resource allocation of a Kubernetes node |
federatorai.cost_analysis.cost_per_month.namespace (gauge) | The cost of the entire 30 days based on resource allocation of a Kubernetes namespace |
federatorai.cost_analysis.cost_efficiency_per_day.cluster (gauge) | The cost efficiency for the entire 24 hours based on resource allocation of a Kubernetes cluster |
federatorai.cost_analysis.cost_efficiency_per_day.node (gauge) | The cost efficiency for the entire 24 hours based on resource allocation of a Kubernetes node |
federatorai.cost_analysis.cost_efficiency_per_day.namespace (gauge) | The cost efficiency for the entire 24 hours based on resource allocation of a Kubernetes namespace |
federatorai.cost_analysis.cost_efficiency_per_week.cluster (gauge) | The cost efficiency for the entire 7 days based on resource allocation of a Kubernetes cluster |
federatorai.cost_analysis.cost_efficiency_per_week.node (gauge) | The cost efficiency for the entire 7 days based on resource allocation of a Kubernetes node |
federatorai.cost_analysis.cost_efficiency_per_week.namespace (gauge) | The cost efficiency for the entire 7 days based on resource allocation of a Kubernetes namespace |
federatorai.cost_analysis.cost_efficiency_per_month.cluster (gauge) | The cost efficiency for the entire 30 days based on resource allocation of a Kubernetes cluster |
federatorai.cost_analysis.cost_efficiency_per_month.node (gauge) | The cost efficiency for the entire 30 days based on resource allocation of a Kubernetes node |
federatorai.cost_analysis.cost_efficiency_per_month.namespace (gauge) | The cost efficiency for the entire 30 days based on resource allocation of a Kubernetes namespace |
federatorai.recommendation.cost_analysis.cost_per_day.cluster (gauge) | The estimated cost of the entire 24 hours based on Federator.ai recommendation for a Kubernetes cluster |
federatorai.recommendation.cost_analysis.cost_per_day.node (gauge) | The estimated cost of the entire 24 hours based on Federator.ai recommendation for a Kubernetes node |
federatorai.recommendation.cost_analysis.cost_per_day.namespace (gauge) | The estimated cost of the entire 24 hours based on Federator.ai recommendation for a Kubernetes namespace |
federatorai.recommendation.cost_analysis.cost_per_week.cluster (gauge) | The estimated cost of the entire 7 days based on Federator.ai recommendation for a Kubernetes cluster |
federatorai.recommendation.cost_analysis.cost_per_week.node (gauge) | The estimated cost of the entire 7 days based on Federator.ai recommendation for a Kubernetes node |
federatorai.recommendation.cost_analysis.cost_per_week.namespace (gauge) | The estimated cost of the entire 7 days based on Federator.ai recommendation for a Kubernetes namespace |
federatorai.recommendation.cost_analysis.cost_per_month.cluster (gauge) | The estimated cost of the entire 30 days based on Federator.ai recommendation for a Kubernetes cluster |
federatorai.recommendation.cost_analysis.cost_per_month.node (gauge) | The estimated cost of the entire 30 days based on Federator.ai recommendation for a Kubernetes node |
federatorai.recommendation.cost_analysis.cost_per_month.namespace (gauge) | The estimated cost of the entire 30 days based on Federator.ai recommendation for a Kubernetes namespace |
federatorai.recommendation.cost_analysis.cost_efficiency_per_day.cluster (gauge) | The cost efficiency for the entire 24 hours based on Federator.ai recommendation for a Kubernetes cluster |
federatorai.recommendation.cost_analysis.cost_efficiency_per_day.namespace (gauge) | The cost efficiency for the entire 24 hours based on Federator.ai recommendation for a Kubernetes namespace |
federatorai.recommendation.cost_analysis.cost_efficiency_per_week.cluster (gauge) | The cost efficiency for the entire 7 days based on Federator.ai recommendation for a Kubernetes cluster |
federatorai.recommendation.cost_analysis.cost_efficiency_per_week.namespace (gauge) | The cost efficiency for the entire 7 days based on Federator.ai recommendation for a Kubernetes namespace |
federatorai.recommendation.cost_analysis.cost_efficiency_per_month.cluster (gauge) | The cost efficiency for the entire 30 days based on Federator.ai recommendation for a Kubernetes cluster |
federatorai.recommendation.cost_analysis.cost_efficiency_per_month.namespace (gauge) | The cost efficiency for the entire 30 days based on Federator.ai recommendation for a Kubernetes namespace |
Federator.ai does not include any service checks.
Federator.ai does not include any events.
Need help? Read the Federator.ai - Installation and Configuration Guide or contact Datadog support.