Threshold-Based Alerts with Edge Delta Monitors

Establish threshold-based alerts on aggregated metrics for faster recognition of issues and proactive response to incidents.

On the edge, the threshold node monitors the values of incoming metrics and triggers an alert signal if specified conditions are met, based on pre-defined limits. That signal can be consumed by various triggering destination nodes such as the webhook output.

Monitors, on the other hand, evaluate consolidated data across all edges. As systems grow in complexity, manually reviewing metrics becomes less feasible. Threshold-based alerting scales with the system, automatically monitoring numerous metrics across many resources or components.

Monitors

Monitors are back end application components that listen for specific events and then trigger notifications. Unlike Edge Delta Fleets, they reside in the centralized Edge Delta back end. This gives them access to aggregated data across all environments. They are similar in principle to threshold triggers in Pipeline configurations but they can monitor for conditions across all Fleets. In addition, they can monitor Fleets for issues with the agents themselves such as downed agents or crash loops.

Threshold-based alerts are a critical component of monitoring systems, automating the detection of anomalies and potential issues, and enabling teams to respond quickly and proactively.

Setting up threshold-based alerts turns raw data into actionable intelligence. When a metric crosses a predefined threshold, it signals that something unusual may be happening, warranting immediate attention. This could indicate a spike in error rates, a drop in throughput, or an abnormal resource consumption pattern.

By alerting on threshold breaches, you can address issues before they escalate into larger problems or outages. This proactive stance can help maintain service levels and business continuity. Threshold-based alerts can also help reduce alert fatigue by ensuring teams are only notified when something significant happens, as opposed to constant notifications for minor fluctuations in the data. This focused alerting helps maintain clarity and ensures that high-priority issues are given the attention they need.

Threshold alerts can also feed into capacity planning processes, revealing when resources are consistently hitting high utilization thresholds and may need to be scaled up to meet demand.

For compliance-heavy industries, evidence of proactive monitoring can be an important part of meeting regulatory requirements, showing that steps are in place to identify and address potential issues promptly.

To set thresholds:

  • Baseline Establishment: Initially, baseline metrics under typical operating conditions should be established to inform the setting of meaningful thresholds.
  • Contextual Relevance: Set thresholds based on the context of the system’s function, understanding that the same metric might have different threshold levels if the underlying system’s behavior is expected to change.
  • Iterative Refinement: Continuously refine thresholds based on historical data and as a response to observed incidents, to ensure that they remain relevant and effective.

To implement this best practice effectively, the thresholds must be carefully considered, taking into account the natural variability of the system and avoiding overly-sensitive settings that lead to frequent, inconsequential alerts. They should also be regularly reviewed and adjusted as systems and their workloads evolve.


Metric Threshold Monitor

Monitor metric thresholds in the Edge Delta web application.

Log Threshold Monitor

Monitor Log Thresholds in the Edge Delta web application.

Pattern Anomaly Monitor

Monitor Pattern Anomalies in the Edge Delta web application.

Monitor Notifications

Configure notifications for monitors in the Edge Delta web application.