CostObserver

Alert Fatigue Is Not a Tool Problem

An engineering team has 1,400 unresolved alerts in their security queue. Not because they are understaffed. Not because they do not care. Because nobody can tell them which of those 1,400 alerts is actively costing money right now.

That is the real alert fatigue problem. Not volume. Prioritisation.

Why Severity Alone Is Not Enough

Most security tools classify alerts by severity: Critical, High, Medium, Low. The assumption is that Critical alerts get addressed first and Low alerts can wait.

The problem is that severity is a measure of potential impact, not current financial impact. A Critical alert on a development environment with no customer data and no production access is less urgent than a Medium alert on a production workload that is currently generating $8,000 in anomalous spend per week.

Severity without financial context is noise with a scary label.

The IBM Cost of a Data Breach 2025 Report found that organisations using AI and automation in their security operations saw $1.9M lower average breach costs compared to organisations without those capabilities. The underlying reason, in our reading of the data, is better prioritisation of the signals that matter, not more alerts.

What a Better Triage Model Looks Like

Here is a concrete example of how cost context changes prioritisation.

Scenario: Two alerts arrive in the same hour.

Alert A: GuardDuty raises a Critical finding: Recon:IAMUser/MaliciousIPCaller. An IAM user is making API calls from a known malicious IP address. No corresponding cost anomaly. The IAM user has read-only permissions on a non-production account.

Alert B: AWS Cost Anomaly Detection flags a Medium anomaly: EC2 spend in eu-west-1 is 340% above the expected baseline. The account does not normally run workloads in eu-west-1. CloudTrail shows RunInstances calls from an IAM role that has not been used in 90 days.

Under a severity-only model, Alert A gets investigated first. It is Critical.

Under a cost-context model, Alert B gets investigated first. The financial impact is active and growing. Alert A is worth investigating, but the IAM user has limited permissions and no anomalous spend. Alert B has both a security signal and a cost signal pointing at the same resource.

That is the difference cost context makes in a triage workflow.

Why Adding More Tools Does Not Fix This

The instinct when alert fatigue sets in is to buy a better alerting tool, a SIEM, a CSPM, or a threat detection platform. Each new tool adds more alerts to the queue.

The Verizon 2025 Data Breach Investigations Report found that ransomware was present in 44% of breaches analysed. More tools without a prioritisation model produces more noise. The missing layer is not another alert source. It is a filter that connects security signals to financial impact so the team knows which alerts are actively costing money right now.

A Practical Triage Pattern

This does not require new tooling. It requires connecting what you already have.

Step 1: When an alert fires, pull the cost context immediately

For the resource or IAM principal involved in the alert, check AWS Cost Explorer for the last 24 and 72 hours. Is there anomalous spend associated with this resource? If yes, the alert moves to the top of the queue regardless of severity classification.

Step 2: Set up AWS Cost Anomaly Detection as a parallel signal

AWS Cost Anomaly Detection uses machine learning to identify unusual spend patterns and can alert via email or SNS. Configure it to alert on anomalies by service, linked account, or cost category. When a cost anomaly fires, cross-reference it with GuardDuty findings and CloudTrail activity for the same time window.

Step 3: Build a shared triage ritual

FinOps and security teams reviewing the same data once a week surfaces patterns neither team sees independently. A cost anomaly that the FinOps team assumed was a scaling issue often has a corresponding security finding that the security team deprioritised because it looked low-severity in isolation.

Note: The goal is not to make every cost anomaly a security investigation. Most cost spikes are not security incidents. The goal is to make sure that when a cost spike and a security signal point at the same resource, someone connects them before the incident compounds.

The Operational Shift

Alert fatigue is a symptom of a prioritisation model that uses severity as a proxy for urgency. Severity measures potential impact. Cost context measures current impact.

The teams that resolve incidents fastest are not the ones with the fewest alerts. They are the ones who know which alerts are actively costing money right now and address those first.

That is the operational shift SecFinOps enables. Not fewer alerts. Better filters.

Try CostObserver. Read-only access, no credit card, connects in minutes. Or explore the demo without signing up.