Compliance

Harden load balancers (ALB/NLB/CLB)

One capability across Application and Classic Load Balancers and the Auto Scaling groups behind them: reject malformed HTTP, drain connections cleanly, balance traffic evenly and replace genuinely-broken instances, mostly through single attribute flips.

13 min·10 sections·AWS

Last reviewed 16 June 2026

Remediates AWS Security Hub: AutoScaling.1 ELB.4 ELB.7 ELB.9 ELB.12 ELB.14

Hardening load balancers: the basics

Why the front door has several settings that decide how safely and evenly it behaves

A load balancer is the front door of an internet-facing service: it terminates connections, parses HTTP, and forwards each request to a target. The way it parses, drains, and distributes that traffic is governed by a handful of attributes, and the defaults, especially on older load balancers, lean towards permissiveness and convenience rather than safety and efficiency. An Application Load Balancer (ALB) created before AWS tightened the defaults may forward malformed headers or sit in a loose HTTP parsing mode; a Classic Load Balancer (CLB) may cut live connections on every deploy or spread traffic unevenly across zones; an Auto Scaling group behind a load balancer may ignore the load balancer's own health signal entirely.

AWS Security Hub turns each weak default into its own control, which is why a single perimeter can fail several load-balancer checks at once. ELB.4 fails an ALB that forwards invalid HTTP headers; ELB.12 fails an ALB whose desync mitigation mode is monitor rather than defensive or strictest; ELB.14 is the same desync check for Classic Load Balancers; ELB.7 fails a CLB without connection draining; ELB.9 fails a CLB without cross-zone load balancing; AutoScaling.1 fails an Auto Scaling group behind a load balancer that still uses EC2-only health checks. They look like separate problems on the report, but they are one capability: make the front door reject ambiguous traffic, hand off cleanly, balance evenly, and act on the health signal it already has.

The reason these are worth closing as a group is the cost-to-fix asymmetry. Almost every one is a single attribute change with no infrastructure cost, no downtime and no redeploy, while the downsides range from request smuggling (a breach-class attack) to dropped customer requests, lopsided over-provisioning and broken instances that never get replaced. The job is to inventory every load balancer and Auto Scaling group, flip the safe values, then make those the defaults so new load balancers are born compliant.

In this lesson you will learn how a load balancer's attributes govern HTTP parsing, connection handling, traffic distribution and health checking, how to find every weakly-configured load balancer and Auto Scaling group in an account, and how to harden them with single attribute changes that take effect with no downtime. The Controls this lesson covers section lists every Security Hub control in this capability, each linking to a deep page with the exact check and a copy-and-paste fix.

Fun fact

Seventy thousand dollars for one malformed header

In 2019 James Kettle of PortSwigger published the HTTP desync research that put request smuggling on every security team's radar. Within months bug-bounty programmes at major vendors were paying out five-figure rewards, Kettle himself collected 70,000 dollars from a single programme for a smuggling chain that hijacked admin sessions. The attack needs nothing more than a front-end load balancer and a back-end server disagreeing about which header decides where a request ends. AWS shipped the drop_invalid_header_fields attribute and the desync mitigation modes shortly after, and made the safe values the default for new load balancers. Older ones, or ones explicitly set loose, are exactly what ELB.4, ELB.12 and ELB.14 hunt for.

Finding weakly-configured load balancers across an estate

Priya is auditing the perimeter of a fintech account ahead of a SOC 2 renewal. Security Hub has flagged several ALBs and CLBs across the production account, all public-facing, all in front of authenticated traffic.

Rather than work the findings one by one, she sweeps every ALB in the region and reads the two HTTP-hardening attributes together, so she can fix the whole cohort in one change window.

Sweep every Application Load Balancer and read its desync mode and invalid-header setting together. monitor mode fails ELB.12; a false invalid-header flag fails ELB.4.

$ for arn in $(aws elbv2 describe-load-balancers --query 'LoadBalancers[?Type==`application`].LoadBalancerArn' --output text); do echo "$arn"; aws elbv2 describe-load-balancer-attributes --load-balancer-arn "$arn" --query 'Attributes[?Key==`routing.http.desync_mitigation_mode` || Key==`routing.http.drop_invalid_header_fields.enabled`].Value' --output text; done

arn:...:loadbalancer/app/prod-api/abcd1234 monitor false

arn:...:loadbalancer/app/prod-auth/1122aabb monitor false

arn:...:loadbalancer/app/prod-internal/77ee88ff defensive true

# Two public ALBs are in monitor mode and forwarding invalid headers. Fix the cohort in one window.

One pass over the fleet shows which ALBs fail ELB.12 and ELB.4 together, exactly the cohort Security Hub is reporting.

How load-balancer hardening actually worksdeep dive

Most of these controls read a single attribute. On an ALB, routing.http.drop_invalid_header_fields.enabled must be true (ELB.4) and routing.http.desync_mitigation_mode must be defensive or strictest, not monitor (ELB.12). The desync mode is the broader defence: monitor logs ambiguous requests but forwards them, defensive (the AWS default) routes conformant requests and blocks the clearly-malicious ones, strictest rejects anything that is not strictly RFC 7230 compliant. On a Classic Load Balancer the equivalent desync attribute is elb.http.desyncmitigationmode (ELB.14), plus connection draining (ELB.7) and cross-zone load balancing (ELB.9). AutoScaling.1 reads a group's HealthCheckType, which must be ELB rather than EC2 when the group sits behind a load balancer.

The attribute changes are instant and non-disruptive. Flipping the invalid-header or desync attribute on a live ALB does not restart the listener, drain connections or reset in-flight requests; the only side effect is that genuinely malformed requests start being rejected, and in practice the legitimate-use fallout has been essentially zero. Connection draining and cross-zone balancing on a CLB propagate with no downtime, and cross-zone balancing carries no per-request charge on a Classic Load Balancer. AutoScaling.1 is the one with an operational caveat: switching to ELB health checks delegates termination to the load balancer's probe, so confirm that probe reflects real application health and set a HealthCheckGracePeriod above the instance cold-start time, or every fresh instance is killed before it finishes booting.

Security Hub evaluates these change-triggered through AWS Config (alb-http-drop-invalid-header-enabled, alb-desync-mode-check, autoscaling-group-elb-healthcheck-required and siblings), so a fix flips the finding to PASSED on the next evaluation. Note one relationship: AWS recommends disabling ELB.4 once ELB.12 is enabled, because the desync modes supersede plain invalid-header dropping. The durable answer is to encode the safe attribute values in your Terraform or CDK load-balancer module so new load balancers are born compliant, backed by the Config rules to catch any drift.

What is the impact of leaving load balancers unhardened?

The headline impact is HTTP request smuggling. An ALB that forwards invalid headers or sits in monitor mode, or a CLB on the loose desync setting, can let an attacker who finds a parsing disagreement between the load balancer and the backend prepend a hidden request to the next legitimate connection. In published exploits this has been used to steal session cookies, escalate to admin endpoints, and poison shared caches so every subsequent user is served attacker content, all without exploiting a single line of application code. That is a board-level event: customer data exposure, regulatory notification and reputational damage.

The reliability impacts are quieter but real. A Classic Load Balancer without connection draining cuts live requests on every deploy and scale-in, so routine operational events drop a fraction of customer traffic and cluster support tickets around deployment windows. Without cross-zone balancing, instances in one zone run hot while another zone coasts, and teams over-provision the whole fleet to compensate, paying for capacity they already had spare. An Auto Scaling group on EC2-only health checks keeps a broken-but-powered-on instance in the desired-capacity count forever: you pay for a zombie that serves no traffic, and your effective capacity quietly drops below what you provisioned, exactly when a traffic peak makes it an incident.

On the compliance side, ELB.4, ELB.12 and ELB.14 map to NIST 800-53 and PCI DSS v4.0.1 requirements for protecting public-facing web applications, so a load balancer on a permissive setting is documented evidence the perimeter accepts non-RFC traffic. The remediation, by contrast, is mostly a single attribute change. Few controls have a cleaner cost-benefit profile, near-zero effort against breach-class, revenue and availability downsides.

How do you harden load balancers safely?

Work the capability as one loop rather than chasing individual findings. Most steps are a single attribute flip; the two with operational caveats (the desync modes and ELB health checks) just need a quick check before you change them.

1. Inventory every load balancer and Auto Scaling group

Across every region and account, list Application Load Balancers (read drop_invalid_header_fields and desync_mitigation_mode), Classic Load Balancers (read the desync mode, connection draining and cross-zone balancing) and Auto Scaling groups behind a load balancer (read HealthCheckType and HealthCheckGracePeriod). Do not trust the Security Hub count alone, run the describe sweep yourself, because older load balancers and ones created from older IaC modules default to the unsafe values. Record the ARN, account, region and the service each fronts.

2. Confirm the few changes that have a caveat

Most flips are safe to apply blind. Two are not. For the desync modes, check the ALB access-log classification field over recent traffic: only Compliant and Acceptable classifications mean defensive is safe, while Ambiguous or Severe entries mean some (usually legacy) client sends non-conformant requests, investigate before blocking. For AutoScaling.1, confirm the target group's health probe reflects real application health and set a grace period above the cold-start time before switching to ELB checks, or fresh instances are killed in a loop.

3. Flip the safe values, highest impact first

Set drop_invalid_header_fields to true and desync_mitigation_mode to defensive on ALBs, the desync mode to defensive on CLBs, enable connection draining and cross-zone balancing on CLBs, and switch load-balanced Auto Scaling groups to ELB health checks with a sensible grace period. Prioritise the HTTP-hardening controls on public, authenticated services first, since those carry the breach risk. Each change is a single API call per resource and takes effect immediately with no downtime. After enabling cross-zone balancing or ELB health checks, right-size the fleet so you capture the saving rather than leaving the headroom running.

4. Prevent recurrence with AWS Config and IaC defaults

Cleanup without prevention just resets the clock. Enable the AWS Config managed rules (alb-http-drop-invalid-header-enabled, alb-desync-mode-check, autoscaling-group-elb-healthcheck-required and the CLB equivalents) so any new non-compliant resource raises an event within minutes, and set the safe attribute values as defaults in your Terraform or CDK modules with a CI check that refuses a definition leaving them unset. New load balancers are then compliant from creation.

# Harden every Application Load Balancer in the region: reject invalid headers and
# require defensive (or strictest) desync mode. Both are instant, non-disruptive flips.
for arn in $(aws elbv2 describe-load-balancers \
    --query 'LoadBalancers[?Type==`application`].LoadBalancerArn' --output text); do
  aws elbv2 modify-load-balancer-attributes --load-balancer-arn "$arn" \
    --attributes \
      Key=routing.http.drop_invalid_header_fields.enabled,Value=true \
      Key=routing.http.desync_mitigation_mode,Value=defensive
  echo "$arn: hardened"
done

# Switch load-balanced Auto Scaling groups to ELB health checks with a safe grace period
# (confirm the target-group probe reflects real app health first).
for g in $(aws autoscaling describe-auto-scaling-groups \
    --query 'AutoScalingGroups[?(LoadBalancerNames!=`[]` || TargetGroupARNs!=`[]`) && HealthCheckType==`EC2`].AutoScalingGroupName' \
    --output text); do
  aws autoscaling update-auto-scaling-group --auto-scaling-group-name "$g" \
    --health-check-type ELB --health-check-grace-period 300
  echo "$g: now using ELB health checks"
done

Quick quiz

Question 1 of 5

Security Hub shows ELB.4, ELB.12, ELB.7, ELB.9 and AutoScaling.1 findings across the perimeter. What is the most efficient way to think about them?

Keep learning

Go deeper on the load-balancer controls in this capability, the attacks they prevent, and how to enforce the safe values.

You can now treat load-balancer hardening as one capability rather than a scatter of findings: inventory every load balancer and Auto Scaling group, confirm the two changes that carry a caveat, flip the safe attribute values highest-impact first, and prevent recurrence with Config rules and IaC defaults. The Controls this lesson covers section below links every control in this group to its deep page and fix.

Back to the library

Hardening load balancers: the cost and risk view

Mostly free configuration flips against a mix of breach risk, dropped revenue and over-provisioning

Load balancers sit in front of every internet-facing application and decide how traffic is parsed, handed off and spread. Nearly every control in this group is a configuration change with no AWS spend: rejecting malformed headers, tightening the HTTP parsing mode, draining connections and acting on the load balancer's health signal are all zero-cost attribute flips. Two of them, cross-zone balancing (ELB.9) and ELB health checks on Auto Scaling groups (AutoScaling.1), actually point in the same direction as cost optimisation, because they stop teams over-provisioning to work around uneven or zombie capacity.

Frame each failing control by the downside it removes, not by its severity label. ELB.4 and ELB.12/ELB.14 close HTTP request smuggling, a breach-class attack with documented six-figure bounty payouts and real authentication-bypass incidents; ELB.7 stops routine deploys dropping live customer requests; ELB.9 and AutoScaling.1 are availability-and-waste items where the real exposure shows up during a traffic peak. Several map to PCI DSS and NIST controls, so an open finding is also an audit item.

Because the fixes are free, the metric that matters is how fast they close, not their dollar cost. A zero-cost, audit-mapped, breach-class fix aging in a backlog is a process signal: the same backlog discipline that lets a free fix linger is what lets the expensive risks linger. The finance role is to make sure these are closed promptly across the whole estate and prevented from recurring with an IaC default.

This lesson is for the finance partner who sees a cluster of load-balancer findings on the security report and wants to know what the right response is and what it costs. It covers why nearly all of these controls are free to fix, which two also reduce compute spend, why an open finding here can be a breach, revenue or audit cost, and how to make sure the whole estate is closed and kept closed rather than chased a finding at a time.

Fun fact

Seventy thousand dollars for one malformed header

How a finance partner frames the load-balancer hardening decision

Priya is the finance partner reviewing the perimeter findings ahead of the SOC 2 renewal. Security Hub has flagged several ALBs and CLBs (ELB.4, ELB.12, ELB.7, ELB.9) plus an AutoScaling.1 finding. Her instinct is not to weigh these against budget, because nearly every one is a zero-cost attribute flip with no new resources and no usage charge. Her question is which of these removes a balance-sheet downside, and which two might actually reduce the bill.

She sorts the cohort by downside rather than by severity label. ELB.4 and ELB.12 close HTTP request smuggling, a breach-class attack with documented six-figure bounty payouts and real authentication-bypass incidents, so they go on the risk register at full weight even though one is labelled Medium. ELB.9 and AutoScaling.1 she tags as availability-and-waste items that point the same way as cost optimisation, because both stop teams over-provisioning to paper over uneven load or zombie instances. Her output is one line for the finance pack: the fixes are free, two of them save compute, and the only metric that matters is how fast a zero-cost breach-class fix closes, because a free fix aging in a backlog is a process signal.

Why load-balancer hardening belongs on the risk register

The cost model here is unusually one-sided. Remediation is near-zero, no new AWS resources, no usage charge, no downtime, just a sliver of engineering time, and two of the controls (cross-zone balancing and ELB health checks) actually reduce compute spend by removing the reason teams over-provision. So this category rarely competes for budget; it competes only for attention.

The risk side is concrete. Request smuggling has been used to bypass authentication and hijack sessions at scale, with documented bounty payouts in the tens of thousands and real breaches behind them; for a payments or healthcare application that triggers regulatory notification, forensics and customer remediation costs orders of magnitude larger than the free fix. The reliability controls turn into incident cost during a traffic peak, when the gap between provisioned and serving capacity becomes customer-facing. The finance contribution is to ensure these appear on the risk register at the right size, so a free fix is not deprioritised just because it is labelled Medium or Low.

Because the fixes are free, the metric to watch is aging, not dollars. A zero-cost, audit-mapped fix sitting open for weeks is a process signal worth raising. Pair the cleanup with an IaC default and a Config rule so the finding never returns, which is cheaper than re-remediating it every audit cycle.

What finance can drive on load-balancer hardening

Finance cannot flip the attributes, but it owns the framing that keeps free, breach-class fixes from rotting in a backlog because their severity label reads Medium. Three levers, used at the regular review.

1. Put each control on the register at the size of its downside, not its severity label

Cross-reference each finding against what it actually exposes. ELB.4 and ELB.12/ELB.14 close request smuggling, a breach-class attack with regulatory-notification and forensics costs orders of magnitude larger than the free fix, so they belong high on the register even when labelled Medium. ELB.7 is dropped customer revenue on every deploy; ELB.9 and AutoScaling.1 are availability-and-waste. Sizing by downside stops a free, breach-class fix being deprioritised behind a louder but cheaper risk.

2. Capture the two controls that are also cost savings

ELB.9 (cross-zone balancing) and AutoScaling.1 (ELB health checks) point the same way as cost optimisation, because both remove the reason teams over-provision: uneven load and zombie instances paid for but serving no traffic. Insist that after these are enabled the fleet is right-sized so the saving is actually captured rather than left as idle headroom. This is the rare security fix with a positive cash line attached, and finance should make sure it is realised.

3. Track aging, not dollars, and fund the IaC default

Because the fixes are free, the metric to watch is how long a finding sits open, not what it costs. A zero-cost, audit-mapped, breach-class fix aging for weeks is a process signal that the same backlog discipline is letting expensive risks linger too. Approve the one-time work on sight and fund the IaC default plus the Config rule, because encoding the safe values in the Terraform or CDK module is cheaper than re-remediating the same finding every audit cycle.

Quick quiz

Question 1 of 5

Why does load-balancer hardening rarely compete for budget?

Keep learning

Go deeper on the load-balancer controls in this capability, the attacks they prevent, and how to enforce the safe values.

You have finished the finance view of load-balancer hardening. You know the cost model is one-sided (free to fix, breach- or revenue-class to ignore), that two of the controls (cross-zone balancing and ELB health checks) actually reduce compute spend by removing the reason teams over-provision, and that the metric to watch is aging rather than dollars, because a free breach-class fix left open is a process signal. Next time a cluster of ELB findings lands, you will size them by downside and make sure the IaC default closes them for good.

Back to the library

Hardening load balancers: the headline

Whether the perimeter rejects bad traffic and behaves predictably, by default

Every internet-facing application sits behind a load balancer that acts as the first filter for inbound traffic. Some of ours are configured permissively: forwarding malformed requests that enable a recognised attack class (request smuggling), cutting live customer connections during routine deploys, spreading load unevenly so we over-provision to compensate, or failing to replace servers that have actually broken. The report shows these as separate findings across our load balancers and the auto-scaling groups behind them.

Almost all of these are free configuration changes with no downtime. A few of them save money as a side effect by letting the same workload run on fewer instances. The reason they are worth leadership awareness is the asymmetry: a one-line change versus a breach, dropped revenue, or an availability incident at the worst moment.

The defensible end state is that every load balancer rejects ambiguous traffic, hands off and balances cleanly, and acts on real application health, with new ones born that way through our infrastructure templates. The leadership question is whether we have a process that closes free, important fixes promptly and prevents them from recurring.

A short read for the leader who needs to know what a permissively-configured perimeter exposes, why hardening it is mostly free, and what a defensible end state, every load balancer compliant and new ones born that way, looks like across the estate.

Fun fact

Seventy thousand dollars for one malformed header

What it looks like when the perimeter is hardened by default, not by exception

After a vendor in the sector disclosed an authentication bypass traced to an HTTP request-smuggling chain, the CTO asked a blunt question: could a single malformed header get past our front door? The honest answer was maybe, because some public ALBs were still in monitor mode and forwarding invalid headers, configurations that predated AWS tightening the defaults and had never been revisited.

The leadership response was to treat the perimeter as a default rather than a per-load-balancer task. The HTTP-hardening attributes were flipped to defensive across the public, authenticated services first, the two changes with caveats (the desync mode and ELB health checks) were checked against access logs and probe health before being applied, and the safe values were baked into the Terraform load-balancer module so new load balancers are born compliant. Two months on, the same question had a clear answer backed by an enforced SLA on free-and-important security fixes and a Config rule that catches any drift, which is the confidence signal that belongs on an operational review.

Why this is on the report at all

The dollar cost of fixing these is effectively zero, which is exactly why they deserve a moment of attention. When a recognised attack class, or a recurring drop in customer requests, can be closed by a single configuration change at no cost, the only reason it stays open is organisational, it got lost in a backlog or no one owns it. That, not the load-balancer setting itself, is the real signal.

There is a compliance and reputational dimension too. The HTTP-hardening controls map to PCI DSS and NIST, so leaving them open creates an audit finding on infrastructure that handles your customers' traffic, and the downside scenario, request smuggling leading to session hijacking or mass cache poisoning, is the kind of incident that makes headlines and breaks trust. The leadership move is to ensure free-and-important security fixes have a fast, enforced SLA, and that new load balancers are born compliant.

The leadership move on load-balancer hardening

The executive handle is an enforced SLA on free-and-important fixes plus a born-compliant default, not a per-load-balancer approval. When a one-line change closes a recognised attack class at no cost, the only reason it stays open is organisational.

1. Set a fast, enforced deadline for free breach-class fixes

The dollar cost of these is effectively zero, which is precisely why they earn attention: a free, breach-class fix sitting open means it got lost in a backlog or nobody owns it, and that, not the load-balancer setting, is the real signal. Require a short, enforced SLA for free-and-important security fixes, anchored to the next audit window, so the HTTP-hardening controls on public authenticated services close in days rather than drifting between reviews.

2. Require new load balancers to be born compliant

Cleanup without prevention just resets the clock. Mandate that the safe attribute values (invalid-header dropping, the defensive desync mode, connection draining, cross-zone balancing, ELB health checks) are encoded as defaults in the Terraform or CDK module with a CI check that refuses a definition leaving them unset, backed by the AWS Config managed rules to catch drift. New load balancers are then compliant from creation rather than remediated after a Security Hub finding.

3. Demand a checked change for the two controls with caveats, and a recorded exception otherwise

Two changes are not blind flips: the desync mode needs the ALB access-log classification confirmed clear of Ambiguous and Severe entries, and ELB health checks need the target-group probe and grace period confirmed before switching, or fresh instances are killed in a loop. Require evidence these checks were done. Any load balancer intentionally left on a permissive setting (a legacy client genuinely needs loose parsing) carries a recorded, time-bounded exception, never a silently ignored finding.

Quick quiz

Question 1 of 5

Why does a free, breach-class load-balancer fix that stays open earn leadership attention?

Keep learning

Go deeper on the load-balancer controls in this capability, the attacks they prevent, and how to enforce the safe values.

Two takeaways: a permissively-configured perimeter is a one-line change away from a recognised attack class, dropped revenue or an availability incident, and the fact the fix is free is exactly why an open finding is a governance signal rather than a budget one. Set a fast enforced SLA on free-and-important fixes, require new load balancers to be born compliant through the IaC module, and the perimeter hardens by default rather than by exception.

Back to the library

Controls this lesson covers

One capability, many AWS Security Hub controls. This lesson is the shared playbook; each control below keeps its own deep page with the exact check, severity and a copy-and-paste fix.

AutoScaling

AutoScaling.1 Low ASGs with an LB should use ELB health checks

ELB

Part of the learning path Build in resilience

Harden load balancers (ALB/NLB/CLB)

Hardening load balancers: the basics

Seventy thousand dollars for one malformed header

Finding weakly-configured load balancers across an estate

How load-balancer hardening actually worksdeep dive

What is the impact of leaving load balancers unhardened?

How do you harden load balancers safely?

1. Inventory every load balancer and Auto Scaling group

2. Confirm the few changes that have a caveat

3. Flip the safe values, highest impact first

4. Prevent recurrence with AWS Config and IaC defaults

Quick quiz

Keep learning

Hardening load balancers: the cost and risk view

Seventy thousand dollars for one malformed header

How a finance partner frames the load-balancer hardening decision

Why load-balancer hardening belongs on the risk register

What finance can drive on load-balancer hardening

1. Put each control on the register at the size of its downside, not its severity label

2. Capture the two controls that are also cost savings

3. Track aging, not dollars, and fund the IaC default

Quick quiz

Keep learning

Hardening load balancers: the headline

Seventy thousand dollars for one malformed header

What it looks like when the perimeter is hardened by default, not by exception

Why this is on the report at all

The leadership move on load-balancer hardening

1. Set a fast, enforced deadline for free breach-class fixes

2. Require new load balancers to be born compliant

3. Demand a checked change for the two controls with caveats, and a recorded exception otherwise

Quick quiz

Keep learning

Controls this lesson covers

AutoScaling

ELB

Related compliance lessons