Load balancer health checks failing for backend servers in VPC networking setup

brenda_data · November 24, 2024, 6:22am

Our IBM Cloud Load Balancer is marking all backend servers as unhealthy even though the application is running fine. When I curl the health check endpoint directly from within the VPC, it responds correctly with HTTP 200. The health check configuration in the load balancer shows:


Protocol: HTTP
Port: 8080
Path: /health
Interval: 10s
Timeout: 5s

All three backend servers are showing as “failing” in the load balancer dashboard, causing traffic disruption. We’ve verified the application logs and the /health endpoint is being hit and returning 200 OK responses. I’m wondering if this is related to our security group configuration or if there’s something wrong with how the load balancer is configured to reach the backend servers.

brenda_data · November 24, 2024, 8:03am

The most common cause of this issue is security group rules not allowing traffic from the load balancer to reach your backend servers. IBM Cloud Load Balancers use specific source IP ranges for health checks, and these need to be explicitly allowed in your backend server security groups. Check if your security group has an inbound rule allowing HTTP traffic on port 8080 from the load balancer subnet CIDR or from the IBM Cloud load balancer service IP ranges.

ruth_cloud · December 4, 2024, 7:18am

You need to allow traffic from the load balancer’s subnet CIDR to your backend servers. Since your load balancer is in 10.10.1.0/24, add an inbound rule to your backend server security group allowing TCP port 8080 from that CIDR. Also, make sure your backend servers can respond - check that there’s no overly restrictive outbound rule blocking responses back to the load balancer subnet. Health checks are bidirectional communication, so both inbound and outbound need to work.

michelle_coder · December 17, 2024, 12:30pm

If your application takes 3-4 seconds to respond and your health check timeout is set to 5 seconds, you’re cutting it very close. However, that wouldn’t cause all checks to fail consistently. Let me ask - did you verify the security group rule was actually applied and is in effect? Sometimes there’s a delay or the rule gets added to the wrong security group. Also, check if there’s a network ACL on your backend subnet that might be blocking traffic. Network ACLs are stateless and need both inbound and outbound rules configured correctly.

linda_wizard · November 27, 2024, 3:03pm

I checked our security groups and found that we only have rules allowing traffic from our office IP range and from within the VPC CIDR (10.10.0.0/16). The load balancer is in a different subnet (10.10.1.0/24) and the backend servers are in 10.10.2.0/24. Do I need to add a specific rule for the load balancer subnet, or is there a service IP range I should be using instead?

brenda_data · December 11, 2024, 7:44am

Another thing to verify is whether your application is actually listening on all network interfaces. Sometimes applications bind to localhost (127.0.0.1) instead of 0.0.0.0, which would explain why curl from within the same server works but external health checks fail. SSH into one of your backend servers and run netstat -tlnp | grep 8080 to see what interface your application is bound to. If it shows 127.0.0.1:8080, you need to configure your application to listen on 0.0.0.0:8080 instead.

Topic		Replies	Views
Container service ingress fails health check from OCI Load Balancer after updating backend subnet security rules Oracle Cloud question , networking , load-balancer , oci-2020 , container-servi , security-lists , health-check , ingress-controller , vcn-flow-logs	5	0	September 10, 2025
Firewall policy blocking internal VPC traffic between subnets in same zone IBM Cloud question , networking , security , ic-2019 , vpc-routing , ibm-cloud-firewall , internal-block , service-disruption , firewall-policy	5	0	November 25, 2025
Security group rules blocking SSH access to virtual server instances IBM Cloud question , compute , networking , ic-2021 , remote-access , security-group , ibm-cloud-cli , ssh-blocked , port-22	4	0	March 28, 2025
VPC firewall rule conflict blocks secure API access from on-premises IBM Cloud question , networking , security , hybrid-cloud , vpc , ic-2021 , firewall-rules , api-access	5	0	September 17, 2025
Database connection timeout when accessing Db2 across VPC peering link IBM Cloud question , networking , routing , database , ic-2021 , connection-timeout , security-groups , vpc-peering , db2	4	1	December 19, 2024
API Gateway VPC Link returns 502 Bad Gateway when routing to private NLB in production Amazon Web Services (AWS) question , api-gateway , networking , security , rest-api , aws-2019 , apis , http , vpc-link	7	0	June 19, 2025
Db2 connection timeout for ERP reporting jobs due to VPC firewall blocking port 50000 IBM Cloud question , database , sql , ic-2021 , connection-timeout , vpc-firewall , net-connect , db2 , port-config	5	1	June 14, 2025
Network latency spikes causing delayed analytics queries in multi-zone VPC IBM Cloud question , networking , analytics , performance , ic-2020 , vpc-routing , ibm-cloud-monitoring , latency-spike , query-delay	4	1	July 18, 2025
VPC network latency spikes detected but monitoring shows zero packet loss - troubleshooting network performance IBM Cloud question , networking , ic-2020 , flow-logs , network-acl , monitoring-mana , ibm-cloud-vpc-flow , incomplete-metrics , latency-spikes	3	0	October 21, 2025

Load balancer health checks failing for backend servers in VPC networking setup

Related topics