We have a Cloud SQL MySQL instance configured with high availability and a read replica in a different region. During a recent outage when the primary instance became unresponsive, the automatic failover to the standby didn’t trigger as expected, resulting in 45 minutes of downtime.
Our Cloud SQL HA configuration shows enabled, and we have both a standby instance in the same region and a read replica in another region. The failover health checks should have detected the primary failure, but nothing happened. We had to manually promote the read replica, which caused data inconsistency issues.
I’m confused about the read replica limitations in this scenario. Should failover work with read replicas, or is there something specific about the HA standby versus read replicas? The documentation isn’t entirely clear on this distinction.