We’re experiencing significant latency spikes in carrier rate lookup API calls during peak order processing hours. Our transportation management system handles around 15,000 shipments daily, and during morning batch processing (7-10 AM), rate lookup calls that normally complete in 200-300ms are spiking to 4-8 seconds.
The rate_lookup_cache table has grown to 8M+ records, and we’re seeing timeout errors on about 12% of API calls during peak load. This is causing shipment delays and SLA violations.
Here’s a typical API call pattern we’re seeing:
RateLookupRequest request = new RateLookupRequest();
request.setCarrierCode("FEDEX");
request.setServiceLevel("GROUND");
RateLookupResponse response = rateService.getRates(request);
// Timeout after 5000ms during peak hours
We’ve noticed that bulk rating jobs are scheduled to run at the same time as interactive lookups, and thread pool configuration hasn’t been reviewed since initial implementation. Any guidance on optimizing this would be greatly appreciated.