Real-time vs batch data integration for machine status updates - performance tradeoffs

kenneth_dev · February 12, 2025, 10:25am

I’m interested in hearing experiences with real-time versus batch integration approaches for machine status updates in shop floor control. We’re currently using 5-minute batch updates from our machine interfaces, but production wants real-time visibility.

The tradeoffs I’m weighing: real-time gives immediate status visibility but increases network traffic and database load. Batch processing is more efficient but introduces latency in status updates. Our network reliability varies across facilities - some plants have solid infrastructure, others experience occasional drops.

Has anyone implemented a hybrid approach? I’m thinking real-time for critical status changes (machine down, quality alert) and batch for routine updates (cycle counts, temperature readings). Curious about latency requirements others have successfully met and how you’ve handled network reliability issues in real-time scenarios.

anjali_dev · February 14, 2025, 9:39pm

We went through this exact evaluation last year. Started with pure real-time and quickly found our network couldn’t handle it during peak production. Switched to a hybrid model similar to what you’re describing. Critical events (machine stops, errors) trigger immediate updates, everything else batches every 2 minutes. Works well and production is happy with the visibility.

hans_king · February 23, 2025, 8:27pm

We use industrial PCs as edge collectors - they’re relatively cheap and reliable. For timestamp handling, the edge collector stamps events when they occur, not when they’re transmitted. MES accepts the original timestamp so historical data stays accurate. The only challenge is if the buffer fills up during extended outages - we have logic to prioritize critical events and aggregate routine data if storage gets tight.

ankitmanager · March 3, 2025, 9:23am

I want to add the infrastructure perspective since network reliability was mentioned. We’ve found that hybrid integration strategies work best when you have clear event classification. Here’s what we’ve learned across multiple implementations:

Network Reliability Considerations:

Real-time integration requires consistent network availability - aim for 99.5% uptime minimum. If your plant networks can’t meet this, real-time will cause more problems than it solves. Key factors:

• Wireless networks are problematic for real-time - latency spikes during interference

• Wired networks are more predictable but still need quality of service (QoS) configuration

• Segment your network - don’t mix machine data traffic with office traffic

• Monitor packet loss - anything above 1% will cause issues with real-time protocols

Latency Requirements by Use Case:

Different scenarios have different tolerance:

• Machine downtime alerts: <5 seconds acceptable, real-time justified

• Quality failures: <30 seconds acceptable, near-real-time sufficient

• Production counts: <2 minutes acceptable, batch processing fine

• Temperature/pressure readings: <5 minutes acceptable, batch processing preferred (reduces noise)

• Tool wear indicators: <10 minutes acceptable, batch definitely sufficient

Hybrid Integration Strategy Framework:

The approach Susan and you mentioned works well. Here’s how to structure it:

Event Classification: Define three tiers - Critical (real-time), Important (near-real-time), Routine (batch)
Transport Layer: Use message queuing for real-time events (ensures delivery even if MES is briefly unavailable), REST APIs for batch uploads
Database Impact: Real-time events write to a hot table with short retention (24 hours), batch processes write to main tables. Reduces database contention.
Failover Logic: When real-time connection fails, queue events locally and switch to batch mode automatically. Resume real-time when connection restores.

Performance Optimization:

If you implement hybrid integration:

• Batch your routine updates but vary the batch timing across machines (don’t have 200 machines all sending updates at :00, :05, :10)

• Use delta updates for batch processing - only send values that changed

• Compress batch payloads if sending large datasets

• Implement backpressure handling - if MES is slow to respond, increase batch intervals temporarily

The sweet spot we’ve found is real-time for about 10-15% of events (the critical ones), near-real-time (30-60 second batches) for another 20%, and regular batching (2-5 minutes) for the remaining 65-70%. This balances visibility with system load effectively.

rebeccasage · February 21, 2025, 10:46am

Great points all around. Marco, your edge computing approach is interesting. Are you using dedicated hardware for the edge collectors or just running services on existing plant servers? And how do you handle the synchronization when buffered data finally reaches MES - any timestamp conflicts?

nancy_analyst · February 20, 2025, 7:47am

From an operations perspective, the latency requirements really depend on your production model. In our high-mix low-volume environment, we need real-time because line changeovers happen frequently and we need immediate visibility. But for long-running processes, 5-minute batches would be perfectly adequate. Don’t over-engineer if your production doesn’t require it.

Topic		Views
Real-time vs batch data collection for shop floor analytics: Trade-offs and best practices DELMIA Apriso MES discussion , reporting-analytics , real-time , performance , batch-processing , dam-2021 , shop-floor-control , apriso-data-collection , data-collection-strategy	5	January 12, 2026
Real-time versus batch processing for schedule management integration - performance trade-offs Infor CloudSuite discussion , integration , performance , rest-api , error-handling , batch-processing , schedule-mgmt , ics-2021 , integration-api	4	December 27, 2024
Real-time performance dashboards versus batch reporting: balancing accuracy with system load Rockwell FactoryTalk MES discussion , workflow-process , event-driven , performance-analysis , data-caching , system-load , analytics-engine , ft-10-0 , dashboard-perfo	5	December 12, 2024
Shop floor control real-time synchronization - what API latency is acceptable? Honeywell MES discussion , api-development , real-time , performance , rest-api , shop-floor-control , hm-2023-2 , latency-requirements , control-responsiveness	6	July 10, 2025
Event processing vs batch processing for ERP integration: real-time trade-offs IBM Watson IoT discussion , integration , performance , erp-integration , reliability , batch-processing , system-architecture , event-processing , wiot-25	6	August 3, 2025
Real-time shop floor data via API polling versus batch synch Rockwell FactoryTalk MES discussion , api-development , rest-api , shop-floor-control , websocket , architecture-design , system-performance , ft-10-0 , data-sync-patterns	3	March 27, 2025
Edge vs cloud IoT data processing for MES performance analysis DELMIA Apriso MES discussion , dam-2022 , performance-analysis , bandwidth , latency , iot-integration , hybrid-architecture , edge-gateway , cloud-processing	5	January 5, 2025
Material management inventory sync: real-time API vs batch ETL for ERP integration DELMIA Apriso MES discussion , workflow-process , rest-api , event-driven , data-integration , material-mgmt , dam-2021 , inventory-sync , delmia-apriso-mes	6	August 9, 2025
Cloud vs on-prem manufacturing scheduling: performance, reliability tradeoffs Epicor SCM discussion , iot , performance-optimization , mes-integration , job-scheduling , cloud-hybrid-deployment , es-10-2-500 , manufacturing-plan , wan-failover	4	January 24, 2025

Real-time vs batch data integration for machine status updates - performance tradeoffs

Related topics