Comparing IoT data lake vs SAP HANA native storage for monetization analytics

stevensage · August 8, 2025, 10:36am

We’re designing the storage architecture for our IoT monetization analytics platform and evaluating two approaches: storing raw device telemetry in a data lake (S3/Azure Data Lake) with periodic aggregation to HANA, versus using HANA native storage for everything.

Our scale: 50K connected devices generating billing events, approximately 2TB monthly data growth. We need real-time billing calculations but also historical trend analysis going back 2+ years. The cost difference is significant - data lake storage is roughly 1/10th the cost of HANA, but query performance and integration complexity favor HANA native.

Has anyone implemented a hybrid approach where hot data (last 3-6 months) lives in HANA for real-time monetization while cold data sits in the lake for analytics? Curious about the integration patterns, performance trade-offs, and whether the cost savings justify the architectural complexity.

amitadmin · August 22, 2025, 12:40am

Good point about schema consistency. How do you handle the historical analytics queries that span both hot and cold data? Do you query both systems and merge results, or do you replicate aggregated summaries back to the lake?

daniel_251 · August 8, 2025, 12:16pm

We went full data lake initially and regretted it. Query performance for real-time billing was terrible - 5-8 second latencies on aggregation queries that HANA handles in milliseconds. Ended up moving the last 90 days into HANA and keeping older data in the lake. The integration overhead is real though - you need solid ETL pipelines and careful partitioning strategies.

alex_builder · August 29, 2025, 8:31am

We use federated queries via HANA Smart Data Access to query the lake directly from HANA. It’s not as fast as native HANA queries, but for historical analytics where sub-second response isn’t critical, it works well. The benefit is a single query interface - your analytics tools just hit HANA, and it transparently federates to the lake for older data. Performance is acceptable for most analytical workloads, though we do pre-aggregate common metrics monthly and store those summaries in HANA to avoid repeated federation overhead.

Topic		Views
Best data storage strategies for IoT monetization: balancing cost and analytics throughput Cisco IoT Cloud Connect discussion , monetization , cost-optimization , analytics-report , real-time-analytics , data-lifecycle , data-storage , cciot-24 , storage-architecture	5	June 5, 2025
Best practices for device data retention in SAP IoT sapiot-25 SAP IoT discussion , database-mgt , compliance , archiving , data-retention , sap-hana , device-mgmt , storage-mgmt , sapiot-25	6	January 20, 2025
Best practices for long-term storage of IoT device logs - cost vs performance tradeoffs Microsoft Azure IoT discussion , performance , sql , azure-data-lake , retention-policy , storage-cost , data-storage , device-mgmt , aziot-24	4	December 26, 2024
Comparing usage aggregation vs real-time charging for IIoT billing in SAP IoT billing engine SAP IoT discussion , performance-opt , scalability , billing-engi , iiot-support , billing-strategy , sapiot-25 , usage-aggregation , real-time-charging	3	December 19, 2024
Data storage bottleneck: High ingest latency in SAP IoT with HANA backend during peak sensor data loads SAP IoT question , performance-opt , rest-api , hana , real-time-analytics , partitioning , data-storage , sapiot-25 , ingest-latency	6	December 29, 2024
Choosing between persistent data storage and real-time streaming for telemetry analytics IBM Watson IoT discussion , real-time , analytics , connectivity , cost-optimization , hybrid-architecture , data-storage , wiot-24 , telemetry-pipeline	4	August 30, 2025
IoT Hub data storage integration vs external data lake for long-term telemetry retention Microsoft Azure IoT discussion , analytics , retention , data-lake , iot-hub , data-storage , sys-integration , aziot-25	5	May 27, 2025
Dashboard latency: comparing real-time vs batch data ingestion approaches SAP IoT discussion , performance-opt , dashboard , connectivity , batch-processing , analytics-report , real-time-data , viz-dashboar , sapiot-24	3	January 30, 2025
Optimizing device data processing in SAP IoT sapiot-23 SAP IoT discussion , performance-opt , real-time , batch-processing , data-filtering , data-streaming , mqtt , device-mgmt , sapiot-23	5	April 16, 2025

Comparing IoT data lake vs SAP HANA native storage for monetization analytics

Related topics