Data storage SDK query performance issues cause slow responses in aziot-25

gary_dev · July 17, 2025, 6:47pm

We’re experiencing severe query performance issues with the data storage SDK in aziot-25. Queries against our telemetry data table (50M records) take 45-90 seconds to return results, causing unacceptable delays in our analytics dashboards. Simple time-range queries with device ID filters are timing out or returning extremely slowly.

Typical query pattern:

SELECT * FROM telemetry
WHERE deviceId = 'sensor_1234'
AND timestamp BETWEEN '2025-07-01' AND '2025-07-14'

The query returns ~50K records but takes over a minute. We’ve verified the underlying database has sufficient resources. Query optimization documentation mentions indexing and pagination but doesn’t provide clear guidance on proper implementation. How should we optimize these queries for acceptable performance?

carolanalyst · July 19, 2025, 10:08am

You’re doing a SELECT * which is retrieving all columns for 50K records - that’s a massive data transfer. First, select only the columns you actually need. Second, you absolutely need indexes on deviceId and timestamp columns. Check if those indexes exist. Third, returning 50K records in one query is inefficient - implement pagination with LIMIT and OFFSET to fetch data in chunks of 1000-5000 records per request.

garyanalyst · August 10, 2025, 11:41am

The data storage SDK in aziot-25 has query optimization features you should enable. Use the query builder API instead of raw SQL - it automatically applies best practices like column selection and pagination. Also enable query result caching for frequently accessed data. We saw 10x performance improvement by switching from raw SQL to the SDK’s query builder with proper indexing.

edward_ninja · July 27, 2025, 8:21pm

Yes, paginate for backend processing too. Processing 50K records in memory at once is inefficient and risky. Use cursor-based pagination or keyset pagination rather than OFFSET-based for better performance. For index verification, query the database metadata tables or use EXPLAIN on your query to see if indexes are being used. The aziot-25 storage SDK has built-in pagination support via the query options parameter.

donaldlead · August 16, 2025, 7:28am

Your query performance issues require systematic optimization across all three areas:

Query Optimization: Avoid SELECT * and specify only required columns. Use the SDK’s query builder for automatic optimization:

var query = storageClient.CreateQuery()
    .Select("deviceId", "timestamp", "temperature", "humidity")
    .Where("deviceId", deviceId)
    .WhereBetween("timestamp", startDate, endDate)
    .OrderBy("timestamp")
    .Limit(5000);

Indexing: Create composite indexes optimized for your query patterns. For time-range queries with device filtering:

CREATE INDEX idx_device_time
ON telemetry(deviceId, timestamp DESC)
INCLUDE (temperature, humidity);

Pagination: Implement cursor-based pagination for efficient large result set handling:

string continuationToken = null;
do {
    var result = await query.ExecuteAsync(continuationToken);
    ProcessBatch(result.Items);
    continuationToken = result.ContinuationToken;
} while (continuationToken != null);

Detailed implementation strategy: First, verify index existence using the storage SDK’s metadata API or database EXPLAIN plans. Your query should show “Index Seek” on idx_device_time, not “Table Scan”. If indexes are missing, create composite index on (deviceId, timestamp) with included columns for frequently accessed fields. This eliminates the need for key lookups after index seek. Second, implement pagination with 5000 record chunks - this reduces memory pressure and enables progressive result processing. Use continuation tokens instead of OFFSET-based pagination to avoid performance degradation on later pages. Third, optimize column selection - if you need 10 columns out of 50, explicitly list them to reduce data transfer by 80%. Fourth, enable query result caching in the SDK for repeated queries:

var options = new QueryOptions {
    EnableCache = true,
    CacheDuration = TimeSpan.FromMinutes(5)
};

For 50M record tables, consider time-based partitioning (monthly or weekly) to improve query performance. Partition pruning eliminates 90%+ of data from scans when querying recent time ranges. Also implement query timeout handling and retry logic for long-running queries. Monitor query execution metrics via the SDK’s telemetry: track execution time, rows scanned vs returned, and cache hit rates. For your specific query pattern, properly indexed and paginated queries should complete in under 5 seconds for the first page and 2-3 seconds for subsequent pages. If performance doesn’t improve after indexing, check if statistics are updated (run ANALYZE on the table) and verify query planner is choosing optimal execution plans. Consider read replicas if you have high concurrent query load impacting write performance.

pamelaengineer · July 21, 2025, 10:42am

We do need most columns for the analytics processing. Are you suggesting we should paginate even for backend processing, not just UI display? Also, how do we verify if proper indexes exist on the telemetry table?

edward_user · August 1, 2025, 8:12pm

Check your query execution plan. If the database is doing full table scans instead of index seeks, that’s your problem. Create a composite index on (deviceId, timestamp) for optimal query performance. Also consider partitioning your telemetry table by date if you’re not already - this dramatically improves time-range query performance by eliminating irrelevant partitions from the scan.

Topic		Views
Data storage module shows slow query performance on large SQLite databases in aziot-25 Microsoft Azure IoT question , performance-opt , database-mgt , sql , edge-compute , sqlite , slow-query , data-storage , aziot-25	6	June 15, 2025
Analytics dashboard queries show high latency on large IoT datasets with timestamp range filters Oracle IoT Cloud question , indexing , query-performance , database-optimization , partitioning , analytics-dashboard , data-storage , analytics-ml , oiot-22	6	February 13, 2025
Custom dashboard loads slowly in app enablement module when querying device telemetry Google Cloud IoT question , performance-opt , sql , analytics-report , slow-query , dashboard-performance , bigquery , table-partitioning , app-enablement	6	September 2, 2025
Data storage query times out during large aggregation jobs Cumulocity IoT question , performance-opt , reporting , rest-api , query-timeout , aggregation , database-indexing , data-storage , iiot-support	4	November 26, 2025
Visualization dashboard API widget fails to load device data with TimeoutException during peak load AWS IoT question , timeout , dashboard , query-optimization , caching , cloudwatch , api-sdk , awsiot-25 , viz-dashboar	5	January 1, 2025
Data storage query performance degrades when retrieving measurements from 100K+ device time series Cumulocity IoT question , performance-opt , query-timeout , database-indexing , time-series , data-storage , c8y-1020 , aggregation-pipeline , mongodb	5	June 25, 2025
Inventory optimization API GET inventory levels returns slow response times for large warehouse datasets Blue Yonder Luminate question , api-development , performance , dashboard , rest-api , inventory-opt , slow-response , pagination , by-2023-2	5	April 12, 2025
Azure Log Analytics query latency spikes during high-volume data ingestion Microsoft Azure question , monitoring , networking , observability , az-2021 , performance-tuning , latency , azure-log-analytics , kusto-query	6	February 4, 2025
Azure Blob Storage analytics queries timeout when analyzing large datasets with index tags Microsoft Azure question , storage , analytics , query-timeout , az-2019 , azure-storage-analytics , blob-index-tags , synapse-integration , kql	3	March 20, 2025

Data storage SDK query performance issues cause slow responses in aziot-25

Related topics