Self-service BI dashboard performance drops with large datasets in analytics module

marie_lead · April 4, 2025, 6:39pm

Our self-service analytics dashboards have become unusable over the past month as our datasets grew beyond 5 million records. Load times went from 3-4 seconds to over 30 seconds, and some complex dashboards timeout entirely. This is seriously impacting user adoption - business users are abandoning the platform.

The dashboards use multiple widgets with cross-filtering enabled, showing sales trends, regional breakdowns, and product performance. When users apply filters or drill down, the entire dashboard freezes for 20-30 seconds.

Here’s a typical query pattern we’re using:


q = load "sales_data";
q = filter q by 'Date' in ["current_year"];
q = group q by ('Region', 'Product');
q = foreach q generate sum('Revenue') as 'Total';

We haven’t implemented any specific data aggregation strategies or indexed filter fields. Should we be pre-aggregating data or is there a better query optimization approach for large datasets in tcrm-2022?

william_chief · April 7, 2025, 8:52am

Are you using compact form for your datasets? With 5 million records, storage format makes a huge difference. Also, check if your date filters are using indexed fields. Date ranges are common filter criteria and should absolutely be indexed. You can verify indexing in the dataset metadata and add indexes through the dataflow configuration if they’re missing.

mia_608 · April 7, 2025, 8:40pm

Beyond indexing, you really need to implement aggregation layers for self-service dashboards at this scale. Create pre-aggregated datasets at different grain levels - daily summaries, weekly rollups, monthly aggregates. Then configure your dashboard to use the appropriate aggregation level based on the selected date range. This is standard practice for enterprise-scale self-service BI. Users drilling into daily detail would query the granular dataset, but overview widgets use the aggregated versions. This can reduce query times by 80-90% for typical use cases.

sam3730 · April 15, 2025, 9:26pm

Here’s a comprehensive optimization strategy addressing all three critical areas:

Query Optimization: Restructure your SAQL to filter early and minimize data scanning. Your current query loads everything first - instead:


q = load "sales_data";
q = filter q by 'Date' in ["current_year"];
q = filter q by 'Region' is not null;
q = group q by ('Region', 'Product');
q = foreach q generate sum('Revenue') as 'Total';

Apply all filters immediately after load, before any grouping or aggregation. This reduces the working dataset size early in the query pipeline.

Data Aggregation Strategy: Implement a three-tier aggregation approach in your dataflow:

Raw dataset (5M records) - for detailed drill-downs only
Daily aggregates (500K records) - for week/month views
Monthly aggregates (50K records) - for year/quarter overview

Create separate datasets for each grain level and configure your dashboard to automatically select the appropriate dataset based on date range selection. Use dashboard bindings with conditional logic:


if date_range <= 7 days: use raw_data
if date_range <= 90 days: use daily_aggregates
if date_range > 90 days: use monthly_aggregates

Indexed Filter Fields: Add indexes to your most commonly filtered dimensions. In your dataflow metadata, configure these fields as indexed:

Date (absolutely critical for time-series queries)
Region (common filter criterion)
Product Category (if used in filters)
Any field used in dashboard global filters

Indexing requires dataset rebuild but provides 5-10x performance improvement for filtered queries. The rebuild is a one-time cost for ongoing performance gains.

Additional optimizations for tcrm-2022:

Enable compact storage format (reduces dataset size by 40-60%)
Implement query result caching with 1-hour TTL for common patterns
Use pagination for large result sets (limit initial load to 1000 rows)
Disable cross-filtering on widgets that don’t need it (reduces query cascade)
Schedule dataset refreshes during off-peak hours to maintain compact form

With these changes implemented, you should see dashboard load times drop from 30+ seconds to under 5 seconds for typical queries, even with continued data growth. The aggregation strategy is the biggest win - most business users don’t need transaction-level detail for overview dashboards.

rajesh_guru · April 8, 2025, 5:09am

Also consider implementing result caching for common filter combinations. If multiple users are running similar queries (like current quarter sales by region), cache those results with a reasonable TTL. Tableau CRM in tcrm-2022 has improved caching capabilities that can serve cached results for identical queries, dramatically improving perceived performance for repeated access patterns.

alex_api · April 7, 2025, 11:03am

We’re using the default storage format - didn’t realize compact form was an option. The date field isn’t specifically indexed, just a standard dimension. How much performance improvement can we expect from adding indexes? And does that require rebuilding the entire dataset?

Topic		Replies	Views
Enterprise reporting dashboard slow to load with large datasets exceeding 100K rows SAP Crystal Reports question , performance-opt , sql , query-optimization , dashboard-design , scr-2022 , enterprise-reporting , dashboard-performance , data-pagination	6	3	September 29, 2025
Enterprise reporting dashboard slow to load with large datasets exceeding 500K rows SAP Crystal Reports question , performance , sql , query-optimization , dashboard-design , scr-2022 , enterprise-reporting , dashboard-performance , data-pagination	6	2	May 25, 2025
Ad-hoc reporting response time degrades with large crosstab views and multiple dimension filters Tableau question , performance-opt , tab-2023-3 , ad-hoc-reporting , tableau-server , extract-optimization , context-filters , crosstab , query-performance	6	2	March 5, 2025
Process analytics dashboard loads slowly when joining large datasets from multiple sources Appian question , query-tuning , data-integration , process-analytics , performance-optimization , large-datasets , dashboard-performance , appian-22-4 , data-fabric	7	0	January 16, 2026
Best practices for preparing large datasets for visualization in Crystal Reports SAP Crystal Reports discussion , etl , performance , scr-2016 , data-preparation , aggregation , database-optimization , data-visualization , large-dataset	4	2	July 9, 2025
Project management reporting performance is slow when querying large datasets in Prism Analytics Workday question , proj-mgmt , database-mgt , sql , wd-r1-2024 , performance-tuning , prism-analytics , slow-report , exec-dashboard-delay	7	2	February 22, 2025
Sales dashboard performance slow when loading SuiteAnalytics workbooks NetSuite question , reporting-analytics , sales-mgmt , dashboard-optimization , ns-2024-1 , suiteanalytics , dataset-config , performance-lag , browser-performance	4	2	July 6, 2025
Event management dashboard slow to load after enabling additional event types SAP Customer Experience (SAP CX) question , performance , query-optimization , event-mgmt , scx-2111 , pagination , reporting-dashboards , database-indexing , dashboard-designer	3	1	June 23, 2025
Custom dashboard loads slowly in app enablement module when querying device telemetry Google Cloud IoT question , performance-opt , sql , analytics-report , slow-query , dashboard-performance , bigquery , table-partitioning , app-enablement	6	3	September 2, 2025

Self-service BI dashboard performance drops with large datasets in analytics module

Related topics