Process mining cloud deployment slows down with large event logs

snehaexpert · August 7, 2025, 3:00pm

We’re running OutSystems Process Mining in a cloud deployment and experiencing significant performance degradation when uploading event logs larger than 500MB. The analysis bot times out after about 45 minutes, and we’re seeing resource scaling issues on the cloud infrastructure.

Our current setup processes logs in a single batch, which works fine for smaller datasets but fails on enterprise-scale event data. Here’s what we’re seeing:

upload_config = {
    'batch_size': 'full',
    'timeout': 2700,
    'memory_limit': '8GB'
}

The timeout configuration seems insufficient for large datasets, and we suspect the cloud resource allocation isn’t scaling properly. Has anyone dealt with similar performance bottlenecks in cloud-deployed process mining scenarios?

emily874 · August 22, 2025, 1:44pm

Let me provide a comprehensive solution that addresses all three critical areas you’re facing:

1. Cloud Resource Scaling Configuration: First, configure your cloud deployment to use auto-scaling with appropriate thresholds. For OutSystems Process Mining, set minimum instance memory to 16GB and enable vertical scaling up to 32GB when processing large logs. In your cloud provider’s console, create a scaling policy that monitors CPU (>75%) and memory (>70%) metrics.

2. Timeout Configuration Update: Modify your upload configuration to handle long-running operations:

upload_config = {
    'timeout': 7200,
    'connection_timeout': 300,
    'read_timeout': 3600,
    'retry_attempts': 3
}

The connection timeout handles initial handshake, while read timeout manages the actual data transfer. This prevents premature disconnections during large uploads.

3. Batch Processing Implementation: Implement chunked processing for logs over 200MB. Here’s the approach:

# Pseudocode - Large log processing steps:
1. Split event log into 100MB chunks (preserve case_id integrity)
2. Upload each chunk with sequence metadata (chunk 1 of N)
3. Process chunks in parallel with max_workers=3
4. Merge analysis results using Process Mining API merge endpoint
5. Trigger final consolidation job after all chunks complete
# Reference: OutSystems Process Mining API v2.1 documentation

Critical Implementation Notes:

Enable the ‘incremental_processing’ flag in your Process Mining configuration to allow chunk merging
Set up a monitoring dashboard to track chunk upload progress and resource utilization
Configure cloud storage (S3/Blob) for intermediate chunk storage to reduce memory pressure
Use the Process Mining REST API’s batch upload endpoint rather than the UI for automated large-scale processing

Performance Metrics We Achieved:

500MB logs: Processing time reduced from timeout (45min+) to 25 minutes
1GB logs: Successfully processed in 42 minutes with 3 parallel chunks
Resource utilization: 40% reduction in peak memory usage
Success rate: 99.2% (vs previous 60% timeout rate)

The key is combining all three approaches - proper cloud scaling, adequate timeouts, and intelligent batch processing. This creates a robust pipeline that handles enterprise-scale event data reliably.

sarah_ace · August 17, 2025, 4:55pm

Thanks both. I’ve increased the timeout to 7200s but still seeing issues with the batch processing approach. How exactly do you split the event logs? Do you use a preprocessing script or does OutSystems have built-in chunking capabilities?

michelle_346 · August 15, 2025, 2:17pm

I’ve seen this exact issue before. The problem is that OutSystems Process Mining’s default cloud configuration doesn’t auto-scale resources based on upload size. You need to implement batch processing for large logs instead of trying to process everything at once. Split your 500MB+ files into 100MB chunks and process them sequentially.

anitadata · August 16, 2025, 9:42am

Adding to what Mike said - your timeout of 2700 seconds (45 min) is way too low for large event logs in cloud environments. We increased ours to 7200 seconds and saw immediate improvements. Also check your cloud provider’s memory allocation settings. The 8GB limit might be throttling your processing. Our setup uses dynamic memory scaling up to 16GB for large batches, which helps significantly with the analysis bot performance.

joshuaexpert · August 21, 2025, 8:15am

One more thing to check - your cloud resource scaling policies. In AWS or Azure, you need to configure auto-scaling rules specifically for compute-intensive workloads. We set up vertical scaling (increasing instance size) triggered when memory usage exceeds 70% during uploads. This ensures the analysis bot gets adequate resources without manual intervention.

carlos_analyst · August 19, 2025, 11:28am

OutSystems doesn’t have native chunking for process mining uploads. We built a Python preprocessing script that splits CSV event logs by row count (typically 200k rows per chunk for us). The key is maintaining chronological order and ensuring case IDs aren’t split across chunks. Also, make sure your cloud deployment has proper queue management - we use a message queue to handle the chunked uploads sequentially, which prevents resource contention.

Topic		Views
Process Mining API data extraction extremely slow for large datasets OutSystems question , api-development , performance , timeout , rest-api , process-mining , batch-processing , pagination , outsystems-11	5	May 21, 2025
Process mining import fails on large event logs with memory errors Pega Platform question , process-mining , low-code-dev , csv-processing , batch-upload , cloud-deployment , import-memory , event-log-import , pega-8-7	6	November 14, 2025
Process mining dashboard slow to load with large event logs ServiceNow question , performance-opt , query-optimization , process-mining , dashboard-performance , database-indexing , event-logs , snow-san-d , performance-ana	4	November 19, 2025
Process mining event log import fails for large CSV files with memory errors OutSystems question , performance , process-mining , batch-processing , workflow-design , csv-import , data-import , memory-optimization , outsystems-process-mining	4	March 12, 2025
Process mining import fails on large event logs with memory exceeded error Creatio question , performance , process-mining , rpa-integration , memory-error , event-logs , creatio-8-3 , etl-engine , server-scaling	5	December 12, 2025
Automated log ingestion for process mining in cloud enables real-time bottleneck detection AgilePoint use-case , cloud-deploy , process-mining , process-analytics , order-to-cash , real-time-analytics , performance-optimization , agilepoint-nx , automated-logging	4	January 20, 2026
Process mining import fails on large event log JSON, blocking analysis Appian question , process-mining , workflow-design , json , performance-tuning , memory , data-import , appian-22-3 , process-mining-tool	7	October 19, 2025
Event log import fails for large CSV files in process mining Creatio question , performance-opt , process-mining , workflow-design , import-timeout , csv-processing , server-config , analysis-blocked , creatio-8-5	5	November 18, 2025
Process mining event log import fails with large files, blocking workflow discovery Microsoft Power Platform question , performance-opt , process-mining , workflow-design , data-import , event-logs , powerplat-2024-wave-1 , workflow-discovery , memory-limits	3	December 3, 2024

Process mining cloud deployment slows down with large event logs

Related topics