Process Mining API data extraction extremely slow for large datasets

guptaguru · May 18, 2025, 12:37am

Extracting event logs via Process Mining API takes over 15 minutes and frequently times out with 504 Gateway Timeout errors when pulling data for processes with more than 50,000 events. We’re trying to build automated analytics reports that run daily, but the API performance is making this impractical.

The timeout occurs during large data pulls:


GET /api/v1/process-mining/events?processId=12345&limit=10000
Status: 504 Gateway Timeout
Error: "Request exceeded maximum execution time"

We’ve tried adjusting the limit parameter, but even with smaller page sizes the overall extraction time is excessive. The API documentation mentions pagination support and batch processing capabilities, but we’re unclear on the optimal configuration for large dataset extraction. Our timeout configuration seems standard, but perhaps there are specific settings for Process Mining API calls that we’re missing?

michelle_346 · May 23, 2025, 7:29am

Have you looked into using filters to reduce the dataset size before extraction? Instead of pulling all 50,000 events, filter by date range, event type, or process instance status. Most analytics use cases don’t need the complete historical dataset every time. You could implement incremental extraction where you only pull events created or modified since the last successful extraction run.

guru_master · May 22, 2025, 11:19pm

For the timeout configuration, make sure you’re setting timeouts at multiple levels - the OutSystems HTTP client timeout, the API gateway timeout, and the Process Mining service timeout. They all need to be aligned. I typically set HTTP client timeout to 120 seconds, gateway timeout to 150 seconds, and ensure the backend service can complete within those limits. Also consider implementing retry logic with exponential backoff for transient timeout errors.

chris_sql · May 19, 2025, 8:50am

We reduced the limit to 2000 events per page, but now we’re making 25+ sequential API calls to get all the data, and the total time is still around 12-15 minutes. Is there a way to parallelize these requests or use batch processing to speed things up? The sequential pagination approach seems inefficient for our use case.

josetech · May 18, 2025, 10:14am

The 504 timeout with large datasets is common when you’re trying to pull too much data in a single request. Even though you’re using pagination with limit=10000, that’s still a lot of events to process in one call. Try reducing the page size to 1000-2000 events and implement proper pagination with offset or cursor-based navigation. Also check if your gateway timeout is set appropriately for data extraction operations.

giorgiotech · May 21, 2025, 1:28pm

Process Mining APIs often support bulk export operations separate from the standard pagination endpoints. Check if there’s an /export or /bulk endpoint that’s designed for large dataset extraction. These endpoints typically generate a data file asynchronously and provide a download link, which is much more efficient than paginating through thousands of events. You’d poll for completion status rather than waiting synchronously.

Topic		Views
Process mining API export fails on large datasets, returns 5 Pega Platform question , api-development , rest-api , process-mining , pagination , gateway-timeout , data-export , async-processing , pega-8-5	5	September 5, 2025
Process mining API export fails for large datasets with timeout errors ServiceNow question , api-development , performance , timeout , rest-api , process-mining , snow-tokyo , json , reporting-blocked	4	July 22, 2025
Process mining API export fails for large event logs with timeout error AgilePoint question , api-development , timeout , rest-api , process-mining , process-analytics , json , pagination , export-failure	6	August 27, 2025
Process mining data extraction: OData vs REST API for large-scale analytics OutSystems discussion , analytics , performance , rest-api , data-integration , process-mining , batch-processing , odata , integration-studio	4	January 14, 2025
Process mining cloud deployment slows down with large event logs OutSystems question , performance-opt , cloud-deploy , process-mining , batch-processing , python , timeout-config , outsystems-11 , cloud-resources	6	August 19, 2025
Process mining dashboard slow to load with large event logs ServiceNow question , performance-opt , query-optimization , process-mining , dashboard-performance , database-indexing , event-logs , snow-san-d , performance-ana	4	November 19, 2025
Workforce analytics API export fails with 504 Gateway Timeout on large datasets ADP Workforce Now question , performance-opt , workforce-analytics , api-development , rest-api , gateway-timeout , reporting-blocked , data-export , adp-2022-2	6	April 22, 2025
Process mining event log import fails with large files, blocking workflow discovery Microsoft Power Platform question , performance-opt , process-mining , workflow-design , data-import , event-logs , powerplat-2024-wave-1 , workflow-discovery , memory-limits	3	December 3, 2024
Event log import fails for large CSV files in process mining Creatio question , performance-opt , process-mining , workflow-design , import-timeout , csv-processing , server-config , analysis-blocked , creatio-8-5	5	November 18, 2025

Process Mining API data extraction extremely slow for large datasets

Related topics