Genealogy tracking API batch queries timing out when tracing

jessica_sql · May 25, 2025, 9:41pm

We’re experiencing timeout errors when running batch genealogy queries through the REST API to trace material lineage for compliance audits. Our current implementation retrieves full genealogy trees for 50-100 parts at a time, but queries consistently fail after 30 seconds with a 504 Gateway Timeout.

The API call structure looks like this:


POST /api/genealogy/batch-trace
{
  "partIds": ["P-001", "P-002", ...],
  "direction": "both",
  "depth": "unlimited"
}

We’ve noticed that individual queries complete in 2-3 seconds, but batching them causes the timeout. We need this for monthly compliance reports where we trace hundreds of parts. Has anyone dealt with batch query pagination or recursive logic to handle this more efficiently? Also wondering if there are API timeout configuration settings we can adjust or if database indexing might help.

ankit_696 · May 28, 2025, 9:13pm

Thanks for the insights. I checked with our DBA and confirmed we don’t have composite indexes on the genealogy relationship tables. That’s definitely contributing to the problem. However, we still need to process large batches for our compliance reporting. Is there a recommended approach for recursive client logic that would handle this more gracefully? Should we be limiting depth even if we need full traceability?

giovanni_sql · June 2, 2025, 11:58am

Let me provide a comprehensive solution that addresses all the key points mentioned in this thread.

1. Batch Query Pagination Strategy Reduce your batch size from 50-100 parts to 10-15 parts per API call. This keeps individual requests manageable while still providing reasonable throughput. Implement this with a simple loop:


for (batch in partIds.chunked(10)) {
  results.addAll(api.traceGenealogy(batch))
  Thread.sleep(200) // Rate limiting
}

2. Recursive Client Logic Implementation Instead of requesting unlimited depth in one call, implement breadth-first traversal on the client side. Query one level at a time, collect unique part IDs from results, then query the next level. This gives you full traceability while maintaining control over query complexity. Pseudocode approach:


// Pseudocode - Breadth-first genealogy traversal:
1. Start with initial part IDs as currentLevel
2. While currentLevel is not empty:
   a. Query genealogy for currentLevel (batch of 10-15)
   b. Extract all parent/child IDs from results
   c. Filter out already-processed IDs
   d. Set nextLevel = new unique IDs
   e. Add results to master genealogy map
   f. currentLevel = nextLevel
3. Return complete genealogy tree
// Typical depth: 4-6 levels for most products

3. API Timeout Settings Increase the gateway timeout in your Apriso server configuration. Edit the API gateway properties file and set api.gateway.timeout=90000 (90 seconds). Also check the database connection timeout - it should be at least 60 seconds. However, with proper pagination and recursive logic, you shouldn’t need to rely on longer timeouts.

4. Database Indexing Optimization Work with your DBA to create composite indexes on the genealogy tables. Critical indexes needed:

Composite index on (parent_part_id, relationship_type, created_date)
Composite index on (child_part_id, relationship_type, created_date)
Index on (part_id, status) for active part lookups

After indexing, run ANALYZE/UPDATE STATISTICS on the genealogy tables to ensure the query optimizer uses the new indexes effectively.

5. Additional Recommendations

Implement connection pooling with at least 20 connections for parallel batch processing
Add retry logic with exponential backoff for transient failures
Consider caching frequently accessed genealogy paths (24-hour TTL)
Monitor API response times and adjust batch sizes based on actual performance
For compliance reports, run queries during off-peak hours when database load is lower

This combination of client-side pagination, recursive traversal, proper indexing, and configuration tuning should eliminate your timeout issues while maintaining full genealogy traceability. We implemented this exact approach for a pharmaceutical client processing 500+ parts daily for FDA compliance reporting, and query times dropped from 30+ seconds (with timeouts) to 8-12 seconds total for complete genealogy trees.

samadmin · May 30, 2025, 5:18pm

One more thing to consider - implement caching for frequently queried genealogy paths. If you’re running monthly compliance reports, chances are you’re querying the same parts repeatedly. We built a Redis cache layer that stores genealogy results for 24 hours, which cut our API load by 70%. Just make sure to invalidate cache entries when parts are updated or new relationships are created.

Topic		Views
Genealogy tracking API serial number queries are slow - how to optimize? GE Vernova question , performance-opt , api-development , rest-api , indexing , genealogy-tracking , slow-query , gpsf-2022 , traceability-delay	6	April 21, 2025
Genealogy tracking API serial number lookup takes 8+ seconds Honeywell MES question , api-development , performance , rest-api , java , genealogy-tracking , slow-query , hm-2023-2 , traceability-delays	3	March 16, 2025
Genealogy tracking API returns incomplete traceability chain Rockwell FactoryTalk MES question , api-development , rest-api , json , genealogy-tracking , ft-12-0 , traceability-gr , incomplete-lineage , compliance-audit-fail	4	June 13, 2025
Genealogy tracking traceability depth versus system performance trade-offs Rockwell FactoryTalk MES discussion , workflow-process , genealogy-tracking , performance-optimization , query-performance , database-indexing , ft-10-0 , genealogy-engine , archival-strategy	5	January 15, 2025
Genealogy tracking: balancing full material traceability with system performance DELMIA Apriso MES discussion , performance-opt , database-design , traceability , workflow-process , compliance , dam-2021 , genealogy-tracking , data-management	7	February 17, 2025
Best practices for genealogy tracking and product lineage reporting at scale AVEVA MES discussion , traceability , reporting-analytics , compliance , query-optimization , data-architecture , genealogy-tracking , performance-tuning , am-2021-2	4	March 2, 2025
Genealogy tracking API fails to create parent-child links after batch serial import Siemens Opcenter Execution question , traceability , api-development , rest-api , json , genealogy-tracking , batch-import , missing-links , soc-4-1	5	December 8, 2025
Genealogy tracking API returns 429 Too Many Requests after cloud scaling DELMIA Apriso MES question , performance-opt , traceability , api-gateway , cloud-deploy , rest-api , dam-2023 , genealogy-tracking , rate-limiting	6	December 21, 2024
Genealogy traceability vs batch-level reporting: analytics design tradeoffs Rockwell FactoryTalk MES discussion , data-modeling , reporting-analytics , regulatory , genealogy-tracking , design-tradeoff , ft-10-0 , mes-analytics , compliance-vs-performance	4	January 15, 2026

Genealogy tracking API batch queries timing out when tracing

Related topics