Automation creates duplicate non-conformance records when triggered by batch import

paul_sys · May 7, 2025, 7:10pm

We have an automation script that creates non-conformance records based on incoming quality events from our manufacturing execution system. The script works fine for individual events, but when multiple non-conformance events are logged in rapid succession during batch imports, we’re getting duplicate records created.

Here’s a simplified version of our current logic:

for (QualityEvent event : eventList) {
    NonConformance nc = createNonConformance(event);
    nc.setEventId(event.getId());
    save(nc);
}

The event identifier usage should prevent duplicates, but somehow the same event is generating multiple non-conformance records. I suspect there’s a timing issue with the deduplication logic where the script checks for existing records before the previous save operation completes. This is causing major reporting confusion as our metrics are inflated with duplicate entries.

Has anyone dealt with similar concurrency issues in VVQ 23R2 automation scripts?

mateo_api · May 24, 2025, 4:34am

Here’s a comprehensive solution that addresses all three focus areas:

Deduplication Logic: Implement a robust deduplication strategy using a combination of database-level unique constraints and application-level checks. First, ensure your non-conformance object has a unique index on the event_id field at the database level. This provides a hard guarantee against duplicates regardless of timing issues.

Event Identifier Usage: Enhance your event identifier strategy to include both the source event ID and a timestamp hash. This creates a compound key that’s truly unique:

String uniqueKey = event.getId() + "_" +
    generateHash(event.getTimestamp());
if (!recordExists(uniqueKey)) {
    NonConformance nc = createNonConformance(event);
    nc.setUniqueIdentifier(uniqueKey);
    nc.setEventId(event.getId());
}

Automation Script Update: Refactor your script to use a try-catch pattern that gracefully handles duplicate key violations:

for (QualityEvent event : eventList) {
    try {
        String uniqueKey = buildUniqueKey(event);
        if (isNewEvent(uniqueKey)) {
            NonConformance nc = createNC(event, uniqueKey);
            save(nc);
        }
    } catch (DuplicateKeyException e) {
        logDuplicateAttempt(event.getId());
        continue;
    }
}

Additionally, implement a post-processing cleanup job that runs after batch imports to identify and merge any duplicates that slipped through. This job should compare non-conformance records created within the same time window (e.g., last 5 minutes) and consolidate any that share the same event ID, keeping the first created record and archiving the duplicates.

For immediate remediation, run a data cleanup script to identify and remove existing duplicates based on event ID, keeping only the earliest created record for each unique event. Then deploy the updated automation script with proper deduplication logic to prevent future occurrences.

martadeveloper · May 14, 2025, 5:28pm

I tried adding a query check, but the duplicates are still appearing occasionally. I think the problem is that the query executes before the previous save commits to the database. Is there a way to force synchronous processing or implement a proper locking mechanism in Veeva Vault automation scripts?

kai_guru · May 12, 2025, 9:33am

We had the exact same problem. The solution was to add a unique constraint check before the save operation. Query the system for any non-conformance records with the same event ID, and only proceed with creation if the query returns zero results. Also consider adding a small delay between batch operations to reduce the likelihood of concurrent execution.

anil_pro · May 14, 2025, 7:54pm

You need to implement idempotency in your script. Instead of just checking if a record exists, use a two-phase approach: first, try to create a placeholder record with just the event ID in a locked state, then update it with full details. If the placeholder creation fails due to a duplicate key, you know another process is already handling that event. This pattern works well for high-volume batch processing scenarios where concurrent execution is unavoidable.

olivia_ace · May 21, 2025, 3:49pm

Another approach is to modify your batch import process to deduplicate events before they reach the automation script. Implement a staging area where incoming events are collected and deduplicated based on event ID, then process only unique events through your automation. This shifts the responsibility upstream and ensures your automation script receives clean, unique data to begin with.

Topic		Views
Supplier validation batch import fails with duplicate key constraint violations Veeva Vault QMS question , supplier-mgmt , xml , vvq-23r2 , data-validation , duplicate-detection , data-migration-import , batch-import-engine , upsert-mode	6	March 19, 2025
Process automation batch import fails due to duplicate records ServiceNow question , process-automation , data-integration , snow-utah , javascript , duplicate-records , deduplication , import-set , asset-management	3	August 15, 2025
Quality control data import fails with duplicate records warning after running Data Mover scripts for inspection lots Microsoft Dynamics 365 question , database-mgt , sql , sql-server , quality-control , duplicate-key , data-import , d365-10-0-41 , data-cleaning	7	May 8, 2025
Incident master data sync fails when duplicate records exist Veeva Vault QMS question , incident-mgmt , vvq-23r2 , data-validation , sync-failure , duplicate-records , bulk-load , controlled-master-data , vault-loader	5	September 5, 2025
Bulk upload of training records via API fails when duplicate entries detected Veeva Vault QMS question , api-development , rest-api , training-mgmt , vvq-23r2 , duplicate-handling , 409-conflict , bulk-upload , lms-integration	6	March 22, 2025
Non-conformance import fails due to duplicate records-what d Arena QMS (by PTC) question , csv , data-validation , import-blocked , duplicate-records , non-conformance , aqp-2022-1 , data-migration-import , arena-import-utility	3	June 19, 2025
Duplicate change orders created during ECN data migration, causing workflow confusion and approval delays Oracle Agile PLM question , data-migration , workflow , sql , ecn-mgmt , data-import , duplicate-records , agil-9-3-6 , unique-identifier	5	July 8, 2025
Non-conformance webhook integration sends duplicate events to external system Qualio question , data-integrity , rest-api , json , webhooks , integration-frameworks , non-conformance , qual-2022-1 , webhook-duplication	5	January 6, 2025
Supplier sync integration fails with duplicate vendor IDs when batch importing MasterControl Quality Excellence question , supplier-mgmt , rest-api , json , import-failure , duplicate-key , integration-frameworks , mc-2022-2 , api-batch-import	5	November 4, 2025

Automation creates duplicate non-conformance records when triggered by batch import

Related topics