Duplicate bug detection not working in defect-tracking module after bulk import

pedrocode · January 5, 2026, 10:16am

After performing a bulk import of 2,500 defects into our ALM 24 defect-tracking module, the duplicate bug detection has stopped working. We’re now seeing obvious duplicates being created that should have been flagged and consolidated.

The fuzzy matching configuration appears unchanged:


defect.duplicate.threshold=85
defect.matching.fields=summary,description
defect.similarity.algorithm=levenshtein

Before the bulk import, the system would catch duplicates with 85%+ similarity. Now identical defects with the same summary are being created as separate entries. The bulk import validation completed without errors, but something seems to have broken the duplicate detection logic. Has anyone experienced data quality degradation after large imports?

thinker_dev · January 8, 2026, 7:30pm

Look at the duplicate detection service logs specifically. After bulk imports, the service sometimes gets overloaded and switches to a degraded mode where it only checks exact matches instead of fuzzy matching. You might see log entries about threshold adjustments or algorithm fallbacks that explain why the Levenshtein similarity isn’t being calculated anymore.

ravisolver · January 6, 2026, 12:37am

I’ve seen this happen when the similarity index needs rebuilding. Large bulk imports can corrupt the index used for fuzzy matching. Try running the index rebuild utility from the admin console. It takes a while with 2,500+ defects, but it should restore duplicate detection functionality.

akashcreator · January 12, 2026, 9:50am

I’ve diagnosed this exact issue before. Here’s what happened and how to fix it:

Root Cause Analysis: Bulk imports of large defect sets (>2000 records) can overwhelm the duplicate detection service, causing it to enter a protective mode. The system logs show this as a threshold adjustment, but what actually happens is more significant.

Fuzzy Matching Configuration: Your configuration looks correct, but after bulk imports, the system temporarily increases the threshold to prevent false positives during the import flood:


defect.duplicate.threshold=85  // Your setting
defect.duplicate.threshold.active=95  // Actual runtime value

Check the active threshold value in the admin console under Defect Tracking > Detection Settings. If it’s higher than 85, that explains why obvious duplicates aren’t being caught.

Bulk Import Validation: The validation completing without errors is misleading. The bulk import process disables fuzzy matching during the import itself (for performance), which means:

Duplicates within the imported batch aren’t detected
The similarity index becomes fragmented
Post-import duplicate detection uses the fragmented index

Duplicate Consolidation: Here’s the fix process:

Stop the duplicate detection service temporarily
Reset the active threshold to match your configuration
Rebuild the similarity index with full re-indexing:


alm-admin reindex --module=defects --mode=full --algorithm=levenshtein

Run a retroactive duplicate scan on the imported defects:


alm-admin scan-duplicates --date-range=2025-10-18:2025-10-30 --action=flag

This will identify duplicates created during and after the import without auto-merging them.

Defect Lifecycle Management: Configure the duplicate detection to handle all workflow states, not just New defects. Add this to your configuration:


defect.duplicate.check.states=New,Open,InProgress,Reopen
defect.duplicate.merge.states=New,Reopen

This ensures detection works across states but only allows automatic merging for New and Reopened defects (to prevent data loss).

Prevention for Future Imports: For bulk imports exceeding 1000 defects, use the staged import mode:


alm-import --file=defects.csv --mode=staged --batch-size=500 --enable-duplicate-check

This imports in smaller batches with duplicate detection enabled between batches, preventing the issue from recurring.

After following these steps, your duplicate detection should return to normal operation within 2-4 hours as the index stabilizes.

anita_mfal · January 5, 2026, 6:36pm

Check if the bulk import bypassed the duplicate detection service. When imports are done in batch mode, they sometimes skip validation steps for performance reasons. You might need to run a post-import duplicate consolidation process to catch the duplicates that were created during the import.

lauraguru · January 9, 2026, 5:25am

Another thing to verify - was the bulk import done with a specific user account or service account? Some accounts have permissions that bypass duplicate checking intentionally, which can affect subsequent operations if that account remains the active context for the defect module.

ninja_master · January 6, 2026, 1:52am

Ran the index rebuild overnight, but duplicate detection still isn’t working. I’m noticing that defects created before the bulk import are being matched correctly, but anything created after the import (whether manual or bulk) doesn’t trigger duplicate detection. Could the import have changed some system-level configuration?

pablosys · January 8, 2026, 4:11pm

Check the defect lifecycle management settings. If the bulk import included defects in various workflow states, it might have triggered a safeguard that disables duplicate detection for non-New status defects. This is a common issue when importing historical defects that are already closed or in progress.

Topic		Replies	Views
Defect tracking duplicate detection fails to identify similar defects Siemens Polarion ALM question , data-quality , java , metrics , test-mgmt , duplicate-detection , defect-tracking , pol-2406 , algorithm	5	0	November 21, 2025
Test case management imports duplicate requirements from DOORS Next during bulk operations IBM Engineering Lifecycle Management question , xml , import-mapping , test-case-mgmt , requirements-mgmt , elm-7-0-1 , doors-next , uuid-handling , oslc-validation	6	0	August 5, 2025
Bulk part import in part management fails with duplicate part numbers Teamcenter question , server-side , java , part-mgmt , tc-12-3 , bom-sync , data-validation , bulk-import-utility , duplicate-error	5	0	March 15, 2025
Bulk defect import fails with field mapping errors when migrating from legacy system Micro Focus ALM / Quality Center question , data-migration , system-admin , xml , migration , bulk-import , field-mapping , defect-tracking , mf-25-4	6	0	December 13, 2025
Quality-mgmt detecting false duplicate defects mf-25.3 high false positive rate Micro Focus ALM / Quality Center question , automation , defect-mgmt , quality-mgmt , ml-training , duplicate-detection , mf-25-3 , triage	3	0	January 12, 2026
Audit management master data sync fails during bulk import with duplicate key errors Sparta Systems TrackWise question , audit-mgmt , sql , master-data , bulk-operations , duplicate-key , data-import , controlled-master-data , tw-9-0	5	0	August 20, 2025
Sales forecasting reports inaccurate due to duplicate lead records from bulk imports Zendesk Sell question , data-quality , data-governance , zs-2022 , sales-forecasting , forecast-accuracy , bulk-import , duplicate-leads	5	1	September 13, 2025
Duplicate asset records created after bulk import in asset lifecycle module, unique constraint not enforced Aras Innovator question , data-migration , database-mgt , sql , asset-lifecycle , aras-13-0 , import-tool , duplicates , asset-tracking	3	2	May 24, 2025
Quality control data import fails with duplicate records warning after running Data Mover scripts for inspection lots Microsoft Dynamics 365 question , database-mgt , sql , sql-server , quality-control , duplicate-key , data-import , d365-10-0-41 , data-cleaning	7	0	May 8, 2025

Duplicate bug detection not working in defect-tracking module after bulk import

Related topics