Fine-tuned VLM extracts BOM data from scanned manufacturing drawings

jennifer_ops · March 14, 2025, 9:23am

We automated BOM extraction from scanned engineering drawings using a fine-tuned Vision Language Model integrated with Blue Yonder Luminate 2023.2. Previously, planners manually transcribed component data from PDFs and images-taking 2-3 hours per complex assembly.

Our VLM fine-tuning approach uses labeled training data with 500+ annotated manufacturing drawings. The model extracts part numbers, quantities, descriptions, and hierarchical relationships, then outputs structured JSON matching our ERP schema. We validate extracted data against Blue Yonder’s manufacturing planning module requirements before ingestion.

The system processes drawings in Swift (our custom automation layer), handles multi-page documents, and maintains 94% extraction accuracy. Integration with BY’s planning workflows reduced BOM entry time by 85% and eliminated transcription errors that previously caused material shortages.

davidsql · April 9, 2025, 5:26am

Excellent implementation case study. Let me provide technical perspective on the three critical success factors here.

VLM Fine-Tuning Strategy: Your approach of combining real-world drawings with targeted augmentation is optimal. The 500+ base dataset with hierarchical labeling addresses the core challenge-manufacturing BOMs aren’t flat lists but structured trees. Training the model to recognize parent-child relationships through visual cues (indentation, connector lines, assembly bubbles) is what elevates this beyond simple OCR. The 40-hour training investment with multimodal transformers gives you domain-specific understanding that generic vision models lack. Consider expanding your augmentation to include different CAD software outputs (AutoCAD vs SolidWorks styles) if you handle multi-source drawings.

Document-to-JSON Extraction Pipeline: Your two-stage architecture (VLM→intermediate JSON→BY schema) is the right pattern. Direct VLM-to-ERP mapping creates brittle integrations. The intermediate format gives you flexibility to handle schema evolution in Blue Yonder updates without retraining the model. Your confidence thresholding at 85% with manual review queue balances automation efficiency with data quality-critical for manufacturing where BOM errors cascade into material procurement and production scheduling failures. The 94% overall accuracy you’re achieving exceeds industry benchmarks for document extraction (typically 85-90% for structured forms).

ERP Schema Adherence: The validation layer checking required fields, data types, and BY-specific conventions (hierarchical level encoding) prevents the silent data corruption that plagues automated integrations. Manufacturing planning modules are particularly sensitive to malformed BOMs-missing UOM fields or incorrect parent linkages break MRP calculations. Your approach of validating against Blue Yonder’s manufacturing planning API schema ensures compatibility with downstream processes like material requirements planning, capacity scheduling, and shop floor execution.

Operational Impact: 85% reduction in BOM entry time translates to significant planner productivity gains, but the elimination of transcription errors is the bigger win. Manual BOM entry errors cause material shortages (stockouts during production runs), excess inventory (over-ordering due to quantity mistakes), and schedule delays (rework when assemblies don’t match). Your 12% manual review rate for edge cases is excellent-you’ve automated the routine 88% while preserving quality controls for exceptions.

Scaling Considerations: As you expand, monitor model drift-manufacturing drawing standards evolve, and periodic retraining with new examples maintains accuracy. Consider implementing active learning where manual review corrections automatically feed back into training data. Also explore multi-language support if you operate globally; technical drawings often mix languages in annotations.

This is a strong example of applied ML in supply chain operations, demonstrating how domain-specific fine-tuning delivers practical business value beyond generic AI capabilities.

pat_dev · March 20, 2025, 5:37am

What about edge cases-handwritten annotations, poor scan quality, or non-standard drawing formats? Manufacturing floors often have legacy documents that don’t follow current CAD standards. Does your system handle these gracefully or require preprocessing?

giovanni_erp · March 17, 2025, 11:50am

How does your document-to-JSON extraction maintain consistency with Blue Yonder’s schema requirements? We’ve struggled with field mapping mismatches when integrating external data sources. Do you use a validation layer, or does the VLM output directly conform to BY’s expected structure?

neha_sage · March 15, 2025, 8:09pm

We started with a multimodal transformer base and fine-tuned using 500 real drawings plus 200 augmented variations (rotations, quality degradation, annotation styles). Training took about 40 hours on GPU cluster. The key was labeling hierarchical BOM structures-parent assemblies, sub-assemblies, individual components-so the model learns relational context, not just isolated part numbers. We also trained it to recognize standard drawing symbols and callouts specific to our industry.

priyatech · March 26, 2025, 10:14am

Great question. We do basic preprocessing-deskewing, contrast enhancement, noise reduction-but the VLM handles moderate quality variations well due to augmented training data. Handwritten annotations are trickier; we trained specifically on those but accuracy drops to around 78%. For legacy formats, we maintain a fallback queue where low-confidence extractions get flagged for human verification. About 12% of documents need this manual check, which is still far better than 100% manual entry we had before.

Topic		Replies	Views
Automated document-to-JSON conversion in warehouse management accelerated inbound receiving and reduced manual entry Blue Yonder Luminate use-case , data-migration , warehouse-mgmt , json , vlm , by-2023-2 , ai-pipeline , unstructured-data , document-processing	6	0	April 30, 2025
Automated extraction of structured JSON from supplier invoices using VLM for supply planning integration Blue Yonder Luminate use-case , api-development , supply-planning , json , machine-learning , invoice-automation , data-extraction , vlm , by-2022-2	5	1	August 31, 2025
Automated extraction of structured JSON from multipage invoices using Swift VLM in Oracle Fusion Cloud SCM Oracle Fusion Cloud SCM use-case , automation , ofc-24a , json , invoice-processing , cloud-hybrid-deployment , distribution-mgmt , swift-vlm , data-extraction	6	1	March 20, 2025
Automated supplier contract JSON extraction and supply planning dashboard integration reduced manual data entry by 85% Infor SCM use-case , api-integration , supply-planning , workflow-automation , reporting-dashboards , is-2022-2 , document-automation , ocr , json-mapping	5	0	February 7, 2025
Scripted JSON export in document control fails on multipage PDFs with Swift VLM Arena QMS (by PTC) question , document-control , json , python , integration-failure , field-mapping , swift-vlm , scripting-automation , aqp-2022-1	6	0	October 25, 2025
Automated invoice matching in ERP finance using Azure ML reduced manual processing by 85% Microsoft Azure use-case , analytics , rest-api , erp-integration , az-2021 , machine-learning , invoice-automation , azure-ml , supervised-learning	4	0	November 23, 2024
Automated OMS to Luminate integration for order fulfillment reduced processing time by 40% for e-commerce orders Blue Yonder Luminate use-case , integration , automation , rest-api , json , order-fulfillment , by-2022-2 , oms-integration , processing-optimization	6	0	April 10, 2025
Automated vendor onboarding using SSO and approval workflow cut cycle time by 40% in procure-to-pay process Blue Yonder Luminate use-case , data-quality , supplier-collab , workflow , automation , process-improvement , vendor-management , configuration-mgt , by-2023-1	5	1	August 19, 2025
Automated BOM synchronization between PLM and Dynamics 365 using data entities reduces engineering change lead time Microsoft Dynamics 365 use-case , data-migration , automation , bom-management , plm-integration , production-planning , d365-10-0-41 , data-management-framework , engineering-change-orders	5	0	June 23, 2025

Fine-tuned VLM extracts BOM data from scanned manufacturing drawings

Related topics