Structured Representation of Simulation and Annotation Data for Machine Learning in Forming Technologies
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
The use of machine learning (ML) in manufacturing requires structured, especially standardized, access to both simulation data and domain knowledge. This paper introduces a JSON-based data format for representing synthetic force-time series alongside expert annotations. The schema captures simulation metadata, tool and material parameters, and allows explicit expert knowledge, such as failure indicators, to be linked to signal segments.The proposed structure enables process-aware ML methods that leverage both domain knowledge and raw data for improved learning and generalization. A deep drawing use case illustrates how the format facilitates knowledge-guided learning. The approach aims to bridge the gap between real and simulated production data, supporting scalable integration in modern manufacturing systems.