Improving the Interpretability of Data-Driven Models for Additive Manufacturing Processes Using Clusterwise Regression

Mattera, Giulio; Piscopo, Gianfranco; Longobardi, Maria; Giacalone, Massimiliano; Nele, Luigi

doi:10.3390/math12162559

Open AccessArticle

Improving the Interpretability of Data-Driven Models for Additive Manufacturing Processes Using Clusterwise Regression

by

Giulio Mattera

¹

,

Gianfranco Piscopo

²,

Maria Longobardi

²

,

Massimiliano Giacalone

^3,*

and

Luigi Nele

¹

Department of Chemical, Materials and Production Engineering, University of Naples “Federico II”, 80125 Naples, Italy

²

Department of Mathematics and Applications “Renato Caccioppoli”, University of Naples “Federico II”, 80125 Naples, Italy

³

Department of Economics, University of Campania “Luigi Vanvitelli”, 81043 Capua, CE, Italy

^*

Author to whom correspondence should be addressed.

Mathematics 2024, 12(16), 2559; https://doi.org/10.3390/math12162559

Submission received: 15 July 2024 / Revised: 10 August 2024 / Accepted: 16 August 2024 / Published: 19 August 2024

(This article belongs to the Section Probability and Statistics)

Download

Browse Figures

Versions Notes

Abstract

:

Wire Arc Additive Manufacturing (WAAM) represents a disruptive technology in the field of metal additive manufacturing. Understanding the relationship between input factors and layer geometry is crucial for studying the process comprehensively and developing various industrial applications such as slicing software and feedforward controllers. Statistical tools such as clustering and multivariate polynomial regression provide methods for exploring the influence of input factors on the final product. These tools facilitate application development by helping to establish interpretable models that engineers can use to grasp the underlying physical phenomena without resorting to complex physical models. In this study, an experimental campaign was conducted to print steel components using WAAM technology. Advanced statistical methods were employed for mathematical modeling of the process. The results obtained using linear regression, polynomial regression, and a neural network optimized using the Tree-structured Parzen Estimator (TPE) were compared. To enhance performance while maintaining the interpretability of regression models, clusterwise regression was introduced as an alternative modeling technique along with multivariate polynomial regression. The results showed that the proposed approach achieved results comparable to neural network modeling, with a Mean Absolute Error (MAE) of 0.25 mm for layer height and 0.68 mm for layer width compared to 0.23 mm and 0.69 mm with the neural network. Notably, this approach preserves the interpretability of the models; a further discussion on this topic is presented as well.

Keywords:

clustering regression; multivariate regression; applied statistics; additive manufacturing; neural networks; machine learning

MSC:

62P30

1. Introduction

Additive Manufacturing (AM) occupies a crucial role in the Industry 4.0 production paradigm (see Figure 1) [1]. AM involves constructing components by incrementally adding material layer-by-layer, enabling the realization of components with complex geometries that conventional methods find challenging to replicate. Furthermore, AM expedites the prototyping process, decreasing time-to-market for new products. Its capacity to minimize waste by utilizing materials only where required promotes sustainability as well.

Among AM technologies, Wire Arc Additive Manufacturing (WAAM) [2,3] (see Figure 2) has garnered significant attention from the scientific community due to its capability to rapidly construct large metal components at reduced costs. This process, also known as Gas Metal Arc Welding Rapid Prototyping (GMAW-RP), is based on depositing metals layer-by-layer in the form of droplets along a path defined by a slicer using the arc welding process [4]. Welding automation systems are utilized in this process, offering flexibility and a large working volume. Typically, an anthropomorphic six-axis robotic motion platform holds a welding torch and a welding machine that generates arc when the wire contacts the deposition substrate. This approach produces near-net-shape components that often require postprocessing, such as machining or thermal treatments, to meet the final tolerances specified by project requirements and mitigate residual stresses, which are typical defects in the parts [5].

Nowadays, the abundance of resources dedicated to the development of AI applications has facilitated its widespread integration across various industrial applications, including modeling [6,7,8,9], control [10,11,12], optimization [13,14,15], and especially monitoring [16,17,18,19,20,21,22,23]. AI has emerged as a prominent tool, particularly in the modeling and monitoring tasks, with supervised learning approaches predominantly favored due to their demonstrated effectiveness and reliability in achieving desired outcomes [24,25,26,27,28]. These approaches leverage labeled datasets to train models, enabling them to accurately identify patterns and make informed decisions based on past observations. As previously mentioned, one of the most promising applications in the field of AM is predictive modeling. This approach allows for the generation of models that are valuable for simulating what-if scenarios and other applications, especially considering the intricate thermomechanical nature of metal transformation that would otherwise require complex physical models [29]. Due to the complexity, high resource demands, and time-consuming nature of modeling physical phenomena using methods such as finite element analysis, AI is emerging as an alternative tool. However, given the “black-box” nature of the data-driven methods they employ, AI models face challenges related to the low interpretability of their results compared to traditional “white-box” approaches. One of the most important application of process modeling in WAAM is the slicer software, as the geometry of the deposited layers (output) heavily depends on various input factors such as the employed process parameters, welding technique, wire diameter, and material composition utilized for component fabrication [11,30]. When a CAD geometry is obtained, the input factors can be used to determine key layer geometry parameters such as width, height, and overlapping distance (d) [31], as illustrated in Figure 3. Additionally, if a part has a height of N mm, the obtained layer height (h) can be used to determine the required number of slices and generate the path planning for the deposition of subsequent layers by adjusting the position of the robot. Selection of the correct process parameters, which depend on the model employed to correlate the layer geometry and input factors, is important; an incorrect parameterization, as may result from using an inaccurate model to estimate layer height, can lead to error accumulation during part production, resulting in defects such as spatters, humping, and porosity. In fact, if a layer height of h is estimated and a

Δ E

exists, it can accumulate during the deposition until the distance between the torch and the workpiece is different from the planned one, leading to an unstable printing process [32].

As discussed above, estimating the layer geometry with the lowest possible error is crucial to reducing the presence of defects within components realized by WAAM technology. Despite this, several process parameters (which represent the input factors in the regression task) are utilized in WAAM, such the wire feed speed (WFS), welding speed (WS), welding voltage (V), gas flow rate (GFR), inter-pass temperature (T), and contact-to-workpiece distance (CTWD). Each of these parameters influences the layer geometry in opposite ways and with different magnitudes, complicating the parameter selection process [15]. A review of the literature on this topic reveals that researchers have explored various methodologies aiming to model this complex input–output relationship, including statistical methods such as polynomial regression [33] and machine learning techniques such as Support Vector Machine (SVM) [34], neural networks [35], and Advanced Regression Tree (ART) methods [36], which represent the state of the-art. However, while these machine learning methods offer promise in capturing complex relationships, their black-box nature often complicates the interpretability of the resulting models [37]. Understanding how each parameter influences the final components becomes challenging, hindering the ability to optimize the WAAM process effectively. Therefore, the present study aims to address these limitations by conducting a statistical analysis of the layer geometry obtained through depositing layers in carbon steel under different process parameters. The research workflow followed in this work is shown in Figure 4.

In particular, in this work we explore whether the combined use of highly interpretable clustering and regression models leveraging process knowledge can enhance modeling performance without compromising accuracy. After collecting and preprocessing data from an experimental campaign, we compare the results of the proposed technique with alternative modeling approaches from the literature, including polynomial regression, linear regression, random forest, and neural networks, allowing for a methodological comparison with other studies. The rest of this paper is organized as follows: Section 2 delves into the data utilized in the study and outlines the methodology employed for statistical analysis; Section 3 presents the key findings; and Section 4 concludes the paper by emphasizing the significance of the findings and suggesting future research directions.

2. Materials and Methods

2.1. Experimental Setup

In the conducted experimental campaign, data were collected while depositing layers through the WAAM process, specifically employing a stainless steel SS308LSi wire with a diameter of 1.2 mm and a constant-voltage welding process. This process, shown in Figure 5, consists of applying a constant voltage to the welding machine; the raw material in wire form is deposited into the melting pool via electromagnetic force applied to the droplet [38].

For the deposition, a six-axis Omron Adept Vipers850 robot was employed as a motion platform, while the welding equipment comprised a Miller Pheonix 456 welding machine. The final bead geometry at each layer was measured using a Lext 5100 laser scanning confocal microscope. The workflow employed to collect the data is reported in Figure 6, and allowed to obtain the necessary labels for the supervised learning task.

The experimental campaign employed a range of process parameters or input factors, including eleven levels of WFS (2.0, 2.5, 2.8, 3.0, 3.6, 4.0, 4.5, 5.2, 6, 6.5, and 7 mm/min), eight levels of WS (90, 130, 180, 250, 450 550, 700, 760 mm/min), four levels of CTWD (9, 12, 15, 20 mm), and eight levels of welding voltage (14.7, 16, 17.5, 18, 20, 22, 26, 30 V), while the interpass temperature and GFR were maintained fixed to the respective values of 100 °C and 18 L/min. However, certain combinations of parameters, such as low wire feed speed and high welding voltage, resulted in defective parts, leading to their exclusion from the conducted analysis, which comprised a total of 174 samples.

2.2. Multivariate Polynomial Regression

Regression analysis is a statistical modeling technique used to describe the relationship between a dependent continuous variable y (output) and one or more independent variables x (input or features) in the form of a mathematical relationship, as reported in Equation (1); different approaches can be used to estimate the parameters

θ

.

y = f (θ, x)

(1)

The most common form of regression is linear regression, where the relationship between the independent variables and the dependent variable is assumed to be linear, as in Equation (2).

y = W \cdot x

(2)

Because regression is a supervised learning approach, the weights W (M × N) multiplied by the features × (N) can be estimated using the knowledge of the desired outcome y (M). Ordinary Least Squares (OLS) estimation, Lp estimation [39,40,41], gradient descent optimization [42], or other optimization techniques [43] can be used within this scope. Although linear regression is highly interpretable and simple to train via OLS, it is limited to capturing linear relationships between data and it does not consider the correlation between variables, also known as multicollinearity. Lp-norm regression or advanced gradient-based techniques can be used to avoid this problem.

To address linear relationships between data, polynomial regression can be employed as expressed in Equation (3). Polynomial regression assumes an n-th degree polynomial model between the independent variables and the dependent variables. Unlike linear regression, which assumes a linear relationship, polynomial regression can capture nonlinear patterns, offering a more adaptable model for the data.

y = β_{0} + β_{1} x + β_{2} x^{2} + \dots + β_{n} x^{n} + ϵ = Wx

(3)

In the above equation, y is the dependent variable, x is the independent variable,

β_{0}, β_{1}, β_{2}, \dots, β_{n}

are the regression coefficients, n is the degree of the polynomial, and

ϵ

represents the error term. The degree of the polynomial determines the complexity of the relationship that can be captured. Higher-degree polynomials can fit the data more closely but may risk overfitting, especially with limited data. It is essential to strike a balance between model complexity and the risk of overfitting. As in standard linear regression, several methods can be used to estimate the parameters

β

. If no correlation between inputs is assumed, the most popular method is OLS estimation; the parameters can be found as reported in Equation (4).

W = {(x^{T} x)}^{- 1} \cdot y \cdot x^{T}

(4)

2.3. Multicollinearity in Regression

Although polynomial modeling is a statistical tool largely employed to generate simple but effective predictive models, especially from an industrial point of view, as they can be deployed with reduced costs on Programming Logic Controllers (PLCs), it suffers from multicollinearity when the variables exhibit notable interdependence. The OLS method can be applied to find the weight of the regression as the correlation matrix approaches singularity. In a formal context, the diagonal elements of the inverse correlation matrix

{(X^{'} X)}^{- 1}

, corresponding to linearly dependent members of X, tend towards infinity. Consequently, accurately distributing the explained variance among variables becomes challenging, resulting in diminished quality of the parameter estimates and difficulty in discerning the independent contributions of explanatory variables. Several methods can be used to evaluate the degree of multicollinearity. Pairwise correlation is a common approach, with correlation coefficients exceeding 0.9 between features often indicating potential issues. However, the Variance Inflation Factor (VIF) is a more reliable metric, with values above 10 generally signaling significant multicollinearity [44]. Additionally, when employing linear regression models, the eccentricity of the confidence region can be used as an indicator of multicollinearity [45]. When detected, multicollinearity can be reduced. To improve model predictions via gradient-based techniques, meaningful and uncorrelated features are then employed to mitigate this effect. The Spearman correlation index (

ρ

) is computed as reported in Equation (5), where

d_{i}

represents the difference between the ranks of corresponding variables and n is the number of observations. It is a statistical measure used to assess the strength and direction of association between two ranked variables. Spearman correlation is valuable due to its independence from normality assumptions, making it applicable to diverse data types and serving as a correlation index for nonlinear data. By examining the index’s magnitude, it is possible identify low-correlated (0–0.25) and high-correlated (0.75–1) features within the dataset, which can then be selectively utilized or eliminated. This helps to address multicollinearity issues by ensuring that only uncorrelated features are employed to evaluate regression parameters.

ρ = 1 - \frac{6 \sum d_{i}^{2}}{n (n^{2} - 1)}

(5)

This approach is also beneficial for decreasing model complexity and mitigating overfitting while enhancing the ability of the model to generalize.

2.4. Modeling of WAAM Process and Proposal of This Work

In the realm of AM, modeling techniques represent an important tool for both planning and optimization. As discussed in the introduction, the current trend in the literature focuses on using black-box machine learning approaches to address the inherent complexity of AM process modeling [46] in light of the critical role of low estimation errors in planning and optimization tasks. However, while statistical methods can provide valuable insights by elucidating the `black-box’ nature of these models, their performance often falls short of the required standards. This gap has prompted researchers to explore new modeling solutions facilitated by the democratization of machine learning. In the field of WAAM, Xiong et al. [33] introduced a second-order polynomial regression utilizing the wire feed rate, welding speed, arc voltage, and CTWD as inputs, yielding results lower than those obtained by a neural network. Similarly, Naveen Srinivas et al. [47] applied a second-order regression model incorporating the gas flow rate and welding speed along with the wire feed speed for aluminum components. While exhibiting slightly lower performance, these studies have shown that polynomial regression methodologies generate more interpretable models compared to neural networks due to the transparency of their weight parameters. This study aims to demonstrate that combining clustering analysis with polynomial regression (clusterwise regression) can mitigate related problems such as multicollinearity while outperforming standard polynomial models and potentially surpassing the performance of complex black-box neural networks. By enhancing model interpretability, we seek to address a key challenge hindering the industrial adoption of AI [48].

2.5. Proposed Methodology

In a classic regression problem, the estimated parameters are those that minimize the global error. These are unique, and generally describe the general dependencies of the process. However, in complex processes such as WAAM, it is reasonable to expect nonlinear relationships between outputs and variables along with different behaviors in different features spaces. Regarding this particular case, different input factors lead to different metal transfer modes, which introduces changes in the heat input supplied to the material [49]. In Figure 7, it is possible observe the differences in waveform of both the current and voltage signals during the deposition process, which results in different heat inputs, different layer geometries, and the end of deposition.

In the context of a welding-based process such as WAAM, the utilization of cluster-wise regression becomes particularly advantageous due to the diverse impacts that different sets of input factors can have on the heat input delivered to the material and the subsequent heat accumulation. By identifying and accounting for these variations through clusterwise regression [50,51], it is possible to better capture the hidden relationships between input factors and welding outcomes. This not only reduces model complexity and guards against overfitting but also enhances the model’s capacity to generalize across various welding scenarios, leading to more robust and reliable predictive capabilities. Let us first consider a classical linear regression

y = X b + e,

(6)

with

y = (y_{i}; i = 1, \dots, I)

being the vector of the dependent variable of I units,

X = (x_{i j}; i = 1, \dots, I; j = 1, \dots, J)

an

I \times J

matrix of independent regressors,

b = (b_{j}; j = 1, \dots, J)

the vector of J regression coefficients, and

e = (e_{i}; i = 1, \dots, I)

the vector of i.i.d. disturbances. In this particular case,

y

is the vector of the layer geometry expressed by the layer width and height, while

X

is the vector of the input factors or their combination.

It is well known that under the assumption of the error term having a multivariate normal distribution and

E [e^{'} e] = σ^{2} I

, the parameters of the linear regression can be estimated by maximizing the following likelihood function:

L (y ∣ b, σ^{2}) = {(2 π σ^{2})}^{- 1 / 2} exp [- \frac{{(y - X b)}^{'} (y - X b)}{2 σ^{2}}] .

(7)

Following [52,53], we assume that the relationship between the variables of interest is heterogeneous at the cluster level; in other words, assuming the presence of K clusters, we have

b_{j k}

as the value of the j-th regression coefficient in the k-th cluster and

σ_{k}^{2}

as the variance of the error term of the k-th cluster for

j = 1, \dots, J

and

k = 1, \dots, K

. Given the process knowledge mentioned above, we assume the presence of two clusters associated with the different metal transfer modes.

Moreover, we assume that

y_{i}

is distributed as a finite sum or a mixture of conditional normal densities.

y_{i} = \sum_{k = 1}^{K} λ_{k} {(2 π σ_{k}^{2})}^{- 1 / 2} exp [\frac{- {(y_{i} - x_{i} b_{k})}^{2}}{2 σ_{k}^{2}}], i = 1, \dots, I .

(8)

In other words, we assume an independent sample of the observations’ dependent variable

y_{1}, y_{2}, \dots, y_{I}

drawn randomly from a mixture of conditional normal densities of underlying clusters in unknown proportions

λ_{1}, λ_{2}, \dots, λ_{K}

.

Given a sample of I independent subjects/observations, it is possible to form a log-likelihood expression

ln L = \sum_{i = 1}^{I} ln [\sum_{k = 1}^{K} λ_{k} {(2 π σ_{k}^{2})}^{- 1 / 2} exp [\frac{- {(y_{i} - X_{i} b_{k})}^{2}}{2 σ_{k}^{2}}]] .

(9)

Then, we can find the values of

λ_{k}, σ_{k}^{2}

, and

b_{j k}

that maximize the objective function given

K, y

and

X

, and subject to the following constraints:

\begin{matrix} 0 \leq λ_{k} \leq 1, \\ \sum_{k = 1}^{K} λ_{k} = 1, \\ σ_{k}^{2} > 0 . \end{matrix}

The algorithm for estimating the parameters and cluster compositions is explained in detail in [52]. In sum, the computation of the maximum likelihood estimates is obtained by means of the EM algorithm. For given starting values of the parameters, the expectation (E phase) and maximization (M phase) steps of this algorithm are alternated until convergence. Another important aspect is that the maximization of the log-likelihood is conditioned to the a priori specification that the number of clusters

K = 2

.

2.6. Preprocessing and Workflow

The WAAM process is characterized as a multiple-input multiple-output system where the input parameters represent process variables, namely, the Wire Feed Speed (WFS), Welding Speed (WS), Welding Voltage (V), and Contact-to-Workpiece Distance (CTWD), while the outputs correspond to the dimensions of the printed layers, such as the layer width (w) and layer height (h).

As discussed previously, linear regression is not suitable for modeling such systems due to the inherent nonlinearity related to the underlying complex physical phenomena. Additionally, the physical correlation between output variables further complicates the linear modeling approach, which in state-of-art methods consists of assuming two independent equations. Therefore, given the nonlinear nature and correlated outputs, nonlinear regression methods are more appropriate for modeling the WAAM process. In this work, a multivariate polynomial regression with four inputs and two outputs is used as modeling technique, in which we assume no inherent correlation between the output variables. We consider a second-order combination of the input features without considering the intercept, for a total of fourteen features. These features are represented as

x_{i}, x_{j}, x_{i} \cdot x_{j}

, where i and j range from 0 to 3, resulting in a comprehensive set of predictors for modeling the WAAM process, obtaining the model in Equation (10)

\begin{matrix} y & = \sum_{i = 0}^{3} β_{i} x_{i} + \sum_{i = 0}^{3} \sum_{j = i}^{3} β_{i j} x_{i} x_{j}, \end{matrix}

(10)

where:

$β_{i}$ (Linear Coefficients): These coefficients represent the change in the dependent variable for a one-unit change in the corresponding independent variable $x_{i}$ while holding all other variables constant.
$β_{i j}$ (Interaction Coefficients): These coefficients represent the change in the dependent variable resulting from the interaction between the independent variables $x_{i}$ and $x_{j}$ whileholding all other variables constant.

With the model obtained, the OLS was used to estimate the input factors after the feature selection process, for which we used the Spearman correlation index. The final proposed methodology is described in Figure 8. Using the defined set of input factors, the features were extracted using a second-order polynomial. With the aim of mitigating multicollinearity, these features were selected based on the Spearman index to ensure that only low-correlated features were used for regression. The training data were first grouped into clusters using a Gaussian Mixture Model (GMM), then each cluster was individually analyzed using regression models. These regression models represent the relationship between the input factors (inputs) and the target variables (outputs) within the different cluster groups.

The results obtained with the proposed methodology were then compared with other methods mentioned above, such as deep neural networks, linear regression, and polynomial regression. Furthermore, the analysis of both clusters and regression coefficients allows for better analysis of the process and can be easily developed into PLC, as previously discussed.

2.7. Comparison with Optimized Neural Network

Deep Neural Networks (DNNs) represent a powerful class of machine learning models, characterized by their bio-inspired layered architecture of interconnected neurons; these models transform input data through a sequence of nonlinear operations leveraging the gradient descent optimization algorithm, which enables DNNs to generalize complex relationships within continuous functions [42,54]. While they are widely recognized for their performance, especially for supervised models, the training process of DNNs can be challenging due to the significant number of hyperparameters associated with the architecture and training, including:

Number of layers, which determines the depth of the neural network.
Number of neurons per layer, which influences the capacity and complexity of the model.
Learning rate, which controls how much to adjust the model in response to the estimated error each time the model weights are updated.
Activation functions, which are nonlinear functions applied to each neuron’s output to allow the network to model complex relationships in the data.
Batch size, referring to the number of samples processed before the model is updated.
Dropout rate, referring to the probability of randomly ignoring a neuron during training, which helps to prevent overfitting.

Selecting optimal hyperparameters is crucial, as they significantly impact the performance, training time, and generalization ability of DNNs. Manually tuning hyperparameters is a labor-intensive and iterative process that requires domain expertise and considerable computational resources. Traditional methods such as grid search and random search [55,56] are exhaustive and may not efficiently explore the vast hyperparameter space, leading to suboptimal performance. In this work, the Tree-structured Parzen Estimator (TPE) [57] optimization technique is employed to automatically find the optimal hyperparameters, resulting in a reduced validation estimation error when predicting the layer geometry in WAAM. The results suggests that the optimal architecture should contain one hidden layer with 128 neurons and a sigmoid activation function as the hidden layer activation, a learning rate of

α = 0.00778

, a batch size of one step per epoch with no dropout, and training for 600 epochs.

3. Results and Discussion

3.1. Data Preprocessing

Starting from the four input factors of Wire Feed Speed (

x_{0}

), Welding Speed (

x_{1}

), Contact-To-Workpiece Distance (

x_{2}

), and Welding Voltage (

x_{3}

), fourteen features were extracted as described in the previous section to develop the second order multivariate polynomial regression. The Spearman correlation index (

ρ

) was computed by conducting Spearman correlation analysis; the associated color map is shown in Figure 9. From these computed correlations, only those features that presented a low correlation with the others were maintained, thereby avoiding problems related to multicollinearity. At the end of preprocessing, the preserved features were

x_{0}, x_{1}, x_{2}, x_{0}^{2}, x_{1}^{2}, x_{o} \cdot x_{1}, x_{0} \cdot x_{3}, x_{1} \cdot x_{2}, x_{1} \cdot x_{3}

, for a total of nine features. The extracted features were normalized and scaled in the range of 0–1 using Equation (11) for each feature j of the dataset.

x [:, j] = \frac{x [:, j] - m i n (x [:, j])}{m a x (x [:, j]) - m i n (x [:, j])}

(11)

While the eccentricity of the confidence region for the polynomial regression model decreases by 30% after applying the proposed feature selection, it remains elevated, indicating persistent multicollinearity. This is expected due to the inherent interdependence of the WAAM process parameters. For instance, though theoretically independent, the voltage and wire feed speed are typically adjusted together in order to prevent defects such as burn-through related to poor feeding of the wire in the melting pool. Consequently, certain parameter combinations are underrepresented in the dataset, contributing to high correlation and multicollinearity. In light of our primary focus on prediction accuracy, the Mean Absolute Error (MAE) was used to compare model performance.

3.2. Regression Analysis Results

In this study, we compare different modeling techniques using a dataset where 70% of the samples were used for training and the remaining 30% for validation. The simplest technique employed was linear regression using input factors as inputs. The performance metrics, measured in MAE in millimeters, were 0.4 mm for layer height and 1 mm for layer width on the training dataset. For the test dataset, the MAE was 0.39 mm for layer height and 0.917 mm for layer width. These results indicate poor performance, suggesting that linear regression is insufficient for modeling this system. As shown in Table 1, introducing a feature extraction procedure to remove uncorrelated features and polynomial regression introduced an increment in modeling performance. The MAE for layer height decreased to 0.19 mm and 0.55 mm for layer width on the training dataset. On the test dataset, the MAE was 0.24 mm for layer height and 0.66 mm for layer width, indicating better overall performance compared to linear regression. The performance obtained by using the optimized DNN was achieved with an MAE of 0.22 mm for layer height and 0.63 mm for layer width on the training dataset. For the test dataset, the MAE was 0.24 mm for layer height and 0.53 mm for layer width. Compared to the previous models, this method shows slight improvement in layer width estimation while maintaining similar performance for layer height.

Finally the proposed clusterwise regression reached MAE values of 0.21 mm and 0.5 mm on the training dataset for layer height and layer width respectively, along with 0.24 mm and 0.47 mm for layer height and layer width on the testing dataset, leading to the best overall performance among all the developed models. The regression plots of the testing dataset results are shown in Figure 10 and Figure 11.

Next, we provide an analysis of how the coefficient of the polynomial regression changes due to introduction of the clustering technique. As discussed in the previous section, GMM is utilized to cluster the data into two groups associated with two different metal deposition modes, each of which exhibits unique relationships with geometric variables, namely, spray transfer and dip transfer. For each cluster, a distinct regression analysis is conducted to ascertain the parameters linked to the weld bead height (

β_{h}^{i}

) and width (

β_{w}^{i}

), where ’i’ denotes the respective cluster. The final calculation follows Equation (12); it is possible observe the assumption of no correlation between the output variables.

[\begin{matrix} h \\ w \end{matrix}] = [\begin{matrix} β_{h}^{i} & 0 \\ 0 & β_{w}^{i} \end{matrix}] [\begin{matrix} V \\ W F S \\ W S \\ V^{2} \\ W F S^{2} \\ V \cdot W F S \\ C T W D \cdot V \\ W F S \cdot W S \\ W F S \cdot C T W D \end{matrix}]

(12)

In polynomial regression, as described by Equation (13), the coefficients for predicting the height (h) and width (w) of the layers exhibit specific numerical values for each predictor variable (

V, W F S, W S, V^{2}, W F S^{2}, V \cdot W F S, C T W D \cdot V

, and

W F S \cdot W

). For instance, the coefficient for V when predicting height is

0.3685

, while when predicting width it is

0.2819

. This means that by increasing the welding voltage, and consequently the heat input, an increment of the both layer width and layer height is reached, with more influence on layer height than layer width. An opposite relationship is obtained for Welding Speed (WS).

Moving to clusterwise regression, which divides the data into distinct groups (Group 0 and Group 1), we observe notable variations in these coefficients. In Group 0’s clusterwise regression model (see Equation (14)) for height prediction, the coefficient for V is

0.2668

, whereas in Group 1 (see Equation (15)) it increases to

0.4181

. This indicates a different emphasis on the influence of V between the two groups, potentially reflecting different operational conditions or underlying data characteristics that the model captures differently. Similarly, for width prediction in clusterwise regression, the coefficient for V in the Group 0 is

- 0.0417

, contrasting with

- 0.4598

in Group 1. This substantial shift suggests varying importance or directionality in the effect of V on the width across the two groups.

Upon further analysis, the interaction terms such as

V \cdot W F S, C T W D \cdot V

, and

W F S \cdot W S

also exhibit distinct coefficients between polynomial regression and clusterwise regression. These interactions capture nuanced relationships between variables that can significantly impact model performance and interpretation. The differences in coefficients between the polynomial and clusterwise regression models underscore how different modeling techniques interpret and utilize the data. Polynomial regression assumes a global relationship across all data points, while clusterwise regression adapts locally within distinct groups, potentially leading to improved predictive accuracy under heterogeneous data conditions. Moreover, the variations in coefficients between Group 0 and Group 1 within clusterwise regression highlight the importance of considering subgroup-specific effects. These differences may arise from operational nuances or varying experimental setups that affect how predictors contribute to the outcome variables (height and width). In conclusion, while polynomial regression provides a comprehensive overview of the entire dataset, clusterwise regression with subgroup analysis (Group 0 and Group 1) offers deeper insights into localized patterns and subgroup-specific variations. Understanding these differences in coefficients enhances our ability to tailor predictive models to specific operational scenarios or experimental conditions, thereby improving model robustness and interpretability in practical applications.

\{\begin{matrix} h & = 0.3685 * V - 0.5574 * WFS - 0.0015 * WS \\ - 0.0069 * V^{2} - 0.005 * {WFS}^{2} + 0.0721 * V * WFS \\ + 0.0104 * CTWD * V - 0.0002 * WFS * CTWD, \\ w & = 0.2819 * V + 2.8982 * WFS - 0.0291 * WS \\ + 0.0156 * V^{2} - 0.1476 * {WFS}^{2} + 0.0911 * V * WFS \\ - 0.0218 * CTWD * V + 0.0007 * WFS * CTWD . \end{matrix}

(13)

\{\begin{matrix} h & = 0.2668 * V - 0.7124 * WFS + 0.0092 * WS \\ - 0.0010 * V^{2} - 0.0348 * {WFS}^{2} + 0.0502 * V * WFS \\ + 0.0681 * CTWD * V - 0.0001 * WFS * CTWD, \\ w & = - 0.0417 * V + 3.2200 * WFS - 0.0282 WS \\ + 0.0069 * V^{2} - 0.0327 * {WFS}^{2} - 0.0155 * V * WFS \\ - 0.1145 * CTWD * V + 0.0021 * WFS * CTWD . \end{matrix}

(14)

\{\begin{matrix} h & = 0.4181 * V - 0.2369 * WFS - 0.0173 * WS \\ - 0.0149 * V^{2} + 0.0491 * {WFS}^{2} - 0.0274 * V * WFS \\ - 0.0079 * CTWD * V + 0.0002 * WFS * CTWD, \\ w & = - 0.4598 * V + 5.4265 * WFS - 0.0137 * WS \\ + 0.0277 * V^{2} - 0.0919 * {WFS}^{2} - 0.3467 * V * WFS \\ + 0.0406 * CTWD * V - 0.0012 * WFS * CTWD . \end{matrix}

(15)

3.3. Limitations and Future Perspective

While the proposed methodology effectively integrates process knowledge and statistical techniques to create more interpretable models for AM without compromising the quality of the predictions compared to machine learning approaches, a number of limitations introduce aspects in need of further exploration. First, the model overlooks crucial layer geometry factors such as the interpass temperature and overlap distance, which significantly influence subsequent layer formation. Second, despite our feature selection efforts, multicollinearity persists due to the inherent interdependence of AM input factors. For example, the voltage and wire feed speed in WAAM processes are theoretically independent, but must be adjusted in tandem to maintain layer quality. This interdependence also explains data gaps in AM datasets. Finally, the proposed clusterwise regression, while demonstrated to work with the proposed dataset, does not guarantee continuous model behavior across cluster boundaries, particularly when considering different metal transfer modes. Addressing these limitations presents additional avenues for future research.

4. Conclusions

Predictive modeling plays a crucial role in Additive Manufacturing (AM) processes such as Wire Arc Additive Manufacturing (WAAM), allowing for effective process planning and optimization through simulation models. In fact, by accurately estimating geometry based on input factors, it is possible to enhance the precision of WAAM through the use of slicing software. However, given the complexity of AM processes such as WAAM, data-driven techniques such as deep learning models are employed in the current state-of-the-art methods, which provide limited interpretability for engineers who seek to understand the underlying physical phenomena. This represents a critical application of predictive models. In the present study, we compared traditional polynomial regression and an optimized neural network with our novel clustering-based approach. Specifically, we employed a Gaussian Mixture Model to identify two distinct clusters, with each assigned a separate polynomial regression model. This method not only leads to improved performance but also provides better insights into the geometry prediction of the WAAM process. Our findings indicate that “white-box” approaches such as clustering combined with polynomial regression can achieve results comparable to or even surpassing those of complex “black-box” machine learning models. Moreover, our proposed method does not require hyperparameter optimization, unlike many machine learning algorithms, making it more straightforward and easier to implement. Future research will extend the focus from single-layer deposition to the geometry of wall structures, addressing challenges such as neglected physical phenomena, equation dependencies, and model consistency across cluster boundaries. By incorporating complex thermal phenomena and acknowledging the limitations of our current modeling approach, we aim to develop interpretable data-driven models with high performance that are able to reduce the need for feedback control to ensure WAAM process stability.

Author Contributions

Conceptualization, G.M., M.G., M.L., L.N. and G.P.; Methodology, G.M., M.G., M.L., L.N. and G.P.; Software, G.M., M.G., M.L., L.N. and G.P.; Investigation, G.M., M.G., M.L., L.N. and G.P.; Data Curation, G.M., M.G., M.L., L.N. and G.P.; Writing—review and editing, M.G., G.P., M.G., M.L. and L.N.; Supervision, G.M., M.G., M.L., L.N. and G.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

Maria Longobardi is member of the research group GNAMPA of INdAM (Istituto Nazionale di Alta Matematica). Luigi Nele acknowledge the INVITALIA Project NEMESI for their support to this research work.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Dilberoglu, U.M.; Gharehpapagh, B.; Yaman, U.; Dolen, M. The role of additive manufacturing in the era of industry 4.0. Procedia Manuf. 2017, 11, 545–554. [Google Scholar] [CrossRef]
Norrish, J. Recent gas metal arc welding (GMAW) process developments: The implications related to international fabrication standards. Weld. World 2017, 61, 755–767. [Google Scholar] [CrossRef]
Mattera, G.; Nele, L.; Paolella, D. Monitoring and control the Wire Arc Additive Manufacturing process using artificial intelligence techniques: A review. J. Intell. Manuf. 2023, 35, 467–497. [Google Scholar] [CrossRef]
Pan, Z.; Ding, D.; Wu, B.; Cuiuri, D.; Li, H.; Norrish, J. Arc welding processes for additive manufacturing: A review. Trans. Intell. Weld. Manuf. 2018, 1, 3–24. [Google Scholar]
Wu, B.; Pan, Z.; Ding, D.; Cuiuri, D.; Li, H.; Xu, J.; Norrish, J. A review of the wire arc additive manufacturing of metals: Properties, defects and quality improvement. J. Manuf. Process. 2018, 35, 127–139. [Google Scholar] [CrossRef]
Mu, H.; He, F.; Yuan, L.; Commins, P.; Wang, H.; Pan, Z. Toward a smart wire arc additive manufacturing system: A review on current developments and a framework of digital twin. J. Manuf. Syst. 2023, 67, 174–189. [Google Scholar] [CrossRef]
Mu, H.; He, F.; Yuan, L.; Hatamian, H.; Commins, P.; Pan, Z. Online distortion simulation using generative machine learning models: A step toward digital twin of metallic additive manufacturing. J. Ind. Inf. Integr. 2024, 38, 100563. [Google Scholar] [CrossRef]
Günther, J.; Pilarski, P.M.; Helfrich, G.; Shen, H.; Diepold, K. First steps towards an intelligent laser welding architecture using deep neural networks and reinforcement learning. Procedia Technol. 2014, 15, 474–483. [Google Scholar] [CrossRef]
Moradi, M.; Karamimoghadam, M.; Meiabadi, S.; Casalino, G.; Ghaleeh, M.; Baby, B.; Ganapathi, H.; Jose, J.; Abdulla, M.S.; Tallon, P.; et al. Mathematical modelling of fused deposition modeling (FDM) 3D printing of poly vinyl alcohol parts through statistical design of experiments approach. Mathematics 2023, 11, 3022. [Google Scholar] [CrossRef]
Xia, C.; Pan, Z.; Polden, J.; Li, H.; Xu, Y.; Chen, S.; Zhang, Y. A review on wire arc additive manufacturing: Monitoring, control and a framework of automated system. J. Manuf. Syst. 2020, 57, 31–45. [Google Scholar] [CrossRef]
Mattera, G.; Caggiano, A.; Nele, L. Optimal data-driven control of manufacturing processes using reinforcement learning: An application to wire arc additive manufacturing. J. Intell. Manuf. 2024. [Google Scholar] [CrossRef]
Masinelli, G.; Le-Quang, T.; Zanoli, S.; Wasmer, K.; Shevchik, S.A. Adaptive laser welding control: A reinforcement learning approach. IEEE Access 2020, 8, 103803–103814. [Google Scholar] [CrossRef]
Qiu, Z.; Wang, Z.; van Duin, S.; Wu, B.; Zhu, H.; Wexler, D.; Pan, Z.; Li, H. A review of challenges and optimization processing during additive manufacturing of trademarked Ni-Cr-based alloys. In Modern Manufacturing Processes for Aircraft Materials; Elsevier: Amsterdam, The Netherlands, 2024; pp. 263–309. [Google Scholar]
Mattera, G.; Mattera, R. Shrinkage estimation with reinforcement learning of large variance matrices for portfolio selection. Intell. Syst. Appl. 2023, 17, 200181. [Google Scholar] [CrossRef]
Mattera, G.; Caggiano, A.; Nele, L. Reinforcement learning as data-driven optimization technique for GMAW process. Weld. World 2023, 68, 805–817. [Google Scholar] [CrossRef]
Li, Y.; Polden, J.; Pan, Z.; Cui, J.; Xia, C.; He, F.; Mu, H.; Li, H.; Wang, L. A defect detection system for wire arc additive manufacturing using incremental learning. J. Ind. Inf. Integr. 2022, 27, 100291. [Google Scholar] [CrossRef]
Li, Y.; Mu, H.; Polden, J.; Li, H.; Wang, L.; Xia, C.; Pan, Z. Towards intelligent monitoring system in wire arc additive manufacturing: A surface anomaly detector on a small dataset. Int. J. Adv. Manuf. Technol. 2022, 120, 5225–5242. [Google Scholar] [CrossRef]
Mu, H.; Chen, Z.; He, F.; Li, Y.; Xia, C.; Commins, P.; Pan, Z. Defect Detection and Process Monitoring for Wire Arc Additive Manufacturing Using Machine Learning. In Proceedings of the International Conference on Robotic Welding, Intelligence and Automation, Shanghai, China, 26–27 December 2020; Springer: Singapore, 2020; pp. 3–22. [Google Scholar]
Caggiano, A.; Zhang, J.; Alfieri, V.; Caiazzo, F.; Gao, R.; Teti, R. Machine learning-based image processing for on-line defect recognition in additive manufacturing. CIRP Ann. 2019, 68, 451–454. [Google Scholar] [CrossRef]
Caggiano, A.; Mattera, G.; Nele, L. Smart Tool Wear Monitoring of CFRP/CFRP Stack Drilling Using Autoencoders and Memory-Based Neural Networks. Appl. Sci. 2023, 13, 3307. [Google Scholar] [CrossRef]
Mattera, G.; Polden, J.; Caggiano, A.; Commins, P.; Nele, L.; Pan, Z. Anomaly Detection of Wire Arc Additively Manufactured Parts via Surface Tension Transfer through Unsupervised Machine Learning Techniques. In Proceedings of the 17th CIRP Conference on Intelligent Computation in Manufacturing Engineering Procedia, Gulf of Naples, Italy, 12–14 July 2023. in press. [Google Scholar]
Mattera, G.; Polden, J.; Caggiano, A.; Nele, L.; Pan, Z.; Norrish, J. Semi-supervised Learning for Real-Time Anomaly Detection in Pulsed Transfer Wire Arc Additive Manufacturing. J. Manuf. Process. 2024, in press. [Google Scholar] [CrossRef]
Mattera, G.; Polden, J.; Nele, L. Monitoring Wire Arc Additive Manufacturing process of Inconel 718 thin-walled structure using wavelet decomposition and clustering analysis of welding signal. J. Adv. Manuf. Sci. Technol. 2024. [CrossRef]
Alcaraz, J.Y.I.; Foqué, W.; Sharma, A.; Tjahjowidodo, T. Indirect porosity detection and root-cause identification in WAAM. J. Intell. Manuf. 2023, 35, 1607–1628. [Google Scholar] [CrossRef]
Zhang, T.; Xu, C.; Cheng, J.; Chen, Z.; Wang, L.; Wang, K. Research of surface oxidation defects in copper alloy wire arc additive manufacturing based on time-frequency analysis and deep learning method. J. Mater. Res. Technol. 2023, 25, 511–521. [Google Scholar] [CrossRef]
Xia, C.; Pan, Z.; Polden, J.; Li, H.; Xu, Y.; Chen, S. Modelling and prediction of surface roughness in wire arc additive manufacturing using machine learning. J. Intell. Manuf. 2022, 33, 1467–1482. [Google Scholar] [CrossRef]
Yaseer, A.; Chen, H. Machine learning based layer roughness modeling in robotic additive manufacturing. J. Manuf. Process. 2021, 70, 543–552. [Google Scholar] [CrossRef]
Saeheaw, T. Comparison of different supervised machine learning algorithms for bead geometry prediction in GMAW process. Eng. Solid Mech. 2023, 11, 175–190. [Google Scholar] [CrossRef]
Zhao, S.; Qiu, X.; Burnett, I.; Rigby, M.; Lele, A. A lumped-parameter model for sound generation in gas metal arc welding. Mech. Syst. Signal Process. 2021, 147, 107085. [Google Scholar] [CrossRef]
Ding, D.; He, F.; Yuan, L.; Pan, Z.; Wang, L.; Ros, M. The first step towards intelligent wire arc additive manufacturing: An automatic bead modelling system using machine learning through industrial information integration. J. Ind. Inf. Integr. 2021, 23, 100218. [Google Scholar] [CrossRef]
Ding, D.; Pan, Z.; Cuiuri, D.; Li, H. A multi-bead overlapping model for robotic wire and arc additive manufacturing (WAAM). Robot. Comput.-Integr. Manuf. 2015, 31, 101–110. [Google Scholar] [CrossRef]
Nele, L.; Mattera, G.; Vozza, M. Deep neural networks for defects detection in gas metal arc welding. Appl. Sci. 2022, 12, 3615. [Google Scholar] [CrossRef]
Xiong, J.; Zhang, G.; Hu, J.; Wu, L. Bead geometry prediction for robotic GMAW-based rapid manufacturing through a neural network and a second-order regression analysis. J. Intell. Manuf. 2014, 25, 157–163. [Google Scholar] [CrossRef]
Surovi, N.A.; Soh, G.S. Process map generation of geometrically uniform beads using support vector machine. Mater. Today Proc. 2022, 70, 113–118. [Google Scholar] [CrossRef]
Tang, S.; Wang, G.; Song, H.; Li, R.; Zhang, H. A novel method of bead modeling and control for wire and arc additive manufacturing. Rapid Prototyp. J. 2021, 27, 311–320. [Google Scholar] [CrossRef]
Chandra, M.; Vimal, K.; Rajak, S. A comparative study of machine learning algorithms in the prediction of bead geometry in wire-arc additive manufacturing. Int. J. Interact. Des. Manuf. (IJIDeM) 2023, 1–14. [Google Scholar] [CrossRef]
Rudin, C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat. Mach. Intell. 2019, 1, 206–215. [Google Scholar] [CrossRef]
Doodman Tipi, A.R.; Sani, S.H.; Pariz, N. Improving the dynamic metal transfer model of gas metal arc welding (GMAW) process. Int. J. Adv. Manuf. Technol. 2015, 76, 657–668. [Google Scholar] [CrossRef]
Giacalone, M.; Panarello, D.; Mattera, R. Multicollinearity in regression: An efficiency comparison between L p-norm and least squares estimators. Qual. Quant. 2018, 52, 1831–1859. [Google Scholar] [CrossRef]
Giacalone, M. A combined method based on kurtosis indexes for estimating p in non-linear Lp-norm regression. Sustain. Futur. 2020, 2, 100008. [Google Scholar] [CrossRef]
Giacalone, M. Optimal forecasting accuracy using Lp-norm combination. Metron 2022, 80, 187–230. [Google Scholar] [CrossRef]
Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
Hooti, F.; Ahmadi, J.; Longobardi, M. Optimal extended warranty length with limited number of repairs in the warranty period. Reliab. Eng. Syst. Saf. 2020, 203, 107111. [Google Scholar] [CrossRef]
Chan, J.Y.L.; Leow, S.M.H.; Bea, K.T.; Cheng, W.K.; Phoong, S.W.; Hong, Z.W.; Chen, Y.L. Mitigating the Multicollinearity Problem and Its Machine Learning Approach: A Review. Mathematics 2022, 10, 1283. [Google Scholar] [CrossRef]
Duarte, B.P.; Atkinson, A.C.; Oliveira, N.M. Optimum design for ill-conditioned models: K–optimality and stable parameterizations. Chemom. Intell. Lab. Syst. 2023, 239, 104874. [Google Scholar] [CrossRef]
Mbodj, N.G.; Abuabiah, M.; Plapper, P.; El Kandaoui, M.; Yaacoubi, S. Bead geometry prediction in laser-wire additive manufacturing process using machine learning: Case of study. Appl. Sci. 2021, 11, 11949. [Google Scholar] [CrossRef]
Naveen Srinivas, M.; Vimal, K.; Manikandan, N.; Sritharanandh, G. Parametric optimization and multiple regression modelling for fabrication of aluminium alloy thin plate using wire arc additive manufacturing. Int. J. Interact. Des. Manuf. (IJIDeM) 2022, 1–11. [Google Scholar] [CrossRef]
Mehrabi, N.; Morstatter, F.; Saxena, N.; Lerman, K.; Galstyan, A. A survey on bias and fairness in machine learning. ACM Comput. Surv. (CSUR) 2021, 54, 1–35. [Google Scholar] [CrossRef]
Giulio, M.; Joseph, P.; Luigi, N. A Time-Frequency Domain Feature Extraction Approach Enhanced by Computer Vision for Wire Arc Additive Manufacturing Monitoring Using Fourier and Wavelet Transform. J. Adv. Manuf. Syst. 2024. [Google Scholar] [CrossRef]
Cerqueti, R.; Giacalone, M.; Mattera, R. Skewed non-Gaussian GARCH models for cryptocurrencies volatility modelling. Inf. Sci. 2020, 527, 1–26. [Google Scholar] [CrossRef]
Cerqueti, R.; Giacalone, M.; Mattera, R. Model-based fuzzy time series clustering of conditional higher moments. Int. J. Approx. Reason. 2021, 134, 34–52. [Google Scholar] [CrossRef]
DeSarbo, W.S.; Cron, W.L. A maximum likelihood methodology for clusterwise linear regression. J. Classif. 1988, 5, 249–282. [Google Scholar] [CrossRef]
Hennig, C. Identifiablity of Models for Clusterwise Linear Regression. J. Classif. 2000, 17, 273. [Google Scholar] [CrossRef]
Cybenko, G. Approximation by superpositions of a sigmoidal function. Math. Control. Signals Syst. 1989, 2, 303–314. [Google Scholar] [CrossRef]
Bergstra, J.; Bengio, Y. Random search for hyper-parameter optimization. J. Mach. Learn. Res. 2012, 13. [Google Scholar]
Alibrahim, H.; Ludwig, S.A. Hyperparameter optimization: Comparing genetic algorithm against grid search and bayesian optimization. In Proceedings of the 2021 IEEE Congress on Evolutionary Computation (CEC), Kraków, Poland, 28 June–1 July 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1551–1559. [Google Scholar]
Akiba, T.; Sano, S.; Yanase, T.; Ohta, T.; Koyama, M. Optuna: A Next-Generation Hyperparameter Optimization Framework. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Anchorage, AK, USA, 4–8 August 2019. [Google Scholar]

Figure 1. In contrast with traditional manufacturing methods that entail removing material through machining processes, additive manufacturing involves constructing a component by adding material onto a substrate.

Figure 2. WAAM employs an electric arc to selectively melt a wire feedstock, allowing for layer-by-layer deposition to fabricate near-net-shape components following a predefined path generated by slicer software.

Figure 3. The geometric parameters for path planning in the WAAM process include the layer geometry and the overlap distance (source: own elaboration).

Figure 4. Illustration of the research workflow employed in this study (source: own elaboration).

Figure 5. Welding voltage and current sampled at 100 kHz frequency during the constant-voltage welding process. For certain process parameters, this may reveal the natural short-circuit transfer process, allowing metal wire deposition into the melting pool (source: own elaboration).

Figure 6. The workflow employed to collect data consisted of deposit wall structures with different process parameters. Example measurements show layer geometries of different samples obtained using the microscope (source: own elaboration).

Figure 7. Comparison of the welding voltage and welding current signals for different transfer modes obtained by employing a different set of input factors.

Figure 8. Proposed data processing workflow from data collection to layer geometry estimation via cluster wise regression (source: own elaboration).

Figure 9. Plot showing the correlation matrix heatmap; high correlations are represented by red and blue shading, while lower correlations are represented by white shading. Notably, because the matrix is symmetrical, the upper and lower triangular areas exhibit blue shading, reflecting lower correlations between those features (source: own elaboration).

Figure 10. Regression plot of the relationship between the predicted and true values of the layer width for the different models developed in this study (source: own elaboration).

Figure 11. Regression plot of the relationship between predicted and true values of the layer height for the different models developed in this study (source: own elaboration).

Table 1. Mean Absolute Error (MAE) for different modeling techniques (source: own elaboration).

Model	Training Dataset		Test Dataset
Model	Height (mm)	Width (mm)	>Height (mm)	Width (mm)
Linear Regression (LR)	0.40	1.00	0.39	0.92
Polynomial Regression (PR)	0.19	0.55	0.24	0.66
Optimized DNN	0.22	0.63	0.24	0.53
Clusterwise Regression	0.21	0.50	0.24	0.47

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mattera, G.; Piscopo, G.; Longobardi, M.; Giacalone, M.; Nele, L. Improving the Interpretability of Data-Driven Models for Additive Manufacturing Processes Using Clusterwise Regression. Mathematics 2024, 12, 2559. https://doi.org/10.3390/math12162559

AMA Style

Mattera G, Piscopo G, Longobardi M, Giacalone M, Nele L. Improving the Interpretability of Data-Driven Models for Additive Manufacturing Processes Using Clusterwise Regression. Mathematics. 2024; 12(16):2559. https://doi.org/10.3390/math12162559

Chicago/Turabian Style

Mattera, Giulio, Gianfranco Piscopo, Maria Longobardi, Massimiliano Giacalone, and Luigi Nele. 2024. "Improving the Interpretability of Data-Driven Models for Additive Manufacturing Processes Using Clusterwise Regression" Mathematics 12, no. 16: 2559. https://doi.org/10.3390/math12162559

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Improving the Interpretability of Data-Driven Models for Additive Manufacturing Processes Using Clusterwise Regression

Abstract

1. Introduction

2. Materials and Methods

2.1. Experimental Setup

2.2. Multivariate Polynomial Regression

2.3. Multicollinearity in Regression

2.4. Modeling of WAAM Process and Proposal of This Work

2.5. Proposed Methodology

2.6. Preprocessing and Workflow

2.7. Comparison with Optimized Neural Network

3. Results and Discussion

3.1. Data Preprocessing

3.2. Regression Analysis Results

3.3. Limitations and Future Perspective

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI