Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

ML-based identification of the interface regions for coupling local and nonlocal models

Noujoud Nader
Louisiana State University
Baton Rouge, 70803, LA, USA
nnader@lsu.edu
&Patrick Diehl
Louisiana State University
Baton Rouge, 70803, LA, USA
pdiehl@cct.lsu.edu
&Marta D’Elia
Pasteur Labs
Brooklyn, NY 11205
marta.delia@simulation.science
&Christian Glusa
Center for Computing Research, Sandia National Laboratories
Albuquerque, NM 87321
caglusa@sandia.gov
&Serge Prudhomme
Department of Mathematics and Industrial Engineering, Polytechnique Montréal,
C.P. 6079, succ. Centre-ville Montréal, QC H3C 3A7, Canada
serge.prudhomme@polymtl.ca
Abstract

Local-nonlocal coupling approaches provide a means to combine the computational efficiency of local models and the accuracy of nonlocal models. However, the coupling process can be challenging, requiring expertise to identify the interface between local and nonlocal regions. This study introduces a machine learning-based approach to automatically detect the regions in which the local and nonlocal models should be used in a coupling approach. This identification process takes as input the loading functions evaluated at the grid points and provides as output the selected model at those points. Training of the networks is based on datasets provided by classes of loading functions for which reference coupling configurations are computed using accurate coupled solutions, where accuracy is measured in terms of the relative error between the solution to the coupling approach and the solution to the nonlocal model. We study two approaches that differ from one another in terms of the data structure. The first approach, referred to as the full-domain input data approach, inputs the full load vector and outputs a full label vector. In this case, the classification process is carried out globally. The second approach consists of a window-based approach, where loads are preprocessed and partitioned into windows and the problem is formulated as a node-wise classification approach in which the central point of each window is treated individually. The classification problems are solved via deep learning algorithms based on convolutional neural networks. The performance of these approaches is studied on one-dimensional numerical examples using F1-scores and accuracy metrics. In particular, it is shown that the windowing approach provides promising results, achieving an accuracy of 0.96 and an F1-score of 0.97. These results underscore the potential of the approach to automate coupling processes, leading to more accurate and computationally efficient solutions for material science applications.

Keywords coupling methods, local and nonlocal models, loading functions, machine learning, convolutional neural networks

1 Introduction

Recent advances in engineering mechanics have seen an increasing emphasis on coupling nonlocal models with classical local models, an approach driven by the need to address the excessive computational costs of high fidelity models, such as nonlocal equations, for modeling and simulation of complex behaviors in materials science. Nonlocal models, such as nonlocal diffusion and peridynamics, offer a more comprehensive representation of phenomena that are not adequately captured by classical Partial Differential Equations (PDEs). These phenomena include fracture and cracks in mechanics [14]. The scope of nonlocal models extends beyond diffusion and mechanics, finding applications in diverse areas like subsurface transport [3, 46, 25, 24, 49], phase transitions [2, 6, 8] and image processing [17, 31].

However, the implementation and simulation of nonlocal models present several challenges. These include the complexity of handling nonlocal boundary conditions [16] and the high computational costs associated with their numerical solutions. To address these challenges, coupling between local and nonlocal models approach has been proposed. Coupling methods aim to merge the computational efficiency of PDEs with the accuracy of nonlocal models, particularly in cases where nonlocal effects are confined to specific domain areas, and the rest of the system can be effectively described by a PDE.

For a general overview of local-nonlocal coupling methods, classified as either force-based or energy-based formulations, we refer to the recent survey [18]. Our focus is on the coupling of the bond-based peridynamic model with the classical linear elasticity continuum model, using a force-based coupling formulation. In this context, various coupling approaches have been proposed; these are based on matching the displacements (the Matching Displacement Coupling Method) or the stresses (the Matching Stress Coupling Method) either over an overlap region or a sharp interface obtained by shrinking the size of the nonlocal horizon when transitioning from a nonlocal to a local region. The latter approach is known as the variable horizon coupling method (VHCM); the interested reader can find an extended description in [15].

A key aspect of the coupling process is that it requires domain expertise to effectively choose the interface between the local and nonlocal regions. In this work, we propose to circumvent this challenge by letting machine learning detect such an interface region using a supervised approach. Specifically, the use of machine learning involves training neural network architectures on datasets containing local-nonlocal coupling regions that yield a certain accuracy of the coupled solution. Once trained, the networks can automatically identify the regions of the computational domain where nonlocal effects are likely to be significant, solely based on external loading information. This approach represents a significant advance, as it automates the identification of regions by removing the expert-in-the-loop.

Although to the best of our knowledge the use of machine learning for local-nonlocal coupling has never been pursued before, machine learning for nonlocal simulations and/or computational mechanics is not a new concept. In what follows, we briefly review the most relevant research. We first focus on machine learning for peridynamics. Haghighat et al. [22] presented a nonlocal physics-informed framework with the peridyanmic differential operator [33] using deep learning. Kim et al. [26] presented Peri-net to analyze crack patterns using Deep Neural Networks. Nguyen et al. [34] presented peridynamic-based machine learning approach for up to two-dimensional structures. Xu et al. [50] presented a machine-learning-based framework for peridynamic material models with physical constraints. Concerning nonlocal models, the following literature is available. Tao et al. [47] investigated nonlocal neural networks, nonlocal diffusion, and nonlocal modeling. Feng et al. [19] analyzed stochastic nonlocal damage using machine learning. Zhou et al. [56] learned nonlocal constitutive models using neural networks. You et al. [54, 52, 53] used data-driven learning of nonlocal physics from high-fidelity synthetic data. In [9], de Moraes et al. used machine learning to predict the evolution of nonlocal micro-structural defects in crystalline materials. Lal et al. [29] predicted nonlocal elasticity parameters using machine learning and molecular dynamics simulations.

Concerning local models, we restrict ourselves to classical continuum mechanics. For a review of machine learning and data-miming approaches for continuum material mechanics, we refer the reader to [5]; whereas for computational solid mechanics, we refer to [28]. A machine-learning approach for physics-based material models for multiscale solid mechanics was presented in [40].

Finally, the use of machine learning for coupling methods (not involving nonlocal or peridynamics models) is also not new; in fact, Raymond [39] used machine learning to model coupled solid and fluid mechanics with mesh-free methods.

The paper is organized as follows. Section 2 briefly introduces the models, coupling approaches, and discretization. In Section 3 the methodology and the ML model which is the convolution neural network (CNN) are described. Section 4 covers the data generation. In Section 5, we present numerical results for two cases using multiple node classification in Subsection 5.1 and the main case of this work using node-wise classification in Subsection 5.2. Finally, Section 6 concludes the paper.

2 Description of the models and coupling approach

The model problem that we consider in this work deals with the quasi-static simulation of a one-dimensional bar with mixed boundary conditions at the two extremities. The deformations within the bar are assumed to be small enough to be described by linear elasticity. This section briefly describes the local and nonlocal models: linear elasticity and linearized bond-based peridynamics [44]. We present the continuous formulations first and then provide their discretizations using finite differences for the local model and a collocation approach for peridynamics.

2.1 Classical linear elasticity model

We assume a one-dimensional bar of length \ellroman_ℓ occupying the open domain Ω=(0,)Ω0\Omega=(0,\ell)\subset\mathbb{R}roman_Ω = ( 0 , roman_ℓ ) ⊂ blackboard_R. Denoting the closure of ΩΩ\Omegaroman_Ω as Ω¯=[0,]¯Ω0\bar{\Omega}=[0,\ell]over¯ start_ARG roman_Ω end_ARG = [ 0 , roman_ℓ ], we seek the displacement ul=ul(x)subscript𝑢𝑙subscript𝑢𝑙𝑥u_{l}=u_{l}(x)italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( italic_x ), xΩ¯for-all𝑥¯Ω\forall x\in\bar{\Omega}∀ italic_x ∈ over¯ start_ARG roman_Ω end_ARG, such that:

EAul′′(x)=fb(x),𝐸𝐴superscriptsubscript𝑢𝑙′′𝑥subscript𝑓𝑏𝑥\displaystyle-EAu_{l}^{\prime\prime}(x)=f_{b}(x),- italic_E italic_A italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_x ) = italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( italic_x ) , xΩ,for-all𝑥Ω\displaystyle\quad\forall x\in\Omega,∀ italic_x ∈ roman_Ω , (1)
ul(x)=0,subscript𝑢𝑙𝑥0\displaystyle u_{l}(x)=0,italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( italic_x ) = 0 , atx=0,at𝑥0\displaystyle\quad\text{at}\ x=0,at italic_x = 0 , (2)
EAul(x)=g,𝐸𝐴superscriptsubscript𝑢𝑙𝑥𝑔\displaystyle EAu_{l}^{\prime}(x)=g,italic_E italic_A italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x ) = italic_g , atx=,at𝑥\displaystyle\quad\text{at}\ x=\ell,at italic_x = roman_ℓ , (3)

where E𝐸Eitalic_E is the modulus of elasticity, A𝐴Aitalic_A denotes the cross-sectional area of the bar, fb=fb(x)subscript𝑓𝑏subscript𝑓𝑏𝑥f_{b}=f_{b}(x)italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( italic_x ) is the external body force density (per unit length), and g𝑔g\in\mathbb{R}italic_g ∈ blackboard_R is a traction force applied at x=𝑥x=\ellitalic_x = roman_ℓ. We assume that fbsubscript𝑓𝑏f_{b}italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT is chosen smoothly enough so that the solution ulsubscript𝑢𝑙u_{l}italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT can be differentiated as many times as needed. For the sake of simplicity in the presentation, but without loss of generality, we assume a constant Young’s modulus E𝐸Eitalic_E and a constant cross-sectional area A𝐴Aitalic_A for the bar. The model is presented here with a homogeneous Dirichlet boundary condition at x=0𝑥0x=0italic_x = 0 and a Neumann condition at x=𝑥x=\ellitalic_x = roman_ℓ, but we shall also consider examples with homogeneous Dirichlet conditions at both ends in the numerical experiments.

2.2 Linearized bond-based peridynamic model

The static peridynamic equation [44] reads in one dimension as

Hδ(x)κu(y)u(x)|yx|𝑑y=fb(x),subscriptsubscript𝐻𝛿𝑥𝜅𝑢𝑦𝑢𝑥𝑦𝑥differential-d𝑦subscript𝑓𝑏𝑥-\int_{H_{\delta}(x)}\kappa\frac{u(y)-u(x)}{|y-x|}dy=f_{b}(x),- ∫ start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT italic_κ divide start_ARG italic_u ( italic_y ) - italic_u ( italic_x ) end_ARG start_ARG | italic_y - italic_x | end_ARG italic_d italic_y = italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( italic_x ) , (4)

where the so-called horizon δ>0𝛿0\delta>0italic_δ > 0 determines the neighborhood of point x𝑥xitalic_x

Hδ(x)={y;|yx|<δ}.subscript𝐻𝛿𝑥formulae-sequence𝑦𝑦𝑥𝛿.\displaystyle H_{\delta}(x)=\{y\in\mathbb{R};\ |y-x|<\delta\}\text{.}italic_H start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT ( italic_x ) = { italic_y ∈ blackboard_R ; | italic_y - italic_x | < italic_δ } .

In addition, κ=κ(x)𝜅𝜅𝑥\kappa=\kappa(x)italic_κ = italic_κ ( italic_x ) is the peridynamic stiffness parameter evaluated at point x𝑥xitalic_x and u𝑢uitalic_u is the displacement for the deformed configuration. Following [15], the peridynamic stiffness parameter κ𝜅\kappaitalic_κ is chosen such that the solution to the linearized peridynamic bond-based model is compatible with that to the continuum local model, i.e.,

κδ22=EA,𝜅superscript𝛿22𝐸𝐴\frac{\kappa\delta^{2}}{2}=EA,divide start_ARG italic_κ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG = italic_E italic_A , (5)

or, equivalently,

κ=2EAδ2.𝜅2𝐸𝐴superscript𝛿2\kappa=\frac{2EA}{\delta^{2}}.italic_κ = divide start_ARG 2 italic_E italic_A end_ARG start_ARG italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG . (6)

As far as the boundary conditions are concerned, we consider the same local boundary conditions (2) and (3), which are imposed using the variable horizon method described below. The solution to this problem will serve as the reference solution. For more details about boundary conditions for the peridynamic model, we refer the reader to, e.g., [12, 37].

2.3 Variable horizon coupling approach

00a𝑎aitalic_axδ𝑥𝛿x-\deltaitalic_x - italic_δx𝑥xitalic_xx+δ𝑥𝛿x+\deltaitalic_x + italic_δb𝑏bitalic_b\ellroman_ℓΩΩ\Omegaroman_ΩΩδsubscriptΩ𝛿\Omega_{\delta}roman_Ω start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPTΩ1subscriptΩ1\Omega_{1}roman_Ω start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTΩ2subscriptΩ2\Omega_{2}roman_Ω start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTHδ(x)subscript𝐻𝛿𝑥H_{\delta}(x)italic_H start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT ( italic_x )
Figure 1: Definition of the computational domains for the coupled model. Adapted from author’s previous work [15].

We consider in this work the Variable Horizon Coupling Method (VHCM) as introduced in [15]. A similar coupling approach was proposed in [43, 35]. For a broader review of coupling methods, including optimization-based and energy-based coupling methods, we refer to [11]. As shown in Figure 1, we define the region in which we shall use the nonlocal model (4) as Ωδ=(a,b)subscriptΩ𝛿𝑎𝑏\Omega_{\delta}=(a,b)roman_Ω start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT = ( italic_a , italic_b ) and the region for the local model (1) as Ωe=Ω\Ω¯δsubscriptΩ𝑒\Ωsubscript¯Ω𝛿\Omega_{e}=\Omega\backslash\bar{\Omega}_{\delta}roman_Ω start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT = roman_Ω \ over¯ start_ARG roman_Ω end_ARG start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT. We suppose here that ΩesubscriptΩ𝑒\Omega_{e}roman_Ω start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT is decomposed into the two subdomains Ω1=(0,a)subscriptΩ10𝑎\Omega_{1}=(0,a)roman_Ω start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ( 0 , italic_a ) and Ω2=(b,)subscriptΩ2𝑏\Omega_{2}=(b,\ell)roman_Ω start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ( italic_b , roman_ℓ ) so that the end points of the bar are both adjacent to a domain governed by the local model. The main idea of the method is to make the horizon δ𝛿\deltaitalic_δ decrease to zero as one approaches the interfaces x=a𝑥𝑎x=aitalic_x = italic_a and x=b𝑥𝑏x=bitalic_x = italic_b so as to avoid the need to introduce an overlapping region around the interfaces. In this section, we describe the main ingredients of the VHCM method.

In the VHCM, the horizon δv(x)subscript𝛿𝑣𝑥\delta_{v}(x)italic_δ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_x ) depends on the position xΩ𝑥Ωx\in\Omegaitalic_x ∈ roman_Ω; it is usually constant in the nonlocal region far from the local-nonlocal interface and shrinks to zero when approaching the interfaces x=a𝑥𝑎x=aitalic_x = italic_a and x=b𝑥𝑏x=bitalic_x = italic_b. Several choices for the transition of δv(x)subscript𝛿𝑣𝑥\delta_{v}(x)italic_δ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_x ) are possible. In one dimension, it was suggested in [43, 37, 15] to consider a piece-wise linear function, that is, given δ+𝛿superscript\delta\in\mathbb{R}^{+}italic_δ ∈ blackboard_R start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT constant,

δv(x)={xa,a<xa+δ,δ,a+δ<xbδ,bx,bδ<x<b,subscript𝛿𝑣𝑥cases𝑥𝑎𝑎𝑥𝑎𝛿𝛿𝑎𝛿𝑥𝑏𝛿𝑏𝑥𝑏𝛿𝑥𝑏\delta_{v}(x)=\left\{\begin{array}[]{ll}x-a,&\quad a<x\leq a+\delta,\\ \delta,&\quad a+\delta<x\leq b-\delta,\\ b-x,&\quad b-\delta<x<b,\end{array}\right.italic_δ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_x ) = { start_ARRAY start_ROW start_CELL italic_x - italic_a , end_CELL start_CELL italic_a < italic_x ≤ italic_a + italic_δ , end_CELL end_ROW start_ROW start_CELL italic_δ , end_CELL start_CELL italic_a + italic_δ < italic_x ≤ italic_b - italic_δ , end_CELL end_ROW start_ROW start_CELL italic_b - italic_x , end_CELL start_CELL italic_b - italic_δ < italic_x < italic_b , end_CELL end_ROW end_ARRAY (7)

as shown in Figure 2. It follows that Equation (6) for the stiffness constant κ(x)𝜅𝑥\kappa(x)italic_κ ( italic_x ) now becomes:

κ¯(x)=2EAδv(x)2.¯𝜅𝑥2𝐸𝐴subscript𝛿𝑣superscript𝑥2.\displaystyle\bar{\kappa}(x)=\frac{2EA}{\delta_{v}(x)^{2}}\text{.}over¯ start_ARG italic_κ end_ARG ( italic_x ) = divide start_ARG 2 italic_E italic_A end_ARG start_ARG italic_δ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_x ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG . (8)
δvsubscript𝛿𝑣\delta_{v}italic_δ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPTδ𝛿\deltaitalic_δ00a𝑎aitalic_aa+δ𝑎𝛿a+\deltaitalic_a + italic_δbδ𝑏𝛿b-\deltaitalic_b - italic_δb𝑏bitalic_bx𝑥xitalic_x
Figure 2: Example of a variable horizon function δv(x)subscript𝛿𝑣𝑥\delta_{v}(x)italic_δ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_x ) in one dimension. The circles centered at points x(a,a+δ)𝑥𝑎𝑎𝛿x\in(a,a+\delta)italic_x ∈ ( italic_a , italic_a + italic_δ ) are representations of the associated domains Hδ(x)subscript𝐻𝛿𝑥H_{\delta}(x)italic_H start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT ( italic_x ) in terms of δv(x)subscript𝛿𝑣𝑥\delta_{v}(x)italic_δ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_x ). Adapted from [15].

Using the variable horizon method, the coupling problem consists then in finding the displacements ulΩ¯esubscript𝑢𝑙subscript¯Ω𝑒u_{l}\in\bar{\Omega}_{e}italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ∈ over¯ start_ARG roman_Ω end_ARG start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT and uΩ¯δ𝑢subscript¯Ω𝛿u\in\bar{\Omega}_{\delta}italic_u ∈ over¯ start_ARG roman_Ω end_ARG start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT such that

EAul′′(x)𝐸𝐴superscriptsubscript𝑢𝑙′′𝑥\displaystyle-EAu_{l}^{\prime\prime}(x)- italic_E italic_A italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_x ) =fb(x),xΩe,formulae-sequenceabsentsubscript𝑓𝑏𝑥for-all𝑥subscriptΩ𝑒\displaystyle=f_{b}(x),\quad\forall x\in\Omega_{e},= italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( italic_x ) , ∀ italic_x ∈ roman_Ω start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT , (9)
xδv(x)x+δv(x)κ¯(x)u(y)u(x)|yx|𝑑ysuperscriptsubscript𝑥subscript𝛿𝑣𝑥𝑥subscript𝛿𝑣𝑥¯𝜅𝑥𝑢𝑦𝑢𝑥𝑦𝑥differential-d𝑦\displaystyle-\int_{x-\delta_{v}(x)}^{x+\delta_{v}(x)}\bar{\kappa}(x)\frac{u(y% )-u(x)}{|y-x|}dy- ∫ start_POSTSUBSCRIPT italic_x - italic_δ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x + italic_δ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_x ) end_POSTSUPERSCRIPT over¯ start_ARG italic_κ end_ARG ( italic_x ) divide start_ARG italic_u ( italic_y ) - italic_u ( italic_x ) end_ARG start_ARG | italic_y - italic_x | end_ARG italic_d italic_y =fb(x),xΩδ,formulae-sequenceabsentsubscript𝑓𝑏𝑥for-all𝑥subscriptΩ𝛿\displaystyle=f_{b}(x),\quad\forall x\in\Omega_{\delta},= italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( italic_x ) , ∀ italic_x ∈ roman_Ω start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT ,
ul(x)subscript𝑢𝑙𝑥\displaystyle u_{l}(x)italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( italic_x ) =0,atx=0,formulae-sequenceabsent0at𝑥0\displaystyle=0,\quad\text{at}\ x=0,= 0 , at italic_x = 0 ,
EAul(x)𝐸𝐴superscriptsubscript𝑢𝑙𝑥\displaystyle EAu_{l}^{\prime}(x)italic_E italic_A italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x ) =g,atx=,formulae-sequenceabsent𝑔at𝑥\displaystyle=g,\quad\text{at}\ x=\ell,= italic_g , at italic_x = roman_ℓ ,
u(x)ul(x)𝑢𝑥subscript𝑢𝑙𝑥\displaystyle u(x)-u_{l}(x)italic_u ( italic_x ) - italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( italic_x ) =0,atx=a,formulae-sequenceabsent0at𝑥𝑎\displaystyle=0,\quad\text{at}\ x=a,= 0 , at italic_x = italic_a ,
u(x)ul(x)𝑢𝑥subscript𝑢𝑙𝑥\displaystyle u(x)-u_{l}(x)italic_u ( italic_x ) - italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( italic_x ) =0,atx=b,formulae-sequenceabsent0at𝑥𝑏\displaystyle=0,\quad\text{at}\ x=b,= 0 , at italic_x = italic_b ,
σ+(u)(x)EAul(x)superscript𝜎𝑢𝑥𝐸𝐴superscriptsubscript𝑢𝑙𝑥\displaystyle\sigma^{+}(u)(x)-EAu_{l}^{\prime}(x)italic_σ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_u ) ( italic_x ) - italic_E italic_A italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x ) =0,atx=a,formulae-sequenceabsent0at𝑥𝑎\displaystyle=0,\quad\text{at}\ x=a,= 0 , at italic_x = italic_a ,
σ(u)(x)EAul(x)superscript𝜎𝑢𝑥𝐸𝐴superscriptsubscript𝑢𝑙𝑥\displaystyle\sigma^{-}(u)(x)-EAu_{l}^{\prime}(x)italic_σ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_u ) ( italic_x ) - italic_E italic_A italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x ) =0,atx=b.formulae-sequenceabsent0at𝑥𝑏\displaystyle=0,\quad\text{at}\ x=b.= 0 , at italic_x = italic_b .

In other words, the displacements and stresses from the two models are matched at the interfaces, i.e. ul(x)=u(x)subscript𝑢𝑙𝑥𝑢𝑥u_{l}(x)=u(x)italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( italic_x ) = italic_u ( italic_x ) and σ±(x)=Eul(x)superscript𝜎plus-or-minus𝑥𝐸superscriptsubscript𝑢𝑙𝑥\sigma^{\pm}(x)=Eu_{l}^{\prime}(x)italic_σ start_POSTSUPERSCRIPT ± end_POSTSUPERSCRIPT ( italic_x ) = italic_E italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x ), where the peridynamic stresses σ±(x)superscript𝜎plus-or-minus𝑥\sigma^{\pm}(x)italic_σ start_POSTSUPERSCRIPT ± end_POSTSUPERSCRIPT ( italic_x ) [42] are defined as

σ±(u)(x)=δ2xδxxz±δκu(y)u(z)|yz|𝑑y𝑑zκδ448u′′′(x).superscript𝜎plus-or-minus𝑢𝑥𝛿2superscriptsubscript𝑥𝛿𝑥superscriptsubscript𝑥plus-or-minus𝑧𝛿𝜅𝑢𝑦𝑢𝑧𝑦𝑧differential-d𝑦differential-d𝑧𝜅superscript𝛿448superscript𝑢′′′𝑥\sigma^{\pm}(u)(x)=\frac{\delta}{2}\int_{x-\delta}^{x}\int_{x}^{z\pm\delta}% \kappa\frac{u(y)-u(z)}{|y-z|}dydz-\frac{\kappa\delta^{4}}{48}u^{\prime\prime% \prime}(x).italic_σ start_POSTSUPERSCRIPT ± end_POSTSUPERSCRIPT ( italic_u ) ( italic_x ) = divide start_ARG italic_δ end_ARG start_ARG 2 end_ARG ∫ start_POSTSUBSCRIPT italic_x - italic_δ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z ± italic_δ end_POSTSUPERSCRIPT italic_κ divide start_ARG italic_u ( italic_y ) - italic_u ( italic_z ) end_ARG start_ARG | italic_y - italic_z | end_ARG italic_d italic_y italic_d italic_z - divide start_ARG italic_κ italic_δ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_ARG start_ARG 48 end_ARG italic_u start_POSTSUPERSCRIPT ′ ′ ′ end_POSTSUPERSCRIPT ( italic_x ) . (10)

2.4 Discretization

Several approaches for discretizing the coupled problem (9) are available [14]. Here we choose to discretize the classical linear elasticity model by a second-order finite differences scheme and the peridynamic model by a collocation approach, as introduced by Silling and Askari [45].

We now describe the discrete model associated with (9) on the configuration shown in Figure 3. As customarily done in the literature [45, 36], we fix the value of the horizon δ𝛿\deltaitalic_δ and introduce a uniform grid spacing hhitalic_h chosen such that δ𝛿\deltaitalic_δ be a multiple of hhitalic_h, i.e. δ/h=m𝛿𝑚\delta/h=mitalic_δ / italic_h = italic_m, with m𝑚mitalic_m a positive integer. Moreover, we choose the grid size hhitalic_h to be the same in each of the subregions of the computational domain. In other words, the numbers of intervals nδsubscript𝑛𝛿n_{\delta}italic_n start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT, n1subscript𝑛1n_{1}italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, and n2subscript𝑛2n_{2}italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT in ΩδsubscriptΩ𝛿\Omega_{\delta}roman_Ω start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT, Ω1subscriptΩ1\Omega_{1}roman_Ω start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, and Ω2subscriptΩ2\Omega_{2}roman_Ω start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, respectively, are taken such that

h=banδ=an1=bn2.𝑏𝑎subscript𝑛𝛿𝑎subscript𝑛1𝑏subscript𝑛2h=\frac{b-a}{n_{\delta}}=\frac{a}{n_{1}}=\frac{\ell-b}{n_{2}}.italic_h = divide start_ARG italic_b - italic_a end_ARG start_ARG italic_n start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT end_ARG = divide start_ARG italic_a end_ARG start_ARG italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG = divide start_ARG roman_ℓ - italic_b end_ARG start_ARG italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG . (11)

Since we assume that the grid points from the two models coincide at the interfaces x=a𝑥𝑎x=aitalic_x = italic_a and x=b𝑥𝑏x=bitalic_x = italic_b and that the grid is uniform in the three regions ΩδsubscriptΩ𝛿\Omega_{\delta}roman_Ω start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT, Ω1subscriptΩ1\Omega_{1}roman_Ω start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, and Ω2subscriptΩ2\Omega_{2}roman_Ω start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, the grid points are simply given by:

xk=kh,k=0,1,,n,formulae-sequencesubscript𝑥𝑘𝑘𝑘01𝑛x_{k}=kh,\quad k=0,1,\ldots,n,italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_k italic_h , italic_k = 0 , 1 , … , italic_n , (12)

where n=n1+nδ+n2𝑛subscript𝑛1subscript𝑛𝛿subscript𝑛2n=n_{1}+n_{\delta}+n_{2}italic_n = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. The numbering of the nodes is shown in Figure 3. In particular, we note that xn1=asubscript𝑥subscript𝑛1𝑎x_{n_{1}}=aitalic_x start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_a and xn1+nδ=bsubscript𝑥subscript𝑛1subscript𝑛𝛿𝑏x_{n_{1}+n_{\delta}}=bitalic_x start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_b. We consider the same numbering for the degrees of freedom associated with the displacements, that is, the nodal values will be denoted by:

uk{ul(xk),k=0,,n11ul(xk)=u(xk),k=n1u(xk),k=n1+1,,n1+nδ1ul(xk)=u(xk),k=n1+nδul(xk),k=n1+nδ+1,,nu_{k}\approx\left\{\begin{aligned} &u_{l}(x_{k}),&&\quad k=0,\ldots,n_{1}-1\\ &u_{l}(x_{k})=u(x_{k}),&&\quad k=n_{1}\\ &u(x_{k}),&&\quad k=n_{1}+1,\ldots,n_{1}+n_{\delta}-1\\ &u_{l}(x_{k})=u(x_{k}),&&\quad k=n_{1}+n_{\delta}\\ &u_{l}(x_{k}),&&\quad k=n_{1}+n_{\delta}+1,\ldots,n\end{aligned}\right.italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≈ { start_ROW start_CELL end_CELL start_CELL italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , end_CELL start_CELL end_CELL start_CELL italic_k = 0 , … , italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - 1 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = italic_u ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , end_CELL start_CELL end_CELL start_CELL italic_k = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_u ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , end_CELL start_CELL end_CELL start_CELL italic_k = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 , … , italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT - 1 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = italic_u ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , end_CELL start_CELL end_CELL start_CELL italic_k = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , end_CELL start_CELL end_CELL start_CELL italic_k = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT + 1 , … , italic_n end_CELL end_ROW (13)

where we have used the continuity of the displacement at the two interfaces, that is, ul(x)=u(x)subscript𝑢𝑙𝑥𝑢𝑥u_{l}(x)=u(x)italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( italic_x ) = italic_u ( italic_x ) at x=a𝑥𝑎x=aitalic_x = italic_a and x=b𝑥𝑏x=bitalic_x = italic_b. The total number of degrees of freedom is then equal to n+1𝑛1n+1italic_n + 1, being each degree of freedom associated with one grid point.

00a𝑎aitalic_ab𝑏bitalic_b\ellroman_ℓx0subscript𝑥0x_{0}italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTx1subscript𝑥1x_{1}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTx2subscript𝑥2x_{2}italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT\ldotsxn1subscript𝑥subscript𝑛1x_{n_{1}}italic_x start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT\ldotsxn1+nδsubscript𝑥subscript𝑛1subscript𝑛𝛿x_{n_{1}+n_{\delta}}italic_x start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT end_POSTSUBSCRIPT\ldotsxnsubscript𝑥𝑛x_{n}italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPTΩΩ\Omegaroman_ΩΩ1subscriptΩ1\Omega_{1}roman_Ω start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTΩδsubscriptΩ𝛿\Omega_{\delta}roman_Ω start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPTΩ2subscriptΩ2\Omega_{2}roman_Ω start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT
Figure 3: Definition of the grid points and degrees of freedom (represented by \bullet for the degrees of freedom associated with the classical linear elasticity model and by \circ for the degrees of freedom associated with the peridynamic model) in VHCM. Adapted from [15].

The discretization of the VHC problem (9) leads to the following system of equations, in which we assume m=2𝑚2m=2italic_m = 2:

  1. 1.

    Dirichlet BC at x=0𝑥0x=0italic_x = 0:

    u0=0.subscript𝑢00u_{0}=0.italic_u start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 . (14)
  2. 2.

    In Ω1subscriptΩ1\Omega_{1}roman_Ω start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT:

    EAuk12uk+uk+1h2=fb(xk),k=1,,n11.formulae-sequence𝐸𝐴subscript𝑢𝑘12subscript𝑢𝑘subscript𝑢𝑘1superscript2subscript𝑓𝑏subscript𝑥𝑘𝑘1subscript𝑛11-EA\frac{u_{k-1}-2u_{k}+u_{k+1}}{h^{2}}=f_{b}(x_{k}),\quad k=1,\ldots,n_{1}-1.- italic_E italic_A divide start_ARG italic_u start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - 2 italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_u start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG = italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , italic_k = 1 , … , italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - 1 . (15)
  3. 3.

    At interface x=a𝑥𝑎x=aitalic_x = italic_a:

    EA2uk3+9uk218uk1+11uk6hσh+(xk)=0,k=n1.formulae-sequence𝐸𝐴2subscript𝑢𝑘39subscript𝑢𝑘218subscript𝑢𝑘111subscript𝑢𝑘6superscriptsubscript𝜎subscript𝑥𝑘0𝑘subscript𝑛1EA\frac{-2u_{k-3}+9u_{k-2}-18u_{k-1}+11u_{k}}{6h}-\sigma_{h}^{+}(x_{k})=0,% \quad k=n_{1}.italic_E italic_A divide start_ARG - 2 italic_u start_POSTSUBSCRIPT italic_k - 3 end_POSTSUBSCRIPT + 9 italic_u start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT - 18 italic_u start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT + 11 italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG 6 italic_h end_ARG - italic_σ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = 0 , italic_k = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT . (16)
  4. 4.

    In ΩδsubscriptΩ𝛿\Omega_{\delta}roman_Ω start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT:

    κδ22uk12uk+uk+1hδ2=fb(xk),𝜅superscript𝛿22subscript𝑢𝑘12subscript𝑢𝑘subscript𝑢𝑘1superscriptsubscript𝛿2subscript𝑓𝑏subscript𝑥𝑘\displaystyle-\frac{\kappa\delta^{2}}{2}\frac{u_{k-1}-2u_{k}+u_{k+1}}{h_{% \delta}^{2}}=f_{b}(x_{k}),- divide start_ARG italic_κ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG divide start_ARG italic_u start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - 2 italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_u start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_h start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG = italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , k=n1+1,𝑘subscript𝑛11\displaystyle\quad k=n_{1}+1,italic_k = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 , (17)
    κδ22uk2+4uk110uk+4uk+1+uk+28hδ2=fb(xk),𝜅superscript𝛿22subscript𝑢𝑘24subscript𝑢𝑘110subscript𝑢𝑘4subscript𝑢𝑘1subscript𝑢𝑘28superscriptsubscript𝛿2subscript𝑓𝑏subscript𝑥𝑘\displaystyle-\frac{\kappa\delta^{2}}{2}\frac{u_{k-2}+4u_{k-1}-10u_{k}+4u_{k+1% }+u_{k+2}}{8h_{\delta}^{2}}=f_{b}(x_{k}),- divide start_ARG italic_κ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG divide start_ARG italic_u start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT + 4 italic_u start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - 10 italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + 4 italic_u start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT + italic_u start_POSTSUBSCRIPT italic_k + 2 end_POSTSUBSCRIPT end_ARG start_ARG 8 italic_h start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG = italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , k=n1+2,,n1+nδ2,𝑘subscript𝑛12subscript𝑛1subscript𝑛𝛿2\displaystyle\quad k=n_{1}+2,\ldots,n_{1}+n_{\delta}-2,italic_k = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 2 , … , italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT - 2 ,
    κδ22uk12uk+uk+1h2=fb(xk),𝜅superscript𝛿22subscript𝑢𝑘12subscript𝑢𝑘subscript𝑢𝑘1superscript2subscript𝑓𝑏subscript𝑥𝑘\displaystyle-\frac{\kappa\delta^{2}}{2}\frac{u_{k-1}-2u_{k}+u_{k+1}}{h^{2}}=f% _{b}(x_{k}),- divide start_ARG italic_κ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG divide start_ARG italic_u start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - 2 italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_u start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG = italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , k=n1+nδ1.𝑘subscript𝑛1subscript𝑛𝛿1\displaystyle\quad k=n_{1}+n_{\delta}-1.italic_k = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT - 1 .
  5. 5.

    At interface x=b𝑥𝑏x=bitalic_x = italic_b:

    EA11uk+18uk+19uk+2+2uk+36hσh(xk)=0,k=n1+nδ.formulae-sequence𝐸𝐴11subscript𝑢𝑘18subscript𝑢𝑘19subscript𝑢𝑘22subscript𝑢𝑘36superscriptsubscript𝜎subscript𝑥𝑘0𝑘subscript𝑛1subscript𝑛𝛿EA\frac{-11u_{k}+18u_{k+1}-9u_{k+2}+2u_{k+3}}{6h}-\sigma_{h}^{-}(x_{k})=0,% \quad k=n_{1}+n_{\delta}.italic_E italic_A divide start_ARG - 11 italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + 18 italic_u start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT - 9 italic_u start_POSTSUBSCRIPT italic_k + 2 end_POSTSUBSCRIPT + 2 italic_u start_POSTSUBSCRIPT italic_k + 3 end_POSTSUBSCRIPT end_ARG start_ARG 6 italic_h end_ARG - italic_σ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = 0 , italic_k = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT . (18)
  6. 6.

    In Ω2subscriptΩ2\Omega_{2}roman_Ω start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT:

    EAuk12uk+uk+1h2=fb(xk),k=n1+nδ+1,,n1.formulae-sequence𝐸𝐴subscript𝑢𝑘12subscript𝑢𝑘subscript𝑢𝑘1superscript2subscript𝑓𝑏subscript𝑥𝑘𝑘subscript𝑛1subscript𝑛𝛿1𝑛1-EA\frac{u_{k-1}-2u_{k}+u_{k+1}}{h^{2}}=f_{b}(x_{k}),\quad k=n_{1}+n_{\delta}+% 1,\ldots,n-1.- italic_E italic_A divide start_ARG italic_u start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - 2 italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_u start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG = italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , italic_k = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT + 1 , … , italic_n - 1 . (19)
  7. 7.

    Neumann BC at x=𝑥x=\ellitalic_x = roman_ℓ:

    EA2uk3+9uk218uk1+11uk6h=g,k=n.formulae-sequence𝐸𝐴2subscript𝑢𝑘39subscript𝑢𝑘218subscript𝑢𝑘111subscript𝑢𝑘6𝑔𝑘𝑛EA\frac{-2u_{k-3}+9u_{k-2}-18u_{k-1}+11u_{k}}{6h}=g,\quad k=n.italic_E italic_A divide start_ARG - 2 italic_u start_POSTSUBSCRIPT italic_k - 3 end_POSTSUBSCRIPT + 9 italic_u start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT - 18 italic_u start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT + 11 italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG 6 italic_h end_ARG = italic_g , italic_k = italic_n . (20)

We recall that if E𝐸Eitalic_E and A𝐴Aitalic_A are constant, then the quantity κ¯(x)δv(x)2=κδ2¯𝜅𝑥subscript𝛿𝑣superscript𝑥2𝜅superscript𝛿2\bar{\kappa}(x)\delta_{v}(x)^{2}=\kappa\delta^{2}over¯ start_ARG italic_κ end_ARG ( italic_x ) italic_δ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_x ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_κ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT remains also constant. Moreover, the stresses σh±superscriptsubscript𝜎plus-or-minus\sigma_{h}^{\pm}italic_σ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ± end_POSTSUPERSCRIPT in Equations (16) and (18) are obtained as approximations of σ±subscript𝜎plus-or-minus\sigma_{\pm}italic_σ start_POSTSUBSCRIPT ± end_POSTSUBSCRIPT (10). More specifically, assuming that the solution u(x)𝑢𝑥u(x)italic_u ( italic_x ) is sufficiently regular in ΩδsubscriptΩ𝛿\Omega_{\delta}roman_Ω start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT, the stresses are approximated using one-sided third-order finite differences stencils of the first derivative as

σh+(xk)=κδ2211uk+18uk+19uk+2+2uk+36h,k=n1,formulae-sequencesuperscriptsubscript𝜎subscript𝑥𝑘𝜅superscript𝛿2211subscript𝑢𝑘18subscript𝑢𝑘19subscript𝑢𝑘22subscript𝑢𝑘36𝑘subscript𝑛1\displaystyle\sigma_{h}^{+}(x_{k})=\frac{\kappa\delta^{2}}{2}\frac{-11u_{k}+18% u_{k+1}-9u_{k+2}+2u_{k+3}}{6h},\quad k=n_{1},italic_σ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = divide start_ARG italic_κ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG divide start_ARG - 11 italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + 18 italic_u start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT - 9 italic_u start_POSTSUBSCRIPT italic_k + 2 end_POSTSUBSCRIPT + 2 italic_u start_POSTSUBSCRIPT italic_k + 3 end_POSTSUBSCRIPT end_ARG start_ARG 6 italic_h end_ARG , italic_k = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ,
σh(xk)=κδ222uk3+9uk218uk1+11uk6h,k=n1+nδ.formulae-sequencesuperscriptsubscript𝜎subscript𝑥𝑘𝜅superscript𝛿222subscript𝑢𝑘39subscript𝑢𝑘218subscript𝑢𝑘111subscript𝑢𝑘6𝑘subscript𝑛1subscript𝑛𝛿\displaystyle\sigma_{h}^{-}(x_{k})=\frac{\kappa\delta^{2}}{2}\frac{-2u_{k-3}+9% u_{k-2}-18u_{k-1}+11u_{k}}{6h},\quad k=n_{1}+n_{\delta}.italic_σ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = divide start_ARG italic_κ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG divide start_ARG - 2 italic_u start_POSTSUBSCRIPT italic_k - 3 end_POSTSUBSCRIPT + 9 italic_u start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT - 18 italic_u start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT + 11 italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG 6 italic_h end_ARG , italic_k = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT .

This choice ensures that the discretization method has a degree of precision of three, in other words, that any polynomial solution of degree up to three will be computed exactly.

3 Methodology

We propose to employ deep learning techniques to identify the regions in which one should use the local model or the nonlocal model, by solely using load information (i.e. external forces), as illustrated in Figure 4. More specifically, we shall use CNNs as our deep learning tool. We describe below CNNs and the specific architecture that we considered to classify the nature of the grid points.

3.1 Overview

Our main objective is to develop an automated approach to identify the nonlocal region ΩδsubscriptΩ𝛿\Omega_{\delta}roman_Ω start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT and the local regions Ω1subscriptΩ1\Omega_{1}roman_Ω start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and Ω2subscriptΩ2\Omega_{2}roman_Ω start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, shown in Figure 3, to be used in a coupling method. From now on, the nodal values of the solution located in the regions Ω1subscriptΩ1\Omega_{1}roman_Ω start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and Ω2subscriptΩ2\Omega_{2}roman_Ω start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT will be indexed by LM (Local Model), whereas the nodal values located in ΩδsubscriptΩ𝛿\Omega_{\delta}roman_Ω start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT will be indexed by NLM (Nonlocal Model). Although this is a simplified setting that does not fully reflect the numerical tests presented below, we choose to describe the method using one single nonlocal region for the sake of clarity. The extension to more than one nonlocal domains is straightforward.

An overview of the proposed automated identification process is shown in Figure 4. The input data consists of the external load vector fb(xk)subscript𝑓𝑏subscript𝑥𝑘f_{b}(x_{k})italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ), k=1,,n1𝑘1𝑛1k=1,\ldots,n-1italic_k = 1 , … , italic_n - 1, evaluated at all interior grid points xksubscript𝑥𝑘x_{k}italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, see Figure 4a. The output of the process is a label associated with each node that takes the value 0, if the node is classified as belonging to the local region, or 1 otherwise (i.e. the node belongs to the nonlocal region). It is essential to note that the nature of our approach is supervised learning, that is, each load input during training corresponds to a known classification. As this requires training data, the load input (see Figure 4a) is also used to generate the ground-truth data that will serve as the reference configuration during training. The generation, as shown in Figure 4b, is conducted using the following procedure: a coupling configuration (local-nonlocal domain assignment) is added to the training set when it provides a coupled solution whose error, with respect to the fully nonlocal solution, is below a given tolerance. More specifically, we first compute the reference displacement uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT using the nonlocal model in which all nodal points are NLM nodes \circ. Then, given a coupling configuration (in which the nodes are either LM nodes \bullet or NLM nodes \circ), we compute the displacement uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT using the method described in the previous section, see Figure 4b. As mentioned above, the reference configuration is included in the training dataset if the relative error in the 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT norm is below a given tolerance ε𝜀\varepsilonitalic_ε, namely

(uNLM,uVHCM)=uNLMuVHCM2uNLM2<ε.subscript𝑢NLMsubscript𝑢VHCMsubscriptdelimited-∥∥subscript𝑢NLMsubscript𝑢VHCMsubscript2subscriptdelimited-∥∥subscript𝑢NLMsubscript2𝜀\mathcal{E}(u_{\text{NLM}},u_{\text{VHCM}})=\frac{\lVert u_{\text{NLM}}-u_{% \text{VHCM}}\rVert_{\ell_{2}}}{\lVert u_{\text{NLM}}\rVert_{\ell_{2}}}<\varepsilon.caligraphic_E ( italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT ) = divide start_ARG ∥ italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT - italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG < italic_ε . (21)

For more details about the data generation, the reader is referred to Section 4. The loads and the reference configurations of the local-nonlocal splitting provide the training data for the CNN in a supervised manner. Once trained, the CNN model takes as input an (unseen) external load fb(x)subscript𝑓𝑏𝑥f_{b}(x)italic_f start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( italic_x ), see Figure 4c, and outputs the predicted configuration (i.e. it classifies the nodes as either LM \bullet or NLM \circ at each grid point) in the domain, see Figure 4d. More details of the CNN architecture are given in the following section.

Refer to caption
Figure 4: Overview of the training and prediction: (a) Input Load values used for data generation and as network inputs; (b) Numerical simulations: fully nonlocal solution and coupled solution obtained with the reference local-nonlocal domain splitting; (c) CNN network model; (d) Network output: predicted local-nonlocal splitting configuration. Filled circles \bullet indicate local nodes and empty circles \circ indicate nonlocal nodes.

3.2 Convolution neural networks

In recent years, deep learning has yielded impressive results across a spectrum of challenges, such as in visual recognition, speech understanding, natural language processing, and even inference of physical systems responses. Among different deep neural network architectures, CNNs have been investigated and analyzed in depth [21]. CNNs are particularly effective for tasks involving data with a grid-like topology, such as images, audio spectrograms, and sequential data arising for instance in natural language processing, with applications ranging from image classification and object tracking to speech and natural language processing [21, 30].

The CNN structure consists of various fundamental components, including convolutional, pooling, and fully connected layers. The standard structure involves a series of convolution layers followed by a pooling layer. The layers are then succeeded by one or more fully connected layers. The phase in which input data undergoes transformation into output through the layers is the so-called forward propagation process [51]. In the next two sections, we give a brief description of CNNs’ features for the non-expert reader.

  1. 1.

    Convolutional Layers: Convolutional layers are the cornerstone of CNNs. These layers use learnable kernels that convolve over the input data, enabling the network to detect local patterns and features. More precisely, a kernel is typically a small array, usually of size 3 ×\times× 3, which is systematically passed through an input array, also referred to as a tensor. At each position within the tensor, an element-wise multiplication is performed between the elements of the kernel and the corresponding elements in the input tensor. A bias is then added to the sum of these products in order to yield the output value, which occupies the corresponding position in the output tensor—termed “feature map”. This process is reiterated by employing multiple kernels to generate diverse feature maps. These feature maps encapsulate various attributes of the input tensors, such as distinct kernel functions serving as distinct feature extractors. The operation of a convolutional layer can be expressed as:

    Aj=σReLU(i=1NCiKi,j+Bj),subscript𝐴𝑗subscript𝜎ReLUsuperscriptsubscript𝑖1𝑁subscript𝐶𝑖subscript𝐾𝑖𝑗subscript𝐵𝑗A_{j}=\sigma_{\text{ReLU}}\bigg{(}\sum_{i=1}^{N}C_{i}K_{i,j}+B_{j}\bigg{)},italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_σ start_POSTSUBSCRIPT ReLU end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT + italic_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) , (22)

    where Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT denotes an input matrix, Ki,jsubscript𝐾𝑖𝑗K_{i,j}italic_K start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT a kernel matrix, N𝑁Nitalic_N is the number of feature maps from the previous layer, and Bjsubscript𝐵𝑗B_{j}italic_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT a bias value. Finally, the rectified linear unit activation function σReLUsubscript𝜎ReLU\sigma_{\text{ReLU}}italic_σ start_POSTSUBSCRIPT ReLU end_POSTSUBSCRIPT [41],

    σReLU(x)=max(0,x),subscript𝜎ReLU𝑥0𝑥\sigma_{\text{ReLU}}(x)=\max(0,x),italic_σ start_POSTSUBSCRIPT ReLU end_POSTSUBSCRIPT ( italic_x ) = roman_max ( 0 , italic_x ) ,

    is applied element-wise to the matrix, resulting in the output matrix Ajsubscript𝐴𝑗A_{j}italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. The primary task of the learning process involves identifying sets of suitable kernel matrices capable of extracting distinctive and discriminative features for the output tasks. To achieve this, the back-propagation algorithm, commonly employed to optimize the connection weights in neural networks, can be adapted. In this context, it serves to train both the kernel matrices and biases, functioning as shared neuron connection weights.

  2. 2.

    Pooling Layers: Pooling layers downsample the dimensions of feature maps while retaining the salient information; thus, their main effect is an intelligent feature-dimension reduction. To effectively reduce the number of output neurons, pooling algorithms usually combine neighboring elements in the convolution output matrices. Notable among these algorithms are max-pooling and average-pooling [1]. In this study, we implement a max-pooling layer with a 2 × 1 size, which selects the highest value from the two neighboring elements of the input feature map, thereby generating a single element for the output feature map.

  3. 3.

    Fully Connected Layers: Fully connected layers process the flattened feature vectors to make the final predictions. A fully connected layer operation with input z𝑧zitalic_z and output youtsubscript𝑦outy_{\text{out}}italic_y start_POSTSUBSCRIPT out end_POSTSUBSCRIPT is represented as

    yout=σ(Gz+v)subscript𝑦out𝜎𝐺𝑧𝑣y_{\text{out}}=\sigma(Gz+v)italic_y start_POSTSUBSCRIPT out end_POSTSUBSCRIPT = italic_σ ( italic_G italic_z + italic_v ) (23)

    where youtsubscript𝑦outy_{\text{out}}italic_y start_POSTSUBSCRIPT out end_POSTSUBSCRIPT is the output, z𝑧zitalic_z the input vector, G𝐺Gitalic_G the weight matrix, v𝑣vitalic_v the bias vector, and σ𝜎\sigmaitalic_σ a given activation function [41].

The selected CNN architecture:

We propose to use the following CNN architecture for the identification of local and nonlocal regions. First, as a preprocessing step, the input of the network is normalized to a load vector with unit variance and zero mean. The architecture is composed of convolutional layers with ReLU activation functions for feature extraction, followed by pooling layers for spatial reduction, and fully connected layers for the final output. This is shown in Fig. 5. The first convolution layer uses 32 filters with 3×1313\times 13 × 1 kernels. The second convolution layer has 64 filters with 3 kernels, and the third convolution layer has 128 filters with 3×1313\times 13 × 1 kernels. Each convolution layer is followed by a max pooling layer with pool size of two. The output tensor from these three conv-pooling blocks has a dimension of 30×1283012830\times 12830 × 128. The following two layers consist of one flattened layer, followed by a fully connected layer with 64 neurons. The last layer of our CNN model is a fully connected (dense) layer, where the number of neurons matches the output size. In our study, we explore two cases with distinct output sizes. In the first case, the CNN receives as input the full vector of the second derivative of the load functions at the grid points. The resulting output is a vector of labels, having the same size of the input load vector. Herein, the output size is 257, leading to a fully connected layer with 257 neurons. Meanwhile, in the second case, the CNN receives a window as input. The resulting output is a label for the central node. Herein, the output size is then one, corresponding to a single neuron in the final layer. Further details on these cases are discussed in Section 4.4. The sigmoid activation function is used in the last layer across both cases. It is important to note that our experimental investigations explore a variety of network architectures, using different numbers of layers and neurons. Out of all the architectures evaluated, the one that produces the best results is the one that is presented herein.

Computational resources and model parameters:

The CNN model is trained by using the Adam optimizer [27] and a batch size of 32. The training proceeds for a maximum of 200 epochs; however, an early stopping callback is implemented to avoid overfitting. When early stopping is used, training will end as soon as the validation dataset’s loss does not improve [4]. The learning rate is set to the commonly used value of 0.001. The proposed model is implemented in TensorFlow 2.10, with Keras in Python 3.10.11.

Refer to caption

Figure 5: Architecture of the proposed CNN model.

3.3 Classification evaluation metrics

In the context of binary classification problems such as ours, the classification capability of the trained network is usually evaluated with a confusion matrix. The confusion matrix displays the correctly or incorrectly predicted results. It is based on four key elements: true positives (TP, correctly predicted LM), false positives (FP, wrongly predicted LM), true negatives (TN, accurately predicted NLM), and false negatives (FN, wrongly predicted NLM). To evaluate the confusion matrix, we use two induced metrics, namely the accuracy metric and the F1-score, which are the most used metrics in classification [23, 32]:

  1. 1.

    The accuracy metric measures the ratio of correct predictions over the total number of instances evaluated. Higher values for accuracy are indicative of better performance. Formally,

    Accuracy=TP+TNTP+FP+TN+FN.AccuracyTPTNTP+FP+TN+FN\displaystyle\text{Accuracy}=\frac{\text{TP}+\text{TN}}{\text{TP+FP+TN+FN}}.Accuracy = divide start_ARG TP + TN end_ARG start_ARG TP+FP+TN+FN end_ARG . (24)
  2. 2.

    The F1-score is the harmonic mean of “precision” and “recall”. It ranges from 0 to 1, where a higher value indicates better classification performance. Formally,

    F1-score=2×Precision×RecallPrecision+Recall,F1-score2PrecisionRecallPrecisionRecall\displaystyle\text{F1-score}=2\times\frac{\text{Precision}\times\text{Recall}}% {\text{Precision}+\text{Recall}},F1-score = 2 × divide start_ARG Precision × Recall end_ARG start_ARG Precision + Recall end_ARG , (25)

    where

    Precision=TPTP+FP,PrecisionTPTPFP\displaystyle\text{Precision}=\frac{\text{TP}}{\text{TP}+\text{FP}},Precision = divide start_ARG TP end_ARG start_ARG TP + FP end_ARG ,
    Recall=TPTP+FN.RecallTPTPFN\displaystyle\text{Recall}=\frac{\text{TP}}{\text{TP}+\text{FN}}.Recall = divide start_ARG TP end_ARG start_ARG TP + FN end_ARG .

We note that in case 1, see Section 5.1, we report the average over all samples for all metrics (i.e. confusion matrix, accuracy, and F1-score).

4 Data generation

The CNN used for classification is trained on input-output pairs of external loads and corresponding coupling configurations (inducing a coupled solution with a certain accuracy); this pipeline is illustrated in Figure 4. In this section, we introduce the definition of the load functions and present the steps involved in the generation of the coupling regions.

4.1 Families of loading functions

In this section, we consider several classes of loading functions, including singular functions that induce discontinuous solutions with a finite jump to mimic fracture phenomena in higher dimensions. One advantage of peridynamics is that the model does not feature any spatial derivatives thus allowing for solutions with jumps.

  1. 1.

    The first family of load functions, f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, is based on functions [7] featuring a singularity at x=0.5𝑥0.5x=0.5italic_x = 0.5 and defined as:

    f1(x)={0,forx[0,0.5δ),12δ2δ+38+(2δ32lnδ)x+(32+lnδ)x2(x2x)ln((12x)),forx[0.5δ,0.5),12δ2δ+38+(2δ+32+lnδ)x(32+lnδ)x2+(x2x)ln(x12),forx[0.5,0.5+δ),1,forx[0.5+δ,1.0].subscript𝑓1𝑥cases0for𝑥00.5𝛿12superscript𝛿2𝛿382𝛿32𝛿𝑥otherwise32𝛿superscript𝑥2superscript𝑥2𝑥12𝑥for𝑥0.5𝛿0.512superscript𝛿2𝛿382𝛿32𝛿𝑥otherwise32𝛿superscript𝑥2superscript𝑥2𝑥𝑥12for𝑥0.50.5𝛿1for𝑥0.5𝛿1.0f_{1}(x)=\begin{cases}0,&\mbox{for}\ x\in[0,0.5-\delta),\\ \frac{1}{2}\delta^{2}-\delta+\frac{3}{8}+(2\delta-\frac{3}{2}-\ln{\delta})x\\ \quad\quad+(\frac{3}{2}+\ln{\delta})x^{2}-(x^{2}-x)\ln({(\frac{1}{2}-x)}),&% \mbox{for}\ x\in[0.5-\delta,0.5),\\ \frac{1}{2}\delta^{2}-\delta+\frac{3}{8}+(2\delta+\frac{3}{2}+\ln{\delta})x\\ \quad\quad-(\frac{3}{2}+\ln{\delta})x^{2}+(x^{2}-x)\ln({x-\frac{1}{2}}),&\mbox% {for}\ x\in[0.5,0.5+\delta),\\ 1,&\mbox{for}\ x\in[0.5+\delta,1.0].\end{cases}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x ) = { start_ROW start_CELL 0 , end_CELL start_CELL for italic_x ∈ [ 0 , 0.5 - italic_δ ) , end_CELL end_ROW start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_δ + divide start_ARG 3 end_ARG start_ARG 8 end_ARG + ( 2 italic_δ - divide start_ARG 3 end_ARG start_ARG 2 end_ARG - roman_ln italic_δ ) italic_x end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL + ( divide start_ARG 3 end_ARG start_ARG 2 end_ARG + roman_ln italic_δ ) italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ( italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_x ) roman_ln ( ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_x ) ) , end_CELL start_CELL for italic_x ∈ [ 0.5 - italic_δ , 0.5 ) , end_CELL end_ROW start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_δ + divide start_ARG 3 end_ARG start_ARG 8 end_ARG + ( 2 italic_δ + divide start_ARG 3 end_ARG start_ARG 2 end_ARG + roman_ln italic_δ ) italic_x end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL - ( divide start_ARG 3 end_ARG start_ARG 2 end_ARG + roman_ln italic_δ ) italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ( italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_x ) roman_ln ( italic_x - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) , end_CELL start_CELL for italic_x ∈ [ 0.5 , 0.5 + italic_δ ) , end_CELL end_ROW start_ROW start_CELL 1 , end_CELL start_CELL for italic_x ∈ [ 0.5 + italic_δ , 1.0 ] . end_CELL end_ROW (26)

    The function f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and the corresponding fully nonlocal solution uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT are shown in Figure 6(a-b) for δ𝛿\deltaitalic_δ = 0.03 and h=δ/8𝛿8h=\delta/8italic_h = italic_δ / 8.

    In cases where the singularity is located at a point xjumpsubscript𝑥jumpx_{\text{jump}}italic_x start_POSTSUBSCRIPT jump end_POSTSUBSCRIPT different from 0.50.50.50.5, we will consider the loading function f1(x+(0.5xjump))subscript𝑓1𝑥0.5subscript𝑥jumpf_{1}(x+(0.5-x_{\text{jump}}))italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x + ( 0.5 - italic_x start_POSTSUBSCRIPT jump end_POSTSUBSCRIPT ) ).

    Refer to caption
    (a)
    Refer to caption
    (b)
    Refer to caption
    (c)
    Refer to caption
    (d)
    Refer to caption
    (e)
    Refer to caption
    (f)
    Figure 6: RHS functions fi(x)subscript𝑓𝑖𝑥f_{i}(x)italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ), i=1𝑖1i=1italic_i = 1, 2222, and 3333 with and without discontinuity used during training and their associated nonlocal/local solution. (a) Forcing function f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT in Eq. (26); (b) Corresponding full nonlocal solution uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT; (c) Forcing function f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT in Eq. (27); (d) Corresponding full nonlocal solution uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT; (e) Forcing function f3subscript𝑓3f_{3}italic_f start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT in Eq. (28) for c1=18subscript𝑐118c_{1}=18italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 18 and c2=4subscript𝑐24c_{2}=4italic_c start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 4; (f) Corresponding full local solution uLMsubscript𝑢LMu_{\text{LM}}italic_u start_POSTSUBSCRIPT LM end_POSTSUBSCRIPT.
  2. 2.

    The second family of load functions, f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, featuring a singularity at x=0.5𝑥0.5x=0.5italic_x = 0.5 [10], is defined by:

    f2(x)={0,forx[0,0.5δ),12δ2(lnδln(12x)),forx[0.5δ,0.5),12δ2(ln(x12)lnδ)),forx[0.5,0.5+δ),0,forx[0.5+δ,1.0].f_{2}(x)=\begin{cases}0,&\mbox{for}\ x\in[0,0.5-\delta),\\ \frac{1}{2\delta^{2}}\big{(}\ln{\delta}-\ln({\frac{1}{2}-x})\big{)},&\mbox{for% }\ x\in[0.5-\delta,0.5),\\ \frac{1}{2\delta^{2}}\big{(}\ln({x-\frac{1}{2}})-\ln{\delta})\big{)},&\mbox{% for}\ x\in[0.5,0.5+\delta),\\ 0,&\mbox{for}\ x\in[0.5+\delta,1.0].\end{cases}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x ) = { start_ROW start_CELL 0 , end_CELL start_CELL for italic_x ∈ [ 0 , 0.5 - italic_δ ) , end_CELL end_ROW start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG 2 italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ( roman_ln italic_δ - roman_ln ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_x ) ) , end_CELL start_CELL for italic_x ∈ [ 0.5 - italic_δ , 0.5 ) , end_CELL end_ROW start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG 2 italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ( roman_ln ( italic_x - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) - roman_ln italic_δ ) ) , end_CELL start_CELL for italic_x ∈ [ 0.5 , 0.5 + italic_δ ) , end_CELL end_ROW start_ROW start_CELL 0 , end_CELL start_CELL for italic_x ∈ [ 0.5 + italic_δ , 1.0 ] . end_CELL end_ROW (27)

    The function f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and the corresponding full nonlocal solution uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT are plotted in Figure 6(c-d) for δ𝛿\deltaitalic_δ = 0.03 and h=δ/8𝛿8h=\delta/8italic_h = italic_δ / 8.

    In cases where the singularity is located at a point xjumpsubscript𝑥jumpx_{\text{jump}}italic_x start_POSTSUBSCRIPT jump end_POSTSUBSCRIPT different from 0.50.50.50.5, we will evaluate the loading function as f2(x+(0.5xjump))subscript𝑓2𝑥0.5subscript𝑥jumpf_{2}(x+(0.5-x_{\text{jump}}))italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x + ( 0.5 - italic_x start_POSTSUBSCRIPT jump end_POSTSUBSCRIPT ) ).

  3. 3.

    The study also includes the family of loads characterized by polynomial solutions of degree three and lower. The loading functions f3(x)subscript𝑓3𝑥f_{3}(x)italic_f start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_x ) are in this case linear of the form:

    f3(x)=c1x+c2,subscript𝑓3𝑥subscript𝑐1𝑥subscript𝑐2f_{3}(x)=c_{1}x+c_{2},italic_f start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_x ) = italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x + italic_c start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , (28)

    an example of which is shown in Fig. 6(e-f) with c1=18subscript𝑐118c_{1}=18italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 18 and c2=4subscript𝑐24c_{2}=4italic_c start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 4.

  4. 4.

    We consider as well the following family of load functions:

    f4(x)=tanh1t(x12),subscript𝑓4𝑥1𝑡𝑥12f_{4}(x)=\tanh\frac{1}{t}\bigg{(}x-\frac{1}{2}\bigg{)},italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ( italic_x ) = roman_tanh divide start_ARG 1 end_ARG start_ARG italic_t end_ARG ( italic_x - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) , (29)

    where the parameter t𝑡titalic_t controls the variation in the solution around the point x=0.5𝑥0.5x=0.5italic_x = 0.5. An example of function f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT is shown in Fig. 7(a-b) for t=0.0005𝑡0.0005t=0.0005italic_t = 0.0005.

  5. 5.

    Finally, we also include a family of smooth load functions that induce continuous solutions with local behavior:

    f5(x)=e400(xc)2.subscript𝑓5𝑥superscript𝑒400superscript𝑥𝑐2f_{5}(x)=e^{-400(x-c)^{2}}.italic_f start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ( italic_x ) = italic_e start_POSTSUPERSCRIPT - 400 ( italic_x - italic_c ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT . (30)

    Such a function is a smooth representation of a point-wise load applied at the point x=c𝑥𝑐x=citalic_x = italic_c. The load and corresponding solution is shown in Fig. 7(c-d) for c=0.6𝑐0.6c=0.6italic_c = 0.6.

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Refer to caption
(d)
Figure 7: RHS functions fi(x)subscript𝑓𝑖𝑥f_{i}(x)italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ), i=4𝑖4i=4italic_i = 4, 5555 used as test functions and their associated nonlocal solution. (a) Forcing function f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT in Eq. (29) for t=0.0005𝑡0.0005t=0.0005italic_t = 0.0005; (b) Corresponding full nonlocal solution uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT; (c) Forcing function f5subscript𝑓5f_{5}italic_f start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT in Eq. (30) for c=0.6𝑐0.6c=0.6italic_c = 0.6; (d) Corresponding full nonlocal solution uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT.

4.2 Generation of the coupling regions

We present the steps used to generate the output data for the CNN training, i.e. we describe how the reference configurations of the coupling regions are constructed based on the loading functions.

For the sake of simplicity, all experiments focus on a one-dimensional bar with a material property of E=1𝐸1E=1italic_E = 1 and cross-sectional area A=1𝐴1A=1italic_A = 1, resulting in κ=2/δ2𝜅2superscript𝛿2\kappa=2/\delta^{2}italic_κ = 2 / italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. We consider the configuration of Figure 3 with =11\ell=1roman_ℓ = 1 and interface locations a𝑎aitalic_a and b𝑏bitalic_b. We choose the values of a𝑎aitalic_a and b𝑏bitalic_b such that grid points coincide the two interfaces. The domain is partitioned into sub-intervals of size h=1/n1𝑛h=1/nitalic_h = 1 / italic_n, with n𝑛nitalic_n given. The horizon is taken as a multiple of hhitalic_h, i.e. δ=mh𝛿𝑚\delta=mhitalic_δ = italic_m italic_h, with m=8𝑚8m=8italic_m = 8. We acknowledge that implementing non-uniform grids in our approach poses significant challenges, particularly in integrating them with the standard CNN architecture and our coupling method. Therefore, and for the purposes of this proof-of-concept, we will employ a fixed discretization grid to evaluate the innovative aspects of our proposed approach. To avoid any issues with applying the local boundary conditions, we assume that Ωδ=(a,b)subscriptΩ𝛿𝑎𝑏\Omega_{\delta}=(a,b)roman_Ω start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT = ( italic_a , italic_b ) is such that a>2δ𝑎2𝛿a>2\deltaitalic_a > 2 italic_δ and b<2δ𝑏2𝛿b<\ell-2\deltaitalic_b < roman_ℓ - 2 italic_δ. We will nevertheless vary the positions a𝑎aitalic_a and b𝑏bitalic_b of the interfaces in order to consider solutions with a discontinuity located in the interval (2δ,2δ)2𝛿2𝛿(2\delta,\ell-2\delta)( 2 italic_δ , roman_ℓ - 2 italic_δ ).

We now describe the process of generating the reference coupling regions used in the training for the special cases f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Since these loads are smooth away from their respective singularities, it seems sensible that the region where one would use the nonlocal model in the local-nonlocal coupling approach should contain the singular point. We thus compute the fully nonlocal reference solution uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT and choose the smallest interval around the singularity, i.e. Ωδ=(xjumpα,xjump+α)subscriptΩ𝛿subscript𝑥jump𝛼subscript𝑥jump𝛼\Omega_{\delta}=(x_{\text{jump}}-\alpha,x_{\text{jump}}+\alpha)roman_Ω start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT = ( italic_x start_POSTSUBSCRIPT jump end_POSTSUBSCRIPT - italic_α , italic_x start_POSTSUBSCRIPT jump end_POSTSUBSCRIPT + italic_α ) where α>0𝛼0\alpha>0italic_α > 0 is a multiple of the mesh size hhitalic_h, such that the relative error (uNLM,uVHCM)subscript𝑢NLMsubscript𝑢VHCM\mathcal{E}(u_{\text{NLM}},u_{\text{VHCM}})caligraphic_E ( italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT ) (21) between the fully nonlocal and coupled solutions is below a given tolerance ϵitalic-ϵ\epsilonitalic_ϵ. In our experiments, we fix ϵitalic-ϵ\epsilonitalic_ϵ to 0.010.010.010.01. It is clear that the proposed process is directly influenced by the specific features of the chosen loading functions.

In Figure 8, we show an example of an optimal nonlocal region configuration using f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT with singularity at x=0.5𝑥0.5x=0.5italic_x = 0.5. The displacements uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT and uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT using f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT are shown in Figure 8(a). The value of (uNLM,uVHCM)subscript𝑢NLMsubscript𝑢VHCM\mathcal{E}(u_{\text{NLM}},u_{\text{VHCM}})caligraphic_E ( italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT ) using f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is 7.026×1037.026superscript1037.026\times 10^{-3}7.026 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT. Figure 8(b) presents the displacement uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT and uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT using the loading type f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. The value of (uNLM,uVHCM)subscript𝑢NLMsubscript𝑢VHCM\mathcal{E}(u_{\text{NLM}},u_{\text{VHCM}})caligraphic_E ( italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT ) in this case is 7.027×1037.027superscript1037.027\times 10^{-3}7.027 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT. In both cases, the error in the coupled solution is below the selected threshold. We also show the pointwise error between the nonlocal solution (NLM) and the coupled solution (VHCM) in Figure 8(c-d), to illustrate that the local error remains relatively small at all points.

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Refer to caption
(d)
Figure 8: Peridynamic region configuration with discontinuity at x=0.5𝑥0.5x=0.5italic_x = 0.5, and corresponding pointwise errors between the solutions. uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT is represented by \blacksquare and uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT is represented by \bullet. (a) Displacement fields uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT and uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT obtained with f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and (uNLM,uVHCM)=7.026×103subscript𝑢NLMsubscript𝑢VHCM7.026superscript103\mathcal{E}(u_{\text{NLM}},u_{\text{VHCM}})=7.026\times 10^{-3}caligraphic_E ( italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT ) = 7.026 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT; (b) Displacement fields uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT and uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT obtained with f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and (uNLM,uVHCM)=7.027×103subscript𝑢NLMsubscript𝑢VHCM7.027superscript103\mathcal{E}(u_{\text{NLM}},u_{\text{VHCM}})=7.027\times 10^{-3}caligraphic_E ( italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT ) = 7.027 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT. Note that the two curves superpose in both cases. Pointwise errors between uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT and uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT using (c) f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and (d) f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT.

As mentioned in Section 2, the solutions to the local and the nonlocal models coincide for polynomials of degree up to three; furthermore, it is known that the fourth derivative of the solution is an indicator of the extent of the nonlocality of the solution [7]. As a pre-processing step, we leverage this information by taking the second derivative of the loading functions rather than the original functions themselves. Here, the second derivatives are simply evaluated using finite differences approximations of the loading functions. This step will help in accurately capturing the underlying behavior for the load and serve as a nonlocality indicator. The second derivatives of these loads are then used as inputs for our ML model.

We now proceed to explain the process when using load f3subscript𝑓3f_{3}italic_f start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT. The coefficients c1subscript𝑐1c_{1}italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and c2subscript𝑐2c_{2}italic_c start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT in Eq. (28) are random values within the range [30,30]3030[-30,30][ - 30 , 30 ] and [10,10]1010[-10,10][ - 10 , 10 ], respectively. In this case, the reference configuration is characterized by a fully local region as the local model and the nonlocal model lead to the same solution. This choice is made to anchor the training of our model on an instance where the behavior is fully local. This is pivotal for enriching the learning process of the model, enabling it to discern and accurately predict the nuances of fully local configurations alongside the more complex local-nonlocal configurations. Notably, the second derivative of f3subscript𝑓3f_{3}italic_f start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT is always zero and is used as input for our ML model.

4.3 Approaches for input-output selection

We recall that we consider two strategies. In the first one, we utilize the complete dataset over the full computational domain as input. In this scenario, the nature of the problem is akin to the simultaneous classification of all nodes. In the second approach, we transform the datasets into windowed inputs, and solve a point-wise classification problem.

4.3.1 Full-domain input data

The first strategy consists of considering the second derivative of the load functions at all grid points simultaneously following the generation of data as explained in Section 4.2. In this context, we increase the model’s data volume and enhance its ability to generalize by introducing specific transformations to the functions f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT (26) and f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (27). Specifically, we incorporate the negation operation, which complements our existing data by providing contrasting cases. The second derivatives of f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, and f3subscript𝑓3f_{3}italic_f start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are estimated and used here as input vectors. In this approach, the input-output pairs are given by a full second derivative of the load vector-full labels vector. The CNN receives the full second derivative vector as input. The resulting output is a vector of labels, having the same size of the input vector. Each output node has either the LM or NLM label, according to the region (local or nonlocal) they belong to. After prediction, we implement a post-processing step where nonlocal regions containing fewer than eight nodes are treated as local (LM) since we are using the variable horizon coupling method (VHCM) with m=8𝑚8m=8italic_m = 8.

Table 1: Overview of dataset distribution and associated processing times.
A. Dataset Distribution
Case 1: Full-domain Data Case 2: Windowing Data
Train 589 22802
Test 119 4498
Validation 78 3476
Total 786 30776
B. Processing Time (seconds)
Case 1: Full-domain Data Case 2: Windowing Data
Generation Time 138.77 182.37
Train Time 30.19 137.17
Test Time 0.6 10.9

4.3.2 Window-based input data

The second (and more effective) strategy proposed in this paper is the windowing approach. This approach proves to be a highly effective strategy for handling data with several singularities. In this approach, each node, denoted as xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, is treated individually. In the windowing approach, we reformulate the classification problem as point-wise classification; an illustration of the window selection is presented in Figure 9. By utilizing window configurations around each data point, we are effectively segmenting the load’s second derivative, enabling a focused and precise classification strategy at each point. This method allows for a more focused classification approach, providing a detailed insight into the load characteristics within specific windowed segments. The induced input-output pairs are then given by window-label. The CNN receives a window of load’s second derivative as input. The resulting output is a label for the central node. The label is either the LM or NLM, according to the region (local or nonlocal) the central node belong to. In a post-processing step, we apply the same process detailed in Section 4.3.1. This approach aims at enhancing the model’s capability to detect patterns and variations in the data, making our analysis more precise and adaptable to different load scenarios.

We present in Figure 9 an illustration of the window selection for the window-based approach. Considering two points and their corresponding windows, the window spans two horizons on the left and on the right of the central point, thereby covering a total of four horizons. Figure 9 shows two central nodes xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and xjsubscript𝑥𝑗x_{j}italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, each with its associated windows W(xi)𝑊subscript𝑥𝑖W(x_{i})italic_W ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) and W(xj)𝑊subscript𝑥𝑗W(x_{j})italic_W ( italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ). These windows are used as input data for the CNN enabling, classification labels based on their region. The output labels will be then LM for xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT since it is located in the local region (represented by \bullet) and NLM for xjsubscript𝑥𝑗x_{j}italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT since it is located in the nonlocal region (represented by \circ).

x0subscript𝑥0x_{0}italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTxisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPTW(xi)𝑊subscript𝑥𝑖W(x_{i})italic_W ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT )xjsubscript𝑥𝑗x_{j}italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPTW(xj)𝑊subscript𝑥𝑗W(x_{j})italic_W ( italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT )xnsubscript𝑥𝑛x_{n}italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT
Figure 9: Schematic representation of data points and their corresponding windows. Two specific data points, xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and xjsubscript𝑥𝑗x_{j}italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, are highlighted, each associated with its own window, denoted as W(xi)𝑊subscript𝑥𝑖W(x_{i})italic_W ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) and W(xj)𝑊subscript𝑥𝑗W(x_{j})italic_W ( italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ).

We note that this approach can be also applied and used in higher dimensions. For instance, in two or three dimensions, the windows might transition from one-dimensional segments to two-dimensional areas or even three-dimensional volumes. By varying these windows, we aim to capture and understand how nonlocal effects act within specific regions or volumes across the broader dimensional space.

4.4 Case studies

In this section, we describe the different scenarios and case studies considered in our tests. A summary is reported in Table 2.

4.4.1 Case 1: Full-domain input data

In this case, we generate the data as explained in Section 4.3.1. These simulations result in a total of 3,084 data samples, as shown in Table 1(A). The data is first split into training, testing, and validation sets. The training set is used to train the CNN model, where the model learns the relationship between inputs and outputs, adjusting its parameters (like weights and biases) accordingly. The validation set is used during the training process; it helps in evaluating the performance of the model on a dataset it has not be subjected to before. The testing set is used to check how well the model can generalize and helps prevent overfitting. In other words, the testing set allows one to evaluate the performance of the model on new, unseen data [38]. We split the dataset by assigning 75% of the samples to the training set, 10% to the validation set and 15% to the testing set. The size of each data sample is 257, resulting in a training set of size 589×257589257589\times 257589 × 257 and a validation set of size 78×2577825778\times 25778 × 257, while the testing set has size 119×257119257119\times 257119 × 257. The generation and partitioning of the dataset into training, validation, and testing sets necessitates a total of 138.77 seconds (Table 1(B)).

In the full-domain input data approach, the CNN receives as input the full vector of the second derivatives of the load functions at the grid points. The resulting output is a vector of labels, having the same size of the input load vector. Each output node has either the LM or NLM label, according to the region (local or nonlocal) they belong to. The CNN model is structured as multiple node classification model; the results are detailed in Section 5.1. A summary of this test case is shown in Table 2.

Table 2: Summary for training case studies.
Case Study Case 1 Case 2
Section Section 5.1 Section 5.2
Input Full Load’s Second Derivative Vector Windows
Output Full Labels Vector One Node Label
Load Dataset fi(x)subscript𝑓𝑖𝑥f_{i}(x)italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ), i=1,2,3𝑖123i=1,2,3italic_i = 1 , 2 , 3. fi(x)subscript𝑓𝑖𝑥f_{i}(x)italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ), i=1,2,3𝑖123i=1,2,3italic_i = 1 , 2 , 3.
ML Model Type Multiple node Classification Node wise Classification
Interpretation Demonstrated the feasibility of the proposed approach for region detection, as a kind of a “proof of concept". Highly effective strategy for handling data with varying numbers of discontinuities without the need for retraining the model for each specific scenario.

4.4.2 Case 2: Window-based input data

In this case, we use the window-based approach, in which the dataset is generated and preprocessed using the methodology explained in Section 4.3.2. The initial dataset, consisting of 786 complete input-output vector pairs, is partitioned into distinct sets for training, validation, and testing. The training set constitutes 75% of the total data, the validation set allocates 10%, and the remaining 15% is designated for the testing set. We then implement the windowing approach, segmenting each vector into windows of data. Subsequently, we perform a deduplication process, removing duplicated input windows-output label in each set and between sets (train-test, train-validation, and validation-test), ensuring unique samples across all sets. The resultant windowed dataset comprises a total of 30776 data samples (Table 1(A)). The training set, the validation set, and the testing set contain 22802, 3476, and 4498 data samples, respectively. We note that the size of input samples depends on the window size. The generation and data splitting into windows require a total of 182.37 seconds (Table 1(B)). We point out that we introduced the deduplication step to ensure fairness in the testing process. While straightforward in 1D experiments with uniform grids, in practice and especially in higher dimensions, deduplication is nontrivial. However, it is often the case that solutions have common features in certain regions of a computational domain so that having duplicates is very natural and actually helps the learning process. Thus, our results show that even when we stress test the model by removing favorable data, the model still performs well (Section 5.2).

The CNN inputs are the windows and the corresponding output is the label (local or nonlocal) of the central node. Thus, the model has been restructured as a node-wise classification model. We recall that the labels are either LM or NLM depending on the classification (local or nonlocal). An overview of this case is summarized in Table 2.

5 Numerical results

5.1 Case 1: Full-domain input data

In the process of training our CNN model, both training and validation loss were computed to monitor the model’s learning progress and its ability to generalize. The training proceed for a maximum of 200 epochs; however, an early stopping callback is implemented to avoid overfitting. The training time is notably brief, amounting to only 30.19 seconds. However, testing the model requires a duration of 0.6 seconds (Table 1(B)). Figure 10(a) presents the training and validation losses over the training epochs. The performance of the model is assessed using the evaluation metrics in Section 3.3 and is shown in Table 3. The model’s performance evaluation produces an average accuracy of 0.99 and an F1-score value of 0.94 (Table 3). We present the corresponding average confusion matrix in Figure 10(b). Within the confusion matrix, the model predicts correctly 99.73% of the LM instances and 96.34% of the NLM ones. These percentage-based values illustrate the model’s high precision and recall scores, and show its robustness in accurately classifying nodes within local and nonlocal regions for the training dataset.

Refer to caption
(a)
Refer to caption
(b)
Figure 10: (a) Training (blue) and validation (orange) loss versus the number of epochs in case 1. (b) Average confusion matrix for case 1.

To evaluate the results after regions detection, we estimate the error in Eq (21) between the fully nonlocal displacement uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT and the coupled displacement uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT, whose nonlocal region is the one predicted by the CNN. An example of the results for both f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is reported in Figure 11, where, in both cases, the discontinuity is at x=0.71𝑥0.71x=0.71italic_x = 0.71. The value of (uNLM,uVHCM)subscript𝑢NLMsubscript𝑢VHCM\mathcal{E}(u_{\text{NLM}},u_{\text{VHCM}})caligraphic_E ( italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT ) using f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is 5.5282×1055.5282superscript1055.5282\times 10^{-5}5.5282 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT; while using f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is 5.5285×1055.5285superscript1055.5285\times 10^{-5}5.5285 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT.

For the test set, the predicted nonlocal region consistently induces an error in the coupled solution much lower than the tolerance used during training . Across all cases, the estimated errors fall within the range of 2.18×1062.18superscript1062.18\times 10^{-6}2.18 × 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT to 5.32×1035.32superscript1035.32\times 10^{-3}5.32 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT using f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and 2.17×1062.17superscript1062.17\times 10^{-6}2.17 × 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT to 1.36×1041.36superscript1041.36\times 10^{-4}1.36 × 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT using f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (Table 3).

Table 3: CNN evaluation metrics and error summary for the test set.
Accuracy F1-score Load (uNLM,uVHCM)subscript𝑢NLMsubscript𝑢VHCM\mathcal{E}(u_{\text{NLM}},u_{\text{VHCM}})caligraphic_E ( italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT )
Case 1: Full-domain Data 0.99 0.94 f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT 2.18×1062.18superscript1062.18\times 10^{-6}2.18 × 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT5.32×1035.32superscript1035.32\times 10^{-3}5.32 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT
f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT 2.17×1062.17superscript1062.17\times 10^{-6}2.17 × 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT1.36×1041.36superscript1041.36\times 10^{-4}1.36 × 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT
f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT(t=0.05𝑡0.05t=0.05italic_t = 0.05) 8.8×104*8.8superscriptsuperscript104*{8.8\times 10^{-4}}^{\text{*}}8.8 × 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT
f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT(t=0.0005𝑡0.0005t=0.0005italic_t = 0.0005) 1.03×103*1.03superscriptsuperscript103*{1.03\times 10^{-3}}^{\text{*}}1.03 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT
f5subscript𝑓5f_{5}italic_f start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT 8.2×104*8.2superscriptsuperscript104*{8.2\times 10^{-4}}^{\text{*}}8.2 × 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT
Case 2: Windowing data 0.96 0.97 f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT 3.50×1063.50superscript1063.50\times 10^{-6}3.50 × 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT2.34×1032.34superscript1032.34\times 10^{-3}2.34 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT
f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT 3.49×1063.49superscript1063.49\times 10^{-6}3.49 × 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT1.97×1051.97superscript1051.97\times 10^{-5}1.97 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT
f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT (t=0.05𝑡0.05t=0.05italic_t = 0.05) 5.30×1055.30superscript1055.30\times 10^{-5}5.30 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT
f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT (t=0.0005𝑡0.0005t=0.0005italic_t = 0.0005) 2.55×1052.55superscript1052.55\times 10^{-5}2.55 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT
f5subscript𝑓5f_{5}italic_f start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT 1.30×1041.30superscript1041.30\times 10^{-4}1.30 × 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT

Here the coupled solution uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT corresponds to the fully local solution uLMsubscript𝑢LMu_{\text{LM}}italic_u start_POSTSUBSCRIPT LM end_POSTSUBSCRIPT. The estimated error is then computed as the difference between uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT and uLMsubscript𝑢LMu_{\text{LM}}italic_u start_POSTSUBSCRIPT LM end_POSTSUBSCRIPT.

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Figure 11: Error estimation after prediction with discontinuity at x=0.71𝑥0.71x=0.71italic_x = 0.71. (a) Comparison between the reference and predicted configurations of local and nonlocal regions. (b-c) Peridynamic region configuration after prediction with discontinuity at x=0.71𝑥0.71x=0.71italic_x = 0.71. uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT is represented by \blacksquare and uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT is represented by \bullet, (b) displacement fields uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT and uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT using load f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. (uNLM,uVHCM)=5.5282×105subscript𝑢NLMsubscript𝑢VHCM5.5282superscript105\mathcal{E}(u_{\text{NLM}},u_{\text{VHCM}})=5.5282\times 10^{-5}caligraphic_E ( italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT ) = 5.5282 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT; (c) displacement fields uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT and uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT using load f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. (uNLM,uVHCM)=5.5285×105subscript𝑢NLMsubscript𝑢VHCM5.5285superscript105\mathcal{E}(u_{\text{NLM}},u_{\text{VHCM}})=5.5285\times 10^{-5}caligraphic_E ( italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT ) = 5.5285 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT.

5.2 Case 2: Window-based input data

The results in the previous Section 5.1 showed promise that machine learning could support the selection of local-nonlocal coupling regions; however, the full-load strategy does not have satisfactory generalization properties when testing using unseen data. In this section, we illustrate how the windowing approach circumvents these limitations.

As before, the number of epochs is set to of 200; however, an early stopping callback is used to avoid overfitting. Training the model with windowed data take 137.17 seconds while testing requires a duration of 10.9 seconds (Table 1(B)). Figure 12(a) presents the training and validation losses over the training epochs. While both losses decrease overall, the close tracking between training and validation loss suggests that the model is not overfitting to the training data. The plot shows that the callback effectively prevented overfitting by stopping training at epoch 50 when the validation loss showed no further improvement. The performance of the model was assessed using the evaluation metrics are shown in Table 3. The performance metrics show an accuracy of 0.96 and an F1-score of 0.97; the corresponding confusion matrix, reported in Figure 12(c), shows that the model predicted correctly 94.03% of the LM instances and 97.18% of of the NLM ones. Also in this case, the model has high precision and recall scores.

For the test set, the predicted nonlocal region induces an error in the coupled solution much lower than the tolerance ϵitalic-ϵ\epsilonitalic_ϵ used during training all cases using both f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Across all cases, the estimated errors fall within the range of 3.50×1063.50superscript1063.50\times 10^{-6}3.50 × 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT to 2.34×1032.34superscript1032.34\times 10^{-3}2.34 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT using f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and 3.49×1063.49superscript1063.49\times 10^{-6}3.49 × 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT to 1.97×1051.97superscript1051.97\times 10^{-5}1.97 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT using f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (Table 3). The lowest errors are obtained when using the original version of loads f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, which means without transformations.

Refer to caption
(a)
Refer to caption
(b)
Figure 12: (a) Training (blue) and validation (orange) loss versus the number of epochs in case 2. (b) Confusion matrix for windowing dataset in case 2.
Refer to caption
(a)
Refer to caption
(b)
Figure 13: Nonlocal region detection and error estimation after prediction for test case with two jumps using f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. (a) Predicted configuration of local and nonlocal regions. The nodes located in the region where the local model will be used are labeled LM and represented in black. The nodes located in the region where the nonlocal model will be employed are labeled NLM and represented in light grey. (b) Displacement fields uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT, represented by \blacksquare and uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT, represented by \bullet. The corresponding error is 3.299×1033.299superscript1033.299\times 10^{-3}3.299 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT.

To evaluate the windowing approach’s ability to identify multiple discontinuities, we test our pre-trained model, which has not been trained on loads with more than one discontinuity, using a load with two discontinuities. The results are presented in Figure 13. The predicted configurations of local and nonlocal regions are shown in Figure 13(a), while the displacements uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT (represented by \blacksquare) and uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT (represented by \bullet) are shown in Figure 13(b). The corresponding error, denoted as (uNLM,uVHCM)subscript𝑢NLMsubscript𝑢VHCM\mathcal{E}(u_{\text{NLM}},u_{\text{VHCM}})caligraphic_E ( italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT ), is 3.299×1033.299superscript1033.299\times 10^{-3}3.299 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT.

In addition, we test our model using load f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT (Eq. 29) and f5subscript𝑓5f_{5}italic_f start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT (Eq. 30); these loading functions do not belong to the training set. Thus, with these examples, we test mild generalization properties of the algorithm.

For f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT, we consider two scenarios; one with t=0.05 and another with t=0.0005. In all cases, the second derivative is estimated and subsequently segmented into windows, serving as the input for the pre-trained model. Once again, to evaluate the precision of the region detection, we estimate the error between the fully nonlocal displacement (uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT) and the coupled displacement (uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT) using Eq. (21). Figure 14(Left) shows the predicted configurations of local and nonlocal regions. Figure 14(Right) shows uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT, represented by \blacksquare and uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT, represented by \bullet.

For f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT with t=0.05, the prediction labels and configurations of local and nonlocal regions are showcased in Figure 14(a-b). The corresponding error value, denoted as (uNLM,uVHCM)subscript𝑢NLMsubscript𝑢VHCM\mathcal{E}(u_{\text{NLM}},u_{\text{VHCM}})caligraphic_E ( italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT ), is 5.30×1055.30superscript1055.30\times 10^{-5}5.30 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT. When testing f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT with t=0.0005, the findings are similarly reported in Figures 14(c-d), with an error value of 2.55×1052.55superscript1052.55\times 10^{-5}2.55 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT.

For f5subscript𝑓5f_{5}italic_f start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT, as defined in Eq. 30, we follow the same process. The results are presented in Figure 14(e-f), and the corresponding error (uNLM,uVHCM)subscript𝑢NLMsubscript𝑢VHCM\mathcal{E}(u_{\text{NLM}},u_{\text{VHCM}})caligraphic_E ( italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT ) is 1.30×1041.30superscript1041.30\times 10^{-4}1.30 × 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT.

Refer to caption
(a) t=0.05
Refer to caption
(b) t=0.05
Refer to caption
(c) t=0.0005
Refer to caption
(d) t=0.0005
Refer to caption
(e)
Refer to caption
(f)
Figure 14: Nonlocal region detection and error estimation after prediction for general test cases f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT and f5subscript𝑓5f_{5}italic_f start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT in Eq. (29) and Eq. (30). (Left) Predicted configuration of local and nonlocal regions. The nodes located in the region where the local model will be used are labeled LM and represented in black. The nodes located in the region where the nonlocal model will be employed are labeled NLM and represented in light grey. (Right) Displacement fields uNLMsubscript𝑢NLMu_{\text{NLM}}italic_u start_POSTSUBSCRIPT NLM end_POSTSUBSCRIPT, represented by \blacksquare and uVHCMsubscript𝑢VHCMu_{\text{VHCM}}italic_u start_POSTSUBSCRIPT VHCM end_POSTSUBSCRIPT, represented by \bullet. (a-b) Results for f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT with t=0.05𝑡0.05t=0.05italic_t = 0.05, the corresponding error is 5.30×1055.30superscript1055.30\times 10^{-5}5.30 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT. (c-d) Results for f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT with t=0.0005𝑡0.0005t=0.0005italic_t = 0.0005, the corresponding error is 2.55×1052.55superscript1052.55\times 10^{-5}2.55 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT. (e-f) Results for f5subscript𝑓5f_{5}italic_f start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT, the corresponding error is 1.30×1041.30superscript1041.30\times 10^{-4}1.30 × 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT.

6 Conclusion

This paper shows a proof of concept for a ML-based identification of local-nonlocal coupling regions that is independent of the local and nonlocal models utilized. Specifically, the load used as identification input is independent of the two models and the method used to generate local and nonlocal solutions is irrelevant. Furthermore, while we only considered a non-overlapping approach, a coupling approach with overlapping regions could also be used (examples can be found in  [15]).

Among the two approaches we presented, the full-domain approach yields promising results for the classification of the local and nonlocal regions as long as the solutions do not feature multiple discontinuities. To overcome this limitation, we introduced the windowing approach based on a node-wise classification. The latter shows promising results even in the presence of several discontinuities without retraining (see Section 5.2). This model demonstrated robust performance, as reflected by high evaluation metrics, with an accuracy of 0.98 and an F1-score of 0.97. The results were further validated by the corresponding confusion matrix (see Figure 12(c)), which provides a detailed breakdown of the model’s classification accuracy. Nevertheless, a minor limitation in our model’s predictive precision is noticed. Specifically, our analysis identified a misclassification rate of 5.21% of instances as LM (nodes in Local region) and 1.14% as NLM (nodes in nonlocal region). These relatively low values of the total predictions highlight areas of potential improvement.

The proof of concept approach introduced in this work opens several avenues of improvement and extension. In the results presented in this study, the horizon δ=mh𝛿𝑚\delta=m\cdot hitalic_δ = italic_m ⋅ italic_h is constant; for more realistic tests, varying values should be investigated (the work in [13] can be used as a guide for the choice of hhitalic_h and δ𝛿\deltaitalic_δ). As for the boundary conditions, we use homogeneous Dirichlet boundary conditions, while in real-world applications mixed conditions are usually prescribed. More importantly, an extension to two- and three-dimensional problems is mandatory and it is the subject of our current work. Furthermore, stress testing the approach by using more complex forcing functions from a larger subspace should also be explored (e.g. randomly generated functions with prescribed regularity properties).

Another relevant aspect not directly addressed in this work is the presence of cracks and fractures. These features were mimicked in the current study by the presence of discontinuities. In two- and three-dimensional settings, we plan to incorporate these features into the machine learning model and validate our results against real experiments.

In terms of machine learning algorithms, the windowing approach showed promising results and is the candidate strategy to proceed with in higher dimensions. However, the open question is the choice of the shape of the window. Possible candidates in two dimensions could be rectangles or ellipses. More generally, the shape of the window should be chosen such that it matches the support of the nonlocal kernel function. Alternatively, one could parameterize a family of two-dimensional shapes, in which case the parameters would become learnable parameters in the training process. Another open question is the architecture of the CNN for higher dimensions as the input layer and output layer need to be adapted to the higher dimensions.

Additionally, due to the fact that CNNs fail in the presence of nonuniform unstructured grids, our future exploration includes the use of graph neural networks (GNNs) [55, 48, 20]. Given the graph-like structure of our problem, leveraging GNNs could offer a compelling approach to model and predict complex interactions within nodes in local and nonlocal regions. The adaptability of GNNs to capture relationships among various regions’ nodes makes them a promising approach especially for higher dimensions.

7 Funding statement

Sandia National Laboratories is a multimission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International Inc., for the U.S. Department of Energy’s National Nuclear Security Administration under contract DE-NA0003525.

This paper describes objective technical results and analysis. Any subjective views or opinions that might be expressed in the paper do not necessarily represent the views of the U.S. Department of Energy or the United States Government.

References

  • [1] S. Albawi, T. A. Mohammed, and S. Al-Zawi. Understanding of a convolutional neural network. In 2017 international conference on engineering and technology (ICET), pages 1–6. Ieee, 2017.
  • [2] P. W. Bates and A. Chmaj. An integrodifferential model for phase transitions: stationary solutions in higher space dimensions. Journal of statistical physics, 95:1119–1139, 1999.
  • [3] D. A. Benson, S. W. Wheatcraft, and M. M. Meerschaert. Application of a fractional advection-dispersion equation. Water resources research, 36(6):1403–1412, 2000.
  • [4] E. Bisong and E. Bisong. Regularization for deep learning. Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners, pages 415–421, 2019.
  • [5] F. E. Bock, R. C. Aydin, C. J. Cyron, N. Huber, S. R. Kalidindi, and B. Klusemann. A review of the application of machine learning and data mining approaches in continuum materials mechanics. Frontiers in Materials, 6:110, 2019.
  • [6] C. Chen and P. C. Fife. Nonlocal models of phase transitions in solids. Advances in Mathematical Sciences and Applications, 10(2):821–849, 2000.
  • [7] X. Chen and M. Gunzburger. Continuous and discontinuous finite element methods for a peridynamics model of mechanics. Computer Methods in Applied Mechanics and Engineering, 200(9-12):1237–1250, 2011.
  • [8] K. Dayal and K. Bhattacharya. Kinetics of phase transformations in the peridynamic formulation of continuum mechanics. Journal of the Mechanics and Physics of Solids, 54(9):1811–1842, 2006.
  • [9] E. A. B. de Moraes, M. D’Elia, and M. Zayernouri. Machine learning of nonlocal micro-structural defect evolutions in crystalline materials. Computer Methods in Applied Mechanics and Engineering, 403:115743, 2023.
  • [10] M. D’Elia and M. Gunzburger. Optimal distributed control of nonlocal steady diffusion problems. SIAM Journal on Control and Optimization, 52(1):243–273, 2014.
  • [11] M. D’Elia, X. Li, P. Seleson, X. Tian, and Y. Yu. A review of local-to-nonlocal coupling methods in nonlocal diffusion and nonlocal mechanics. Journal of Peridynamics and Nonlocal Modeling, 2021.
  • [12] M. D’Elia, X. Tian, and Y. Yu. A physically-consistent, flexible and efficient strategy to convert local boundary conditions into nonlocal volume constraints. SIAM Journal of Scientific Computing, 42(4):A1935–A1949, 2020.
  • [13] P. Diehl, F. Franzelin, D. Pflüger, and G. C. Ganzenmüller. Bond-based peridynamics: a quantitative study of mode i crack opening. International Journal of Fracture, 201:157–170, 2016.
  • [14] P. Diehl, R. Lipton, T. Wick, and M. Tyagi. A comparative review of peridynamics and phase-field models for engineering fracture mechanics. Computational Mechanics, 69(6):1259–1293, 2022.
  • [15] P. Diehl and S. Prudhomme. Coupling approaches for classical linear elasticity and bond-based peridynamic models. Journal of Peridynamics and Nonlocal Modeling, 4(3):336–366, 2022.
  • [16] P. Diehl, S. Prudhomme, and M. Lévesque. A review of benchmark experiments for the validation of peridynamics models. Journal of Peridynamics and Nonlocal Modeling, 1:14–35, 2019.
  • [17] M. D’Elia, J. C. De Los Reyes, and A. Miniguano-Trujillo. Bilevel parameter learning for nonlocal image denoising models. Journal of Mathematical Imaging and Vision, 63(6):753–775, 2021.
  • [18] M. D’Elia, X. Li, P. Seleson, X. Tian, and Y. Yu. A review of local-to-nonlocal coupling methods in nonlocal diffusion and nonlocal mechanics. Journal of Peridynamics and Nonlocal Modeling, pages 1–50, 2021.
  • [19] Y. Feng, Q. Wang, D. Wu, W. Gao, and F. Tin-Loi. Stochastic nonlocal damage analysis by a machine learning approach. Computer Methods in Applied Mechanics and Engineering, 372:113371, 2020.
  • [20] R. J. Gladstone, H. Rahmani, V. Suryakumar, H. Meidani, M. D’Elia, and A. Zareei. Gnn-based physics solver for time-independent pdes. arXiv preprint arXiv:2303.15681, 2023.
  • [21] J. Gu, Z. Wang, J. Kuen, L. Ma, A. Shahroudy, B. Shuai, T. Liu, X. Wang, G. Wang, J. Cai, et al. Recent advances in convolutional neural networks. Pattern recognition, 77:354–377, 2018.
  • [22] E. Haghighat, A. C. Bekar, E. Madenci, and R. Juanes. A nonlocal physics-informed deep learning framework using the peridynamic differential operator. Computer Methods in Applied Mechanics and Engineering, 385:114012, 2021.
  • [23] M. Hossin and M. N. Sulaiman. A review on evaluation metrics for data classification evaluations. International journal of data mining & knowledge management process, 5(2):1, 2015.
  • [24] A. Katiyar, S. Agrawal, H. Ouchi, P. Seleson, J. T. Foster, and M. M. Sharma. A general peridynamics model for multiphase transport of non-newtonian compressible fluids in porous media. Journal of Computational Physics, 402:109075, 2020.
  • [25] A. Katiyar, J. T. Foster, H. Ouchi, and M. M. Sharma. A peridynamic formulation of pressure driven convective fluid transport in porous media. Journal of Computational Physics, 261:209–229, 2014.
  • [26] M. Kim, N. Winovich, G. Lin, and W. Jeong. Peri-net: analysis of crack patterns using deep neural networks. Journal of Peridynamics and Nonlocal Modeling, 1:131–142, 2019.
  • [27] D. P. Kingma and J. Ba. Adam: A method for stochastic optimization, 2014.
  • [28] S. Kumar and D. M. Kochmann. What machine learning can do for computational solid mechanics. In Current trends and open problems in computational mechanics, pages 275–285. Springer, 2022.
  • [29] H. P. Lal, B. Abhiram, and D. Ghosh. Prediction of nonlocal elasticity parameters using high-throughput molecular dynamics simulations and machine learning. European Journal of Mechanics-A/Solids, page 105175, 2023.
  • [30] Z. Li, F. Liu, W. Yang, S. Peng, and J. Zhou. A survey of convolutional neural networks: analysis, applications, and prospects. IEEE transactions on neural networks and learning systems, 2021.
  • [31] Y. Lou, X. Zhang, S. Osher, and A. Bertozzi. Image recovery via nonlocal operators. Journal of Scientific Computing, 42(2):185–197, 2010.
  • [32] J. Lu, L. Tan, and H. Jiang. Review on convolutional neural network (CNN) applied to plant leaf disease classification. Agriculture, 11(8):707, 2021.
  • [33] E. Madenci, A. Barut, and M. Futch. Peridynamic differential operator and its applications. Computer Methods in Applied Mechanics and Engineering, 304:408–451, 2016.
  • [34] C. T. Nguyen, S. Oterkus, and E. Oterkus. A peridynamic-based machine learning model for one-dimensional and two-dimensional structures. Continuum Mechanics and Thermodynamics, 35(3):741–773, 2023.
  • [35] J. Nikpayam and M. A. Kouchakzadeh. A variable horizon method for coupling meshfree peridynamics to FEM. Computer Methods in Applied Mechanics and Engineering, 355:308–322, 2019.
  • [36] M. L. Parks, R. B. Lehoucq, S. J. Plimpton, and S. A. Silling. Implementing peridynamics within a molecular dynamics code. Computer Physics Communications, 179(11):777–783, 2008.
  • [37] S. Prudhomme and P. Diehl. On the treatment of boundary conditions for bond-based peridynamic models. Computer Methods in Applied Mechanics and Engineering, 372:113391, 2020.
  • [38] S. Raschka. Model evaluation, model selection, and algorithm selection in machine learning. arXiv:1811.12808, 2018.
  • [39] S. J. Raymond. Combining numerical simulation and machine learning-modeling coupled solid and fluid mechanics using mesh free methods. PhD thesis, Massachusetts Institute of Technology, 2020.
  • [40] I. Rocha, P. Kerfriden, and F. van der Meer. Machine learning of evolving physics-based material models for multiscale solid mechanics. Mechanics of Materials, page 104707, 2023.
  • [41] S. Sharma, S. Sharma, and A. Athaiya. Activation functions in neural networks. Towards Data Sci, 6(12):310–316, 2017.
  • [42] S. Silling. Local-nonlocal coupling in Emu/PDMS. Sandia Report (SAND2020-11382), 2020.
  • [43] S. Silling, D. Littlewood, and P. Seleson. Variable horizon in a peridynamic medium. Journal of Mechanics of Materials and Structures, 10(5):591–612, 2015.
  • [44] S. A. Silling. Reformulation of elasticity theory for discontinuities and long-range forces. Journal of the Mechanics and Physics of Solids, 48(1):175–209, 2000.
  • [45] S. A. Silling and E. Askari. A meshfree method based on the peridynamic model of solid mechanics. Computers & Structures, 83(17-18):1526–1535, 2005.
  • [46] J. L. Suzuki, M. Gulian, M. Zayernouri, and M. D’Elia. Fractional modeling in action: A survey of nonlocal models for subsurface transport, turbulent flows, and anomalous materials. Journal of Peridynamics and Nonlocal modeling, 5(3):392–459, 2023.
  • [47] Y. Tao, Q. Sun, Q. Du, and W. Liu. Nonlocal neural networks, nonlocal diffusion and nonlocal modeling. Advances in Neural Information Processing Systems, 31, 2018.
  • [48] Z. Wu, S. Pan, F. Chen, G. Long, C. Zhang, and S. Y. Philip. A comprehensive survey on graph neural networks. IEEE transactions on neural networks and learning systems, 32(1):4–24, 2020.
  • [49] X. Xu, M. D’Elia, C. Glusa, and J. T. Foster. Machine-learning of nonlocal kernels for anomalous subsurface transport from breakthrough curves. arXiv preprint arXiv:2201.11146, 2022.
  • [50] X. Xu, M. D’Elia, and J. T. Foster. A machine-learning framework for peridynamic material models with physical constraints. Computer Methods in Applied Mechanics and Engineering, 386:114062, 2021.
  • [51] R. Yamashita, M. Nishio, R. K. G. Do, and K. Togashi. Convolutional neural networks: an overview and application in radiology. Insights into imaging, 9:611–629, 2018.
  • [52] H. You, Y. Yu, S. Silling, and M. D’Elia. Data-driven learning of nonlocal models: from high-fidelity simulations to constitutive laws. Accepted in AAAI Spring Symposium: MLPS, 2021.
  • [53] H. You, Y. Yu, S. Silling, and M. D’Elia. A data-driven peridynamic continuum model for upscaling molecular dynamics. Computer Methods in Applied Mechanics and Engineering, 389:114400, 2022.
  • [54] H. You, Y. Yu, N. Trask, M. Gulian, and M. D’Elia. Data-driven learning of nonlocal physics from high-fidelity synthetic data. Computer Methods in Applied Mechanics and Engineering, 374:113553, 2021.
  • [55] J. Zhou, G. Cui, S. Hu, Z. Zhang, C. Yang, Z. Liu, L. Wang, C. Li, and M. Sun. Graph neural networks: A review of methods and applications. AI open, 1:57–81, 2020.
  • [56] X.-H. Zhou, J. Han, and H. Xiao. Learning nonlocal constitutive models with neural networks. Computer Methods in Applied Mechanics and Engineering, 384:113927, 2021.