1. Introduction
High-voltage direct current (HVDC) gas-insulated lines (GILs) enable the transmission of power over long distances with high system stability, with advantages such as ecological aspect and economic benefits and are increasingly attractive due to the energy turnaround that requires the transport of energy in some cases for hundreds of kilometers [
1,
2].
Positive and negative ion pairs are responsible for the gas conductivity. Under normal circumstances, natural background radiation generates these ion pairs. SF
6 gas insulation systems have a big issue when operating in the presence of radiation, such as near a tokamak, which emits neutrons, X-rays, and gamma rays. Radiation increases the ion-pair generation of the gas by ionization and dissociation, consequently increasing the conductivity of the gas [
3,
4,
5,
6].
The increase in conductivity affects the system’s electric field distribution and the insulators’ surface charge accumulation, resulting in a modification of the time to a steady state and the breakdown voltage, for example. It gives rise to dark currents, i.e., leakage current through the insulating system flowing out of the ground electrode, which can cause power losses, heating, and breakdowns [
6,
7]. Therefore, the design and operation of HVDC-GIS systems must consider radiation-induced effects. In modeling and simulation, a source parameter denoted as
S accounts for the ionization process, representing the volumetric rate of ion-pair generation [
3].
Many scenarios of radiation fields are possible. For example, in the ITER Tokamak, there are two orders of magnitude uncertainties in the radiation to which the gas insulator will be subjected, from 0.02 Gy/s to 2 Gy/s [
4]. A. De Lorenzi et al. (2009) [
8] suggested that more detailed modeling of the RIC (Radiation Induced Conductivity) phenomenon is needed.
Besides the ionization of the gas, the main issue that affects HVDC-GIS systems is the accumulation of electric charges on the interface between the gas and solid insulators, which may lead to a decrease in the flashover voltage and breakdowns [
1]. The dynamics of the surface charge accumulation is a critical factor in the design of the systems, and the enhanced SF
6 conductivity caused by radiation modifies the dynamics and increases the hetero-charge distribution in the insulator surface [
8].
The investigations in this work use data-driven methods to learn the dynamics and relations in HVDC-GIS using SF
6 as gas insulation in different radiation conditions. The data were collected through simulations in COMSOL Multiphysics
® using the electric currents and transport diffusion of diluted species physics, as described in [
9]. Migration of charges in the solid insulator and gas models the surface charge accumulation; the models did not implement surface conductivity for simplicity but can be easily implemented.
The systems studied were at 500 kV DC, the operating voltage of the Neutral Beam Injector of the Divertor Tokamak Test (DTT) facility [
3,
10]. The parameters that varied in the simulations were the ion-pair generation rate and the potential applied.
The ML (machine learning) models were trained and validated with simulation data. The SR [
11] and SINDy [
12,
13,
14,
15] algorithms were evaluated for learning the dynamics of surface charge and predicting time to steady state, conductivity, and dark current at steady state. The SR algorithm aimed to find the best functional form to predict the target variable, while the SINDy one sought to find the best sparse regression of the dynamics of the target variable. The present work employed both methods and evaluated the results.
Previous works in obtaining an analytical solution for the steady-state time for the system demonstrated that the expression computed fails when the conductivity of the gas is affected by radiation, i.e., the ion-pair generation parameter
S is high. The difference between the analysis time and the time computed by the analytical expression was in one order of magnitude [
8].
Building an analytical expression for the steady-state conditions aims to provide important information in advance without performing simulations. Further works can use dynamic models built with SINDy, such as optimizing the design of HVDC structures in RIC conditions. The optimization of electromagnetic problems is a very demanding task, as the computational cost is high and the function to be optimized is unknown [
8]. Simulation and optimization using surrogate models built with ML, such as the surface charge density dynamics, can reduce the time needed to obtain a feasible solution.
Data-driven strategies for modeling physical and engineering systems have often proven successful. Machine learning and artificial intelligence can tackle complex, large datasets that normally appear in engineering and physical problems, as it is the case of the system evaluated in this work. The data-driven framework can help discover missing physics [
16], improve control [
17], find reduced models [
18,
19], and model discrepancies between the real systems and simulation results that use first-principle laws [
16].
A practical example of how analytical expressions might accelerate simulations in HVDC RIC systems was presented in the paper [
9]. The authors demonstrated the feasibility of simplifying the drift diffusion reaction (DDR) equations of ions in the gas using a conductivity analytical expression. The analytical expression is a compromise between the most detailed simulation using DDR equations and the simplest one using a constant gas conductivity, which does not work when RIC conditions occur in the system.
Although many works on HVDC using machine learning exist, they mainly focus on monitoring [
20], control [
21,
22], partial discharges [
23,
24,
25], and fault detection and prediction [
26,
27,
28,
29] in the systems. One previous work in the literature involves directly predicting the flashover voltage of an AC gas-insulated system, but it uses experimental data for learning [
30]. New forms of applying machine learning to the HVDC-GIL system, such as the present work, might bring new paths for optimizations using surrogate models, fast assessments with analytical expressions, and learning and understanding more of the systems with the derived interpretable representations, especially in high-radiation environments.
The main contributions of this work are summarized as follows:
Symbolic expressions for key variables: Developed interpretable symbolic models for dark currents, gas conductivity, and time to steady state using symbolic regression.
Symbolic dynamic expression: Applied SINDy to derive a sparse surface charge density model, resulting in an interpretable expression that models a complicated phenomenon evolution in time.
Method’s applicability: Demonstrated how machine learning-derived models can be obtained from simulated data, showcasing their capability in high-complexity HVDC systems. Extensions for experimental data are trivial.
2. Electromagnetic Modeling
This section recalls the approach for electromagnetic (EM) modeling of HVDC gas-insulated equipment, such as HVDC-GIS and HVDC-GIL. The capacitive to the resistive transition of the electric field following the application of DC voltage is described by the electro-quasistatic (EQS) approximation of Maxwell equations. Particularly, during the time evolution, the surface charge density
accumulates along the interface between dielectric gas and solid insulator according to the law [
31]:
where
is the normal vector pointing out from the solid insulator,
and
the current densities within the insulator and the gas,
is the interface electric conductivity, and
the electric field tangent to the interface. It is well recognized that the accumulation of
is related to the main fault phenomena within HVDC gas-insulated systems [
32].
A complete description of the electric field in the dielectric gas (
) requires the solution of a self-consistent model, described by the following set of Partial Differential Equations (PDEs) [
1,
33]:
Equations (
2) and (
3) describe the generation, drift, diffusion, and recombination of positive (
) and negative (
) ion number densities through a set of transport parameters: the mobility (
), diffusion coefficient (
D), and the recombination ratio
R. The source of ion pairs is driven by the parameter
S,
e is the elementary charge and
is the permittivity of the gas. A detailed description of the boundary and initial conditions for the solution of the model, as well as the coupling with the model for the conduction in the solid insulator, is out of the scope of the paper; however, the interested reader can find details, e.g., in [
34,
35].
Combining Equations (
2) and (
3), the expression of the current density in the gas
is inferred:
3. Machine Learning Methods
This section briefly describes the machine learning methods. Sparse Identification of Nonlinear Dynamics and symbolic regression search for a symbolic representation to fit the target variable. This learning process uses data that might be generated by simulations or collected in experiments and operations. This work uses only simulated data from a physical model. The methods permit the search for parsimonious models, either by promoting the sparseness of the fitted coefficients or evaluating the expression’s complexity. The learning tasks are supervised, i.e., the target is known and provided to the algorithm. The model space comprises analytic expressions that use predictive variables collected or generated simultaneously with the target.
3.1. SINDy
The SINDy (Sparse Identification of Nonlinear Dynamics) used in this work was proposed by [
13] to find the governing equations from measurement data of nonlinear dynamical systems. The method uses machine learning and promotes the sparsity of the parameters, determining the fewest terms sufficient to describe the observed data accurately. It might be classified as a symbolic regression method, focusing on fitting polynomial expressions to describe derivative data, as presented in Equation (
6), assuming that only a few terms are essential for the regression to build low-order models that tend to generalize and avoid overfitting [
14,
36].
Later, robust dynamics learning with rational polynomial candidates was proposed and developed by [
36], increasing the robustness of the algorithm to noise, and constrained SINDy was proposed and developed by [
14], which enables the constrain on the coefficients given a prior knowledge, for example, from physics. The SINDy method was successfully applied for fluid dynamics combined with model order reduction (POD) [
14], complex mechanical problems [
13,
36], and reaction kinetics [
36]. The identified models are parsimonious, accurately equilibrating complexity, and remain interpretable as a symbolic representation [
14].
The sparsity-promoting characteristics of the method make it a good algorithm for finding simplified or reduced models of apparently complicated/complex systems [
13,
14]. Equation (
7) presents the general regression of the nonlinear dynamical system, and an example of a sparse, regularized, regression problem to find
, using LASSO, is presented in Equation (
8) [
13].
where
∈
represents the feature library, which is the candidate functions to describe the evolution in time of the system;
X∈
is the matrix with the row vector of state
∈
at each
;
are the vector coefficients
∈
where
weights the sparsity constraint of the optimization. Each column of
is learned separately to find a sparse vector of coefficients for each variable of interest.
The state vector
∈
can be collected during the simulation or from measurement data, sampled at different times
. The derivative of the interested dynamic state comes from the time-series data collected, and the values of the state of the system at the time
t build the feature library [
13]. An example of a feature library is presented in matrix (
9), where each column in
represents a candidate function for the right-hand side on Equation (
6).
where
represents all candidate polynomial functions of order
. For example, matrix (
10) presents the library when
.
The feature library can be a nonlinear function built and chosen before optimization. No assumptions or priors are needed to obtain a sparse system representation. During the fitting of
, the optimization algorithm promotes the system’s sparseness. The algorithm for learning the coefficients is the sequential thresholded least-squares (STLSs) [
13]. If the feature library consists only of linear terms and the optimization is not penalized, the SINDy algorithm reduces to the dynamic mode decomposition one [
14].
The sequential thresholded least-squares (STLSs) algorithm begins with a least-squares solution for
, where all coefficients smaller than a cutoff value
are thresholded. This thresholding process repeats iteratively until the non-zero coefficients converge. The method is computationally efficient, rapidly achieving a sparse solution in only a few iterations. Furthermore, it is remarkably robust to noise, performing well even when the derivatives come from noisy data [
13,
14]. The best
and the model selection should be chosen by varying the parameter and searching at the Pareto front [
14].
As evident from the algorithm, the
cutoff threshold controls the number of predictive variables in the model, bringing a tradeoff between complexity and accuracy in the learned model. Generally, the higher the
, the lower the complexity and accuracy; it might be underfitting the data in this case. Choosing simpler models with a sparse matrix of coefficients and good accuracy is advisable. Physical systems generally have only a few relevant terms in the governing Equations [
13], and this enables the bypassing of the intractable brute force search to sweep through all possible model structures [
14] by promoting the sparseness in the solution.
This work uses the PySINDy library, which implements different SINDy methods readily available in Python [
37,
38].
3.2. Symbolic Regression
Discovering equations from observed data is one of the pillars of science. Symbolic regression, a machine learning algorithm that discovers generalizable and interpretable symbolic models, can automate this process.
The software used in this article is the package described in [
11], the library PySR that implements an efficient and distributed symbolic regression. PySR is a generalizable and open-source algorithm that can be passed custom losses and operators to the optimization process by default, aimed at “accelerating the discovery of interpretable symbolic models from data” [
11].
In [
11], implementation, a genetic algorithm searches for potential equations with a multi-population evolution in a loop of evolve–simplify–optimize. Although real-value coefficients appear in the expressions, they are optimized using classical methods. The genetic algorithm is used to optimize the presence of operations, represented by integers. This way, symbolic regression is optimized, as discovering equations from data is complicated and time-consuming [
11].
Figure 1 presents a tree-based representation of the expression
, where ‘+’ and ‘*’ nodes are the binary operators of addition and multiplication, respectively. A “node” in a tree represents each operation and operand of an analytical expression. The default complexity used for the fittings in this article in PySR is the total number of nodes in the expression independently of its content [
11].
The generalization and interpretability of the models are promoted by jointly minimizing the prediction error and the model complexity. A penalization term on the loss function related to the frecency of the complexity of the expression promotes parsimony in the learning. Frecency measures the frequency and recency of a given complexity in the population [
11].
The analysis of the best model in the fitting considers a score value as presented in Equation (
11). This score combines the reduced loss of the discovered equation and its increased complexity related to the precedent candidate.
The optimization process of PySR is the classical genetic algorithm based on the tournament for individual selection and mutations and crossovers to generate new ones. The software has some adaptations of the algorithms, mainly due to the nature of the model being optimized [
11].
A simple evolutionary algorithm starts with a population of individuals, a fitness function, and a set of mutation operators. The algorithm evaluates each individual’s fitness in a randomly chosen subset of individuals (e.g., ns-sized, where ns = 2 for standard tournament selection), and the fittest one is selected with a probability p. If it is not selected, the process repeats until only one remains. A copy of the designated winner individual undergoes a randomly chosen mutation. Finally, the mutated individual replaces the weakest member of the population. PySR, at this last step, replaces the eldest member, known as “age-regularized” evolution. Simulated annealing algorithm selects individuals, controlling individuals’ increase and narrowing diversity by setting a high or low temperature, respectively, [
11].
The second modification of PySR from the classical evolution algorithm is adapting for the mathematical expressions, introducing the loop of evolve–simplify–optimize. The evolve step is the evolution selection as previously described; simplify is the simplification phase of reducing equations to an equivalent, simpler form using algebraic transformations, Finally, the optimization step uses a classical optimization algorithm to fine-tune constants within the equations.
The third and last modification is the adaptative parsimony metric already described previously, i.e., using frecency instead of the complexity of the expression in the loss function, which gives the adaptability and results in the number of expressions at different complexities being roughly the same.
4. Methodology
This section describes the steps performed in this work and details some important aspects of the methodology. The work can be split into three macro steps: data generation, transformation, and modeling. The flowchart in
Figure 2 presents the steps and subtasks performed.
4.1. Data Generation
The data generation was performed in the commercial Finite Element Software (FEM) COMSOL
® Multiphysics 6.2, which can be used to simulate many engineering problems, from heat transfer to electromagnetic, by modeling the physical phenomena. The physics used was the
electric current (ec) interface in the whole domain, setting the suitable boundary conditions in the electrodes. The drift diffusion reaction model, which implements Equations (
2) and (3) in the gas domain, was set up using the
Transport of Diluted Species (tds) interface. The
ec and
tds interfaces are coupled to solve for electric field and charge concentration. The current density in the gas, contributing to the surface charge accumulation and described by Equation (
5), was implemented as a variable in the physics, using the solutions of
ec and
tds. Further details of implementation can be consulted in [
9]. All simulations are time-dependent studies, i.e., the solution of the system’s evolution in time. The java code of the time-dependent simulations can be accessed in the repository [
39].
The model described in
Section 2, whose geometry depicted in
Figure 3 inspired by [
40], was run 120 times on COMSOL
®, varying the ion-pair generation rate and the potential applied. The results from the time-dependent simulations compose the final datasets in the learning tasks, from which the target and predictive variables values were extracted. The total simulation time was 8000 h for each pair of varied parameters, with 1000 time sampling points uniformly distributed.
Additional electrostatic simulations, which solve the Poisson PDE for potential without the DDR formulation and surface charging, were performed with varying potential applied in the same geometry of
Figure 3 and the same material properties used in the previous 120 simulation runs described. The results from the electrostatic simulations were only used as predictive variables in the learning tasks, not being used as targets.
4.2. Data Transformation
This work involved learning tasks focused on different targets to predict. Three were related to the system’s steady-state conditions, and one focused on modeling the dynamics. In the latter one, the surface charge density derivative in time was the target variable, specifically, the derivative of the time series. The targets for the steady-state condition were the time to steady state, the value of dark currents, and the equivalent gas conductivity. The predictive variables used were the simulations’ input conditions, the system’s time-dependent solutions using the model described in
Section 2, and the system’s electrostatic solutions.
The steady state in the time series data was identified by comparing the standard deviation of the time series in an analyzed rolling window and the whole collected time. It is important to note that the timestep in the simulations was roughly 10 h, so the steady-state time was inferred by this method, and the data had up to this precision.
The number of data points used for each rolling window in steady-state identification was 20, and the step was 1, so sequential rolling windows overlapped. A threshold = was applied to identify the reach of the steady state based on a relative deviation for each window. All simulations reached a steady state, so there was no problem with unidentified cases. Therefore, the value considered for the steady state was the one at the last step for all variables.
The analyzed dataset for dynamics was surface charge densities, so each sampled coordinate in the arc length of the gas–solid insulator interface had a computed time to reach the steady state. The maximum value calculated on the arc was used as the time to reach the steady state for each simulation.
There was some scaling of variables to make predictive and target close in order of magnitude. This was important as the algorithms used were sensitive to large differences. Feature engineering was performed as well, specifically for dynamics learning. The list below presents the scaling and feature engineering that was performed.
Conductivity and currents: transformed units to pico multiplying by or nano multiplying by ;
Time to steady state: converted to hours divided by 3600;
Feature engineering: the reciprocal of exponential of time divided by 300, i.e., ; the reciprocal of variables such as potential and surface electric field.
4.3. Modelling
The SR was used to learn the variables’ values at steady state, and SINDy was used to predict the derivative of surface charge density over time. The learning algorithms were applied to 80% of the data collected in the simulations, called the training set, 15% was used as the validation dataset, and the test error was computed in the 5% not used in training and validation, called the test set. Random sampling without replacement of all collected data split the sets.
Each algorithm, SR and SINDy, had hyperparameters to be tuned to perform the learning task better. The validation dataset was used in this tuning procedure. The process involved try-and-error evaluating the final validation error in each iteration and using a grid search to find the optimal combination of parameters. It is important to highlight that the test dataset was never used in this process; it was only evaluated at the end when all hyperparameters were already chosen.
The hyperparameters tuned in each learning case for SR are presented in the list below. Descriptions of all other parameters can be consulted in the doc strings of the code in the repository indicated on [
11]. Other available input parameters in the software were not tuned, and their default value was used.
niterations: maximum number of iterations of the learning algorithm;
unary_operators: allowed unary operators;
binary_operators: allowed binary operators;
maxsize: maximum complexity, related to the number of operations allowed;
model_selection: model selection criteria;
parsimony: punishes complexity;
constraints: set constraints for the equations.
The hyperparameters tuned in the learning for SINDy, mostly related to the STLSQ optimizer and library, are presented in the list below. Descriptions of all other parameters can be consulted in the doc strings of the code in the repository indicated on [
37,
38]. Other available input parameters in the software were not tuned, and their default value was used.
threshold: minimum coefficient value to not be set to zero;
alpha: weight on the regularization;
max_iter: maximum iterations of the optimization algorithm;
polynomial_degree: polynomial degree of the feature library;
include_bias: include the 1s column in the feature library.
The SR implementation in Python used the library PySR described in the paper [
11], version 0.19.4; and the SINDy implementation, used the library PySINDy described in the papers [
37,
38], version 1.7.5.
6. Discussion
The learning tasks were successfully performed using the symbolic regression and SINDy algorithms, yielding relatively good results based on the validation and test errors. Dark currents and conductivity required low-complexity symbolic equations to fit the data, while time to steady state, even with highly complex equations, could not reach very good results.
The final Equation (
12) for dark current in the gas is plausible and has the same behavior related to radiation as observed in the literature [
3], i.e., the dark current increases linearly with ion-pair generation rate in the saturation condition reached in steady state. In the lower voltage conditions, the dark current through the gas–electrode is higher in the steady state as there is a higher concentration of ions in the gas that generate the current, which is caused by the lower flux in the interface of the gas–solid insulator. The higher the voltage, the higher the current in the solid insulator and the flux in the gas–solid insulator boundary.
The results derived in [
6] disagree with Equation (
12) as the dark current increases with the increase in potential applied. Still, it is important to note that the model constructed by the authors is for air and for a different geometry, which does not incorporate the intricate dynamics of the flux in the boundary gas–solid insulator. The authors also observe the linear relation of radiation dosage related to the ion-pair generation rate and the dark current.
The time to steady state does not vary much between the conditions simulated, demonstrating it is not strongly dependent on ion-pair generation rate and voltage. Further analysis, including simulations with varying conductivities and permittivities in the gas and solid, might improve the model with more significant variations in the time to steady state, as different conductivities will influence the system time constant. The relatively high test error indicates that the model will perform even worse for other simulation conditions not present in the training dataset.
Conductivity on the steady state was fitted and has a relatively low test error. Still, it can significantly impact time-dependent simulations as the errors might add up over time. Furthermore, the values of the electric field and ion concentrations might differ considerably in the initial times of the simulation compared to the steady-state ones in which the model was fitted, so it is not currently advisable to use it as a surrogate model. Further investigations are needed to compare simulations using DDR and the analytical expression. The DDR shows that the conductivity of the gas cannot be modeled as constant (see [
9] for further reference), so the current analytical expression is a better alternative.
Good model performance is observed in SINDy surface charge density dynamics fitting. This learning task was more demanding, as it had more data points—it is a time series that varies in space, and the surface charging couples gas dynamics and electric field solutions. However, work is needed to improve the expressions for use as a surrogate model in the simulations. The current one demonstrated the power of the algorithm in discovering dynamics from data; even for large amounts of data and a big library, it efficiently fitted a sparse representation.
The dynamic of saturation of the surface charge density observed in
Figure 6 is called capacitive-resistive transition, described in many previous papers [
1,
9] and expected. The charging of positive or negative particles is influenced by the geometry of the solid insulator interface and its proximity to the electrodes, as seen in
Figure 5. The fast saturation behavior, observed in the curves on
Figure 6, is related to neglecting the interface conductivity of the boundary, so all current from the gas that cannot be dissipated by the current in the solid insulator accumulates at the interface, as can be seen in Equation (
1) excluding the last term. The geometry and distance from the electrode create specific points of accumulation on the boundary; the same preferred accumulation near the positive electrode of negative particles is observed in [
9]. The varying time to steady state observed in
Figure 5b is discussed and derived by [
40] as well, although with a different method. As discussed previously, the linear increase in current in the solid with potential is expected and can be seen in
Figure 4b.
The physical units of the learned Equations (
12) and (
14) can be used to infer the physical units of the learned coefficients using dimensional analysis. These units and values can be coupled with prior knowledge about the system to develop its symbolic representation further, for example, by combining system conditions that were not varied in the study to compute the learned coefficients. This further step is out of the scope of this work, but it might be an interesting path to explore with combinatorics and symbolic programming.
One of the challenges in the learning tasks was the considerable discrepancy in the variables’ order of dimension. It interferes mainly with applying the STLS in SINDy, which depends on the absolute value of the coefficients in the pruning based on the threshold step. The discrepancy was overcome by normalizing the variables or transforming the units, for example, in kilo, nano, and pico.
It is important to point out that the system’s physics is simplified, and other phenomena might be important in the simulated conditions, such as the anode’s emittance of electrons, the surface conductivity of the gas–solid insulator interface, and spatial and temporal variation in temperature. The conditions simulated are restricted to the same geometry and material properties. Further enrichment of the dataset with variations of these properties is interesting for building more generally applicable analytical expressions for the HVDC-GIS in RIC conditions.