-
A Direct Construction of Solitary Waves for a Fractional Korteweg-de Vries Equation With an Inhomogeneous Symbol
Authors:
Swati Yadav,
Jun Xue
Abstract:
We construct solitary waves for the fractional Korteweg-De Vries type equation $u_t + (Λ^{-s}u + u^2)_x = 0$, where $Λ^{-s}$ denotes the Bessel potential operator $(1 + |D|^2)^{-\frac{s}{2}}$ for $s \in (0,\infty)$. The approach is to parameterise the known periodic solution curves through the relative wave height. Using a priori estimates, we show that the periodic waves locally uniformly converg…
▽ More
We construct solitary waves for the fractional Korteweg-De Vries type equation $u_t + (Λ^{-s}u + u^2)_x = 0$, where $Λ^{-s}$ denotes the Bessel potential operator $(1 + |D|^2)^{-\frac{s}{2}}$ for $s \in (0,\infty)$. The approach is to parameterise the known periodic solution curves through the relative wave height. Using a priori estimates, we show that the periodic waves locally uniformly converge to waves with negative tails, which are transformed to the desired branch of solutions. The obtained branch reaches a highest wave, the behavior of which varies with $s$. The work is a generalisation of recent work by Ehrnström-Nik-Walker, and is as far as we know the first simultaneous construction of small, intermediate and highest solitary waves for the complete family of (inhomogeneous) fractional KdV equations with negative-order dispersive operators. The obtained waves display exponential decay rate as $|x| \to \infty$.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
A new characterization of the dissipation structure and the relaxation limit for the compressible Euler-Maxwell system
Authors:
Timothée Crin-Barat,
Yue-Jun Peng,
Ling-Yun Shou,
Jiang Xu
Abstract:
We investigate the three-dimensional compressible Euler-Maxwell system, a model for simulating the transport of electrons interacting with propagating electromagnetic waves in semiconductor devices. First, we show the global well-posedness of classical solutions being a sharp small perturbation of constant equilibrium in a critical regularity setting, uniformly with respect to the relaxation param…
▽ More
We investigate the three-dimensional compressible Euler-Maxwell system, a model for simulating the transport of electrons interacting with propagating electromagnetic waves in semiconductor devices. First, we show the global well-posedness of classical solutions being a sharp small perturbation of constant equilibrium in a critical regularity setting, uniformly with respect to the relaxation parameter $\varepsilon>0$. Then, for all times $t>0$, we derive quantitative error estimates at the rate $O(\varepsilon)$ between the rescaled Euler-Maxwell system and the limit drift-diffusion model. To the best of our knowledge, this work provides the first global-in-time strong convergence for the relaxation procedure in the case of ill-prepared data.
In order to prove our results, we develop a new characterization of the dissipation structure for the linearized Euler-Maxwell system with respect to the relaxation parameter $\varepsilon$. This is done by partitioning the frequency space into three distinct regimes: low, medium and high frequencies, each associated with a different behaviour of the solution. Then, in each regime, the use of efficient unknowns and Lyapunov functionals based on the hypocoercivity theory leads to uniform a priori estimates.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
The pressureless damped Euler-Riesz system in the critical regularity framework
Authors:
Meiling Chi,
Ling-Yun Shou,
Jiang Xu
Abstract:
We are concerned with a system governing the evolution of the pressureless compressible Euler equations with Riesz interaction and damping in $\mathbb{R}^{d}$ ($d\geq1$), where the interaction force is given by $\nabla(-Δ)^{\smash{\frac{α-d}{2}}}(ρ-\barρ)$ with $d-2<α<d$. Referring to the standard dissipative structure of first-order hyperbolic systems, the purpose of this paper is to investigate…
▽ More
We are concerned with a system governing the evolution of the pressureless compressible Euler equations with Riesz interaction and damping in $\mathbb{R}^{d}$ ($d\geq1$), where the interaction force is given by $\nabla(-Δ)^{\smash{\frac{α-d}{2}}}(ρ-\barρ)$ with $d-2<α<d$. Referring to the standard dissipative structure of first-order hyperbolic systems, the purpose of this paper is to investigate the weaker dissipation effect arising from the interaction force and to establish the global existence and large-time behavior of solutions to the Cauchy problem in the critical $L^p$ framework. More precisely, it is observed by the spectral analysis that the density behaves like fractional heat diffusion at low frequencies. Furthermore, if the low-frequency part of the initial perturbation is bounded in some Besov space $\dot{B}^{σ_1}_{p,\infty}$ with $-d/p-1\leq σ_1<d/p-1$, it is shown that the $L^p$-norm of the $σ$-order derivative for the density converges to its equilibrium at the rate $(1+t)^{-\smash{\frac{σ-σ_1}{α-d+2}}}$, which coincides with that of the fractional heat kernel.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Precompactness in matrix weighted Bourgain-Morrey spaces
Authors:
Tengfei Bai,
Jingshi Xu
Abstract:
In this paper, we introduce matrix weighted Bourgain-Morrey spaces and obtain two sufficient conditions for precompact sets in matrix weighted Bourgain-Morrey spaces. We prove that the dyadic average operator is bounded on some matrix weighted Bourgain-Morrey spaces. With this result, we obtain the necessity for precompact sets in some matrix weighted Bourgain-Morrey spaces.
In this paper, we introduce matrix weighted Bourgain-Morrey spaces and obtain two sufficient conditions for precompact sets in matrix weighted Bourgain-Morrey spaces. We prove that the dyadic average operator is bounded on some matrix weighted Bourgain-Morrey spaces. With this result, we obtain the necessity for precompact sets in some matrix weighted Bourgain-Morrey spaces.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Achieving Near-Optimal Convergence for Distributed Minimax Optimization with Adaptive Stepsizes
Authors:
Yan Huang,
Xiang Li,
Yipeng Shen,
Niao He,
Jinming Xu
Abstract:
In this paper, we show that applying adaptive methods directly to distributed minimax problems can result in non-convergence due to inconsistency in locally computed adaptive stepsizes. To address this challenge, we propose D-AdaST, a Distributed Adaptive minimax method with Stepsize Tracking. The key strategy is to employ an adaptive stepsize tracking protocol involving the transmission of two ex…
▽ More
In this paper, we show that applying adaptive methods directly to distributed minimax problems can result in non-convergence due to inconsistency in locally computed adaptive stepsizes. To address this challenge, we propose D-AdaST, a Distributed Adaptive minimax method with Stepsize Tracking. The key strategy is to employ an adaptive stepsize tracking protocol involving the transmission of two extra (scalar) variables. This protocol ensures the consistency among stepsizes of nodes, eliminating the steady-state error due to the lack of coordination of stepsizes among nodes that commonly exists in vanilla distributed adaptive methods, and thus guarantees exact convergence. For nonconvex-strongly-concave distributed minimax problems, we characterize the specific transient times that ensure time-scale separation of stepsizes and quasi-independence of networks, leading to a near-optimal convergence rate of $\tilde{\mathcal{O}} \left( ε^{-\left( 4+δ\right)} \right)$ for any small $δ> 0$, matching that of the centralized counterpart. To our best knowledge, D-AdaST is the first distributed adaptive method achieving near-optimal convergence without knowing any problem-dependent parameters for nonconvex minimax problems. Extensive experiments are conducted to validate our theoretical results.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
A dissimilarity measure for semidirected networks
Authors:
Michael Maxfield,
Jingcheng Xu,
Cécile Ané
Abstract:
Semidirected networks have received interest in evolutionary biology as the appropriate generalization of unrooted trees to networks, in which some but not all edges are directed. Yet these networks lack proper theoretical study. We define here a general class of semidirected phylogenetic networks, with a stable set of leaves, tree nodes and hybrid nodes. We prove that for these networks, if we lo…
▽ More
Semidirected networks have received interest in evolutionary biology as the appropriate generalization of unrooted trees to networks, in which some but not all edges are directed. Yet these networks lack proper theoretical study. We define here a general class of semidirected phylogenetic networks, with a stable set of leaves, tree nodes and hybrid nodes. We prove that for these networks, if we locally choose the direction of one edge, then globally the set of paths starting by this edge is stable across all choices to root the network. We define an edge-based representation of semidirected phylogenetic networks and use it to define a dissimilarity between networks, which can be efficiently computed in near-quadratic time. Our dissimilarity extends the widely-used Robinson-Foulds distance on both rooted trees and unrooted trees. After generalizing the notion of tree-child networks to semidirected networks, we prove that our edge-based dissimilarity is in fact a distance on the space of tree-child semidirected phylogenetic networks.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Embedding and Duality of Matrix-Weighted Modulation Spaces
Authors:
Shengrong Wang,
Pengfei Guo,
Jingshi Xu
Abstract:
In this paper, we give a approximation characterization, the lifting property, embedding properties and the duality of matrix weighted modulation spaces.
In this paper, we give a approximation characterization, the lifting property, embedding properties and the duality of matrix weighted modulation spaces.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Input gradient annealing neural network for solving low-temperature Fokker-Planck equations
Authors:
Liangkai Hang,
Dan Hu,
Zin-Qin John Xu
Abstract:
We present a novel yet simple deep learning approach, called input gradient annealing neural network (IGANN), for solving stationary Fokker-Planck equations. Traditional methods, such as finite difference and finite elements, suffer from the curse of dimensionality. Neural network based algorithms are meshless methods, which can avoid the curse of dimensionality. However, at low temperature, when…
▽ More
We present a novel yet simple deep learning approach, called input gradient annealing neural network (IGANN), for solving stationary Fokker-Planck equations. Traditional methods, such as finite difference and finite elements, suffer from the curse of dimensionality. Neural network based algorithms are meshless methods, which can avoid the curse of dimensionality. However, at low temperature, when directly solving a stationary Fokker-Planck equation with more than two metastable states in the generalized potential landscape, the small eigenvalue introduces numerical difficulties due to a large condition number. To overcome these problems, we introduce the IGANN method, which uses a penalty of negative input gradient annealing during the training. We demonstrate that the IGANN method can effectively solve high-dimensional and low-temperature Fokker-Planck equations through our numerical experiments.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Research on the Evaluation Index System of Enterprise Production Efficiency
Authors:
W. Li,
J. Cai,
C. Wang,
Y. Chen,
J. Xu,
J. Zhao,
Y. Chen
Abstract:
This paper focuses on studying the evaluation index system for the production efficiency of tobacco enterprises. Considering the limitations of existing evaluation methods in accurately assessing the production quality of cigarette enterprises, a mathematical model based on the Analytic Hierarchy Process (AHP) is established. This model constructs an evaluation framework for the production efficie…
▽ More
This paper focuses on studying the evaluation index system for the production efficiency of tobacco enterprises. Considering the limitations of existing evaluation methods in accurately assessing the production quality of cigarette enterprises, a mathematical model based on the Analytic Hierarchy Process (AHP) is established. This model constructs an evaluation framework for the production efficiency of cigarette enterprises and subsequently analyzes the significance of each index within this framework. To comprehensively analyze the multi-index and feasibility aspects of the selected projects, the AHP method is employed to establish a comprehensive feasibility research and evaluation structure model. The result of this feasibility study provides the conclusion that the construction of an evaluation index system for the production efficiency of cigarette enterprises can indeed promote the enhancement of their production efficiency.
△ Less
Submitted 28 April, 2024;
originally announced April 2024.
-
Online Planning of Power Flows for Power Systems Against Bushfires Using Spatial Context
Authors:
Jianyu Xu,
Qiuzhuang Sun,
Yang Yang,
Huadong Mo,
Daoyi Dong
Abstract:
The 2019-20 Australia bushfire incurred numerous economic losses and significantly affected the operations of power systems. A power station or transmission line can be significantly affected due to bushfires, leading to an increase in operational costs. We study a fundamental but challenging problem of planning the optimal power flow (OPF) for power systems subject to bushfires. Considering the s…
▽ More
The 2019-20 Australia bushfire incurred numerous economic losses and significantly affected the operations of power systems. A power station or transmission line can be significantly affected due to bushfires, leading to an increase in operational costs. We study a fundamental but challenging problem of planning the optimal power flow (OPF) for power systems subject to bushfires. Considering the stochastic nature of bushfire spread, we develop a model to capture such dynamics based on Moore's neighborhood model. Under a periodic inspection scheme that reveals the in-situ bushfire status, we propose an online optimization modeling framework that sequentially plans the power flows in the electricity network. Our framework assumes that the spread of bushfires is non-stationary over time, and the spread and containment probabilities are unknown. To meet these challenges, we develop a contextual online learning algorithm that treats the in-situ geographical information of the bushfire as a 'spatial context'. The online learning algorithm learns the unknown probabilities sequentially based on the observed data and then makes the OPF decision accordingly. The sequential OPF decisions aim to minimize the regret function, which is defined as the cumulative loss against the clairvoyant strategy that knows the true model parameters. We provide a theoretical guarantee of our algorithm by deriving a bound on the regret function, which outperforms the regret bound achieved by other benchmark algorithms. Our model assumptions are verified by the real bushfire data from NSW, Australia, and we apply our model to two power systems to illustrate its applicability.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
The Cattaneo-Christov approximation of Fourier heat-conductive compressible fluids
Authors:
Timothée Crin-Barat,
Shuichi Kawashima,
Jiang Xu
Abstract:
We investigate the Navier-Stokes-Cattaneo-Christov (NSC) system in $\mathbb{R}^d$ ($d\geq3$), a model of heat-conductive compressible flows serving as a finite speed of propagation approximation of the Navier-Stokes-Fourier (NSF) system. Due to the presence of Oldroyd's upper-convected derivatives, the system (NSC) exhibits a \textit{lack of hyperbolicity} which makes it challenging to establish i…
▽ More
We investigate the Navier-Stokes-Cattaneo-Christov (NSC) system in $\mathbb{R}^d$ ($d\geq3$), a model of heat-conductive compressible flows serving as a finite speed of propagation approximation of the Navier-Stokes-Fourier (NSF) system. Due to the presence of Oldroyd's upper-convected derivatives, the system (NSC) exhibits a \textit{lack of hyperbolicity} which makes it challenging to establish its well-posedness, especially in multi-dimensional contexts. In this paper, within a critical regularity functional framework, we prove the global-in-time well-posedness of (NSC) for initial data that are small perturbations of constant equilibria, uniformly with respect to the approximation parameter $\varepsilon>0$. Then, building upon this result, we obtain the sharp large-time asymptotic behaviour of (NSC) and, for all time $t>0$, we derive quantitative error estimates between the solutions of (NSC) and (NSF). To the best of our knowledge, our work provides the first strong convergence result for this relaxation procedure in the three-dimensional setting and for ill-prepared data.
The (NSC) system is partially dissipative and incorporates both partial diffusion and partial damping mechanisms. To address these aspects and ensure the large-time stability of the solutions, we construct localized-in-frequency perturbed energy functionals based on the hypocoercivity theory. More precisely, our analysis relies on partitioning the frequency space into \textit{three} distinct regimes: low, medium and high frequencies. Within each frequency regime, we introduce effective unknowns and Lyapunov functionals, revealing the spectrally expected dissipative structures.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
On the flag structure and classification of the holomorphic curves on C*-algebras
Authors:
Zhimeng Chen,
Jing Xu
Abstract:
In this note, we will define the formulas of curvature and it's covariant derivatives for holomorphic curves on C*-algebras for the multivariable case. As applications, the unitarily and similarly classification theorems for holomorphic bundle and commuting operator tuples in Cowen-Douglas class are given.
In this note, we will define the formulas of curvature and it's covariant derivatives for holomorphic curves on C*-algebras for the multivariable case. As applications, the unitarily and similarly classification theorems for holomorphic bundle and commuting operator tuples in Cowen-Douglas class are given.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
$K_{0}$-groups and strongly irreducible decompositions of operator tuples
Authors:
Jing Xu
Abstract:
An operator tuple $\mathbf{T}=(T_{1},\ldots,T_{n})$ is called strongly irreducible (SI), if the joint commutant of $\mathbf{T}$ does not any nontrivial idempotent operator. In this paper, we study the uniqueness of finitely strong irreducible decomposition of operator tuples up to similarity by $K$-theory of operator algebra, and give the algebraically similarity invariants of the Cowen-Douglas tu…
▽ More
An operator tuple $\mathbf{T}=(T_{1},\ldots,T_{n})$ is called strongly irreducible (SI), if the joint commutant of $\mathbf{T}$ does not any nontrivial idempotent operator. In this paper, we study the uniqueness of finitely strong irreducible decomposition of operator tuples up to similarity by $K$-theory of operator algebra, and give the algebraically similarity invariants of the Cowen-Douglas tuple with index 1 by using $K_{0}$-group of the commutant of operator tuples. As an application, we calculate $K_{0}$-groups of some multiplier algebras, and describe the similarity of backwards multishifts on Drury-Arveson space by means of inflation theory.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Galilean symmetry of the KdV hierarchy
Authors:
Jianghao Xu,
Di Yang
Abstract:
By solving the infinitesimal Galilean symmetry for the KdV hierarchy, we obtain an explicit expression for the corresponding one-parameter Lie group, which we call the Galilean symmetry of the KdV hierarchy. As an application, we establish an explicit relationship between the non-abelian Born--Infeld partition function and the generalized Brézin--Gross--Witten partition function.
By solving the infinitesimal Galilean symmetry for the KdV hierarchy, we obtain an explicit expression for the corresponding one-parameter Lie group, which we call the Galilean symmetry of the KdV hierarchy. As an application, we establish an explicit relationship between the non-abelian Born--Infeld partition function and the generalized Brézin--Gross--Witten partition function.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
A general $q$-series transformation and its applications to multi-sum Rogers-Ramanujan-Slater identities
Authors:
Jianan Xu,
Xinrong Ma
Abstract:
In the present paper, we establish a general transformation for $q$-series which contains L. Wang et al's transformation involved in Nahm series. As direct applications, some concrete new transformation formulas for the ${}_{r+1}φ_r$ series as well as $q$-identities of multi-sum Rogers-Ramanujan-Slater type are presented.
In the present paper, we establish a general transformation for $q$-series which contains L. Wang et al's transformation involved in Nahm series. As direct applications, some concrete new transformation formulas for the ${}_{r+1}φ_r$ series as well as $q$-identities of multi-sum Rogers-Ramanujan-Slater type are presented.
△ Less
Submitted 27 May, 2024; v1 submitted 1 March, 2024;
originally announced March 2024.
-
FPM-WSI: Fourier ptychographic whole slide imaging via feature-domain backdiffraction
Authors:
Shuhe Zhang,
Aiye Wang,
Jinghao Xu,
Tianci Feng,
Jinhua Zhou,
An Pan
Abstract:
Fourier ptychographic microscopy (FPM), characterized by high-throughput computational imaging, theoretically provides a cunning solution to the trade-off between spatial resolution and field of view (FOV), which has a promising prospect in the application of digital pathology. However, block reconstruction and then stitching has currently become an unavoidable procedure due to vignetting effects.…
▽ More
Fourier ptychographic microscopy (FPM), characterized by high-throughput computational imaging, theoretically provides a cunning solution to the trade-off between spatial resolution and field of view (FOV), which has a promising prospect in the application of digital pathology. However, block reconstruction and then stitching has currently become an unavoidable procedure due to vignetting effects. The stitched image tends to present color inconsistency in different image segments, or even stitching artifacts. In response, we reported a computational framework based on feature-domain backdiffraction to realize full-FOV, stitching-free FPM reconstruction. Different from conventional algorithms that establish the loss function in the image domain, our method formulates it in the feature domain, where effective information of images is extracted by a feature extractor to bypass the vignetting effect. The feature-domain error between predicted images based on estimation of model parameters and practically captured images is then digitally diffracted back through the optical system for complex amplitude reconstruction and aberration compensation. Through massive simulations and experiments, the method presents effective elimination of vignetting artifacts, and reduces the requirement of precise knowledge of illumination positions. We also found its great potential to recover the data with a lower overlapping rate of spectrum and to realize automatic blind-digital refocusing without a prior defocus distance.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
A Unified-Field Monolithic Fictitious Domain-Finite Element Method for Fluid-Structure-Contact Interactions and Applications to Deterministic Lateral Displacement Problems
Authors:
Cheng Wang,
Pengtao Sun,
Yumiao Zhang,
Jinchao Xu,
Yan Chen,
Jiarui Han
Abstract:
Based upon two overlapped, body-unfitted meshes, a type of unified-field monolithic fictitious domain-finite element method (UFMFD-FEM) is developed in this paper for moving interface problems of dynamic fluid-structure interactions (FSI) accompanying with high-contrast physical coefficients across the interface and contacting collisions between the structure and fluidic channel wall when the stru…
▽ More
Based upon two overlapped, body-unfitted meshes, a type of unified-field monolithic fictitious domain-finite element method (UFMFD-FEM) is developed in this paper for moving interface problems of dynamic fluid-structure interactions (FSI) accompanying with high-contrast physical coefficients across the interface and contacting collisions between the structure and fluidic channel wall when the structure is immersed in the fluid. In particular, the proposed novel numerical method consists of a monolithic, stabilized mixed finite element method within the frame of fictitious domain/immersed boundary method (IBM) for generic fluid-structure-contact interaction (FSCI) problems in the Eulerian-updated Lagrangian description, while involving the no-slip type of interface conditions on the fluid-structure interface, and the repulsive contact force on the structural surface when the immersed structure contacts the fluidic channel wall. The developed UFMFD-FEM for FSI or FSCI problems can deal with the structural motion with large rotational and translational displacements and/or large deformation in an accurate and efficient fashion, which are first validated by two benchmark FSI problems and one FSCI model problem, then by experimental results of a realistic FSCI scenario -- the microfluidic deterministic lateral displacement (DLD) problem that is applied to isolate circulating tumor cells (CTCs) from blood cells in the blood fluid through a cascaded filter DLD microchip in practice, where a particulate fluid with the pillar obstacles effect in the fluidic channel, i.e., the effects of fluid-structure interaction and structure collision, play significant roles to sort particles (cells) of different sizes with tilted pillar arrays.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
Identifying circular orders for blobs in phylogenetic networks
Authors:
John A. Rhodes,
Hector Banos,
Jingcheng Xu,
Cécile Ané
Abstract:
Interest in the inference of evolutionary networks relating species or populations has grown with the increasing recognition of the importance of hybridization, gene flow and admixture, and the availability of large-scale genomic data. However, what network features may be validly inferred from various data types under different models remains poorly understood. Previous work has largely focused o…
▽ More
Interest in the inference of evolutionary networks relating species or populations has grown with the increasing recognition of the importance of hybridization, gene flow and admixture, and the availability of large-scale genomic data. However, what network features may be validly inferred from various data types under different models remains poorly understood. Previous work has largely focused on level-1 networks, in which reticulation events are well separated, and on a general network's tree of blobs, the tree obtained by contracting every blob to a node. An open question is the identifiability of the topology of a blob of unknown level. We consider the identifiability of the circular order in which subnetworks attach to a blob, first proving that this order is well-defined for outer-labeled planar blobs. For this class of blobs, we show that the circular order information from 4-taxon subnetworks identifies the full circular order of the blob. Similarly, the circular order from 3-taxon rooted subnetworks identifies the full circular order of a rooted blob. We then show that subnetwork circular information is identifiable from certain data types and evolutionary models. This provides a general positive result for high-level networks, on the identifiability of the ordering in which taxon blocks attach to blobs in outer-labeled planar networks. Finally, we give examples of blobs with different internal structures which cannot be distinguished under many models and data types.
△ Less
Submitted 18 February, 2024;
originally announced February 2024.
-
Sharp Information-Theoretic Thresholds for Shuffled Linear Regression
Authors:
Leon Lufkin,
Yihong Wu,
Jiaming Xu
Abstract:
This paper studies the problem of shuffled linear regression, where the correspondence between predictors and responses in a linear model is obfuscated by a latent permutation. Specifically, we consider the model $y = Π_* X β_* + w$, where $X$ is an $n \times d$ standard Gaussian design matrix, $w$ is Gaussian noise with entrywise variance $σ^2$, $Π_*$ is an unknown $n \times n$ permutation matrix…
▽ More
This paper studies the problem of shuffled linear regression, where the correspondence between predictors and responses in a linear model is obfuscated by a latent permutation. Specifically, we consider the model $y = Π_* X β_* + w$, where $X$ is an $n \times d$ standard Gaussian design matrix, $w$ is Gaussian noise with entrywise variance $σ^2$, $Π_*$ is an unknown $n \times n$ permutation matrix, and $β_*$ is the regression coefficient, also unknown. Previous work has shown that, in the large $n$-limit, the minimal signal-to-noise ratio ($\mathsf{SNR}$), $\lVert β_* \rVert^2/σ^2$, for recovering the unknown permutation exactly with high probability is between $n^2$ and $n^C$ for some absolute constant $C$ and the sharp threshold is unknown even for $d=1$. We show that this threshold is precisely $\mathsf{SNR} = n^4$ for exact recovery throughout the sublinear regime $d=o(n)$. As a by-product of our analysis, we also determine the sharp threshold of almost exact recovery to be $\mathsf{SNR} = n^2$, where all but a vanishing fraction of the permutation is reconstructed.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Ground states of planar Schrödinger-Poisson systems with an unbounded potential
Authors:
Miao Du,
Jiaxin Xu
Abstract:
In this paper, we deal with a class of planar Schrödinger-Poisson systems, namely, $-Δu+V(x)u+\fracγ{2π}\bigl(\log(|\cdot|)\ast|u|^{2}\bigr)u=b|u|^{p-2}u\ \text{in}\ \mathbb{R}^{2}$, where $γ> 0$, $b \geq 0$, $p>2$ and $V \in C(\mathbb{R}^2, \mathbb{R})$ is an unbounded potential function with $\inf_{\mathbb{R}^2} V >0$. Suppose moreover that the potential $V$ satisfies…
▽ More
In this paper, we deal with a class of planar Schrödinger-Poisson systems, namely, $-Δu+V(x)u+\fracγ{2π}\bigl(\log(|\cdot|)\ast|u|^{2}\bigr)u=b|u|^{p-2}u\ \text{in}\ \mathbb{R}^{2}$, where $γ> 0$, $b \geq 0$, $p>2$ and $V \in C(\mathbb{R}^2, \mathbb{R})$ is an unbounded potential function with $\inf_{\mathbb{R}^2} V >0$. Suppose moreover that the potential $V$ satisfies $\left|\{x \in \mathbb{R}^2:\: V(x)\leq M\}\right| < \infty$ for every $M>0$, we establish the existence of ground state solutions for this system via variational methods. Furthermore, we also explore the minimax characterization of ground state solutions. Our main results can be viewed as a counterpart of the result from Molle and Sardilli (Proc. Edinb. Math. Soc. 65:1133-1146, 2022), where the authors studied the existence of ground state solutions for the above planar Schrödinger-Poisson system in the case where $b>0$ and $p >4$.
△ Less
Submitted 24 June, 2024; v1 submitted 18 January, 2024;
originally announced February 2024.
-
A Mathematical Proof of the Four-Color Conjecture (1): Transformation Step
Authors:
Jin Xu
Abstract:
The four-color conjecture has puzzled mathematicians for over 170 years and has yet to be proven by purely mathematical methods. This series of articles provides a purely mathematical proof of the four-color conjecture, consisting of two parts: the transformation step and the decycle step. The transformation step uses two innovative tools, contracting and extending operations and unchanged bichrom…
▽ More
The four-color conjecture has puzzled mathematicians for over 170 years and has yet to be proven by purely mathematical methods. This series of articles provides a purely mathematical proof of the four-color conjecture, consisting of two parts: the transformation step and the decycle step. The transformation step uses two innovative tools, contracting and extending operations and unchanged bichromatic cycles, to transform the proof of the four-color conjecture into the decycle problem of 4-base modules. Moreover, the decycle step solves the decycle problem of 4-base modules using two other innovative tools: the color-connected potential and the pocket operations. This article presents the proof of the transformation step.
△ Less
Submitted 11 February, 2024;
originally announced February 2024.
-
LQ Optimal Control of First-Order Hyperbolic PDE Systems with Final State Constraints
Authors:
Xiaomin Xue,
Juanjuan Xu,
Huanshui Zhang,
Long Hu
Abstract:
This paper studies the linear-quadratic (LQ) optimal control problem of a class of systems governed by the first-order hyperbolic partial differential equations (PDEs) with final state constraints. The main contribution is to present the solvability condition and the corresponding explicit optimal controller by using the Lagrange multiplier method and the technique of solving forward and backward…
▽ More
This paper studies the linear-quadratic (LQ) optimal control problem of a class of systems governed by the first-order hyperbolic partial differential equations (PDEs) with final state constraints. The main contribution is to present the solvability condition and the corresponding explicit optimal controller by using the Lagrange multiplier method and the technique of solving forward and backward partial differential equations (FBPDEs). In particular, the result is reduced to the case with zero-valued final state constraints. Several numerical examples are provided to demonstrate the performance of the designed optimal controller.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Maximum-Norm Error Estimates of Fourth-Order Compact and ADI Compact Finite Difference Methods for Nonlinear Coupled Bacterial Systems
Authors:
Jie Xu,
Shusen Xie,
Hongfei Fu
Abstract:
In this paper, by introducing two temporal-derivative-dependent auxiliary variables, a linearized and decoupled fourth-order compact finite difference method is developed and analyzed for the nonlinear coupled bacterial systems. The temporal-spatial error splitting technique and discrete energy method are employed to prove the unconditional stability and convergence of the method in discrete maxim…
▽ More
In this paper, by introducing two temporal-derivative-dependent auxiliary variables, a linearized and decoupled fourth-order compact finite difference method is developed and analyzed for the nonlinear coupled bacterial systems. The temporal-spatial error splitting technique and discrete energy method are employed to prove the unconditional stability and convergence of the method in discrete maximum norm. Furthermore, to improve the computational efficiency, an alternating direction implicit (ADI) compact difference algorithm is proposed, and the unconditional stability and optimal-order maximum-norm error estimate for the ADI scheme are also strictly established. Finally, several numerical experiments are conducted to validate the theoretical convergence and to simulate the phenomena of bacterial extinction as well as the formation of endemic diseases.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
On the metric of the jet bundle and similarity on Dirichlet spaces
Authors:
Kui Ji,
Shanshan Ji,
Hyun-Kyoung Kwon,
Xiaoceng Liu,
Jing Xu
Abstract:
In general, it is more difficult to formulate a sufficient condition for similarity than a necessary condition. We give a sufficient condition for a Cowen-Douglas operator with a positivity condition to be similar to the backward shift operator on weighted Dirichlet space. This condition involves the holomorphic jet bundle of the eigenvector bundle of the operator.
In general, it is more difficult to formulate a sufficient condition for similarity than a necessary condition. We give a sufficient condition for a Cowen-Douglas operator with a positivity condition to be similar to the backward shift operator on weighted Dirichlet space. This condition involves the holomorphic jet bundle of the eigenvector bundle of the operator.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
Solving multiscale dynamical systems by deep learning
Authors:
Zhi-Qin John Xu,
Junjie Yao,
Yuxiao Yi,
Liangkai Hang,
Weinan E,
Yaoyu Zhang,
Tianhan Zhang
Abstract:
Multiscale dynamical systems, modeled by high-dimensional stiff ordinary differential equations (ODEs) with wide-ranging characteristic timescales, arise across diverse fields of science and engineering, but their numerical solvers often encounter severe efficiency bottlenecks. This paper introduces a novel DeePODE method, which consists of a global multiscale sampling method and a fitting by deep…
▽ More
Multiscale dynamical systems, modeled by high-dimensional stiff ordinary differential equations (ODEs) with wide-ranging characteristic timescales, arise across diverse fields of science and engineering, but their numerical solvers often encounter severe efficiency bottlenecks. This paper introduces a novel DeePODE method, which consists of a global multiscale sampling method and a fitting by deep neural networks to handle multiscale systems. DeePODE's primary contribution is to address the multiscale challenge of efficiently uncovering representative training sets by combining the Monte Carlo method and the ODE system's intrinsic evolution without suffering from the ``curse of dimensionality''. The DeePODE method is validated in multiscale systems from diverse areas, including a predator-prey model, a power system oscillation, a battery electrolyte auto-ignition, and turbulent flames. Our methods exhibit strong generalization capabilities to unseen conditions, highlighting the power of deep learning in modeling intricate multiscale dynamical processes across science and engineering domains.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
Exact Controllability of Discrete-Time Stochastic System with Multiplicative Noise
Authors:
Juanjuan Xu,
Huanshui Zhang
Abstract:
This paper is concerned with the exact controllability of discrete-time stochastic system which is one of the basic problems of modern control theory. Though the exact controllability of continuous-time system governed by Ito stochastic differential equations has been well studied in S. Peng, Progress in Natural Science, 1994, the counterpart of the discrete-time case is still open due to the adap…
▽ More
This paper is concerned with the exact controllability of discrete-time stochastic system which is one of the basic problems of modern control theory. Though the exact controllability of continuous-time system governed by Ito stochastic differential equations has been well studied in S. Peng, Progress in Natural Science, 1994, the counterpart of the discrete-time case is still open due to the adaptiveness constraint of the controllers and the solvability challenging of stochastic difference equation with terminal value. The main contribution in this paper is to present both the Gramian matrix criterion and the Rank criterion for the exact controllability of discrete-time stochastic system. The novelty lies in the transformation of the forward stochastic difference equation into a novel backward one.
△ Less
Submitted 29 December, 2023;
originally announced December 2023.
-
Expressivity and Approximation Properties of Deep Neural Networks with ReLU$^k$ Activation
Authors:
Juncai He,
Tong Mao,
Jinchao Xu
Abstract:
In this paper, we investigate the expressivity and approximation properties of deep neural networks employing the ReLU$^k$ activation function for $k \geq 2$. Although deep ReLU networks can approximate polynomials effectively, deep ReLU$^k$ networks have the capability to represent higher-degree polynomials precisely. Our initial contribution is a comprehensive, constructive proof for polynomial…
▽ More
In this paper, we investigate the expressivity and approximation properties of deep neural networks employing the ReLU$^k$ activation function for $k \geq 2$. Although deep ReLU networks can approximate polynomials effectively, deep ReLU$^k$ networks have the capability to represent higher-degree polynomials precisely. Our initial contribution is a comprehensive, constructive proof for polynomial representation using deep ReLU$^k$ networks. This allows us to establish an upper bound on both the size and count of network parameters. Consequently, we are able to demonstrate a suboptimal approximation rate for functions from Sobolev spaces as well as for analytic functions. Additionally, through an exploration of the representation power of deep ReLU$^k$ networks for shallow networks, we reveal that deep ReLU$^k$ networks can approximate functions from a range of variation spaces, extending beyond those generated solely by the ReLU$^k$ activation function. This finding demonstrates the adaptability of deep ReLU$^k$ networks in approximating functions within various variation spaces.
△ Less
Submitted 10 January, 2024; v1 submitted 27 December, 2023;
originally announced December 2023.
-
Distributed Stochastic Bilevel Optimization: Improved Complexity and Heterogeneity Analysis
Authors:
Youcheng Niu,
Jinming Xu,
Ying Sun,
Yan Huang,
Li Chai
Abstract:
This paper consider solving a class of nonconvex-strongly-convex distributed stochastic bilevel optimization (DSBO) problems with personalized inner-level objectives. Most existing algorithms require computational loops for hypergradient estimation, leading to computational inefficiency. Moreover, the impact of data heterogeneity on convergence in bilevel problems is not explicitly characterized y…
▽ More
This paper consider solving a class of nonconvex-strongly-convex distributed stochastic bilevel optimization (DSBO) problems with personalized inner-level objectives. Most existing algorithms require computational loops for hypergradient estimation, leading to computational inefficiency. Moreover, the impact of data heterogeneity on convergence in bilevel problems is not explicitly characterized yet. To address these issues, we propose LoPA, a loopless personalized distributed algorithm that leverages a tracking mechanism for iterative approximation of inner-level solutions and Hessian-inverse matrices without relying on extra computation loops. Our theoretical analysis explicitly characterizes the heterogeneity across nodes denoted by $b$, and establishes a sublinear rate of ${\mathcal{O}}( {\frac{1}{(1-ρ)^2K} + \frac{{b^{\frac{2}{3}} }}{{\left( {1 - ρ} \right)^{\frac{2}{3}}K^{\frac{2}{3}} }} + \frac{1}{\sqrt{ K }} ( σ_{\rm{p}} + \frac{1}{\sqrt{m}}σ_{\rm{c}}) } )$ without the boundedness of local hypergradients, where $σ_{\rm p}$ and $σ_{\rm c}$ represent the gradient sampling variances associated with the inner- and outer-level variables, respectively. We also develop a variant of LoPA based on gradient tracking to eliminate the impact of data heterogeneity, yielding an improved rate of ${\mathcal{O}}(\frac{1}{ (1-ρ)^4K } + \frac{1}{\sqrt{K}}( σ_{\rm{p}} + \frac{1}{\sqrt{m}}σ_{\rm{c}} ) )$. The computational complexity of LoPA is of ${\mathcal{O}}({ε^{-2}})$ to an $ε$-stationary point, matching the communication complexity due to the loopless structure, which outperforms existing counterparts for DSBO. Numerical experiments validate the effectiveness of the proposed algorithm.
△ Less
Submitted 8 February, 2024; v1 submitted 22 December, 2023;
originally announced December 2023.
-
Deep Neural Networks and Finite Elements of Any Order on Arbitrary Dimensions
Authors:
Juncai He,
Jinchao Xu
Abstract:
In this study, we establish that deep neural networks employing ReLU and ReLU$^2$ activation functions can effectively represent Lagrange finite element functions of any order on various simplicial meshes in arbitrary dimensions. We introduce two novel formulations for globally expressing the basis functions of Lagrange elements, tailored for both specific and arbitrary meshes. These formulations…
▽ More
In this study, we establish that deep neural networks employing ReLU and ReLU$^2$ activation functions can effectively represent Lagrange finite element functions of any order on various simplicial meshes in arbitrary dimensions. We introduce two novel formulations for globally expressing the basis functions of Lagrange elements, tailored for both specific and arbitrary meshes. These formulations are based on a geometric decomposition of the elements, incorporating several insightful and essential properties of high-dimensional simplicial meshes, barycentric coordinate functions, and global basis functions of linear elements. This representation theory facilitates a natural approximation result for such deep neural networks. Our findings present the first demonstration of how deep neural networks can systematically generate general continuous piecewise polynomial functions on both specific or arbitrary simplicial meshes.
△ Less
Submitted 11 January, 2024; v1 submitted 21 December, 2023;
originally announced December 2023.
-
Robust Functional Principal Component Analysis for Non-Euclidean Random Objects
Authors:
Jiazhen Xu,
Andrew T. A. Wood,
Tao Zou
Abstract:
Functional data analysis offers a diverse toolkit of statistical methods tailored for analyzing samples of real-valued random functions. Recently, samples of time-varying random objects, such as time-varying networks, have been increasingly encountered in modern data analysis. These data structures represent elements within general metric spaces that lack local or global linear structures, renderi…
▽ More
Functional data analysis offers a diverse toolkit of statistical methods tailored for analyzing samples of real-valued random functions. Recently, samples of time-varying random objects, such as time-varying networks, have been increasingly encountered in modern data analysis. These data structures represent elements within general metric spaces that lack local or global linear structures, rendering traditional functional data analysis methods inapplicable. Moreover, the existing methodology for time-varying random objects does not work well in the presence of outlying objects. In this paper, we propose a robust method for analysing time-varying random objects. Our method employs pointwise Fréchet medians and then constructs pointwise distance trajectories between the individual time courses and the sample Fréchet medians. This representation effectively transforms time-varying objects into functional data. A novel robust approach to functional principal component analysis based on a Winsorized U-statistic estimator of the covariance structure is introduced. The proposed robust analysis of these distance trajectories is able to identify key features of time-varying objects and is useful for downstream analysis. To illustrate the efficacy of our approach, numerical studies focusing on dynamic networks are conducted. The results indicate that the proposed method exhibits good all-round performance and surpasses the existing approach in terms of robustness, showcasing its superior performance in handling time-varying objects data.
△ Less
Submitted 28 November, 2023;
originally announced December 2023.
-
On a class of planar Schrödinger-Poisson system with a bounded potential well
Authors:
Miao Du,
Jiaxin Xu
Abstract:
In this paper, we deal with the planar Schrödinger-Poisson system \begin{equation*}\begin{cases} -Δu + V(x) u + φu = b|u|^{p-2} u \ &\text{in}\ \mathbb{R}^{2},\\Δφ= u^{2} &\text{in}\ \mathbb{R}^{2},\end{cases} \end{equation*} where $b \geq 0$, $p > 2 $ and $V \in C(\mathbb{R}^2, \mathbb{R})$ is a potential function with $\inf_{\mathbb{R}^2} V >0$. Suppose moreover that $V$ exhibits a bounded poten…
▽ More
In this paper, we deal with the planar Schrödinger-Poisson system \begin{equation*}\begin{cases} -Δu + V(x) u + φu = b|u|^{p-2} u \ &\text{in}\ \mathbb{R}^{2},\\Δφ= u^{2} &\text{in}\ \mathbb{R}^{2},\end{cases} \end{equation*} where $b \geq 0$, $p > 2 $ and $V \in C(\mathbb{R}^2, \mathbb{R})$ is a potential function with $\inf_{\mathbb{R}^2} V >0$. Suppose moreover that $V$ exhibits a bounded potential well in the sense that $\lim_{|x|\rightarrow \infty} V(x)$ exists and is equal to $\sup_{\mathbb{R}^2} V$. By using variational methods, we obtain the existence of ground state solutions for this system in the case where $p \geq 3$. Furthermore, we also present a minimax characterization of ground state solutions. The main feature of this work is that we do not assume any periodicity or symmetry condition on the external potential $V$, which is essential to establish the compactness condition of Cerami sequences.
△ Less
Submitted 24 June, 2024; v1 submitted 12 December, 2023;
originally announced December 2023.
-
Sensitivity analysis for mixed binary quadratic programming
Authors:
Diego Cifuentes,
Santanu S. Dey,
Jingye Xu
Abstract:
We consider sensitivity analysis for Mixed Binary Quadratic Programs (MBQPs) with respect to changing right-hand-sides (rhs). We show that even if the optimal solution of a given MBQP is known, it is NP-hard to approximate the change in objective function value with respect to changes in rhs. Next, we study algorithmic approaches to obtaining dual bounds for MBQP with changing rhs. We leverage Bur…
▽ More
We consider sensitivity analysis for Mixed Binary Quadratic Programs (MBQPs) with respect to changing right-hand-sides (rhs). We show that even if the optimal solution of a given MBQP is known, it is NP-hard to approximate the change in objective function value with respect to changes in rhs. Next, we study algorithmic approaches to obtaining dual bounds for MBQP with changing rhs. We leverage Burer's completely-positive (CPP) reformulation of MBQPs. Its dual is an instance of co-positive programming (COP), and can be used to obtain sensitivity bounds. We prove that strong duality between the CPP and COP problems holds if the feasible region is bounded or if the objective function is convex, while the duality gap can be strictly positive if neither condition is met. We also show that the COP dual has multiple optimal solutions, and the choice of the dual solution affects the quality of the bounds with rhs changes. We finally provide a method for finding good nearly optimal dual solutions, and we present preliminary computational results on sensitivity analysis for MBQPs.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Reducibility of 1-D quantum harmonic oscillator with new unbounded oscillatory perturbations
Authors:
Jin Xu,
Jiawen Luo,
Zhiqiang Wang,
Zhenguo Liang
Abstract:
Enlightened by Lemma 1.7 in \cite{LiangLuo2021}, we prove a similar lemma which is based upon oscillatory integrals and Langer's turning point theory. From it we show that the Schr{ö}dinger equation $${\rm i}\partial_t u = -\partial_x^2 u+x^2 u+ε\langle x\rangle^μ\sum_{k\inΛ}\left(a_k(ωt)\sin(k|x|^β)+b_k(ωt) \cos(k|x|^β)\right) u,\quad u=u(t,x),~x\in\mathbb{R},~ β>1,$$ can be reduced in…
▽ More
Enlightened by Lemma 1.7 in \cite{LiangLuo2021}, we prove a similar lemma which is based upon oscillatory integrals and Langer's turning point theory. From it we show that the Schr{ö}dinger equation $${\rm i}\partial_t u = -\partial_x^2 u+x^2 u+ε\langle x\rangle^μ\sum_{k\inΛ}\left(a_k(ωt)\sin(k|x|^β)+b_k(ωt) \cos(k|x|^β)\right) u,\quad u=u(t,x),~x\in\mathbb{R},~ β>1,$$ can be reduced in $\mathcal{H}^1(\mathbb{R})$ to an autonomous system for most values of the frequency vector $ω$, where $Λ\subset\mathbb R\setminus\{0\}$, $|Λ|<\infty$ and $\langle x\rangle:=\sqrt{1+x^2}$. The functions $a_k(θ)$ and $b_k(θ)$ are analytic on $\mathbb T^n_σ$ and $μ\geq 0$ will be chosen according to the value of $β$. Comparing with \cite{LiangLuo2021}, the novelty is that the phase functions of oscillatory integral are more degenerate when $β>1$.
△ Less
Submitted 30 November, 2023;
originally announced November 2023.
-
On the supersingular locus of Shimura varieties for quaternionic unitary groups
Authors:
Yasuhiro Terakado,
Jiangwei Xue,
Chia-Fu Yu
Abstract:
We study a Shimura variety attached to a unitary similitude group of a skew-Hermitian form over a totally indefinite quaternion algebra over a totally real number field. We give a necessary and sufficient condition for the existence of skew-Hermitian self-dual lattices. Under this condition we show that the superspecial locus in the fiber at $p$ of the associated Shimura variety is non-empty. We a…
▽ More
We study a Shimura variety attached to a unitary similitude group of a skew-Hermitian form over a totally indefinite quaternion algebra over a totally real number field. We give a necessary and sufficient condition for the existence of skew-Hermitian self-dual lattices. Under this condition we show that the superspecial locus in the fiber at $p$ of the associated Shimura variety is non-empty. We also give an explicit formula for the number of irreducible components of the supersingular locus when $p$ is odd and unramified in the quaternion algebra.
△ Less
Submitted 30 November, 2023;
originally announced November 2023.
-
Mean Field Games with infinitely degenerate diffusion and non-coercive Hamiltonian
Authors:
Yiming Jiang,
Jingchuang Ren,
Yawei Wei,
Jie Xue
Abstract:
In this paper, we consider a class of infinitely degenerate partial differential systems to obtain the Nash equilibria in the mean field games. The degeneracy in the diffusion and the Hamiltonian may be different. This feature brings difficulties to the uniform boundness of the solutions, which is central to the existence and regularity results. First, from the perspective of the value function in…
▽ More
In this paper, we consider a class of infinitely degenerate partial differential systems to obtain the Nash equilibria in the mean field games. The degeneracy in the diffusion and the Hamiltonian may be different. This feature brings difficulties to the uniform boundness of the solutions, which is central to the existence and regularity results. First, from the perspective of the value function in the stochastic optimal control problems, we prove the Lipschitz continuity and the semiconcavity for the solutions of the Hamilton-Jacobi equations (HJE). Then the existence of the weak solutions for the degenerate systems is obtained via a vanishing viscosity method. Furthermore, by constructing an auxiliary function, we conclude the regularity of the viscosity solution for the HJE in the almost everywhere sense.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
A Feasible Conjugate Gradient Method for Calculating $\mathcal B$-Eigenpairs of Symmetric Tensors
Authors:
Jiefeng Xu,
Can Li,
Dong-Hui Li
Abstract:
In this paper, we propose a feasible conjugate gradient (FCG) method for calculating ${\mathcal B}$-eigenpairs of a symmetric tensor ${\mathcal A}$. The method is an extension of the well-known conjugate gradient method for unconstrained optimization problems to some curve constrained optimization problems. The proposed FCG method can find a ${\mathcal B}$-eigenpair of a symmetric tensor…
▽ More
In this paper, we propose a feasible conjugate gradient (FCG) method for calculating ${\mathcal B}$-eigenpairs of a symmetric tensor ${\mathcal A}$. The method is an extension of the well-known conjugate gradient method for unconstrained optimization problems to some curve constrained optimization problems. The proposed FCG method can find a ${\mathcal B}$-eigenpair of a symmetric tensor ${\mathcal A}$ without the requirement that the orders of ${\mathcal A}$ and $\mathcal B$ are equal. We pay particular attention to the Polak-Ribíre-Polyak (PRP) type conjugate gradient method. We show that the FCG method with some Armijo-type line search is globally convergent. Our numerical experiments indicate the promising performance of the proposed method.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Likelihood ratio tests in random graph models with increasing dimensions
Authors:
Ting Yan,
Yuanzhang Li,
Jinfeng Xu,
Yaning Yang,
Ji Zhu
Abstract:
We explore the Wilks phenomena in two random graph models: the $β$-model and the Bradley-Terry model. For two increasing dimensional null hypotheses, including a specified null $H_0: β_i=β_i^0$ for $i=1,\ldots, r$ and a homogenous null $H_0: β_1=\cdots=β_r$, we reveal high dimensional Wilks' phenomena that the normalized log-likelihood ratio statistic,…
▽ More
We explore the Wilks phenomena in two random graph models: the $β$-model and the Bradley-Terry model. For two increasing dimensional null hypotheses, including a specified null $H_0: β_i=β_i^0$ for $i=1,\ldots, r$ and a homogenous null $H_0: β_1=\cdots=β_r$, we reveal high dimensional Wilks' phenomena that the normalized log-likelihood ratio statistic, $[2\{\ell(\widehat{\mathbfβ}) - \ell(\widehat{\mathbfβ}^0)\} -r]/(2r)^{1/2}$, converges in distribution to the standard normal distribution as $r$ goes to infinity. Here, $\ell( \mathbfβ)$ is the log-likelihood function on the model parameter $\mathbfβ=(β_1, \ldots, β_n)^\top$, $\widehat{\mathbfβ}$ is its maximum likelihood estimator (MLE) under the full parameter space, and $\widehat{\mathbfβ}^0$ is the restricted MLE under the null parameter space. For the homogenous null with a fixed $r$, we establish Wilks-type theorems that $2\{\ell(\widehat{\mathbfβ}) - \ell(\widehat{\mathbfβ}^0)\}$ converges in distribution to a chi-square distribution with $r-1$ degrees of freedom, as the total number of parameters, $n$, goes to infinity. When testing the fixed dimensional specified null, we find that its asymptotic null distribution is a chi-square distribution in the $β$-model. However, unexpectedly, this is not true in the Bradley-Terry model. By developing several novel technical methods for asymptotic expansion, we explore Wilks type results in a principled manner; these principled methods should be applicable to a class of random graph models beyond the $β$-model and the Bradley-Terry model. Simulation studies and real network data applications further demonstrate the theoretical results.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
An Unsupervised Deep Learning Approach for the Wave Equation Inverse Problem
Authors:
Xiong-Bin Yan,
Keke Wu,
Zhi-Qin John Xu,
Zheng Ma
Abstract:
Full-waveform inversion (FWI) is a powerful geophysical imaging technique that infers high-resolution subsurface physical parameters by solving a non-convex optimization problem. However, due to limitations in observation, e.g., limited shots or receivers, and random noise, conventional inversion methods are confronted with numerous challenges, such as the local-minimum problem. In recent years, a…
▽ More
Full-waveform inversion (FWI) is a powerful geophysical imaging technique that infers high-resolution subsurface physical parameters by solving a non-convex optimization problem. However, due to limitations in observation, e.g., limited shots or receivers, and random noise, conventional inversion methods are confronted with numerous challenges, such as the local-minimum problem. In recent years, a substantial body of work has demonstrated that the integration of deep neural networks and partial differential equations for solving full-waveform inversion problems has shown promising performance. In this work, drawing inspiration from the expressive capacity of neural networks, we provide an unsupervised learning approach aimed at accurately reconstructing subsurface physical velocity parameters. This method is founded on a re-parametrization technique for Bayesian inference, achieved through a deep neural network with random weights. Notably, our proposed approach does not hinge upon the requirement of the labeled training dataset, rendering it exceedingly versatile and adaptable to diverse subsurface models. Extensive experiments show that the proposed approach performs noticeably better than existing conventional inversion methods.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
MgNO: Efficient Parameterization of Linear Operators via Multigrid
Authors:
Juncai He,
Xinliang Liu,
Jinchao Xu
Abstract:
In this work, we propose a concise neural operator architecture for operator learning. Drawing an analogy with a conventional fully connected neural network, we define the neural operator as follows: the output of the $i$-th neuron in a nonlinear operator layer is defined by $O_i(u) = σ\left( \sum_j W_{ij} u + B_{ij}\right)$. Here, $ W_{ij}$ denotes the bounded linear operator connecting $j$-th in…
▽ More
In this work, we propose a concise neural operator architecture for operator learning. Drawing an analogy with a conventional fully connected neural network, we define the neural operator as follows: the output of the $i$-th neuron in a nonlinear operator layer is defined by $O_i(u) = σ\left( \sum_j W_{ij} u + B_{ij}\right)$. Here, $ W_{ij}$ denotes the bounded linear operator connecting $j$-th input neuron to $i$-th output neuron, and the bias $ B_{ij}$ takes the form of a function rather than a scalar. Given its new universal approximation property, the efficient parameterization of the bounded linear operators between two neurons (Banach spaces) plays a critical role. As a result, we introduce MgNO, utilizing multigrid structures to parameterize these linear operators between neurons. This approach offers both mathematical rigor and practical expressivity. Additionally, MgNO obviates the need for conventional lifting and projecting operators typically required in previous neural operators. Moreover, it seamlessly accommodates diverse boundary conditions. Our empirical observations reveal that MgNO exhibits superior ease of training compared to other CNN-based models, while also displaying a reduced susceptibility to overfitting when contrasted with spectral-type neural operators. We demonstrate the efficiency and accuracy of our method with consistently state-of-the-art performance on different types of partial differential equations (PDEs).
△ Less
Submitted 25 June, 2024; v1 submitted 16 October, 2023;
originally announced October 2023.
-
ADMM Training Algorithms for Residual Networks: Convergence, Complexity and Parallel Training
Authors:
Jintao Xu,
Yifei Li,
Wenxun Xing
Abstract:
We design a series of serial and parallel proximal point (gradient) ADMMs for the fully connected residual networks (FCResNets) training problem by introducing auxiliary variables. Convergence of the proximal point version is proven based on a Kurdyka-Lojasiewicz (KL) property analysis framework, and we can ensure a locally R-linear or sublinear convergence rate depending on the different ranges o…
▽ More
We design a series of serial and parallel proximal point (gradient) ADMMs for the fully connected residual networks (FCResNets) training problem by introducing auxiliary variables. Convergence of the proximal point version is proven based on a Kurdyka-Lojasiewicz (KL) property analysis framework, and we can ensure a locally R-linear or sublinear convergence rate depending on the different ranges of the Kurdyka-Lojasiewicz (KL) exponent, in which a necessary auxiliary function is constructed to realize our goal. Moreover, the advantages of the parallel implementation in terms of lower time complexity and less (per-node) memory consumption are analyzed theoretically. To the best of our knowledge, this is the first work analyzing the convergence, convergence rate, time complexity and (per-node) runtime memory requirement of the ADMM applied in the FCResNets training problem theoretically. Experiments are reported to show the high speed, better performance, robustness and potential in the deep network training tasks. Finally, we present the advantage and potential of our parallel training in large-scale problems.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
A second-order SO(3)-preserving and energy-stable scheme for orthonormal frame gradient flow model of biaxial nematic liquid crystals
Authors:
Hanbin Wang,
Jie Xu,
Zhiguo Yang
Abstract:
In this paper, we present a novel second-order generalised rotational discrete gradient scheme for numerically approximating the orthonormal frame gradient flow of biaxial nematic liquid crystals. This scheme relies on reformulating the original gradient flow system into an equivalent generalised "rotational" form. A second-order discrete gradient approximation of the energy variation is then devi…
▽ More
In this paper, we present a novel second-order generalised rotational discrete gradient scheme for numerically approximating the orthonormal frame gradient flow of biaxial nematic liquid crystals. This scheme relies on reformulating the original gradient flow system into an equivalent generalised "rotational" form. A second-order discrete gradient approximation of the energy variation is then devised such that it satisfies an energy difference relation. The proposed numerical scheme has two remarkable properties: (i) it strictly obeys the orthonormal property of the tensor field and (ii) it satisfies the energy dissipation law at the discrete level, regardless of the time step sizes. We provide ample numerical results to validate the accuracy, efficiency, unconditional stability and SO(3)-preserving property of this scheme. In addition, comparisons of the simulation results between the biaxial orthonormal frame gradient flow model and uniaxial Oseen-Frank gradient flow are made to demonstrate the ability of the former to characterize non-axisymmetric local anisotropy.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Sparse critical graphs for defective $(1,3)$-coloring
Authors:
Alexandr Kostochka,
Jingwei Xu,
Xuding Zhu
Abstract:
A graph $G$ is $(1,3)$-colorable if its vertices can be partitioned into subsets $V_1$ and $V_2$ so that every vertex in $G[V_1]$ has degree at most $1$ and every vertex in $G[V_2]$ has degree at most $3$. We prove that every graph with maximum average degree at most 28/9 is $(1, 3)$-colorable.
A graph $G$ is $(1,3)$-colorable if its vertices can be partitioned into subsets $V_1$ and $V_2$ so that every vertex in $G[V_1]$ has degree at most $1$ and every vertex in $G[V_2]$ has degree at most $3$. We prove that every graph with maximum average degree at most 28/9 is $(1, 3)$-colorable.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
NP-Hardness of Tensor Network Contraction Ordering
Authors:
Jianyu Xu,
Hanwen Zhang,
Ling Liang,
Lei Deng,
Yuan Xie,
Guoqi Li
Abstract:
We study the optimal order (or sequence) of contracting a tensor network with a minimal computational cost. We conclude 2 different versions of this optimal sequence: that minimize the operation number (OMS) and that minimize the time complexity (CMS). Existing results only shows that OMS is NP-hard, but no conclusion on CMS problem. In this work, we firstly reduce CMS to CMS-0, which is a sub-pro…
▽ More
We study the optimal order (or sequence) of contracting a tensor network with a minimal computational cost. We conclude 2 different versions of this optimal sequence: that minimize the operation number (OMS) and that minimize the time complexity (CMS). Existing results only shows that OMS is NP-hard, but no conclusion on CMS problem. In this work, we firstly reduce CMS to CMS-0, which is a sub-problem of CMS with no free indices. Then we prove that CMS is easier than OMS, both in general and in tree cases. Last but not least, we prove that CMS is still NP-hard. Based on our results, we have built up relationships of hardness of different tensor network contraction problems.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Uncertainty quantification and complex analyticity of the nonlinear Poisson-Boltzmann equation for the interface problem with random domains
Authors:
Trevor Norton,
Jie Xu,
Brian Choi,
Mark Kon,
Julio Enrique Castrillón-Candás
Abstract:
The nonlinear Poisson-Boltzmann equation (NPBE) is an elliptic partial differential equation used in applications such as protein interactions and biophysical chemistry (among many others). It describes the nonlinear electrostatic potential of charged bodies submerged in an ionic solution. The kinetic presence of the solvent molecules introduces randomness to the shape of a protein, and thus a mor…
▽ More
The nonlinear Poisson-Boltzmann equation (NPBE) is an elliptic partial differential equation used in applications such as protein interactions and biophysical chemistry (among many others). It describes the nonlinear electrostatic potential of charged bodies submerged in an ionic solution. The kinetic presence of the solvent molecules introduces randomness to the shape of a protein, and thus a more accurate model that incorporates these random perturbations of the domain is analyzed to compute the statistics of quantities of interest of the solution. When the parameterization of the random perturbations is high-dimensional, this calculation is intractable as it is subject to the curse of dimensionality. However, if the solution of the NPBE varies analytically with respect to the random parameters, the problem becomes amenable to techniques such as sparse grids and deep neural networks. In this paper, we show analyticity of the solution of the NPBE with respect to analytic perturbations of the domain by using the analytic implicit function theorem and the domain mapping method. Previous works have shown analyticity of solutions to linear elliptic equations but not for nonlinear problems. We further show how to derive \emph{a priori} bounds on the size of the region of analyticity. This method is applied to the trypsin molecule to demonstrate that the convergence rates of the quantity of interest are consistent with the analyticity result. Furthermore, the approach developed here is sufficiently general enough to be applied to other nonlinear problems in uncertainty quantification.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Analytic regularity of strong solutions for the complexified stochastic non-linear Poisson Boltzmann Equation
Authors:
Brian Choi,
Jie Xu,
Trevor Norton,
Mark Kon,
Julio Enrique Castrillon-Candas
Abstract:
Semi-linear elliptic Partial Differential Equations (PDEs) such as the non-linear Poisson Boltzmann Equation (nPBE) is highly relevant for non-linear electrostatics in computational biology and chemistry. It is of particular importance for modeling potential fields from molecules in solvents or plasmas with stochastic fluctuations. The extensive applications include ones in condensed matter and so…
▽ More
Semi-linear elliptic Partial Differential Equations (PDEs) such as the non-linear Poisson Boltzmann Equation (nPBE) is highly relevant for non-linear electrostatics in computational biology and chemistry. It is of particular importance for modeling potential fields from molecules in solvents or plasmas with stochastic fluctuations. The extensive applications include ones in condensed matter and solid state physics, chemical physics, electrochemistry, biochemistry, thermodynamics, statistical mechanics, and materials science, among others. In this paper we study the complex analytic properties of semi-linear elliptic Partial Differential Equations with respect to random fluctuations on the domain. We first prove the existence and uniqueness of the nPBE on a bounded domain in $\mathbb{R}^3$. This proof relies on the application of a contraction mapping reasoning, as the standard convex optimization argument for the deterministic nPBE no longer applies. Using the existence and uniqueness result we subsequently show that solution to the nPBE admits an analytic extension onto a well defined region in the complex hyperplane with respect to the number of stochastic variables. Due to the analytic extension, stochastic collocation theory for sparse grids predict algebraic to sub-exponential convergence rates with respect to the number of knots. A series of numerical experiments with sparse grids is consistent with this prediction and the analyticity result. Finally, this approach readily extends to a wide class of semi-linear elliptic PDEs.
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
Semidefinite Programming Approximation for a Matrix Optimization Problem over an Uncertain Linear System
Authors:
Jintao Xu,
Shu-Cherng Fang,
Wenxun Xing
Abstract:
A matrix optimization problem over an uncertain linear system on finite horizon (abbreviated as MOPUL) is studied, in which the uncertain transition matrix is regarded as a decision variable. This problem is in general NP-hard. By using the given reference values of system outputs at each stage, we develop a polynomial-time solvable semidefinite programming (SDP) approximation model for the proble…
▽ More
A matrix optimization problem over an uncertain linear system on finite horizon (abbreviated as MOPUL) is studied, in which the uncertain transition matrix is regarded as a decision variable. This problem is in general NP-hard. By using the given reference values of system outputs at each stage, we develop a polynomial-time solvable semidefinite programming (SDP) approximation model for the problem. The upper bound of the cumulative error between reference outputs and the optimal outputs of the approximation model is theoretically analyzed. Two special cases associated with specific applications are considered. The quality of the SDP approximate solutions in terms of feasibility and optimality is also analyzed. Results of numerical experiments are presented to show the influences of perturbed noises at reference outputs and control levels on the performance of SDP approximation.
△ Less
Submitted 30 October, 2023; v1 submitted 24 September, 2023;
originally announced September 2023.
-
Distributed Optimal Control and Application to Consensus of Multi-Agent Systems
Authors:
Liping Zhang,
Juanjuan Xu,
Huanshui Zhang,
Lihua Xie
Abstract:
This paper develops a novel approach to the consensus problem of multi-agent systems by minimizing a weighted state error with neighbor agents via linear quadratic (LQ) optimal control theory. Existing consensus control algorithms only utilize the current state of each agent, and the design of distributed controller depends on nonzero eigenvalues of the communication topology. The presented optima…
▽ More
This paper develops a novel approach to the consensus problem of multi-agent systems by minimizing a weighted state error with neighbor agents via linear quadratic (LQ) optimal control theory. Existing consensus control algorithms only utilize the current state of each agent, and the design of distributed controller depends on nonzero eigenvalues of the communication topology. The presented optimal consensus controller is obtained by solving Riccati equations and designing appropriate observers to account for agents' historical state information. It is shown that the corresponding cost function under the proposed controllers is asymptotically optimal. Simulation examples demonstrate the effectiveness of the proposed scheme, and a much faster convergence speed than the conventional consensus methods. Moreover, the new method avoids computing nonzero eigenvalues of the communication topology as in the traditional consensus methods.
△ Less
Submitted 16 March, 2024; v1 submitted 21 September, 2023;
originally announced September 2023.
-
Trace Monomial Boolean Functions with Large High-Order Nonlinearities
Authors:
Jinjie Gao,
Haibin Kan,
Yuan Li,
Jiahua Xu,
Qichun Wang
Abstract:
Exhibiting an explicit Boolean function with a large high-order nonlinearity is an important problem in cryptography, coding theory, and computational complexity. We prove lower bounds on the second-order, third-order, and higher-order nonlinearities of some trace monomial Boolean functions.
We prove lower bounds on the second-order nonlinearities of functions $\mathrm{tr}_n(x^7)$ and…
▽ More
Exhibiting an explicit Boolean function with a large high-order nonlinearity is an important problem in cryptography, coding theory, and computational complexity. We prove lower bounds on the second-order, third-order, and higher-order nonlinearities of some trace monomial Boolean functions.
We prove lower bounds on the second-order nonlinearities of functions $\mathrm{tr}_n(x^7)$ and $\mathrm{tr}_n(x^{2^r+3})$ where $n=2r$. Among all trace monomials, our bounds match the best second-order nonlinearity lower bounds by \cite{Car08} and \cite{YT20} for odd and even $n$ respectively. We prove a lower bound on the third-order nonlinearity for functions $\mathrm{tr}_n(x^{15})$, which is the best third-order nonlinearity lower bound. For any $r$, we prove that the $r$-th order nonlinearity of $\mathrm{tr}_n(x^{2^{r+1}-1})$ is at least $2^{n-1}-2^{(1-2^{-r})n+\frac{r}{2^{r-1}}-1}- O(2^{\frac{n}{2}})$. For $r \ll \log_2 n$, this is the best lower bound among all explicit functions.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
Three new $q$-Abel transformations and their applications
Authors:
Jianan Xu,
Xinrong Ma
Abstract:
In the present paper, we establish three special $q$-Abel transformation formulae of $q$-series via the use of Abel's lemma on summation by parts. As direct applications, we set up the corresponding $q$-contiguous relations for three kinds of truncated $q$-series. Several new transformations are consequently established.
In the present paper, we establish three special $q$-Abel transformation formulae of $q$-series via the use of Abel's lemma on summation by parts. As direct applications, we set up the corresponding $q$-contiguous relations for three kinds of truncated $q$-series. Several new transformations are consequently established.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
Injective edge colorings of degenerate graphs and the oriented chromatic number
Authors:
Peter Bradshaw,
Alexander Clow,
Jingwei Xu
Abstract:
Given a graph $G$, an injective edge-coloring of $G$ is a function $ψ:E(G) \rightarrow \mathbb N$ such that if $ψ(e) = ψ(e')$, then no third edge joins an endpoint of $e$ and an endpoint of $e'$. The injective chromatic index of a graph $G$, written $χ_{inj}'(G)$, is the minimum number of colors needed for an injective edge coloring of $G$. In this paper, we investigate the injective chromatic ind…
▽ More
Given a graph $G$, an injective edge-coloring of $G$ is a function $ψ:E(G) \rightarrow \mathbb N$ such that if $ψ(e) = ψ(e')$, then no third edge joins an endpoint of $e$ and an endpoint of $e'$. The injective chromatic index of a graph $G$, written $χ_{inj}'(G)$, is the minimum number of colors needed for an injective edge coloring of $G$. In this paper, we investigate the injective chromatic index of certain classes of degenerate graphs. First, we show that if $G$ is a $d$-degenerate graph of maximum degree $Δ$, then $χ_{inj}'(G) = O(d^3 \log Δ)$. Next, we show that if $G$ is a graph of Euler genus $g$, then $χ_{inj}'(G) \leq (3+o(1))g$, which is tight when $G$ is a clique. Finally, we show that the oriented chromatic number of a graph is at most exponential in its injective chromatic index. Using this fact, we prove that the oriented chromatic number of a graph embedded on a surface of Euler genus $g$ has oriented chromatic number at most $O(g^{6400})$, improving the previously known upper bound of $2^{O(g^{\frac{1}{2} + ε})}$ and resolving a conjecture of Aravind and Subramanian.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.