Search | arXiv e-print repository

On the backward stability of s-step GMRES

Abstract: Communication, i.e., data movement, is a critical bottleneck for the performance of classical Krylov subspace method solvers on modern computer architectures. Variants of these methods which avoid communication have been introduced, which, while equivalent in exact arithmetic, can be unstable in finite precision. In this work, we address the backward stability of s-step GMRES, also known as commun… ▽ More Communication, i.e., data movement, is a critical bottleneck for the performance of classical Krylov subspace method solvers on modern computer architectures. Variants of these methods which avoid communication have been introduced, which, while equivalent in exact arithmetic, can be unstable in finite precision. In this work, we address the backward stability of s-step GMRES, also known as communication-avoiding GMRES. We present a framework for simplifying the analysis of s-step GMRES, which includes standard GMRES (s=1) as a special case, by isolating the effects of rounding errors in the QR factorization and the solution of the least squares problem. Using this framework, we analyze s-step GMRES with popular block orthogonalization methods: block modified Gram--Schmidt and reorthogonalized block classical Gram--Schmidt algorithms. An example illustrates the resulting instability of s-step GMRES when paired with the classical s-step Arnoldi process and shows the limitations of popular strategies for resolving this instability. To address this issue, we propose a modified Arnoldi process that allows for much larger block size s while maintaining satisfactory accuracy, as confirmed by our numerical experiments. △ Less

Submitted 4 September, 2024; originally announced September 2024.

Comments: 32 pages, 9 figures

MSC Class: 65F10; 65F50; 65G50

arXiv:2409.00281 [pdf]

Quasi-Steady-State Approach for Efficient Multiscale Simulation and Optimization of mAb Glycosylation in CHO Culture

Authors: Yingjie Ma, Jing Guo, Richard Braatz

Abstract: Glycosylation is a critical quality attribute for monoclonal antibody, and multiscale mechanistic models, spanning from the bioreactor to the Golgi apparatus, have been proposed for analyzing the glycosylation process. However, these models are computationally intensive to solve, making optimization and control challenging. In this work, we propose a quasi-steady-state (QSS) approach for efficient… ▽ More Glycosylation is a critical quality attribute for monoclonal antibody, and multiscale mechanistic models, spanning from the bioreactor to the Golgi apparatus, have been proposed for analyzing the glycosylation process. However, these models are computationally intensive to solve, making optimization and control challenging. In this work, we propose a quasi-steady-state (QSS) approach for efficiently solving the multiscale glycosylation model. By introducing the QSS assumption and assuming negligible nucleotide sugar donor flux for glycosylation in the Golgi, the large-scale partial differential algebraic equation system is converted into a series of independent differential algebraic equation systems. Based on that representation, we develop a three-step QSS simulation method and further reduce computational time through parallel computing and nonuniform time grid strategies. Case studies in simulation, parameter estimation, and dynamic optimization demonstrate that the QSS approach can be more than 300-fold faster than the method of lines, with less than 1.6% relative errors. △ Less

Submitted 30 August, 2024; originally announced September 2024.

arXiv:2408.15552 [pdf, other]

Characterization of Equimatchable Even-Regular Graphs

Authors: Xiao Zhao, Haojie Zheng, Fengming Dong, Hengzhe Li, Yingbin Ma

Abstract: A graph is called equimatchable if all of its maximal matchings have the same size. Due to Eiben and Kotrbcik, any connected graph with odd order and independence number $α(G)$ at most $2$ is equimatchable. Akbari et al. showed that for any odd number $r$, a connected equimatchable $r$-regular graph must be either the complete graph $K_{r+1}$ or the complete bipartite graph $K_{r,r}$. They also de… ▽ More A graph is called equimatchable if all of its maximal matchings have the same size. Due to Eiben and Kotrbcik, any connected graph with odd order and independence number $α(G)$ at most $2$ is equimatchable. Akbari et al. showed that for any odd number $r$, a connected equimatchable $r$-regular graph must be either the complete graph $K_{r+1}$ or the complete bipartite graph $K_{r,r}$. They also determined all connected equimatchable $4$-regular graphs and proved that for any even $r$, any connected equimatchable $r$-regular graph is either $K_{r,r}$ or factor-critical. In this paper, we confirm that for any even $r\ge 6$, there exists a unique connected equimatchable $r$-regular graph $G$ with $α(G)\geq 3$ and odd order. △ Less

Submitted 28 August, 2024; originally announced August 2024.

Comments: 22 Pages and 5 figures

MSC Class: 05C70; 05C75

arXiv:2408.13548 [pdf, ps, other]

Admissible weak factorization systems on extriangulated categories

Authors: Yajun Ma, Hanyang You, Dongdong Zhang, Panyue Zhou

Abstract: Extriangulated categories, introduced by Nakaoka and Palu, serve as a simultaneous generalization of exact and triangulated categories. In this paper, we first introduce the concept of admissible weak factorization systems and establish a bijection between cotorsion pairs and admissible weak factorization systems in extriangulated categories. Consequently, we give the equivalences between heredita… ▽ More Extriangulated categories, introduced by Nakaoka and Palu, serve as a simultaneous generalization of exact and triangulated categories. In this paper, we first introduce the concept of admissible weak factorization systems and establish a bijection between cotorsion pairs and admissible weak factorization systems in extriangulated categories. Consequently, we give the equivalences between hereditary cotorsion pairs and compatible cotorsion pairs via admissible weak factorization systems under certain conditions in extriangulated categories, thereby generalizing a result by Di, Li, and Liang. △ Less

Submitted 27 August, 2024; v1 submitted 24 August, 2024; originally announced August 2024.

Comments: 13 pages

arXiv:2408.10214 [pdf, other]

A Memory Reduction Compact Gas Kinetic Scheme on 3D Unstructured Meshes

Authors: Hongyu Liu, Xing Ji, Yunpeng Mao, Zhe Qian, Kun Xu

Abstract: This paper introduces a memory-reduction third-order compact gas-kinetic scheme (CGKS) for solving compressible Euler and Navier-Stokes equations on 3D unstructured meshes. The scheme utilizes a time-evolution gas distribution function to provide a time-evolution solution at cell interfaces, enabling the implementation of Hermite WENO techniques for high-order reconstruction. However, the HWENO me… ▽ More This paper introduces a memory-reduction third-order compact gas-kinetic scheme (CGKS) for solving compressible Euler and Navier-Stokes equations on 3D unstructured meshes. The scheme utilizes a time-evolution gas distribution function to provide a time-evolution solution at cell interfaces, enabling the implementation of Hermite WENO techniques for high-order reconstruction. However, the HWENO method needs to store a coefficients matrix for the quadratic polynomial to achieve third-order accuracy, resulting in high memory usage. A novel reconstruction method, built upon HWENO reconstruction, has been designed to enhance computational efficiency and reduce memory usage compared to the original CGKS. The simple idea is that the first-order and second-order terms of the quadratic polynomials are determined in a two-step way. In the first step, the second-order terms are obtained from the reconstruction of a linear polynomial of the first-order derivatives by only using the cell-averaged slopes, since the second-order derivatives are nothing but the "derivatives of derivatives". Subsequently, the first-order terms left can be determined by the linear reconstruction only using cell-averaged values. Thus, we successfully split one quadratic least-square regression into several linear least-square regressions, which are commonly used in a second-order finite volume code. Since only a small matrix inversion is needed in a 3-D linear least-square regression, the computational cost for the new reconstruction is dramatically reduced and the storage of the reconstruction-coefficient matrix is no longer necessary. The proposed new reconstruction technique can reduce the overall computational cost by about 20 to 30 percent. The challenging large-scale unsteady numerical simulation is performed, which demonstrates that the current improvement brings the CGKS to a new level for industrial applications. △ Less

Submitted 22 July, 2024; originally announced August 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2402.02075

arXiv:2408.10109 [pdf, other]

On the loss of orthogonality in low-synchronization variants of reorthogonalized block classical Gram-Schmidt

Authors: Erin Carson, Kathryn Lund, Yuxin Ma, Eda Oktay

Abstract: Interest in communication-avoiding orthogonalization schemes for high-performance computing has been growing recently. This manuscript addresses open questions about the numerical stability of various block classical Gram-Schmidt variants that have been proposed in the past few years. An abstract framework is employed, the flexibility of which allows for new rigorous bounds on the loss of orthogon… ▽ More Interest in communication-avoiding orthogonalization schemes for high-performance computing has been growing recently. This manuscript addresses open questions about the numerical stability of various block classical Gram-Schmidt variants that have been proposed in the past few years. An abstract framework is employed, the flexibility of which allows for new rigorous bounds on the loss of orthogonality in these variants. We first analyze a generalization of (reorthogonalized) block classical Gram-Schmidt and show that a "strong" intrablock orthogonalization routine is only needed for the very first block in order to maintain orthogonality on the level of the unit roundoff. Then, using this variant, which has four synchronization points per block column, we remove the synchronization points one at a time and analyze how each alteration affects the stability of the resulting method. Our analysis shows that the variant requiring only one synchronization per block column cannot be guaranteed to be stable in practice, as stability begins to degrade with the first reduction of synchronization points. Our analysis of block methods also provides new theoretical results for the single-column case. In particular, it is proven that DCGS2 from [Bielich, D. et al. Par. Comput. 112 (2022)] and CGS-2 from [Świrydowicz, K. et al, Num. Lin. Alg. Appl. 28 (2021)] are as stable as Householder QR. Numerical examples from the BlockStab toolbox are included throughout, to help compare variants and illustrate the effects of different choices of intraorthogonalization subroutines. △ Less

Submitted 19 August, 2024; originally announced August 2024.

MSC Class: 65-04; 65F25; 65G50; 65Y20

arXiv:2408.05662 [pdf, ps, other]

Quasi-stationary distributions for single death processes with killing

Authors: Zhe-Kang Fang, Yong-Hua Mao

Abstract: This paper studies the quasi-stationary distributions for a single death process (or downwardly skip-free process) with killing defined on the non-negative integers, corresponding to a non-conservative transition rate matrix. The set $\{1,2,3,\cdots\}$ constitutes an irreducible class and $0$ is an absorbing state. For the single death process with three kinds of killing term, we obtain the existe… ▽ More This paper studies the quasi-stationary distributions for a single death process (or downwardly skip-free process) with killing defined on the non-negative integers, corresponding to a non-conservative transition rate matrix. The set $\{1,2,3,\cdots\}$ constitutes an irreducible class and $0$ is an absorbing state. For the single death process with three kinds of killing term, we obtain the existence and uniqueness of the quasi-stationary distribution. Moreover, we derive the conditions for exponential convergence to the quasi-stationary distribution in the total variation norm. Our main approach is based on the Doob's $h$-transform, potential theory and probabilistic methods. △ Less

Submitted 10 August, 2024; originally announced August 2024.

arXiv:2408.01479 [pdf, ps, other]

On the two problems in Ramsey achievement games

Authors: Zhong Huang, Yusuke Kobayashi, Yaping Mao, Bo Ning, Xiumin Wang

Abstract: Let $p,q$ be two integers with $p\geq q$. Given a finite graph $F$ with no isolated vertices, the generalized Ramsey achievement game of $F$ on the complete graph $K_n$, denoted by $(p,q;K_n,F,+)$, is played by two players called Alice and Bob. In each round, Alice firstly chooses $p$ uncolored edges $e_1,e_2,...,e_p$ and colors it blue, then Bob chooses $q$ uncolored edge $f_1,f_2,...,f_q$ and co… ▽ More Let $p,q$ be two integers with $p\geq q$. Given a finite graph $F$ with no isolated vertices, the generalized Ramsey achievement game of $F$ on the complete graph $K_n$, denoted by $(p,q;K_n,F,+)$, is played by two players called Alice and Bob. In each round, Alice firstly chooses $p$ uncolored edges $e_1,e_2,...,e_p$ and colors it blue, then Bob chooses $q$ uncolored edge $f_1,f_2,...,f_q$ and colors it red; the player who can first complete the formation of $F$ in his (or her) color is the winner. The generalized achievement number of $F$, denoted by ${a}(p,q;F)$ is defined to be the smallest $n$ for which Alice has a winning strategy. If $p=q=1$, then it is denoted by ${a}(F)$, which is the classical achievement number of $F$ introduced by Harary in 1982. If Alice aims to form a blue $F$, and the goal of Bob is to try to stop him, this kind of game is called the first player game by Bollobás. Let ${a}^*(F)$ be the smallest positive integer $n$ for which Alice has a winning strategy in the first player game. A conjecture due to Harary states that the minimum value of ${a}(T)$ is realized when $T$ is a path and the maximum value of ${a}(T)$ is realized when $T$ is a star among all trees $T$ of order $n$. He also asked which graphs $F$ satisfy $a^*(F)=a(F)$? In this paper, we proved that $n\leq {a}(p,q;T)\leq n+q\left\lfloor (n-2)/p \right\rfloor$ for all trees $T$ of order $n$, and obtained a lower bound of ${a}(p,q;K_{1,n-1})$, where $K_{1,n-1}$ is a star. We proved that the minimum value of ${a}(T)$ is realized when $T$ is a path which gives a positive solution to the first part of Harary's conjecture, and ${a}(T)\leq 2n-2$ for all trees of order $n$. We also proved that for $n\geq 3$, we have $2n-2-\sqrt{(4n-8)\ln (4n-4)}\leq a(K_{1,n-1})\leq 2n-2$ with the help of a theorem of Alon, Krivelevich, Spencer and Szabó. We proved that $a^*(P_n)=a(P_n)$ for a path $P_n$. △ Less

Submitted 2 August, 2024; originally announced August 2024.

Comments: 13 pages

arXiv:2407.19964 [pdf, other]

A Markov representation of Perron-Frobenius eigenvector for infinite non-negative matrix and Metzler-matrix

Authors: Qian Du, Yong-Hua Mao

Abstract: We will represent the so-called Perron-Frobenius eigenvector (if exists) for infinite non-negative matrix $A$ and Metzler matrix by using its corresponding Markov chain with probability transition function. We will represent the so-called Perron-Frobenius eigenvector (if exists) for infinite non-negative matrix $A$ and Metzler matrix by using its corresponding Markov chain with probability transition function. △ Less

Submitted 29 July, 2024; originally announced July 2024.

arXiv:2407.19803 [pdf, ps, other]

Quasi-stationary distributions for continuous-time $λ$-recurrent jump processes

Authors: Qian Du, Yong-Hua Mao

Abstract: For the continuous-time $λ$-recurrent jump process, the $λ$-recurrence assures the existence of quasi-stationary distribution when it has finite exit states (the states that have positive killing rates). And we give an explicit representation for this quasi-stationary distribution through $Q$-matrix, where the components of the quasi-stationary distribution outside the set $H$ of exit states can b… ▽ More For the continuous-time $λ$-recurrent jump process, the $λ$-recurrence assures the existence of quasi-stationary distribution when it has finite exit states (the states that have positive killing rates). And we give an explicit representation for this quasi-stationary distribution through $Q$-matrix, where the components of the quasi-stationary distribution outside the set $H$ of exit states can be represented by those within $H$. Sufficient condition is also provided for quasi-stationary distribution when the exit states are infinite. △ Less

Submitted 29 July, 2024; originally announced July 2024.

arXiv:2407.07436 [pdf, other]

Alternating Subspace Approximate Message Passing

Authors: Xu Zhu, Yufei Ma, Xiaoguang Li, Tiejun Li

Abstract: Numerous renowned algorithms for tackling the compressed sensing problem employ an alternating strategy, which typically involves data matching in one module and denoising in another. Based on an in-depth analysis of the connection between the message passing and operator splitting, we present a novel approach, the Alternating Subspace Method (ASM), which intuitively combines the principles of the… ▽ More Numerous renowned algorithms for tackling the compressed sensing problem employ an alternating strategy, which typically involves data matching in one module and denoising in another. Based on an in-depth analysis of the connection between the message passing and operator splitting, we present a novel approach, the Alternating Subspace Method (ASM), which intuitively combines the principles of the greedy methods (e.g., the orthogonal matching pursuit type methods) and the splitting methods (e.g., the approximate message passing type methods). Essentially, ASM modifies the splitting method by achieving fidelity in a subspace-restricted fashion. We reveal that such confining strategy still yields a consistent fixed point iteration and establish its local geometric convergence on the lasso problem. Numerical experiments on both the lasso and channel estimation problems demonstrate its high convergence rate and its capacity to incorporate different prior distributions. Further theoretical analysis also demonstrates the advantage of the motivated message-passing splitting by incorporating quasi-variance degree of freedom even for the classical lasso optimization problem. Overall, the proposed method is promising in efficiency, accuracy and flexibility, which has the potential to be competitive in different sparse recovery applications. △ Less

Submitted 10 July, 2024; originally announced July 2024.

Comments: 19 pages, 6 figures

MSC Class: 94A12; 65F10; 90C06

arXiv:2406.16499 [pdf, other]

Mixed precision iterative refinement for least squares with linear equality constraints and generalized least squares problems

Authors: Bowen Gao, Yuxin Ma, Meiyue Shao

Abstract: Recent development on mixed precision techniques has largely enhanced the performance of various linear algebra solvers, one of which being the least squares problem $\min_{x}\lVert b-Ax\rVert_{2}$. By transforming the least squares problem into an augmented linear system, mixed precision techniques are capable of refining the lower precision solution to the working precision. In this paper, we pr… ▽ More Recent development on mixed precision techniques has largely enhanced the performance of various linear algebra solvers, one of which being the least squares problem $\min_{x}\lVert b-Ax\rVert_{2}$. By transforming the least squares problem into an augmented linear system, mixed precision techniques are capable of refining the lower precision solution to the working precision. In this paper, we propose mixed precision iterative refinement algorithms for two variants of the least squares problem -- the least squares problem with linear equality constraints (LSE) and the generalized least squares problem (GLS). Both classical and GMRES-based iterative refinement can be applied to augmented systems of these two problems to improve the accuracy of the solution. For reasonably well-conditioned problems our algorithms reduce the execution time by a factor of 40% in average compared to the fixed precision ones from LAPACK on the x86-64 architecture. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: 32 pages, 7 figures

MSC Class: 65F05; 65F08; 65F10

arXiv:2406.01908 [pdf, other]

PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming

Authors: Bingheng Li, Linxin Yang, Yupeng Chen, Senmiao Wang, Qian Chen, Haitao Mao, Yao Ma, Akang Wang, Tian Ding, Jiliang Tang, Ruoyu Sun

Abstract: Solving large-scale linear programming (LP) problems is an important task in various areas such as communication networks, power systems, finance and logistics. Recently, two distinct approaches have emerged to expedite LP solving: (i) First-order methods (FOMs); (ii) Learning to optimize (L2O). In this work, we propose an FOM-unrolled neural network (NN) called PDHG-Net, and propose a two-stage L… ▽ More Solving large-scale linear programming (LP) problems is an important task in various areas such as communication networks, power systems, finance and logistics. Recently, two distinct approaches have emerged to expedite LP solving: (i) First-order methods (FOMs); (ii) Learning to optimize (L2O). In this work, we propose an FOM-unrolled neural network (NN) called PDHG-Net, and propose a two-stage L2O method to solve large-scale LP problems. The new architecture PDHG-Net is designed by unrolling the recently emerged PDHG method into a neural network, combined with channel-expansion techniques borrowed from graph neural networks. We prove that the proposed PDHG-Net can recover PDHG algorithm, thus can approximate optimal solutions of LP instances with a polynomial number of neurons. We propose a two-stage inference approach: first use PDHG-Net to generate an approximate solution, and then apply PDHG algorithm to further improve the solution. Experiments show that our approach can significantly accelerate LP solving, achieving up to a 3$\times$ speedup compared to FOMs for large-scale LP problems. △ Less

Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

Comments: Accepted by ICML 2024

arXiv:2406.01200 [pdf, ps, other]

Probabilistic Lah numbers and Lah-Bell polynomials

Authors: Yuankui Ma, Taekyun Kim, Dae San Kim

Abstract: Let Y be a random variable whose moment generating function exists in some neighborhood of the origin. The aim of this paper is to study the probabilistic Lah numbers associated with Y and the probabilistic Lah-Bell polynomials associated with Y, as probabilistic versions of the Lah numbers and the Lah-Bell polynomials, respectively. We derive some properties, explicit expressions, recurrence rela… ▽ More Let Y be a random variable whose moment generating function exists in some neighborhood of the origin. The aim of this paper is to study the probabilistic Lah numbers associated with Y and the probabilistic Lah-Bell polynomials associated with Y, as probabilistic versions of the Lah numbers and the Lah-Bell polynomials, respectively. We derive some properties, explicit expressions, recurrence relations and certain identities for those numbers and polynomials. In addition, we treat the special cases that Y is the Poisson random variable with parameter α > 0 and the Bernoulli random variable with probability of success p. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 10 pages

MSC Class: 11B73; 11B83

arXiv:2406.00920 [pdf, ps, other]

Demystifying SGD with Doubly Stochastic Gradients

Authors: Kyurae Kim, Joohwan Ko, Yi-An Ma, Jacob R. Gardner

Abstract: Optimization objectives in the form of a sum of intractable expectations are rising in importance (e.g., diffusion models, variational autoencoders, and many more), a setting also known as "finite sum with infinite data." For these problems, a popular strategy is to employ SGD with doubly stochastic gradients (doubly SGD): the expectations are estimated using the gradient estimator of each compone… ▽ More Optimization objectives in the form of a sum of intractable expectations are rising in importance (e.g., diffusion models, variational autoencoders, and many more), a setting also known as "finite sum with infinite data." For these problems, a popular strategy is to employ SGD with doubly stochastic gradients (doubly SGD): the expectations are estimated using the gradient estimator of each component, while the sum is estimated by subsampling over these estimators. Despite its popularity, little is known about the convergence properties of doubly SGD, except under strong assumptions such as bounded variance. In this work, we establish the convergence of doubly SGD with independent minibatching and random reshuffling under general conditions, which encompasses dependent component gradient estimators. In particular, for dependent estimators, our analysis allows fined-grained analysis of the effect correlations. As a result, under a per-iteration computational budget of $b \times m$, where $b$ is the minibatch size and $m$ is the number of Monte Carlo samples, our analysis suggests where one should invest most of the budget in general. Furthermore, we prove that random reshuffling (RR) improves the complexity dependence on the subsampling noise. △ Less

Submitted 2 June, 2024; originally announced June 2024.

Comments: Accepted to ICML'24

arXiv:2405.17527 [pdf, other]

Unisolver: PDE-Conditional Transformers Are Universal PDE Solvers

Authors: Hang Zhou, Yuezhou Ma, Haixu Wu, Haowen Wang, Mingsheng Long

Abstract: Deep models have recently emerged as a promising tool to solve partial differential equations (PDEs), known as neural PDE solvers. While neural solvers trained from either simulation data or physics-informed loss can solve the PDEs reasonably well, they are mainly restricted to a specific set of PDEs, e.g. a certain equation or a finite set of coefficients. This bottleneck limits the generalizabil… ▽ More Deep models have recently emerged as a promising tool to solve partial differential equations (PDEs), known as neural PDE solvers. While neural solvers trained from either simulation data or physics-informed loss can solve the PDEs reasonably well, they are mainly restricted to a specific set of PDEs, e.g. a certain equation or a finite set of coefficients. This bottleneck limits the generalizability of neural solvers, which is widely recognized as its major advantage over numerical solvers. In this paper, we present the Universal PDE solver (Unisolver) capable of solving a wide scope of PDEs by leveraging a Transformer pre-trained on diverse data and conditioned on diverse PDEs. Instead of simply scaling up data and parameters, Unisolver stems from the theoretical analysis of the PDE-solving process. Our key finding is that a PDE solution is fundamentally under the control of a series of PDE components, e.g. equation symbols, coefficients, and initial and boundary conditions. Inspired by the mathematical structure of PDEs, we define a complete set of PDE components and correspondingly embed them as domain-wise (e.g. equation symbols) and point-wise (e.g. boundaries) conditions for Transformer PDE solvers. Integrating physical insights with recent Transformer advances, Unisolver achieves consistent state-of-the-art results on three challenging large-scale benchmarks, showing impressive gains and endowing favorable generalizability and scalability. △ Less

Submitted 1 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.06889 [pdf, other]

Tuning parameter selection for the adaptive nuclear norm regularized trace regression

Authors: Pan Shang, Lingchen Kong, Yiting Ma

Abstract: Regularized models have been applied in lots of areas, with high-dimensional data sets being popular. Because tuning parameter decides the theoretical performance and computational efficiency of the regularized models, tuning parameter selection is a basic and important issue. We consider the tuning parameter selection for adaptive nuclear norm regularized trace regression, which achieves by the B… ▽ More Regularized models have been applied in lots of areas, with high-dimensional data sets being popular. Because tuning parameter decides the theoretical performance and computational efficiency of the regularized models, tuning parameter selection is a basic and important issue. We consider the tuning parameter selection for adaptive nuclear norm regularized trace regression, which achieves by the Bayesian information criterion (BIC). The proposed BIC is established with the help of an unbiased estimator of degrees of freedom. Under some regularized conditions, this BIC is proved to achieve the rank consistency of the tuning parameter selection. That is the model solution under selected tuning parameter converges to the true solution and has the same rank with that of the true solution in probability. Some numerical results are presented to evaluate the performance of the proposed BIC on tuning parameter selection. △ Less

Submitted 10 May, 2024; originally announced May 2024.

arXiv:2405.04730 [pdf, ps, other]

A rigidity property for a type of wave-Klein-Gordon system

Authors: Yan-Tao Li, Yue Ma

Abstract: In this paper we investigate the rigidity property of a wave component coupled in a wave-Klein-Gordon system. We prove that when the radiation field of the wave component vanishes at the null infinity, the initial data of this component also vanish, therefor there is no wave in the whole spacetime In this paper we investigate the rigidity property of a wave component coupled in a wave-Klein-Gordon system. We prove that when the radiation field of the wave component vanishes at the null infinity, the initial data of this component also vanish, therefor there is no wave in the whole spacetime △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: 29 pages

arXiv:2405.03931 [pdf, ps, other]

Incorporating changeable attitudes toward vaccination into an SIR infectious disease model

Authors: Yi Jiang, Kristin M. Kurianski, Jane HyoJin Lee, Yanping Ma, Daniel Cicala, Glenn Ledder

Abstract: We develop a mechanistic model that classifies individuals both in terms of epidemiological status (SIR) and vaccination attitude (willing or unwilling), with the goal of discovering how disease spread is influenced by changing opinions about vaccination. Analysis of the model identifies existence and stability criteria for both disease-free and endemic disease equilibria. The analytical results,… ▽ More We develop a mechanistic model that classifies individuals both in terms of epidemiological status (SIR) and vaccination attitude (willing or unwilling), with the goal of discovering how disease spread is influenced by changing opinions about vaccination. Analysis of the model identifies existence and stability criteria for both disease-free and endemic disease equilibria. The analytical results, supported by numerical simulations, show that attitude changes induced by disease prevalence can destabilize endemic disease equilibria, resulting in limit cycles. △ Less

Submitted 14 August, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

Comments: 30 pages, 3 tables, 10 figures

MSC Class: 37N25 (Primary) 92D30 (Secondary)

arXiv:2405.01298 [pdf, other]

Reorthogonalized Pythagorean variants of block classical Gram-Schmidt

Authors: Erin Carson, Kathryn Lund, Yuxin Ma, Eda Oktay

Abstract: Block classical Gram-Schmidt (BCGS) is commonly used for orthogonalizing a set of vectors $X$ in distributed computing environments due to its favorable communication properties relative to other orthogonalization approaches, such as modified Gram-Schmidt or Householder. However, it is known that BCGS (as well as recently developed low-synchronization variants of BCGS) can suffer from a significan… ▽ More Block classical Gram-Schmidt (BCGS) is commonly used for orthogonalizing a set of vectors $X$ in distributed computing environments due to its favorable communication properties relative to other orthogonalization approaches, such as modified Gram-Schmidt or Householder. However, it is known that BCGS (as well as recently developed low-synchronization variants of BCGS) can suffer from a significant loss of orthogonality in finite-precision arithmetic, which can contribute to instability and inaccurate solutions in downstream applications such as $s$-step Krylov subspace methods. A common solution to improve the orthogonality among the vectors is reorthogonalization. Focusing on the "Pythagorean" variant of BCGS, introduced in [E. Carson, K. Lund, & M. Rozložník. SIAM J. Matrix Anal. Appl. 42(3), pp. 1365--1380, 2021], which guarantees an $O(\varepsilon)κ^2(X)$ bound on the loss of orthogonality as long as $O(\varepsilon)κ^2(X)<1$, where $\varepsilon$ denotes the unit roundoff, we introduce and analyze two reorthogonalized Pythagorean BCGS variants. These variants feature favorable communication properties, with asymptotically two synchronization points per block column, as well as an improved $O(\varepsilon)$ bound on the loss of orthogonality. Our bounds are derived in a general fashion to additionally allow for the analysis of mixed-precision variants. We verify our theoretical results with a panel of test matrices and experiments from a new version of the \texttt{BlockStab} toolbox. △ Less

Submitted 2 May, 2024; originally announced May 2024.

MSC Class: 65-04; 65F25; 65G50; 65Y20

arXiv:2405.00703 [pdf, ps, other]

Local and Global Log-Gradient estimates of solutions to $Δ_pv+bv^q+cv^r =0$ on manifolds and applications

Authors: Jie He, Yuanqing Ma, Youde Wang

Abstract: In this paper, we employ the Nash-Moser iteration technique to study local and global properties of positive solutions to the equation $$Δ_pv+bv^q+cv^r =0$$ on complete Riemannian manifolds with Ricci curvature bounded from below, where $b, c\in\mathbb R$, $p>1$, and $q\leq r$ are some real constants. Assuming certain conditions on $b,\, c,\, p,\, q$ and $r$, we derive succinct Cheng-Yau type grad… ▽ More In this paper, we employ the Nash-Moser iteration technique to study local and global properties of positive solutions to the equation $$Δ_pv+bv^q+cv^r =0$$ on complete Riemannian manifolds with Ricci curvature bounded from below, where $b, c\in\mathbb R$, $p>1$, and $q\leq r$ are some real constants. Assuming certain conditions on $b,\, c,\, p,\, q$ and $r$, we derive succinct Cheng-Yau type gradient estimates for positive solutions, which is of sharp form. These gradient estimates allow us to obtain some Liouville-type theorems and Harnack inequalities. Our Liouville-type results are novel even in Euclidean spaces. Based on the local gradient estimates and a trick of Sung and Wang, we also obtain the global gradient estimates for such solutions. As applications we show the uniqueness of positive solutions to some generalized Allen-Cahn equation and Fisher-KPP equation. △ Less

Submitted 22 April, 2024; originally announced May 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2311.02568; text overlap with arXiv:2311.13179

arXiv:2404.08913 [pdf, ps, other]

On the best approximation by finite Gaussian mixtures

Authors: Yun Ma, Yihong Wu, Pengkun Yang

Abstract: We consider the problem of approximating a general Gaussian location mixture by finite mixtures. The minimum order of finite mixtures that achieve a prescribed accuracy (measured by various $f$-divergences) is determined within constant factors for the family of mixing distributions with compactly support or appropriate assumptions on the tail probability including subgaussian and subexponential.… ▽ More We consider the problem of approximating a general Gaussian location mixture by finite mixtures. The minimum order of finite mixtures that achieve a prescribed accuracy (measured by various $f$-divergences) is determined within constant factors for the family of mixing distributions with compactly support or appropriate assumptions on the tail probability including subgaussian and subexponential. While the upper bound is achieved using the technique of local moment matching, the lower bound is established by relating the best approximation error to the low-rank approximation of certain trigonometric moment matrices, followed by a refined spectral analysis of their minimum eigenvalue. In the case of Gaussian mixing distributions, this result corrects a previous lower bound in [Allerton Conference 48 (2010) 620-628]. △ Less

Submitted 13 April, 2024; originally announced April 2024.

arXiv:2404.08830 [pdf, ps, other]

Strongly Gauduchon Hyperbolicity and Two Other Types of Hyperbolicity

Authors: Yi Ma

Abstract: This paper proposes sG-hyperbolicity as a new tool for studying hyperbolicity on complex manifolds. It demonstrates that this notion leads to a wider class of divisorially hyperbolic manifolds compared to balanced hyperbolicity. We also introduce weakly p-Kähler hyperbolic structures and pluriclosed star split hyperbolic metrics as possible new avenues for exploration. This paper proposes sG-hyperbolicity as a new tool for studying hyperbolicity on complex manifolds. It demonstrates that this notion leads to a wider class of divisorially hyperbolic manifolds compared to balanced hyperbolicity. We also introduce weakly p-Kähler hyperbolic structures and pluriclosed star split hyperbolic metrics as possible new avenues for exploration. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: 12 pages

arXiv:2404.04507 [pdf, other]

Irrational-window-filter projection method and application to quasiperiodic Schrödinger eigenproblems

Authors: Kai Jiang, Xueyang Li, Yao Ma, Juan Zhang, Pingwen Zhang, Qi Zhou

Abstract: In this paper, we propose a new algorithm, the irrational-window-filter projection method (IWFPM), for solving arbitrary dimensional global quasiperiodic systems. Based on the projection method (PM), IWFPM further utilizes the concentrated distribution of Fourier coefficients to filter out relevant spectral points using an irrational window. Moreover, a corresponding index-shift transform is desig… ▽ More In this paper, we propose a new algorithm, the irrational-window-filter projection method (IWFPM), for solving arbitrary dimensional global quasiperiodic systems. Based on the projection method (PM), IWFPM further utilizes the concentrated distribution of Fourier coefficients to filter out relevant spectral points using an irrational window. Moreover, a corresponding index-shift transform is designed to make the Fast Fourier Transform available. The corresponding error analysis on the function approximation level is also given. We apply IWFPM to 1D, 2D, and 3D quasiperiodic Schrödinger eigenproblems to demonstrate its accuracy and efficiency. IWFPM exhibits a significant computational advantage over PM for both extended and localized quantum states. Furthermore, the widespread existence of such spectral point distribution feature can endow IWFPM with significant potential for broader applications in quasiperiodic systems. △ Less

Submitted 30 June, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

MSC Class: 35P05; 35J10; 65D15; 65T50; 81-08

arXiv:2403.19871 [pdf, other]

Towards Stable Machine Learning Model Retraining via Slowly Varying Sequences

Authors: Dimitris Bertsimas, Vassilis Digalakis Jr, Yu Ma, Phevos Paschalidis

Abstract: We consider the task of retraining machine learning (ML) models when new batches of data become available. Existing methods focus largely on greedy approaches to find the best-performing model for each batch, without considering the stability of the model's structure across retraining iterations. In this study, we propose a methodology for finding sequences of ML models that are stable across retr… ▽ More We consider the task of retraining machine learning (ML) models when new batches of data become available. Existing methods focus largely on greedy approaches to find the best-performing model for each batch, without considering the stability of the model's structure across retraining iterations. In this study, we propose a methodology for finding sequences of ML models that are stable across retraining iterations. We develop a mixed-integer optimization formulation that is guaranteed to recover Pareto optimal models (in terms of the predictive power-stability trade-off) and an efficient polynomial-time algorithm that performs well in practice. We focus on retaining consistent analytical insights - which is important to model interpretability, ease of implementation, and fostering trust with users - by using custom-defined distance metrics that can be directly incorporated into the optimization problem. Our method shows stronger stability than greedily trained models with a small, controllable sacrifice in predictive power, as evidenced through a real-world case study in a major hospital system in Connecticut. △ Less

Submitted 22 May, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

arXiv:2403.19122 [pdf, other]

Safety-Critical Planning and Control for Dynamic Obstacle Avoidance Using Control Barrier Functions

Authors: Shuo Liu, Yihui Mao

Abstract: Dynamic obstacle avoidance is a challenging topic for optimal control and optimization-based trajectory planning problems, especially when in a tight environment. Many existing works use control barrier functions (CBFs) to enforce safety constraints within control systems. Inside these works, CBFs are usually formulated under model predictive control (MPC) framework to anticipate future states and… ▽ More Dynamic obstacle avoidance is a challenging topic for optimal control and optimization-based trajectory planning problems, especially when in a tight environment. Many existing works use control barrier functions (CBFs) to enforce safety constraints within control systems. Inside these works, CBFs are usually formulated under model predictive control (MPC) framework to anticipate future states and make informed decisions, or integrated with path planning algorithms as a safety enhancement tool. However, these approaches usually require knowledge of the obstacle boundary equations or have very slow computational efficiency. In this paper, we propose a novel framework to the iterative MPC with discrete-time CBFs (DCBFs) to generate a collision-free trajectory. The DCBFs are obtained from convex polyhedra generated in sequential grid maps, without the need to know the boundary equations of obstacles. Additionally, a path planning algorithm is incorporated into this framework to ensure the global optimality of the generated trajectory. We demonstrate through numerical examples that our framework enables a unicycle robot to safely and efficiently navigate through tight and dynamically changing environments, tackling both convex and nonconvex obstacles with remarkable computing efficiency and reliability in control and trajectory generation. △ Less

Submitted 27 March, 2024; originally announced March 2024.

Comments: 9 pages, 4 figures. arXiv admin note: text overlap with arXiv:2210.04361

arXiv:2403.11217 [pdf]

Research on Personal Credit Risk Assessment Methods Based on Causal Inference

Authors: Jiaxin Wang, YiLong Ma

Abstract: The discussion on causality in human history dates back to ancient Greece, yet to this day, there is still no consensus. Fundamentally, this stems from the nature of human cognition, as understanding causality requires abstract tools to transcend the limitations of human cognition. In recent decades, the rapid development of mathematical and computational tools has provided new theoretical and tec… ▽ More The discussion on causality in human history dates back to ancient Greece, yet to this day, there is still no consensus. Fundamentally, this stems from the nature of human cognition, as understanding causality requires abstract tools to transcend the limitations of human cognition. In recent decades, the rapid development of mathematical and computational tools has provided new theoretical and technical means for exploring causality, creating more avenues for investigation. Based on this, this paper introduces a new definition of causality using category theory, proposed by Samuel Eilenberg and Saunders Mac Lane in 1945 to avoid the self-referential contradictions in set theory, notably the Russell paradox. Within this framework, the feasibility of indicator synthesis in causal inference is demonstrated. Due to the limitations in the development of category theory-related technical tools, this paper adopts the widely-used probabilistic causal graph tool proposed by Judea Pearl in 1995 to study the application of causal inference in personal credit risk management. The specific work includes: research on the construction method of causal inference index system, definition of causality and feasibility proof of indicator synthesis causal inference within this framework, application methods of causal graph model and intervention alternative criteria in personal credit risk management, and so on. △ Less

Submitted 17 March, 2024; originally announced March 2024.

arXiv:2403.11163 [pdf, ps, other]

doi 10.1080/24754269.2024.2343151

A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques

Authors: Xuetong Li, Yuan Gao, Hong Chang, Danyang Huang, Yingying Ma, Rui Pan, Haobo Qi, Feifei Wang, Shuyuan Wu, Ke Xu, Jing Zhou, Xuening Zhu, Yingqiu Zhu, Hansheng Wang

Abstract: This paper presents a selective review of statistical computation methods for massive data analysis. A huge amount of statistical methods for massive data computation have been rapidly developed in the past decades. In this work, we focus on three categories of statistical computation methods: (1) distributed computing, (2) subsampling methods, and (3) minibatch gradient techniques. The first clas… ▽ More This paper presents a selective review of statistical computation methods for massive data analysis. A huge amount of statistical methods for massive data computation have been rapidly developed in the past decades. In this work, we focus on three categories of statistical computation methods: (1) distributed computing, (2) subsampling methods, and (3) minibatch gradient techniques. The first class of literature is about distributed computing and focuses on the situation, where the dataset size is too huge to be comfortably handled by one single computer. In this case, a distributed computation system with multiple computers has to be utilized. The second class of literature is about subsampling methods and concerns about the situation, where the sample size of dataset is small enough to be placed on one single computer but too large to be easily processed by its memory as a whole. The last class of literature studies those minibatch gradient related optimization techniques, which have been extensively used for optimizing various deep learning models. △ Less

Submitted 17 March, 2024; originally announced March 2024.

arXiv:2403.01131 [pdf, other]

LLaMoCo: Instruction Tuning of Large Language Models for Optimization Code Generation

Authors: Zeyuan Ma, Hongshu Guo, Jiacheng Chen, Guojun Peng, Zhiguang Cao, Yining Ma, Yue-Jiao Gong

Abstract: Recent research explores optimization using large language models (LLMs) by either iteratively seeking next-step solutions from LLMs or directly prompting LLMs for an optimizer. However, these approaches exhibit inherent limitations, including low operational efficiency, high sensitivity to prompt design, and a lack of domain-specific knowledge. We introduce LLaMoCo, the first instruction-tuning f… ▽ More Recent research explores optimization using large language models (LLMs) by either iteratively seeking next-step solutions from LLMs or directly prompting LLMs for an optimizer. However, these approaches exhibit inherent limitations, including low operational efficiency, high sensitivity to prompt design, and a lack of domain-specific knowledge. We introduce LLaMoCo, the first instruction-tuning framework designed to adapt LLMs for solving optimization problems in a code-to-code manner. Specifically, we establish a comprehensive instruction set containing well-described problem prompts and effective optimization codes. We then develop a novel two-phase learning strategy that incorporates a contrastive learning-based warm-up procedure before the instruction-tuning phase to enhance the convergence behavior during model fine-tuning. The experiment results demonstrate that a CodeGen (350M) model fine-tuned by our LLaMoCo achieves superior optimization performance compared to GPT-4 Turbo and the other competitors across both synthetic and realistic problem sets. The fine-tuned model and the usage instructions are available at https://anonymous.4open.science/r/LLaMoCo-722A. △ Less

Submitted 5 March, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

arXiv:2403.00468 [pdf, ps, other]

Probabilistic central Bell polynomials

Authors: R. Xu, Y. Ma, T. Kim, D. S. Kim, S. Boulaars

Abstract: Let Y be a random variable whose moment generating function exists in a neighborhood of the origin. In this paper, we study the probabilistic central Bell polynomials associated with random variable Y, as probabilistic extension of the central Bell polynomials. In addition, we investigate the probabilistic central factorial numbers of the second kind associated with Y and the probabilistic central… ▽ More Let Y be a random variable whose moment generating function exists in a neighborhood of the origin. In this paper, we study the probabilistic central Bell polynomials associated with random variable Y, as probabilistic extension of the central Bell polynomials. In addition, we investigate the probabilistic central factorial numbers of the second kind associated with Y and the probabilistic central Fubini polynomials associated with Y. The aim of this paper is to derive some properties, explicit expressions, certain identities and recurrence relations for those polynomials and numbers. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Comments: 12 pages

MSC Class: 11B73; 11B83

arXiv:2402.10396 [pdf]

doi 10.1016/j.compchemeng.2024.108751

Improved SQP and SLSQP Algorithms for Feasible Path-based Process Optimisation

Authors: Yingjie Ma, Xi Gao, Chao Liu, Jie Li

Abstract: Feasible path algorithms have been widely used for process optimisation due to its good convergence. The sequential quadratic programming (SQP) algorithm is usually used to drive the feasible path algorithms towards optimality. However, existing SQP algorithms may suffer from inconsistent quadratic programming (QP) subproblems and numerical noise, especially for ill-conditioned process optimisatio… ▽ More Feasible path algorithms have been widely used for process optimisation due to its good convergence. The sequential quadratic programming (SQP) algorithm is usually used to drive the feasible path algorithms towards optimality. However, existing SQP algorithms may suffer from inconsistent quadratic programming (QP) subproblems and numerical noise, especially for ill-conditioned process optimisation problems, leading to a suboptimal or infeasible solution. In this work, we propose an improved SQP algorithm (I-SQP) and an improved sequential least squares programming algorithm (I-SLSQP) that solves a least squares (LSQ) subproblem at each major iteration. A hybrid method through the combination of two existing relaxations is proposed to solve the inconsistent subproblems for better convergence and higher efficiency. We find that a certain part of the dual LSQ algorithm suffers from serious cancellation errors, resulting in an inaccurate search direction or no viable search direction generated. Therefore, the QP solver is used to solve LSQ subproblems in such a situation. The computational results indicates that I-SLSQP is more robust than fmincon in MATLAB, IPOPT, Py-SLSQP and I-SQP. It is also shown that I-SLSQP and Py-SLSQP is superior to I-SQP for ill-conditioned process optimisation problems, whilst I-SQP is more computationally efficient than I-SLSQP and Py-SLSQP for well-conditioned problems. △ Less

Submitted 24 July, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

Comments: 43 pages, 10 figures, 12 tables

Journal ref: Computers & Chemical Engineering 2024, 188, 108751

arXiv:2401.08942 [pdf, ps, other]

Ramsey and Gallai-Ramsey numbers for linear forests and kipas

Authors: Ping Li, Yaping Mao, Ingo Schiermeyer, Yifan Yao

Abstract: For two graphs $G,H$, the \emph{Ramsey number} $r(G,H)$ is the minimum integer $n$ such that any red/blue edge-coloring of $K_n$ contains either a red copy of $G$ or a blue copy of $H$. For two graphs $G,H$, the \emph{Gallai-Ramsey number} $\operatorname{gr}_k(G:H)$ is defined as the minimum integer $n$ such that any $k$-edge-coloring of $K_n$ must contain either a rainbow copy of $G$ or a monochr… ▽ More For two graphs $G,H$, the \emph{Ramsey number} $r(G,H)$ is the minimum integer $n$ such that any red/blue edge-coloring of $K_n$ contains either a red copy of $G$ or a blue copy of $H$. For two graphs $G,H$, the \emph{Gallai-Ramsey number} $\operatorname{gr}_k(G:H)$ is defined as the minimum integer $n$ such that any $k$-edge-coloring of $K_n$ must contain either a rainbow copy of $G$ or a monochromatic copy of $H$. In this paper, the classical Ramsey numbers of linear forest versus kipas are obtained. We obtain the exact values of $\operatorname{gr}_k(G:H)$, where $H$ is either a path or a kipas and $G\in\{K_{1,3},P_4^+,P_5\}$ and $P_4^+$ is the graph consisting of $P_4$ with one extra edge incident with inner vertex. △ Less

Submitted 25 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

arXiv:2401.07228 [pdf, other]

A Lawson-time-splitting extended Fourier pseudospectral method for the Gross-Pitaevskii equation with time-dependent low regularity potential

Authors: Bo Lin, Ying Ma, Chushan Wang

Abstract: We propose a Lawson-time-splitting extended Fourier pseudospectral (LTSeFP) method for the numerical integration of the Gross-Pitaevskii equation with time-dependent potential that is of low regularity in space. For the spatial discretization of low regularity potential, we use an extended Fourier pseudospectral (eFP) method, i.e., we compute the discrete Fourier transform of the low regularity po… ▽ More We propose a Lawson-time-splitting extended Fourier pseudospectral (LTSeFP) method for the numerical integration of the Gross-Pitaevskii equation with time-dependent potential that is of low regularity in space. For the spatial discretization of low regularity potential, we use an extended Fourier pseudospectral (eFP) method, i.e., we compute the discrete Fourier transform of the low regularity potential in an extended window. For the temporal discretization, to efficiently implement the eFP method for time-dependent low regularity potential, we combine the standard time-splitting method with a Lawson-type exponential integrator to integrate potential and nonlinearity differently. The LTSeFP method is both accurate and efficient: it achieves first-order convergence in time and optimal-order convergence in space in $L^2$-norm under low regularity potential, while the computational cost is comparable to the standard time-splitting Fourier pseudospectral method. Theoretically, we also prove such convergence orders for a large class of spatially low regularity time-dependent potential. Extensive numerical results are reported to confirm the error estimates and to demonstrate the superiority of our method. △ Less

Submitted 14 January, 2024; originally announced January 2024.

Comments: 19 pages, 10 figures

MSC Class: 35Q55; 65M15; 65M70; 81Q05

arXiv:2401.06325 [pdf, other]

Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo

Authors: Xunpeng Huang, Difan Zou, Hanze Dong, Yian Ma, Tong Zhang

Abstract: To sample from a general target distribution $p_*\propto e^{-f_*}$ beyond the isoperimetric condition, Huang et al. (2023) proposed to perform sampling through reverse diffusion, giving rise to Diffusion-based Monte Carlo (DMC). Specifically, DMC follows the reverse SDE of a diffusion process that transforms the target distribution to the standard Gaussian, utilizing a non-parametric score estimat… ▽ More To sample from a general target distribution $p_*\propto e^{-f_*}$ beyond the isoperimetric condition, Huang et al. (2023) proposed to perform sampling through reverse diffusion, giving rise to Diffusion-based Monte Carlo (DMC). Specifically, DMC follows the reverse SDE of a diffusion process that transforms the target distribution to the standard Gaussian, utilizing a non-parametric score estimation. However, the original DMC algorithm encountered high gradient complexity, resulting in an exponential dependency on the error tolerance $ε$ of the obtained samples. In this paper, we demonstrate that the high complexity of DMC originates from its redundant design of score estimation, and proposed a more efficient algorithm, called RS-DMC, based on a novel recursive score estimation method. In particular, we first divide the entire diffusion process into multiple segments and then formulate the score estimation step (at any time step) as a series of interconnected mean estimation and sampling subproblems accordingly, which are correlated in a recursive manner. Importantly, we show that with a proper design of the segment decomposition, all sampling subproblems will only need to tackle a strongly log-concave distribution, which can be very efficient to solve using the Langevin-based samplers with a provably rapid convergence rate. As a result, we prove that the gradient complexity of RS-DMC only has a quasi-polynomial dependency on $ε$, which significantly improves exponential gradient complexity in Huang et al. (2023). Furthermore, under commonly used dissipative conditions, our algorithm is provably much faster than the popular Langevin-based algorithms. Our algorithm design and theoretical framework illuminate a novel direction for addressing sampling problems, which could be of broader applicability in the community. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 54 pages

arXiv:2401.02638 [pdf, ps, other]

Probabilistic degenerate Fubini polynomials associated with random variables

Authors: Rongrong Xu, Taekyun Kim, Dae San Kim, Yuankui Ma

Abstract: Let Y be a random variable such that the moment generating function of Y exists in a neighborhood of the origin. The aim of this paper is to study probabilistic versions of the degenerate Fubini polynomials and the degenerate Fubini polynomials of order $r$, namely the probabilisitc degenerate Fubini polynomials associated with Y and the probabilistic degenerate Fubini polynomials of order r assoc… ▽ More Let Y be a random variable such that the moment generating function of Y exists in a neighborhood of the origin. The aim of this paper is to study probabilistic versions of the degenerate Fubini polynomials and the degenerate Fubini polynomials of order $r$, namely the probabilisitc degenerate Fubini polynomials associated with Y and the probabilistic degenerate Fubini polynomials of order r associated with Y. We derive some properties, explicit expressions, certain identities and recurrence relations for those polynomials. △ Less

Submitted 5 January, 2024; originally announced January 2024.

Comments: 15

MSC Class: 11B73; 11B83

arXiv:2312.17712 [pdf, ps, other]

The Euclidean-hyperboloidal foliation method. Application to f(R) modified gravity

Authors: Philippe G. LeFloch, Yue Ma

Abstract: This paper is a part of a series devoted to the Euclidean-hyperboloidal foliation method introduced by the authors for investigating the global existence problem associated with nonlinear systems of coupled wave-Klein-Gordon equations with small data. This method was developed especially for investigating the initial value problem for the Einstein-massive field system in wave gauge. Here, we study… ▽ More This paper is a part of a series devoted to the Euclidean-hyperboloidal foliation method introduced by the authors for investigating the global existence problem associated with nonlinear systems of coupled wave-Klein-Gordon equations with small data. This method was developed especially for investigating the initial value problem for the Einstein-massive field system in wave gauge. Here, we study the (fourth-order) field equations of f(R) modified gravity and investigate the global dynamical behavior of the gravitational field in the near-Minkowski regime. We establish the existence of a globally hyperbolic Cauchy development approaching Minkowski spacetime (in spacelike, null, and timelike directions), when the initial data set is sufficiently close to an asymptotically Euclidean and spacelike hypersurface in Minkowski spacetime. We cast the (fourth-order) f(R)-field equations in the form of a second-order wave-Klein-Gordon system, which has an analogous structure to the Einstein-massive field system but, in addition, involves a (possibly small) effective mass parameter. We establish the nonlinear stability of the Minkowski spacetime in the context of f(R) gravity, when the integrand f(R) in the action functional can be taken to be arbitrarily close to the integrand R of the standard Hilbert-Einstein action. △ Less

Submitted 7 May, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

Comments: 46 pages

arXiv:2312.14361 [pdf, ps, other]

A Gradient-Based Optimization Method Using the Koopman Operator

Authors: Mengqi Hu, Bian Li, Yi-An Ma, Yifei Lou, Xiu Yang

Abstract: In this paper, we propose a novel approach to solving optimization problems by reformulating the optimization problem into a dynamical system, followed by the adaptive spectral Koopman (ASK) method. The Koopman operator, employed in our approach, approximates the evolution of an ordinary differential equation (ODE) using a finite number of eigenfunctions and eigenvalues. We begin by providing a br… ▽ More In this paper, we propose a novel approach to solving optimization problems by reformulating the optimization problem into a dynamical system, followed by the adaptive spectral Koopman (ASK) method. The Koopman operator, employed in our approach, approximates the evolution of an ordinary differential equation (ODE) using a finite number of eigenfunctions and eigenvalues. We begin by providing a brief overview of the Koopman operator and the ASK method. Subsequently, we adapt the ASK method for solving a general optimization problem. Moreover, we provide an error bound to aid in understanding the performance of the proposed approach, marking the initial step in a more comprehensive numerical analysis. Experimentally, we demonstrate the applicability and accuracy of our method across a diverse range of optimization problems, including min-max problems. Our approach consistently yields smaller gradient norms and higher success rates in finding critical points compared to state-of-the-art gradient-based methods. We also observe the proposed method works particularly well when the dynamical properties of the system can be effectively modeled by the system's behaviors in a neighborhood of critical points. △ Less

Submitted 21 December, 2023; originally announced December 2023.

MSC Class: 37N30; 37N40; 37Mxx; 46N10; 47N10

arXiv:2312.01046 [pdf, other]

Bagged Regularized $k$-Distances for Anomaly Detection

Authors: Yuchao Cai, Yuheng Ma, Hanfang Yang, Hanyuan Hang

Abstract: We consider the paradigm of unsupervised anomaly detection, which involves the identification of anomalies within a dataset in the absence of labeled examples. Though distance-based methods are top-performing for unsupervised anomaly detection, they suffer heavily from the sensitivity to the choice of the number of the nearest neighbors. In this paper, we propose a new distance-based algorithm cal… ▽ More We consider the paradigm of unsupervised anomaly detection, which involves the identification of anomalies within a dataset in the absence of labeled examples. Though distance-based methods are top-performing for unsupervised anomaly detection, they suffer heavily from the sensitivity to the choice of the number of the nearest neighbors. In this paper, we propose a new distance-based algorithm called bagged regularized $k$-distances for anomaly detection (BRDAD) converting the unsupervised anomaly detection problem into a convex optimization problem. Our BRDAD algorithm selects the weights by minimizing the surrogate risk, i.e., the finite sample bound of the empirical risk of the bagged weighted $k$-distances for density estimation (BWDDE). This approach enables us to successfully address the sensitivity challenge of the hyperparameter choice in distance-based algorithms. Moreover, when dealing with large-scale datasets, the efficiency issues can be addressed by the incorporated bagging technique in our BRDAD algorithm. On the theoretical side, we establish fast convergence rates of the AUC regret of our algorithm and demonstrate that the bagging technique significantly reduces the computational complexity. On the practical side, we conduct numerical experiments on anomaly detection benchmarks to illustrate the insensitivity of parameter selection of our algorithm compared with other state-of-the-art distance-based methods. Moreover, promising improvements are brought by applying the bagging technique in our algorithm on real-world datasets. △ Less

Submitted 13 February, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

arXiv:2311.16899 [pdf, other]

On the saturation spectrum of the unions of disjoint cycles

Authors: Yue Ma

Abstract: Let $G$ be a graph and $\mathcal{H}$ be a family of graphs. We say $G$ is $\mathcal{H}$-saturated if $G$ does not contain a copy of $H$ with $H\in\mathcal{H}$, but the addition of any edge $e\notin E(G)$ creates at least one copy of some $H\in\mathcal{H}$ within $G+e$. The saturation number of $\mathcal{H}$ is the minimum size of an $\mathcal{H}$-saturated graph on $n$ vertices, and the saturation… ▽ More Let $G$ be a graph and $\mathcal{H}$ be a family of graphs. We say $G$ is $\mathcal{H}$-saturated if $G$ does not contain a copy of $H$ with $H\in\mathcal{H}$, but the addition of any edge $e\notin E(G)$ creates at least one copy of some $H\in\mathcal{H}$ within $G+e$. The saturation number of $\mathcal{H}$ is the minimum size of an $\mathcal{H}$-saturated graph on $n$ vertices, and the saturation spectrum of $\mathcal{H}$ is the set of all possible sizes of an $\mathcal{H}$-saturated graph on $n$ vertices. Let $k\mathcal{C}_{\ge 3}$ be the family of the unions of $k$ vertex-disjoint cycles. In this note, we completely determine the saturation number and the saturation spectrum of $k\mathcal{C}_{\ge 3}$ for $k=2$ and give some results for $k\ge 3$. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: 24 pages, 4 figures

arXiv:2311.06960 [pdf, other]

Robust Regression over Averaged Uncertainty

Authors: Dimitris Bertsimas, Yu Ma

Abstract: We propose a new formulation of robust regression by integrating all realizations of the uncertainty set and taking an averaged approach to obtain the optimal solution for the ordinary least-squared regression problem. We show that this formulation surprisingly recovers ridge regression and establishes the missing link between robust optimization and the mean squared error approaches for existing… ▽ More We propose a new formulation of robust regression by integrating all realizations of the uncertainty set and taking an averaged approach to obtain the optimal solution for the ordinary least-squared regression problem. We show that this formulation surprisingly recovers ridge regression and establishes the missing link between robust optimization and the mean squared error approaches for existing regression problems. We first prove the equivalence for four uncertainty sets: ellipsoidal, box, diamond, and budget, and provide closed-form formulations of the penalty term as a function of the sample size, feature size, as well as perturbation protection strength. We then show in synthetic datasets with different levels of perturbations, a consistent improvement of the averaged formulation over the existing worst-case formulation in out-of-sample performance. Importantly, as the perturbation level increases, the improvement increases, confirming our method's advantage in high-noise environments. We report similar improvements in the out-of-sample datasets in real-world regression problems obtained from UCI datasets. △ Less

Submitted 12 November, 2023; originally announced November 2023.

arXiv:2311.01381 [pdf, ps, other]

Li-Yau Inequality and Liouville Property to a Semilinear Heat Equation on Riemannian Manifolds

Authors: Huan-Jie Chen, Shi-Zhong Du, Yue-Xiao Ma

Abstract: This work deals with the Entire solutions of a nonlinear equation. The first part of this paper is devoted to investigation of the Liouville property on compact manifolds, which extends a result by Castorina-Mantegazza [4] for positive f. Secondly, we will turn to non-compact manifolds and prove a Liouville theorem under the assumptions of boundedness of the Ricci curvature from below, diffeomorph… ▽ More This work deals with the Entire solutions of a nonlinear equation. The first part of this paper is devoted to investigation of the Liouville property on compact manifolds, which extends a result by Castorina-Mantegazza [4] for positive f. Secondly, we will turn to non-compact manifolds and prove a Liouville theorem under the assumptions of boundedness of the Ricci curvature from below, diffeomorphism of M with R^N and sub-criticality of p defined below. Finally, we also present simplified proofs of Yau's theorem for harmonic function and Gidas-Spruck's theorem for elliptic semilinear equation. Our proofs are based on Li-Yau type estimation for nonlinear equations. △ Less

Submitted 2 November, 2023; originally announced November 2023.

MSC Class: 35K58; 53B21; 35K05

arXiv:2310.20177 [pdf, other]

An extended Fourier pseudospectral method for the Gross-Pitaevskii equation with low regularity potential

Authors: Weizhu Bao, Bo Lin, Ying Ma, Chushan Wang

Abstract: We propose and analyze an extended Fourier pseudospectral (eFP) method for the spatial discretization of the Gross-Pitaevskii equation (GPE) with low regularity potential by treating the potential in an extended window for its discrete Fourier transform. The proposed eFP method maintains optimal convergence rates with respect to the regularity of the exact solution even if the potential is of low… ▽ More We propose and analyze an extended Fourier pseudospectral (eFP) method for the spatial discretization of the Gross-Pitaevskii equation (GPE) with low regularity potential by treating the potential in an extended window for its discrete Fourier transform. The proposed eFP method maintains optimal convergence rates with respect to the regularity of the exact solution even if the potential is of low regularity and enjoys similar computational cost as the standard Fourier pseudospectral method, and thus it is both efficient and accurate. Furthermore, similar to the Fourier spectral/pseudospectral methods, the eFP method can be easily coupled with different popular temporal integrators including finite difference methods, time-splitting methods and exponential-type integrators. Numerical results are presented to validate our optimal error estimates and to demonstrate that they are sharp as well as to show its efficiency in practical computations. △ Less

Submitted 31 October, 2023; originally announced October 2023.

Comments: 20 pages, 7 figures

MSC Class: 35Q55; 65M15; 65M70; 81Q05

arXiv:2310.16463 [pdf, other]

Constructing disjoint Steiner trees in Sierpiński graphs

Authors: Chenxu Yang, Ping Li, Yaping Mao, Eddie Cheng, Ralf Klasing

Abstract: Let $G$ be a graph and $S\subseteq V(G)$ with $|S|\geq 2$. Then the trees $T_1, T_2, \cdots, T_\ell$ in $G$ are \emph{internally disjoint Steiner trees} connecting $S$ (or $S$-Steiner trees) if $E(T_i) \cap E(T_j )=\emptyset$ and $V(T_i)\cap V(T_j)=S$ for every pair of distinct integers $i,j$, $1 \leq i, j \leq \ell$. Similarly, if we only have the condition $E(T_i) \cap E(T_j )=\emptyset$ but wit… ▽ More Let $G$ be a graph and $S\subseteq V(G)$ with $|S|\geq 2$. Then the trees $T_1, T_2, \cdots, T_\ell$ in $G$ are \emph{internally disjoint Steiner trees} connecting $S$ (or $S$-Steiner trees) if $E(T_i) \cap E(T_j )=\emptyset$ and $V(T_i)\cap V(T_j)=S$ for every pair of distinct integers $i,j$, $1 \leq i, j \leq \ell$. Similarly, if we only have the condition $E(T_i) \cap E(T_j )=\emptyset$ but without the condition $V(T_i)\cap V(T_j)=S$, then they are \emph{edge-disjoint Steiner trees}. The \emph{generalized $k$-connectivity}, denoted by $κ_k(G)$, of a graph $G$, is defined as $κ_k(G)=\min\{κ_G(S)|S \subseteq V(G) \ \textrm{and} \ |S|=k \}$, where $κ_G(S)$ is the maximum number of internally disjoint $S$-Steiner trees. The \emph{generalized local edge-connectivity} $λ_{G}(S)$ is the maximum number of edge-disjoint Steiner trees connecting $S$ in $G$. The {\it generalized $k$-edge-connectivity} $λ_k(G)$ of $G$ is defined as $λ_k(G)=\min\{λ_{G}(S)\,|\,S\subseteq V(G) \ and \ |S|=k\}$. These measures are generalizations of the concepts of connectivity and edge-connectivity, and they and can be used as measures of vulnerability of networks. It is, in general, difficult to compute these generalized connectivities. However, there are precise results for some special classes of graphs. In this paper, we obtain the exact value of $λ_{k}(S(n,\ell))$ for $3\leq k\leq \ell^n$, and the exact value of $κ_{k}(S(n,\ell))$ for $3\leq k\leq \ell$, where $S(n, \ell)$ is the Sierpiński graphs with order $\ell^n$. As a direct consequence, these graphs provide additional interesting examples when $λ_{k}(S(n,\ell))=κ_{k}(S(n,\ell))$. We also study the some network properties of Sierpiński graphs. △ Less

Submitted 28 August, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

Comments: Steiner Tree; Generalized Connectivity; Sierpiński Graph

arXiv:2310.08791 [pdf, other]

Refinements on vertical Sato-Tate

Authors: Zhao Yu Ma

Abstract: Vertical Sato-Tate states that the Frobenius trace of a randomly chosen elliptic curve over $\mathbb F_p$ tends to a semicircular distribution as $p\rightarrow \infty$. We go beyond this statement by considering the number of elliptic curves $N_{t,p}'$ with a given trace $t$ over $\mathbb F_p$ and characterizing the 2-dimensional distribution of $(t,N_{t,p}')$. In particular, this gives the distri… ▽ More Vertical Sato-Tate states that the Frobenius trace of a randomly chosen elliptic curve over $\mathbb F_p$ tends to a semicircular distribution as $p\rightarrow \infty$. We go beyond this statement by considering the number of elliptic curves $N_{t,p}'$ with a given trace $t$ over $\mathbb F_p$ and characterizing the 2-dimensional distribution of $(t,N_{t,p}')$. In particular, this gives the distribution of the size of isogeny classes of elliptic curves over $\mathbb F_p$. Furthermore, we show a notion of stronger convergence for vertical Sato-Tate which states that the number of elliptic curves with Frobenius trace in an interval of length $p^ε$ converges to the expected amount. The key step in the proof is to truncate Gekeler's infinite product formula, which relies crucially on an effective Chebotarev's density theorem that was recently developed by Pierce, Turnage-Butterbaugh and Wood. △ Less

Submitted 29 May, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

Comments: 27 pages, 4 figures. Minor edits for clarity

MSC Class: 11G07 (Primary) 14H52; 11G20 (Secondary)

arXiv:2309.02660 [pdf, ps, other]

A Bi-level Globalization Strategy for Non-convex Consensus ADMM and ALADIN

Authors: Xu Du, Jingzhe Wang, Xiaohua Zhou, Yijie Mao

Abstract: In this paper, we formally analyze global convergence in the realm of distributed consensus optimization. Current solutions have explored such analysis, particularly focusing on consensus alternating direction method of multipliers (CADMM), including convex and non-convex cases. While such efforts on non-convexity offer elegant theory guaranteeing global convergence, they entail strong assumptions… ▽ More In this paper, we formally analyze global convergence in the realm of distributed consensus optimization. Current solutions have explored such analysis, particularly focusing on consensus alternating direction method of multipliers (CADMM), including convex and non-convex cases. While such efforts on non-convexity offer elegant theory guaranteeing global convergence, they entail strong assumptions and complicated proof techniques that are increasingly pose challenges when adopted to real-world applications. To resolve such tension, we propose a novel bi-level globalization strategy that not only guarantees global convergence but also provides succinct proofs, all while requiring mild assumptions. We begin by adopting such a strategy to perform global convergence analysis for the non-convex cases in C-ADMM. Then, we employ our proposed strategy in consensus augmented Lagrangian based alternating direction inexact Newton method (C-ALADIN), a more recent and generalization of C-ADMM. Surprisingly, our analysis shows that C-ALADIN globally converges to local optimizer, complementary to the prior work on C-ALADIN, which had primarily focused on analyzing local convergence for non-convex cases. △ Less

Submitted 5 September, 2023; originally announced September 2023.

arXiv:2308.15461 [pdf, other]

Canonical Factors for Hybrid Neural Fields

Authors: Brent Yi, Weijia Zeng, Sam Buchanan, Yi Ma

Abstract: Factored feature volumes offer a simple way to build more compact, efficient, and intepretable neural fields, but also introduce biases that are not necessarily beneficial for real-world data. In this work, we (1) characterize the undesirable biases that these architectures have for axis-aligned signals -- they can lead to radiance field reconstruction differences of as high as 2 PSNR -- and (2) e… ▽ More Factored feature volumes offer a simple way to build more compact, efficient, and intepretable neural fields, but also introduce biases that are not necessarily beneficial for real-world data. In this work, we (1) characterize the undesirable biases that these architectures have for axis-aligned signals -- they can lead to radiance field reconstruction differences of as high as 2 PSNR -- and (2) explore how learning a set of canonicalizing transformations can improve representations by removing these biases. We prove in a two-dimensional model problem that simultaneously learning these transformations together with scene appearance succeeds with drastically improved efficiency. We validate the resulting architectures, which we call TILTED, using image, signed distance, and radiance field reconstruction tasks, where we observe improvements across quality, robustness, compactness, and runtime. Results demonstrate that TILTED can enable capabilities comparable to baselines that are 2x larger, while highlighting weaknesses of neural field evaluation procedures. △ Less

Submitted 29 August, 2023; originally announced August 2023.

Comments: ICCV 2023. Project webpage: https://brentyi.github.io/tilted/

arXiv:2308.15089 [pdf, other]

doi 10.1142/S0218202524500155

Optimal error bounds on time-splitting methods for the nonlinear Schrödinger equation with low regularity potential and nonlinearity

Authors: Weizhu Bao, Ying Ma, Chushan Wang

Abstract: We establish optimal error bounds on time-splitting methods for the nonlinear Schrödinger equation with low regularity potential and typical power-type nonlinearity $ f(ρ) = ρ^σ$, where $ ρ:=|ψ|^2 $ is the density with $ ψ$ the wave function and $ σ> 0 $ the exponent of the nonlinearity. For the first-order Lie-Trotter time-splitting method, optimal $ L^2 $-norm error bound is proved for… ▽ More We establish optimal error bounds on time-splitting methods for the nonlinear Schrödinger equation with low regularity potential and typical power-type nonlinearity $ f(ρ) = ρ^σ$, where $ ρ:=|ψ|^2 $ is the density with $ ψ$ the wave function and $ σ> 0 $ the exponent of the nonlinearity. For the first-order Lie-Trotter time-splitting method, optimal $ L^2 $-norm error bound is proved for $L^\infty$-potential and $ σ> 0 $, and optimal $H^1$-norm error bound is obtained for $ W^{1, 4} $-potential and $ σ\geq 1/2 $. For the second-order Strang time-splitting method, optimal $ L^2 $-norm error bound is established for $H^2$-potential and $ σ\geq 1 $, and optimal $H^1$-norm error bound is proved for $H^3$-potential and $ σ\geq 3/2 $ (or $σ= 1$). Compared to those error estimates of time-splitting methods in the literature, our optimal error bounds either improve the convergence rates under the same regularity assumptions or significantly relax the regularity requirements on potential and nonlinearity for optimal convergence orders. A key ingredient in our proof is to adopt a new technique called \textit{regularity compensation oscillation} (RCO), where low frequency modes are analyzed by phase cancellation, and high frequency modes are estimated by regularity of the solution. Extensive numerical results are reported to confirm our error estimates and to demonstrate that they are sharp. △ Less

Submitted 7 January, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

Comments: 34 pages, 8 figures

MSC Class: 35Q55; 65M15; 65M70; 81Q05

Journal ref: Math. Models Methods Appl. Sci., Vol. 34 (2024), pp. 803-844

arXiv:2308.14743 [pdf, ps, other]

doi 10.1088/1751-8121/ad1622

Domain Walls and Vector Solitons in the Coupled Nonlinear Schrodinger Equation

Authors: David D. J. M. Snee, Yi-Ping Ma

Abstract: We outline a program to classify domain walls (DWs) and vector solitons in the 1D two-component coupled nonlinear Schrodinger (CNLS) equation with general coefficients. The CNLS equation is reduced first to a complex ordinary differential equation (ODE), and then to a real ODE after imposing a restriction. In the real ODE, we identify four possible equilibria including ZZ, ZN, NZ, and NN, with Z (… ▽ More We outline a program to classify domain walls (DWs) and vector solitons in the 1D two-component coupled nonlinear Schrodinger (CNLS) equation with general coefficients. The CNLS equation is reduced first to a complex ordinary differential equation (ODE), and then to a real ODE after imposing a restriction. In the real ODE, we identify four possible equilibria including ZZ, ZN, NZ, and NN, with Z (N) denoting a zero (nonzero) value in a component, and analyze their spatial stability. We identify two types of DWs including asymmetric DWs between ZZ and NN and symmetric DWs between ZN and NZ. We identify three codimension-1 mechanisms for generating vector solitons in the real ODE including heteroclinic cycles, local bifurcations, and exact solutions. Heteroclinic cycles are formed by assembling two DWs back-to-back and generate extended bright-bright (BB), dark-dark (DD), and dark-bright (DB) solitons. Local bifurcations include the Turing (Hamiltonian-Hopf) bifurcation that generates Turing solitons with oscillatory tails and the pitchfork bifurcation that generates DB, bright-antidark, DD, and dark-antidark solitons with monotonic tails. Exact solutions include scalar bright and dark solitons with vector amplitudes. Any codimension-1 real vector soliton can be numerically continued into a codimension-0 family. Complex vector solitons have two more parameters: a dark or antidark component can be numerically continued in the wavenumber, while a bright component can be multiplied by a constant phase factor (polarization). We introduce a numerical continuation method to find real and complex vector solitons and show that DWs and DB solitons in the immiscible regime can be related by varying bifurcation parameters. We show that collisions between two polarized DB solitons typically feature a mass exchange that changes the parameters of the two bright components and the two soliton velocities. △ Less

Submitted 8 January, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

Comments: 29 pages, 5 figures, 1 table; accepted version in Journal of Physics A after addressing referee comments

Journal ref: J. Phys. A: Math. Theor. 57 035702 (2024)

arXiv:2308.13031 [pdf, ps, other]

Initial data gluing in the asymptotically flat regime via solution operators with prescribed support properties

Authors: Yuchen Mao, Sung-Jin Oh, Zhongkai Tao

Abstract: We give new proofs of general relativistic initial data gluing results on unit-scale annuli based on explicit solution operators for the linearized constraint equation around the flat case with prescribed support properties. These results retrieve and optimize - in terms of positivity, regularity, size and/or spatial decay requirements - a number of known theorems concerning asymptotically flat in… ▽ More We give new proofs of general relativistic initial data gluing results on unit-scale annuli based on explicit solution operators for the linearized constraint equation around the flat case with prescribed support properties. These results retrieve and optimize - in terms of positivity, regularity, size and/or spatial decay requirements - a number of known theorems concerning asymptotically flat initial data, including Kerr exterior gluing by Corvino-Schoen and Chruściel-Delay, interior gluing (or "fill-in") by Bieri-Chruściel, and obstruction-free gluing by Czimek-Rodnianski. In particular, our proof of the strengthened obstruction-free gluing theorem relies on purely spacelike techniques, rather than null gluing as in the original approach. △ Less

Submitted 24 August, 2023; originally announced August 2023.

Comments: 30 pages, 1 figure. Comments are welcome!

arXiv:2308.10059 [pdf, other]

The degree threshold for covering with all the connected $3$-graphs with $3$ edges

Authors: Yue Ma, Xinmin Hou, Zhi Yin

Abstract: Given two $r$-uniform hypergraphs $F$ and $H$, we say that $H$ has an $F$-covering if every vertex in $H$ is contained in a copy of $F$. Let $c_{i}(n,F)$ be the least integer such that every $n$-vertex $r$-graph $H$ with $δ_{i}(H)>c_i(n,F)$ has an $F$-covering. Falgas-Ravry, Markstöm and Zhao (Combin. Probab. Comput., 2021) asymptotically determined $c_1(n,K_{4}^{(3)-})$, where $K_{4}^{(3)-}$ is o… ▽ More Given two $r$-uniform hypergraphs $F$ and $H$, we say that $H$ has an $F$-covering if every vertex in $H$ is contained in a copy of $F$. Let $c_{i}(n,F)$ be the least integer such that every $n$-vertex $r$-graph $H$ with $δ_{i}(H)>c_i(n,F)$ has an $F$-covering. Falgas-Ravry, Markstöm and Zhao (Combin. Probab. Comput., 2021) asymptotically determined $c_1(n,K_{4}^{(3)-})$, where $K_{4}^{(3)-}$ is obtained by deleting an edge from the complete $3$-graph on $4$ vertices. Later, Tang, Ma and Hou (arXiv, 2022) asymptotically determined $c_1(n,C_{6}^{(3)})$, where $C_{6}^{(3)}$ is the linear triangle, i.e. $C_{6}^{(3)}=([6],\{123,345,561\})$. In this paper, we determine $c_1(n,F_5)$ asymptotically, where $F_5$ is the generalized triangle, i.e. $F_5=([5],\{123,124,345\})$. We also determine the exact values of $c_1(n,F)$, where $F$ is any connected $3$-graphs with $3$ edges and $F\notin\{K_4^{(3)-}, C_{6}^{(3)}, F_5\}$. △ Less

Submitted 19 August, 2023; originally announced August 2023.

Comments: 17 pages, 10 figures

Showing 1–50 of 372 results for author: Ma, Y