Search | arXiv e-print repository

A rigidity result for ancient Ricci flows

Abstract: Using a size condition of the sharp log Sobolev functional (log entropy) near infinity only, we prove a rigidity result for ancient Ricci flows without sign condition on the curvatures. The result is also related to the problem of identifying type II ancient Ricci flows and their backward limits. Using a size condition of the sharp log Sobolev functional (log entropy) near infinity only, we prove a rigidity result for ancient Ricci flows without sign condition on the curvatures. The result is also related to the problem of identifying type II ancient Ricci flows and their backward limits. △ Less

Submitted 24 June, 2024; originally announced June 2024.

MSC Class: 2020: 58J35

arXiv:2406.10849 [pdf, other]

A parallel framework for graphical optimal transport

Authors: Jiaojiao Fan, Isabel Haasler, Qinsheng Zhang, Johan Karlsson, Yongxin Chen

Abstract: We study multi-marginal optimal transport (MOT) problems where the underlying cost has a graphical structure. These graphical multi-marginal optimal transport problems have found applications in several domains including traffic flow control and regression problems in the Wasserstein space. MOT problem can be approached through two aspects: a single big MOT problem, or coupled minor OT problems. I… ▽ More We study multi-marginal optimal transport (MOT) problems where the underlying cost has a graphical structure. These graphical multi-marginal optimal transport problems have found applications in several domains including traffic flow control and regression problems in the Wasserstein space. MOT problem can be approached through two aspects: a single big MOT problem, or coupled minor OT problems. In this paper, we focus on the latter approach and demonstrate it has efficiency gain from the parallelization. For tree-structured MOT problems, we introduce a novel parallelizable algorithm that significantly reduces computational complexity. Additionally, we adapt this algorithm for general graphs, employing the modified junction trees to enable parallel updates. Our contributions, validated through numerical experiments, offer new avenues for MOT applications and establish benchmarks in computational efficiency. △ Less

Submitted 16 June, 2024; originally announced June 2024.

arXiv:2406.05697 [pdf, other]

Decision-Focused Surrogate Modeling for Mixed-Integer Linear Optimization

Authors: Shivi Dixit, Rishabh Gupta, Qi Zhang

Abstract: Mixed-integer optimization is at the core of many online decision-making systems that demand frequent updates of decisions in real time. However, due to their combinatorial nature, mixed-integer linear programs (MILPs) can be difficult to solve, rendering them often unsuitable for time-critical online applications. To address this challenge, we develop a data-driven approach for constructing surro… ▽ More Mixed-integer optimization is at the core of many online decision-making systems that demand frequent updates of decisions in real time. However, due to their combinatorial nature, mixed-integer linear programs (MILPs) can be difficult to solve, rendering them often unsuitable for time-critical online applications. To address this challenge, we develop a data-driven approach for constructing surrogate optimization models in the form of linear programs (LPs) that can be solved much more efficiently than the corresponding MILPs. We train these surrogate LPs in a decision-focused manner such that for different model inputs, they achieve the same or close to the same optimal solutions as the original MILPs. One key advantage of the proposed method is that it allows the incorporation of all the original MILP's linear constraints, which significantly increases the likelihood of obtaining feasible predicted solutions. Results from two computational case studies indicate that this decision-focused surrogate modeling approach is highly data-efficient and provides very accurate predictions of the optimal solutions. In these examples, it outperforms more commonly used neural-network-based optimization proxies. △ Less

Submitted 9 June, 2024; originally announced June 2024.

arXiv:2406.05138 [pdf]

doi 10.1016/j.compgeo.2024.106454

A Novel Coupled bES-FEM Formulation with SUPG stabilization for Thermo-Hydro-Mechanical Analysis in Saturated Porous Media

Authors: Zi-Qi Tang, Xi-Wen Zhou, Yin-Fu Jin, Zhen-Yu Yin, Qi Zhang

Abstract: Two primary types of numerical instabilities often occur in low-order finite element method (FEM) analyses of thermo-hydro-mechanical (THM) phenomena: (1) pressure oscillations arising improper interpolation of pressure and displacement fields; and (2) spatial oscillations induced by nonlinear convection terms in convection-dominated scenarios. In response to these issues, this paper proposes a no… ▽ More Two primary types of numerical instabilities often occur in low-order finite element method (FEM) analyses of thermo-hydro-mechanical (THM) phenomena: (1) pressure oscillations arising improper interpolation of pressure and displacement fields; and (2) spatial oscillations induced by nonlinear convection terms in convection-dominated scenarios. In response to these issues, this paper proposes a novel stabilized edge-based smoothed FEM with a bubble function (bES-FEM) for THM analysis within saturated porous media. In the proposed framework, a cubic bubble function is first incorporated into ES-FEM to efficiently mitigate pressure oscillations that breach the Inf-Sup condition, and then the Streamline Upwind Petrov-Galerkin (SUPG) scheme is adopted in bES-FEM to effectively reduce the spurious oscillations in convection-dominated heat transfer scenarios. The accuracy of the bES-FEM with SUPG formulation for THM coupled problems is validated through a series of five benchmark tests. Moreover, the simulations of open-loop ground source energy systems demonstrate the proposed method's exceptional capability in tackling complex THM challenges in real-world applications. All the obtained results showcase the superiority of proposed bES-FEM with SUPG in eliminating the spatial and pressure oscillations, marking it as a promising tool for the exploration of coupled THM issues. △ Less

Submitted 20 May, 2024; originally announced June 2024.

Comments: 39 pages (include references), 18 figures, 7 tables

Journal ref: Comput. Geotech. 173 (2024) 106454

arXiv:2406.05052 [pdf, other]

Column generation for multistage stochastic mixed-integer nonlinear programs with discrete state variables

Authors: Tushar Rathi, Benjamin P. Riley, Angela Flores-Quiroz, Qi Zhang

Abstract: Stochastic programming provides a natural framework for modeling sequential optimization problems under uncertainty; however, the efficient solution of large-scale multistage stochastic programs remains a challenge, especially in the presence of discrete decisions and nonlinearities. In this work, we consider multistage stochastic mixed-integer nonlinear programs (MINLPs) with discrete state varia… ▽ More Stochastic programming provides a natural framework for modeling sequential optimization problems under uncertainty; however, the efficient solution of large-scale multistage stochastic programs remains a challenge, especially in the presence of discrete decisions and nonlinearities. In this work, we consider multistage stochastic mixed-integer nonlinear programs (MINLPs) with discrete state variables, which exhibit a decomposable structure that allows its solution using a column generation approach. Following a Dantzig-Wolfe reformulation, we apply column generation such that each pricing subproblem is an MINLP of much smaller size, making it more amenable to global MINLP solvers. We further propose a method for generating additional columns that satisfy the nonanticipativity constraints, leading to significantly improved convergence and optimal or near-optimal solutions for many large-scale instances in a reasonable computation time. The effectiveness of the tailored column generation algorithm is demonstrated via computational case studies on a multistage blending problem and a problem involving the routing of mobile generators in a power distribution network. △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: 31 pages, 10 figures

arXiv:2405.19650 [pdf, other]

Few for Many: Tchebycheff Set Scalarization for Many-Objective Optimization

Authors: Xi Lin, Yilu Liu, Xiaoyuan Zhang, Fei Liu, Zhenkun Wang, Qingfu Zhang

Abstract: Multi-objective optimization can be found in many real-world applications where some conflicting objectives can not be optimized by a single solution. Existing optimization methods often focus on finding a set of Pareto solutions with different optimal trade-offs among the objectives. However, the required number of solutions to well approximate the whole Pareto optimal set could be exponentially… ▽ More Multi-objective optimization can be found in many real-world applications where some conflicting objectives can not be optimized by a single solution. Existing optimization methods often focus on finding a set of Pareto solutions with different optimal trade-offs among the objectives. However, the required number of solutions to well approximate the whole Pareto optimal set could be exponentially large with respect to the number of objectives, which makes these methods unsuitable for handling many optimization objectives. In this work, instead of finding a dense set of Pareto solutions, we propose a novel Tchebycheff set scalarization method to find a few representative solutions (e.g., 5) to cover a large number of objectives (e.g., $>100$) in a collaborative and complementary manner. In this way, each objective can be well addressed by at least one solution in the small solution set. In addition, we further develop a smooth Tchebycheff set scalarization approach for efficient optimization with good theoretical guarantees. Experimental studies on different problems with many optimization objectives demonstrate the effectiveness of our proposed method. △ Less

Submitted 29 May, 2024; originally announced May 2024.

arXiv:2405.19440 [pdf, other]

On the Convergence of Multi-objective Optimization under Generalized Smoothness

Authors: Qi Zhang, Peiyao Xiao, Kaiyi Ji, Shaofeng Zou

Abstract: Multi-objective optimization (MOO) is receiving more attention in various fields such as multi-task learning. Recent works provide some effective algorithms with theoretical analysis but they are limited by the standard $L$-smooth or bounded-gradient assumptions, which are typically unsatisfactory for neural networks, such as recurrent neural networks (RNNs) and transformers. In this paper, we stu… ▽ More Multi-objective optimization (MOO) is receiving more attention in various fields such as multi-task learning. Recent works provide some effective algorithms with theoretical analysis but they are limited by the standard $L$-smooth or bounded-gradient assumptions, which are typically unsatisfactory for neural networks, such as recurrent neural networks (RNNs) and transformers. In this paper, we study a more general and realistic class of $\ell$-smooth loss functions, where $\ell$ is a general non-decreasing function of gradient norm. We develop two novel single-loop algorithms for $\ell$-smooth MOO problems, Generalized Smooth Multi-objective Gradient descent (GSMGrad) and its stochastic variant, Stochastic Generalized Smooth Multi-objective Gradient descent (SGSMGrad), which approximate the conflict-avoidant (CA) direction that maximizes the minimum improvement among objectives. We provide a comprehensive convergence analysis of both algorithms and show that they converge to an $ε$-accurate Pareto stationary point with a guaranteed $ε$-level average CA distance (i.e., the gap between the updating direction and the CA direction) over all iterations, where totally $\mathcal{O}(ε^{-2})$ and $\mathcal{O}(ε^{-4})$ samples are needed for deterministic and stochastic settings, respectively. Our algorithms can also guarantee a tighter $ε$-level CA distance in each iteration using more samples. Moreover, we propose a practical variant of GSMGrad named GSMGrad-FA using only constant-level time and space, while achieving the same performance guarantee as GSMGrad. Our experiments validate our theory and demonstrate the effectiveness of the proposed methods. △ Less

Submitted 12 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

arXiv:2405.17875 [pdf, other]

BO4IO: A Bayesian optimization approach to inverse optimization with uncertainty quantification

Authors: Yen-An Lu, Wei-Shou Hu, Joel A. Paulson, Qi Zhang

Abstract: This work addresses data-driven inverse optimization (IO), where the goal is to estimate unknown parameters in an optimization model from observed decisions that can be assumed to be optimal or near-optimal solutions to the optimization problem. The IO problem is commonly formulated as a large-scale bilevel program that is notoriously difficult to solve. Deviating from traditional exact solution m… ▽ More This work addresses data-driven inverse optimization (IO), where the goal is to estimate unknown parameters in an optimization model from observed decisions that can be assumed to be optimal or near-optimal solutions to the optimization problem. The IO problem is commonly formulated as a large-scale bilevel program that is notoriously difficult to solve. Deviating from traditional exact solution methods, we propose a derivative-free optimization approach based on Bayesian optimization, which we call BO4IO, to solve general IO problems. We treat the IO loss function as a black box and approximate it with a Gaussian process model. Using the predicted posterior function, an acquisition function is minimized at each iteration to query new candidate solutions and sequentially converge to the optimal parameter estimates. The main advantages of using Bayesian optimization for IO are two-fold: (i) it circumvents the need of complex reformulations of the bilevel program or specialized algorithms and can hence enable computational tractability even when the underlying optimization problem is nonconvex or involves discrete variables, and (ii) it allows approximations of the profile likelihood, which provide uncertainty quantification on the IO parameter estimates. We apply the proposed method to three computational case studies, covering different classes of forward optimization problems ranging from convex nonlinear to nonconvex mixed-integer nonlinear programs. Our extensive computational results demonstrate the efficacy and robustness of BO4IO to accurately estimate unknown model parameters from small and noisy datasets. In addition, the proposed profile likelihood analysis has proven to be effective in providing good approximations of the confidence intervals on the parameter estimates and assessing the identifiability of the unknown parameters. △ Less

Submitted 28 May, 2024; originally announced May 2024.

arXiv:2405.12824 [pdf, ps, other]

A note on the finitely generated fixed subgroup property

Authors: Jialin Lei, Jiming Ma, Qiang Zhang

Abstract: We study when a group of form $G\times\mathbb{Z}^m (m\geq 1)$ has the finitely generated fixed subgroup property of automorphisms ($\rm{FGFP}_a$), by using the BNS-invariant, and provide some partial answers and non-trivial examples. We study when a group of form $G\times\mathbb{Z}^m (m\geq 1)$ has the finitely generated fixed subgroup property of automorphisms ($\rm{FGFP}_a$), by using the BNS-invariant, and provide some partial answers and non-trivial examples. △ Less

Submitted 30 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

Comments: 9 pages

arXiv:2405.09742 [pdf, other]

Random Scaling and Momentum for Non-smooth Non-convex Optimization

Authors: Qinzi Zhang, Ashok Cutkosky

Abstract: Training neural networks requires optimizing a loss function that may be highly irregular, and in particular neither convex nor smooth. Popular training algorithms are based on stochastic gradient descent with momentum (SGDM), for which classical analysis applies only if the loss is either convex or smooth. We show that a very small modification to SGDM closes this gap: simply scale the update at… ▽ More Training neural networks requires optimizing a loss function that may be highly irregular, and in particular neither convex nor smooth. Popular training algorithms are based on stochastic gradient descent with momentum (SGDM), for which classical analysis applies only if the loss is either convex or smooth. We show that a very small modification to SGDM closes this gap: simply scale the update at each time point by an exponentially distributed random scalar. The resulting algorithm achieves optimal convergence guarantees. Intriguingly, this result is not derived by a specific analysis of SGDM: instead, it falls naturally out of a more general framework for converting online convex optimization algorithms to non-convex optimization algorithms. △ Less

Submitted 15 May, 2024; originally announced May 2024.

arXiv:2405.06813 [pdf, other]

A note on distance variance for categorical variables

Authors: Qingyang Zhang

Abstract: This study investigates the extension of distance variance, a validated spread metric for continuous and binary variables [Edelmann et al., 2020, Ann. Stat., 48(6)], to quantify the spread of general categorical variables. We provide both geometric and algebraic characterizations of distance variance, revealing its connections to some commonly used entropy measures, and the variance-covariance mat… ▽ More This study investigates the extension of distance variance, a validated spread metric for continuous and binary variables [Edelmann et al., 2020, Ann. Stat., 48(6)], to quantify the spread of general categorical variables. We provide both geometric and algebraic characterizations of distance variance, revealing its connections to some commonly used entropy measures, and the variance-covariance matrix of the one-hot encoded representation. However, we demonstrate that distance variance fails to satisfy the Schur-concavity axiom for categorical variables with more than two categories, leading to counterintuitive results. This limitation hinders its applicability as a universal measure of spread. △ Less

Submitted 10 May, 2024; originally announced May 2024.

Comments: 3 figures

arXiv:2405.05103 [pdf, ps, other]

Multistability of Bi-Reaction Networks

Authors: Yixuan Liang, Xiaoxian Tang, Qian Zhang

Abstract: We provide a sufficient and necessary condition in terms of the stoichiometric coefficients for a bi-reaction network to admit multistability. Also, this result completely characterizes the bi-reaction networks according to if they admit multistability. We provide a sufficient and necessary condition in terms of the stoichiometric coefficients for a bi-reaction network to admit multistability. Also, this result completely characterizes the bi-reaction networks according to if they admit multistability. △ Less

Submitted 8 May, 2024; originally announced May 2024.

Comments: 39 pages

arXiv:2405.01438 [pdf, other]

Solving the train-platforming problem via a two-level Lagrangian Relaxation approach

Authors: Qin Zhang, Richard Martin Lusby, Pan Shang, Chang Liu, Wenqian Liu

Abstract: High-speed railway stations are crucial junctions in high-speed railway networks. Compared to operations on the tracks between stations, trains have more routing possibilities within stations. As a result, track allocation at a station is relatively complicated. In this study, we aim to solve the train platforming problem for a busy high-speed railway station by considering comprehensive track res… ▽ More High-speed railway stations are crucial junctions in high-speed railway networks. Compared to operations on the tracks between stations, trains have more routing possibilities within stations. As a result, track allocation at a station is relatively complicated. In this study, we aim to solve the train platforming problem for a busy high-speed railway station by considering comprehensive track resources and interlocking configurations. A two-level space-time network is constructed to capture infrastructure information at various levels of detail from both macroscopic and microscopic perspectives. Additionally, we propose a nonlinear programming model that minimizes a weighted sum of total travel time and total deviation time for trains at the station. We apply a Two-level Lagrangian Relaxation (2-L LR) to a linearized version of the model and demonstrate how this induces a decomposable train-specific path choice problem at the macroscopic level that is guided by Lagrange multipliers associated with microscopic resource capacity violation. As case studies, the proposed model and solution approach are applied to a small virtual railway station and a high-speed railway hub station located on the busiest high-speed railway line in China. Through a comparison of other approaches that include Logic-based Benders Decomposition (LBBD), we highlight the superiority of the proposed method; on realistic instances, the 2-L LR method finds solution that are, on average, approximately 2% from optimality. Finally, we test algorithm performance at the operational level and obtain near-optimal solutions, with optimality gaps of approximately 1%, in a very short time. △ Less

Submitted 2 May, 2024; originally announced May 2024.

arXiv:2404.13689 [pdf, other]

Stability of the Abstract Thermoelastic System with Singularity

Authors: Chenxi Deng, Zhong-Jie Han, Zhaobin Kuang, Qiong Zhang

Abstract: In this paper, we analyze an abstract thermoelastic system, where the heat conduction follows the Cattaneo law. Zero becomes a spectrum point of the system operator when the coupling and thermal damping parameters of the system satisfy specific conditions. We obtain the decay rates of solutions to the system with or without the inertial term. Furthermore, the decay rate of the system without inert… ▽ More In this paper, we analyze an abstract thermoelastic system, where the heat conduction follows the Cattaneo law. Zero becomes a spectrum point of the system operator when the coupling and thermal damping parameters of the system satisfy specific conditions. We obtain the decay rates of solutions to the system with or without the inertial term. Furthermore, the decay rate of the system without inertial terms is shown to be optimal. △ Less

Submitted 21 April, 2024; originally announced April 2024.

Comments: 15 pages

MSC Class: 35Q74; 74F05

arXiv:2404.11134 [pdf, ps, other]

Co-existence of Type II blow-ups with multiple blow-up rates for five-dimensional heat equation with critical nonlinear boundary conditions

Authors: Juncheng Wei, Zikai Ye, Xiaoyu Zeng, Qidi Zhang

Abstract: We consider the following five-dimensional heat equation with critical boundary condition \begin{equation*} \partial_t u=Δu \mbox{ \ in \ } \mathbb{R}_+^5\times (0,T) , \quad -\partial_{x_5}u =|u|^\frac{2}{3}u \mbox{ \ on \ } \pp \mathbb{R}^5_+ \times (0,T) . \end{equation*} Given $\mathfrak{o}$ distinct boundary points $q^{[i]} \in \partial \mathbb{R}_+^5$, and $\mathfrak{o}$ integers… ▽ More We consider the following five-dimensional heat equation with critical boundary condition \begin{equation*} \partial_t u=Δu \mbox{ \ in \ } \mathbb{R}_+^5\times (0,T) , \quad -\partial_{x_5}u =|u|^\frac{2}{3}u \mbox{ \ on \ } \pp \mathbb{R}^5_+ \times (0,T) . \end{equation*} Given $\mathfrak{o}$ distinct boundary points $q^{[i]} \in \partial \mathbb{R}_+^5$, and $\mathfrak{o}$ integers $l_i\in \mathbb{N}$ (possibly duplicated), $i=1,2,\dots, \mathfrak{o}$, for $T>0$ sufficiently small, we construct a finite-time blow-up solution $u$ with a type II blow-up rate $(T-t)^{-3l_i -3}$ for $x$ near $q^{[i]}$. This seems to be the first result of the co-existence of type II blowups with different blow-up rates. To accommodate highly unstable blowups with different blowup rates, we first develop a unified linear theory for the inner problem with more time decay in the blow-up scheme through restriction on the spatial growth of the right-hand side, and then use vanishing adjustment functions for deriving multiple rates at distinct points. This paper is inspired by [25, 52, 60]. △ Less

Submitted 17 April, 2024; originally announced April 2024.

Comments: 59 pages; comments welcome

arXiv:2404.09314 [pdf, ps, other]

Modular data of non-semisimple modular categories

Authors: Liang Chang, Quinn T. Kolt, Zhenghan Wang, Qing Zhang

Abstract: We investigate non-semisimple modular categories with an eye towards a structure theory, low-rank classification, and applications to low dimensional topology and topological physics. We aim to extend the well-understood theory of semisimple modular categories to the non-semisimple case by using representations of factorizable ribbon Hopf algebras as a case study. We focus on the Cohen-Westreich m… ▽ More We investigate non-semisimple modular categories with an eye towards a structure theory, low-rank classification, and applications to low dimensional topology and topological physics. We aim to extend the well-understood theory of semisimple modular categories to the non-semisimple case by using representations of factorizable ribbon Hopf algebras as a case study. We focus on the Cohen-Westreich modular data, which is obtained from the Lyubashenko-Majid modular representation restricted to the Higman ideal of a factorizable ribbon Hopf algebra. The Cohen-Westreich $S$-matrix diagonalizes the mixed fusion rules and reduces to the usual $S$-matrix for semisimple modular categories. The paper includes detailed studies on small quantum groups $U_qsl(2)$ and the Drinfeld doubles of Nichols Hopf algebras, especially the $\mathrm{SL}(2, \mathbb{Z})$-representation on their centers, Cohen-Westreich modular data, and the congruence kernel theorem's validity. △ Less

Submitted 6 May, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

Comments: 51 pages. Minor changes to fix typos

arXiv:2404.07463 [pdf, ps, other]

Generic representations, open parameters and ABV-packets for $p$-adic groups

Authors: Clifton Cunningham, Sarah Dijols, Andrew Fiori, Qing Zhang

Abstract: If $π$ is a representation of a $p$-adic group $G(F)$, and $φ$ is its Langlands parameter, can we use the moduli space of Langlands parameters to find a geometric property of $φ$ that will detect when $π$ is generic? In this paper we show that if $G$ is classical or if we assume the Kazhdan-Lusztig hypothesis for $G$, then the answer is yes, and the property is that the orbit of $φ$ is open. We al… ▽ More If $π$ is a representation of a $p$-adic group $G(F)$, and $φ$ is its Langlands parameter, can we use the moduli space of Langlands parameters to find a geometric property of $φ$ that will detect when $π$ is generic? In this paper we show that if $G$ is classical or if we assume the Kazhdan-Lusztig hypothesis for $G$, then the answer is yes, and the property is that the orbit of $φ$ is open. We also propose an adaptation of Shahidi's enhanced genericity conjecture to ABV-packets: for every Langlands parameter $φ$ for a $p$-adic group $G(F)$, the ABV-packet $Π^{\mathrm{ABV}}_φ(G(F))$ contains a generic representation if and only if the local adjoint L-function $L(s,φ,\mathop{\text{Ad}})$ is regular at $s=1$, and show that this condition is equivalent to the "open parameter" condition above. We show that this genericity conjecture for ABV-packets follows from other standard conjectures and we verify its validity with the same conditions on $G$. We show that, in this case, the ABV-packet for $φ$ coincides with its $L$-packet. Finally, we prove Vogan's conjecture on $A$-packets for tempered parameters. △ Less

Submitted 10 April, 2024; originally announced April 2024.

MSC Class: 11F70; 32S60

arXiv:2404.06735 [pdf, other]

A Copula Graphical Model for Multi-Attribute Data using Optimal Transport

Authors: Qi Zhang, Bing Li, Lingzhou Xue

Abstract: Motivated by modern data forms such as images and multi-view data, the multi-attribute graphical model aims to explore the conditional independence structure among vectors. Under the Gaussian assumption, the conditional independence between vectors is characterized by blockwise zeros in the precision matrix. To relax the restrictive Gaussian assumption, in this paper, we introduce a novel semipara… ▽ More Motivated by modern data forms such as images and multi-view data, the multi-attribute graphical model aims to explore the conditional independence structure among vectors. Under the Gaussian assumption, the conditional independence between vectors is characterized by blockwise zeros in the precision matrix. To relax the restrictive Gaussian assumption, in this paper, we introduce a novel semiparametric multi-attribute graphical model based on a new copula named Cyclically Monotone Copula. This new copula treats the distribution of the node vectors as multivariate marginals and transforms them into Gaussian distributions based on the optimal transport theory. Since the model allows the node vectors to have arbitrary continuous distributions, it is more flexible than the classical Gaussian copula method that performs coordinatewise Gaussianization. We establish the concentration inequalities of the estimated covariance matrices and provide sufficient conditions for selection consistency of the group graphical lasso estimator. For the setting with high-dimensional attributes, a {Projected Cyclically Monotone Copula} model is proposed to address the curse of dimensionality issue that arises from solving high-dimensional optimal transport problems. Numerical results based on synthetic and real data show the efficiency and flexibility of our methods. △ Less

Submitted 10 April, 2024; originally announced April 2024.

Comments: 37 pages

arXiv:2404.04636 [pdf, ps, other]

Global solvability for the Boussinesq system with fractional Laplacian

Authors: Huiyang Zhang, Shuokai Yan, Qinghua Zhang

Abstract: This paper focuses on the global solvability for the Boussinesq system with fractional Laplacian $(-Δ)^α$ in $\mathbb{R}^{n}$ for $n\geq3$. It proves the existence of a small positive number $\varepsilon=\varepsilon(n,α)$ such that for each $0<T<\infty$, if $\frac{1}{2}<α<\frac{2+n}{4}$ and $\|u_{0}\|_{\dot{H}^{s_{0}}}+T^{1/2}\|θ_{0}\|_{\dot{H}^{s_{0}-α}}\leq \varepsilon$, then the fractional Bous… ▽ More This paper focuses on the global solvability for the Boussinesq system with fractional Laplacian $(-Δ)^α$ in $\mathbb{R}^{n}$ for $n\geq3$. It proves the existence of a small positive number $\varepsilon=\varepsilon(n,α)$ such that for each $0<T<\infty$, if $\frac{1}{2}<α<\frac{2+n}{4}$ and $\|u_{0}\|_{\dot{H}^{s_{0}}}+T^{1/2}\|θ_{0}\|_{\dot{H}^{s_{0}-α}}\leq \varepsilon$, then the fractional Boussinesq system has a unique strong solution on the bounded interval $[0,T]$. If $\frac{1}{2}<α<\frac{2+n}{6}$ and $\|u_{0}\|_{\dot{H}^{s_{0}}}+\|θ_{0}\|_{\dot{H}^{s_{0}-2α}}\leq \varepsilon$, then the fractional Boussinesq system has a unique strong solution on the whole interval $[0,\infty)$. △ Less

Submitted 6 April, 2024; originally announced April 2024.

MSC Class: 35Q30; 76D05

arXiv:2404.01436 [pdf, ps, other]

Convergence Guarantees for RMSProp and Adam in Generalized-smooth Non-convex Optimization with Affine Noise Variance

Authors: Qi Zhang, Yi Zhou, Shaofeng Zou

Abstract: This paper provides the first tight convergence analyses for RMSProp and Adam in non-convex optimization under the most relaxed assumptions of coordinate-wise generalized smoothness and affine noise variance. We first analyze RMSProp, which is a special case of Adam with adaptive learning rates but without first-order momentum. Specifically, to solve the challenges due to dependence among adaptive… ▽ More This paper provides the first tight convergence analyses for RMSProp and Adam in non-convex optimization under the most relaxed assumptions of coordinate-wise generalized smoothness and affine noise variance. We first analyze RMSProp, which is a special case of Adam with adaptive learning rates but without first-order momentum. Specifically, to solve the challenges due to dependence among adaptive update, unbounded gradient estimate and Lipschitz constant, we demonstrate that the first-order term in the descent lemma converges and its denominator is upper bounded by a function of gradient norm. Based on this result, we show that RMSProp with proper hyperparameters converges to an $ε$-stationary point with an iteration complexity of $\mathcal O(ε^{-4})$. We then generalize our analysis to Adam, where the additional challenge is due to a mismatch between the gradient and first-order momentum. We develop a new upper bound on the first-order term in the descent lemma, which is also a function of the gradient norm. We show that Adam with proper hyperparameters converges to an $ε$-stationary point with an iteration complexity of $\mathcal O(ε^{-4})$. Our complexity results for both RMSProp and Adam match with the complexity lower bound established in \cite{arjevani2023lower}. △ Less

Submitted 3 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

arXiv:2404.00630 [pdf, other]

Sobolev Calibration of Imperfect Computer Models

Authors: Qingwen Zhang, Wenjia Wang

Abstract: Calibration refers to the statistical estimation of unknown model parameters in computer experiments, such that computer experiments can match underlying physical systems. This work develops a new calibration method for imperfect computer models, Sobolev calibration, which can rule out calibration parameters that generate overfitting calibrated functions. We prove that the Sobolev calibration enjo… ▽ More Calibration refers to the statistical estimation of unknown model parameters in computer experiments, such that computer experiments can match underlying physical systems. This work develops a new calibration method for imperfect computer models, Sobolev calibration, which can rule out calibration parameters that generate overfitting calibrated functions. We prove that the Sobolev calibration enjoys desired theoretical properties including fast convergence rate, asymptotic normality and semiparametric efficiency. We also demonstrate an interesting property that the Sobolev calibration can bridge the gap between two influential methods: $L_2$ calibration and Kennedy and O'Hagan's calibration. In addition to exploring the deterministic physical experiments, we theoretically justify that our method can transfer to the case when the physical process is indeed a Gaussian process, which follows the original idea of Kennedy and O'Hagan's. Numerical simulations as well as a real-world example illustrate the competitive performance of the proposed method. △ Less

Submitted 31 March, 2024; originally announced April 2024.

arXiv:2404.00608 [pdf, other]

Sample Complexity of Chance Constrained Optimization in Dynamic Environment

Authors: Apurv Shukla, Qian Zhang, Le Xie

Abstract: We study the scenario approach for solving chance-constrained optimization in time-coupled dynamic environments. Scenario generation methods approximate the true feasible region from scenarios generated independently and identically from the actual distribution. In this paper, we consider this problem in a dynamic environment, where the scenarios are assumed to be drawn sequentially from an unknow… ▽ More We study the scenario approach for solving chance-constrained optimization in time-coupled dynamic environments. Scenario generation methods approximate the true feasible region from scenarios generated independently and identically from the actual distribution. In this paper, we consider this problem in a dynamic environment, where the scenarios are assumed to be drawn sequentially from an unknown and time-varying distribution. Such dynamic environments are driven by changing environmental conditions that could be found in many real-world applications such as energy systems. We couple the time-varying distributions using the Wasserstein metric between the sequence of scenario-generating distributions and the actual chance-constrained distribution. Our main results are bounds on the number of samples essential for ensuring the ex-post risk in chance-constrained optimization problems when the underlying feasible set is convex or non-convex. Finally, our results are illustrated on multiple numerical experiments for both types of feasible sets. △ Less

Submitted 31 March, 2024; originally announced April 2024.

Comments: To apper in American Control Conference 2024

arXiv:2403.05008 [pdf, ps, other]

Clunie lemma in several complex variables and application in PDEs

Authors: Wenjie Hao, Qingcai Zhang

Abstract: Two purposes will be shown in this paper. The first one is to extend the classic Tumura-Clunie type theorem for meromorphic functions of one complex variable to meromorphic functions of several complex variables by using Clunie lemma. The second one is to characterize entire solutions of certain partial differential equations in $\mathbb{C}^{m}$. Our results are extensions and generalizations of t… ▽ More Two purposes will be shown in this paper. The first one is to extend the classic Tumura-Clunie type theorem for meromorphic functions of one complex variable to meromorphic functions of several complex variables by using Clunie lemma. The second one is to characterize entire solutions of certain partial differential equations in $\mathbb{C}^{m}$. Our results are extensions and generalizations of the previous theorems by Liao-Ye \cite{Liao-Ye} and Li \cite{Li11}. △ Less

Submitted 7 March, 2024; originally announced March 2024.

arXiv:2402.19078 [pdf, other]

Smooth Tchebycheff Scalarization for Multi-Objective Optimization

Authors: Xi Lin, Xiaoyuan Zhang, Zhiyuan Yang, Fei Liu, Zhenkun Wang, Qingfu Zhang

Abstract: Multi-objective optimization problems can be found in many real-world applications, where the objectives often conflict each other and cannot be optimized by a single solution. In the past few decades, numerous methods have been proposed to find Pareto solutions that represent different optimal trade-offs among the objectives for a given problem. However, these existing methods could have high com… ▽ More Multi-objective optimization problems can be found in many real-world applications, where the objectives often conflict each other and cannot be optimized by a single solution. In the past few decades, numerous methods have been proposed to find Pareto solutions that represent different optimal trade-offs among the objectives for a given problem. However, these existing methods could have high computational complexity or may not have good theoretical properties for solving a general differentiable multi-objective optimization problem. In this work, by leveraging the smooth optimization technique, we propose a novel and lightweight smooth Tchebycheff scalarization approach for gradient-based multi-objective optimization. It has good theoretical properties for finding all Pareto solutions with valid trade-off preferences, while enjoying significantly lower computational complexity compared to other methods. Experimental results on various real-world application problems fully demonstrate the effectiveness of our proposed method. △ Less

Submitted 14 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

Comments: fix some typos

arXiv:2401.15941 [pdf, ps, other]

High order conservative LDG-IMEX methods for the degenerate nonlinear non-equilibrium radiation diffusion problems

Authors: Shaoqin Zheng, Min Tang, Qiang Zhang, Tao Xiong

Abstract: In this paper, we develop a class of high-order conservative methods for simulating non-equilibrium radiation diffusion problems. Numerically, this system poses significant challenges due to strong nonlinearity within the stiff source terms and the degeneracy of nonlinear diffusion terms. Explicit methods require impractically small time steps, while implicit methods, which offer stability, come w… ▽ More In this paper, we develop a class of high-order conservative methods for simulating non-equilibrium radiation diffusion problems. Numerically, this system poses significant challenges due to strong nonlinearity within the stiff source terms and the degeneracy of nonlinear diffusion terms. Explicit methods require impractically small time steps, while implicit methods, which offer stability, come with the challenge to guarantee the convergence of nonlinear iterative solvers. To overcome these challenges, we propose a predictor-corrector approach and design proper implicit-explicit time discretizations. In the predictor step, the system is reformulated into a nonconservative form and linear diffusion terms are introduced as a penalization to mitigate strong nonlinearities. We then employ a Picard iteration to secure convergence in handling the nonlinear aspects. The corrector step guarantees the conservation of total energy, which is vital for accurately simulating the speeds of propagating sharp fronts in this system. For spatial approximations, we utilize local discontinuous Galerkin finite element methods, coupled with positive-preserving and TVB limiters. We validate the orders of accuracy, conservation properties, and suitability of using large time steps for our proposed methods, through numerical experiments conducted on one- and two-dimensional spatial problems. In both homogeneous and heterogeneous non-equilibrium radiation diffusion problems, we attain a time stability condition comparable to that of a fully implicit time discretization. Such an approach is also applicable to many other reaction-diffusion systems. △ Less

Submitted 29 January, 2024; originally announced January 2024.

arXiv:2401.08330 [pdf, other]

Boosting Gradient Ascent for Continuous DR-submodular Maximization

Authors: Qixin Zhang, Zongqi Wan, Zengde Deng, Zaiyi Chen, Xiaoming Sun, Jialin Zhang, Yu Yang

Abstract: Projected Gradient Ascent (PGA) is the most commonly used optimization scheme in machine learning and operations research areas. Nevertheless, numerous studies and examples have shown that the PGA methods may fail to achieve the tight approximation ratio for continuous DR-submodular maximization problems. To address this challenge, we present a boosting technique in this paper, which can efficient… ▽ More Projected Gradient Ascent (PGA) is the most commonly used optimization scheme in machine learning and operations research areas. Nevertheless, numerous studies and examples have shown that the PGA methods may fail to achieve the tight approximation ratio for continuous DR-submodular maximization problems. To address this challenge, we present a boosting technique in this paper, which can efficiently improve the approximation guarantee of the standard PGA to \emph{optimal} with only small modifications on the objective function. The fundamental idea of our boosting technique is to exploit non-oblivious search to derive a novel auxiliary function $F$, whose stationary points are excellent approximations to the global maximum of the original DR-submodular objective $f$. Specifically, when $f$ is monotone and $γ$-weakly DR-submodular, we propose an auxiliary function $F$ whose stationary points can provide a better $(1-e^{-γ})$-approximation than the $(γ^2/(1+γ^2))$-approximation guaranteed by the stationary points of $f$ itself. Similarly, for the non-monotone case, we devise another auxiliary function $F$ whose stationary points can achieve an optimal $\frac{1-\min_{\boldsymbol{x}\in\mathcal{C}}\|\boldsymbol{x}\|_{\infty}}{4}$-approximation guarantee where $\mathcal{C}$ is a convex constraint set. In contrast, the stationary points of the original non-monotone DR-submodular function can be arbitrarily bad~\citep{chen2023continuous}. Furthermore, we demonstrate the scalability of our boosting technique on four problems. In all of these four problems, our resulting variants of boosting PGA algorithm beat the previous standard PGA in several aspects such as approximation ratio and efficiency. Finally, we corroborate our theoretical findings with numerical experiments, which demonstrate the effectiveness of our boosting PGA methods. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Comments: 74 pages, 6 figures and 9 tables. An extended version of Stochastic Continuous Submodular Maximization: Boosting via Non-oblivious Function (ICML 2022)

arXiv:2401.07029 [pdf, ps, other]

A local maximum principle for robust optimal control problems of quadratic BSDEs

Authors: Tao Hao, Jiaqiang Wen, Qi Zhang

Abstract: The paper concerns the necessary maximum principle for robust optimal control problems of quadratic BSDEs. The coefficient of the systems depends on the parameter $θ$, and the generator of BSDEs is of quadratic growth in $z$. Since the model is uncertain, the variational inequality is proved by weak convergence technique. In addition, due to the generator being quadratic with respect to $z$, the f… ▽ More The paper concerns the necessary maximum principle for robust optimal control problems of quadratic BSDEs. The coefficient of the systems depends on the parameter $θ$, and the generator of BSDEs is of quadratic growth in $z$. Since the model is uncertain, the variational inequality is proved by weak convergence technique. In addition, due to the generator being quadratic with respect to $z$, the forward adjoint equations are SDEs with unbounded coefficient involving mean oscillation martingales. Using reverse Hölder inequality and John-Nirenberg inequality, we show that its solutions are continuous with respect to the parameter $θ$. The necessary and sufficient conditions for robust optimal control are proved by linearization method. △ Less

Submitted 13 January, 2024; originally announced January 2024.

Comments: 35 pages

MSC Class: 93E20; 60H10

arXiv:2401.06098 [pdf, other]

Proximal observers for secure state estimation

Authors: Laurent Bako, Madiha Nadri, Vincent Andrieu, Qinghua Zhang

Abstract: This paper discusses a general framework for designing robust state estimators for a class of discrete-time nonlinear systems. We consider systems that may be impacted by impulsive (sparse but otherwise arbitrary) measurement noise sequences. We show that a family of state estimators, robust to this type of undesired signal, can be obtained by minimizing a class of nonsmooth convex functions at ea… ▽ More This paper discusses a general framework for designing robust state estimators for a class of discrete-time nonlinear systems. We consider systems that may be impacted by impulsive (sparse but otherwise arbitrary) measurement noise sequences. We show that a family of state estimators, robust to this type of undesired signal, can be obtained by minimizing a class of nonsmooth convex functions at each time step. The resulting state observers are defined through proximal operators. We obtain a nonlinear implicit dynamical system in term of estimation error and prove, in the noise-free setting, that it vanishes asymptotically when the minimized loss function and the to-beobserved system enjoy appropriate properties. From a computational perspective, even though the proposed observers can be implemented via efficient numerical procedures, they do not admit closed-form expressions. The paper argues that by adopting appropriate relaxations, simple and fast analytic expressions can be derived. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 15 pages, 5 figures

arXiv:2401.03237 [pdf, other]

A New Parallel Cooperative Landscape Smoothing Algorithm and Its Applications on TSP and UBQP

Authors: Wei Wang, Jialong Shi, Jianyong Sun, Arnaud Liefooghe, Qingfu Zhang

Abstract: Combinatorial optimization problem (COP) is difficult to solve because of the massive local optimal solutions in his solution space. Various methods have been put forward to smooth the solution space of COPs, including homotopic convex (HC) transformation for the traveling salesman problem (TSP). This paper first extends the HC transformation approach to the unconstrained binary quadratic programm… ▽ More Combinatorial optimization problem (COP) is difficult to solve because of the massive local optimal solutions in his solution space. Various methods have been put forward to smooth the solution space of COPs, including homotopic convex (HC) transformation for the traveling salesman problem (TSP). This paper first extends the HC transformation approach to the unconstrained binary quadratic programming (UBQP). We theoretically prove the effectiveness of the proposed HC transformation method on smoothing the landscape of the UBQP. Subsequently, we introduce an iterative algorithmic framework incorporating HC transformation, referred as landscape smoothing iterated local search (LSILS). Our experimental analyses, conducted on various UBQP instances show the effectiveness of LSILS. Furthermore, this paper proposes a parallel cooperative variant of LSILS, denoted as PC-LSILS and apply it to both the UBQP and the TSP. Our experimental findings highlight that PC-LSILS improves the smoothing performance of the HC transformation, and further improves the overall performance of the algorithm. △ Less

Submitted 6 January, 2024; originally announced January 2024.

arXiv:2401.02210 [pdf, ps, other]

Roth-type Theorem for high-power system in Piatetski-Shapiro primes (II)

Authors: Xiumin Ren, Yu-chen Sun, Qingqing Zhang, Rui Zhang

Abstract: We consider the nonlinear system $c_1p_1^d +c_2p_2^d + \dots + c_s p_s^d = 0$ with $c_1, c_2,\dots, c_s\in\mathbb Z$ being nonzero and satisfying $c_1 +c_2 + \dots + c_s = 0$. We show that for $s\ge 2\lfloor \frac{d^2}2\rfloor+1$ and $c\in\left(1, 1+c(d,s)\right)$, if the system has only $K$-trivial solutions in subset $\mathcal{A}$ of Piatetski-Shapiro primes up to $x$ and corresponding to $c$, t… ▽ More We consider the nonlinear system $c_1p_1^d +c_2p_2^d + \dots + c_s p_s^d = 0$ with $c_1, c_2,\dots, c_s\in\mathbb Z$ being nonzero and satisfying $c_1 +c_2 + \dots + c_s = 0$. We show that for $s\ge 2\lfloor \frac{d^2}2\rfloor+1$ and $c\in\left(1, 1+c(d,s)\right)$, if the system has only $K$-trivial solutions in subset $\mathcal{A}$ of Piatetski-Shapiro primes up to $x$ and corresponding to $c$, then $|\mathcal{A}| \ll \frac{x^{\frac1c}}{\log x} $$\left(\log \log \log \log x\right)^{\frac{2-s}{dc}+\varepsilon}$. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Comments: 14 pages

MSC Class: 11B30; 11P32; 11L20

arXiv:2312.15419 [pdf, ps, other]

Gradient estimates on graphs with the $CDψ(n,-K)$condition

Authors: Yi Li, Qianwei Zhang

Abstract: This paper investigates gradient estimates on graphs satisfying the $CDψ(n,-K)$ condition with positive constants $n,K$, and concave $C^{1}$ functions $ψ:(0,+\infty)\rightarrow\mathbb{R}$. Our study focuses on gradient estimates for positive solutions of the heat equation $\partial_{t}u=Δu$. Additionally, the estimate is extended to a heat-type equation $\partial_{t}u=Δu+cu^σ$, where $σ$ is a cons… ▽ More This paper investigates gradient estimates on graphs satisfying the $CDψ(n,-K)$ condition with positive constants $n,K$, and concave $C^{1}$ functions $ψ:(0,+\infty)\rightarrow\mathbb{R}$. Our study focuses on gradient estimates for positive solutions of the heat equation $\partial_{t}u=Δu$. Additionally, the estimate is extended to a heat-type equation $\partial_{t}u=Δu+cu^σ$, where $σ$ is a constant and $c$ is a continuous function defined on $[0,+\infty)$. Furthermore, we utilize these estimates to derive heat kernel bounds and Harnack inequalities. △ Less

Submitted 24 December, 2023; originally announced December 2023.

arXiv:2312.15158 [pdf, other]

Map-Reduce for Multiprocessing Large Data and Multi-threading for Data Scraping

Authors: Zefeng Qiu, Prashanth Umapathy, Qingquan Zhang, Guanqun Song, Ting Zhu

Abstract: This document is the final project report for our advanced operating system class. During this project, we mainly focused on applying multiprocessing and multi-threading technology to our whole project and utilized the map-reduce algorithm in our data cleaning and data analysis process. In general, our project can be divided into two components: data scraping and data processing, where the previou… ▽ More This document is the final project report for our advanced operating system class. During this project, we mainly focused on applying multiprocessing and multi-threading technology to our whole project and utilized the map-reduce algorithm in our data cleaning and data analysis process. In general, our project can be divided into two components: data scraping and data processing, where the previous part was almost web wrangling with employing potential multiprocessing or multi-threading technology to speed up the whole process. And after we collect and scrape a large amount value of data as mentioned above, we can use them as input to implement data cleaning and data analysis, during this period, we take advantage of the map-reduce algorithm to increase efficiency. △ Less

Submitted 22 December, 2023; originally announced December 2023.

arXiv:2312.13425 [pdf, other]

Quadratic and cubic Lagrange finite elements for mixed Laplace eigenvalue problems on criss-cross meshes

Authors: Kaibo Hu, Jiguang Sun, Qian Zhang

Abstract: In [6], it was shown that the linear Lagrange element space on criss-cross meshes and its divergence exhibit spurious eigenvalues when applied in the mixed formulation of the Laplace eigenvalue problem, despite satisfying both the inf-sup condition and ellipticity on the discrete kernel. The lack of a Fortin interpolation is responsible for the spurious eigenvalues produced by the linear Lagrange… ▽ More In [6], it was shown that the linear Lagrange element space on criss-cross meshes and its divergence exhibit spurious eigenvalues when applied in the mixed formulation of the Laplace eigenvalue problem, despite satisfying both the inf-sup condition and ellipticity on the discrete kernel. The lack of a Fortin interpolation is responsible for the spurious eigenvalues produced by the linear Lagrange space. In contrast, results in [8] confirm that quartic and higher-order Lagrange elements do not yield spurious eigenvalues on general meshes without nearly singular vertices, including criss-cross meshes as a special case. In this paper, we investigate quadratic and cubic Lagrange elements on criss-cross meshes. We prove the convergence of discrete eigenvalues by fitting the Lagrange elements on criss-cross meshes into a complex and constructing a Fortin interpolation. As a by-product, we construct bounded commuting projections for the finite element Stokes complex, which induces isomorphisms between cohomologies of the continuous and discrete complexes. We provide numerical examples to validate the theoretical results. △ Less

Submitted 20 December, 2023; originally announced December 2023.

Comments: 19 pages, 8 figures

arXiv:2312.03287 [pdf, ps, other]

Global solutions to quasilinear wave-Klein-Gordon systems in two space dimensions

Authors: Qian Zhang

Abstract: In this paper we prove global existence and global behavior of solutions to quasilinear wave-Klein-Gordon systems in $\mathbb{R}^{1+2}$ with quadratic nonlinearities satisfying the null condition. We consider small, regular and compactly supported initial data, and prove global existence, pointwise decay estimates and linear scattering for the solutions. In this paper we prove global existence and global behavior of solutions to quasilinear wave-Klein-Gordon systems in $\mathbb{R}^{1+2}$ with quadratic nonlinearities satisfying the null condition. We consider small, regular and compactly supported initial data, and prove global existence, pointwise decay estimates and linear scattering for the solutions. △ Less

Submitted 5 December, 2023; originally announced December 2023.

Comments: 28 pages

arXiv:2311.15526 [pdf, other]

Unfitted finite element method for the quad-curl interface problem

Authors: Hailong Guo, Mingyan Zhang, Qian Zhang, Zhimin Zhang

Abstract: In this paper, we introduce a novel unfitted finite element method to solve the quad-curl interface problem. We adapt Nitsche's method for curlcurl-conforming elements and double the degrees of freedom on interface elements. To ensure stability, we incorporate ghost penalty terms and a discrete divergence-free term. We establish the well-posedness of our method and demonstrate an optimal error bou… ▽ More In this paper, we introduce a novel unfitted finite element method to solve the quad-curl interface problem. We adapt Nitsche's method for curlcurl-conforming elements and double the degrees of freedom on interface elements. To ensure stability, we incorporate ghost penalty terms and a discrete divergence-free term. We establish the well-posedness of our method and demonstrate an optimal error bound in the discrete energy norm. We also analyze the stiffness matrix's condition number. Our numerical tests back up our theory on convergence rates and condition numbers. △ Less

Submitted 26 November, 2023; originally announced November 2023.

arXiv:2311.15482 [pdf, other]

Distributional Hessian and divdiv complexes on triangulation and cohomology

Authors: Kaibo Hu, Ting Lin, Qian Zhang

Abstract: In this paper, we construct discrete versions of some Bernstein-Gelfand-Gelfand (BGG) complexes, i.e., the Hessian and the divdiv complexes, on triangulations in 2D and 3D. The sequences consist of finite elements with local polynomial shape functions and various types of Dirac measure on subsimplices. The construction generalizes Whitney forms (canonical conforming finite elements) for the de Rha… ▽ More In this paper, we construct discrete versions of some Bernstein-Gelfand-Gelfand (BGG) complexes, i.e., the Hessian and the divdiv complexes, on triangulations in 2D and 3D. The sequences consist of finite elements with local polynomial shape functions and various types of Dirac measure on subsimplices. The construction generalizes Whitney forms (canonical conforming finite elements) for the de Rham complex and Regge calculus/finite elements for the elasticity (Riemannian deformation) complex from discrete topological and Discrete Exterior Calculus perspectives. We show that the cohomology of the resulting complexes is isomorphic to the continuous versions, and thus isomorphic to the de~Rham cohomology with coefficients. △ Less

Submitted 26 November, 2023; originally announced November 2023.

Comments: keywords: Bernstein-Gelfand-Gelfand sequences, cohomology, finite element exterior calculus, discrete exterior calculus, Regge calculus

arXiv:2311.13418 [pdf, ps, other]

Liouville theorem for quasilinear elliptic equations in $\mathbb R^N$

Authors: Wangzhe Wu, Qiqi Zhang

Abstract: We prove Liouville theorem for the equation $Δ_m v + v^p + M |\nabla v|^{q}= 0$ in a domain $Ω\subset\mathbb R^n$, with $M\in \mathbb{R}$ in the critical and subcritical case. As a natural extension of our recent work \cite{MWZ}, the proof is based on an integral identity and Young's inequality. We prove Liouville theorem for the equation $Δ_m v + v^p + M |\nabla v|^{q}= 0$ in a domain $Ω\subset\mathbb R^n$, with $M\in \mathbb{R}$ in the critical and subcritical case. As a natural extension of our recent work \cite{MWZ}, the proof is based on an integral identity and Young's inequality. △ Less

Submitted 22 November, 2023; originally announced November 2023.

arXiv:2311.12306 [pdf, ps, other]

A blow up solution of the Navier-Stokes equations with a super critical forcing term

Authors: Qi S. Zhang

Abstract: A forced solution $v$ of the axially symmetric Navier-Stokes equation in a finite cylinder $D$ with suitable boundary condition is constructed. The forcing term is in the super critical space $L^q_t L^1_x$ for all $q>1$. The velocity is in the energy space at the final moment when it blows up. A forced solution $v$ of the axially symmetric Navier-Stokes equation in a finite cylinder $D$ with suitable boundary condition is constructed. The forcing term is in the super critical space $L^q_t L^1_x$ for all $q>1$. The velocity is in the energy space at the final moment when it blows up. △ Less

Submitted 20 November, 2023; originally announced November 2023.

Comments: 8 pages

MSC Class: 35Q30; 76D03

arXiv:2311.04641 [pdf, ps, other]

Liouville theorem for elliptic equations involving the sum of the function and its gradient in $\mathbb R^n$

Authors: Xi-nan Ma, Wangzhe Wu, Qiqi Zhang

Abstract: We prove Liouville theorem for the equation $Δv + N v^p + M |\nabla v|^{q}= 0$ in $\mathbb R^n$, with $M, N > 0, q = \frac{2p}{p + 1}$ in the critical and subcritical case. The proof is based on an integral identity and Young inequality. We prove Liouville theorem for the equation $Δv + N v^p + M |\nabla v|^{q}= 0$ in $\mathbb R^n$, with $M, N > 0, q = \frac{2p}{p + 1}$ in the critical and subcritical case. The proof is based on an integral identity and Young inequality. △ Less

Submitted 8 November, 2023; originally announced November 2023.

arXiv:2311.02250 [pdf, other]

Efficient Scenario Generation for Chance-constrained Economic Dispatch Considering Ambient Wind Conditions

Authors: Qian Zhang, Apurv Shukla, Le Xie

Abstract: Scenario generation is an effective data-driven method for solving chance-constrained optimization while ensuring desired risk guarantees with a finite number of samples. Crucial challenges in deploying this technique in the real world arise due to the absence of appropriate risk-tuning models tailored for the desired application. In this paper, we focus on designing efficient scenario generation… ▽ More Scenario generation is an effective data-driven method for solving chance-constrained optimization while ensuring desired risk guarantees with a finite number of samples. Crucial challenges in deploying this technique in the real world arise due to the absence of appropriate risk-tuning models tailored for the desired application. In this paper, we focus on designing efficient scenario generation schemes for economic dispatch in power systems. We propose a novel scenario generation method based on filtering scenarios using ambient wind conditions. These filtered scenarios are deployed incrementally in order to meet desired risk levels while using minimum resources. In order to study the performance of the proposed scheme, we illustrate the procedure on case studies performed for both 24-bus and 118-bus systems with real-world wind power forecasting data. Numerical results suggest that the proposed filter-and-increment scenario generation model leads to a precise and efficient solution for the chance-constrained economic dispatch problem. △ Less

Submitted 2 January, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

Comments: 12 pages

arXiv:2311.01126 [pdf, other]

Two improved algorithms for sparse generalized canonical correlation analysis

Authors: Kuo-Yue Li, Qi-Ye Zhang, Yong-Han Sun

Abstract: Regularized generalized canonical correlation analysis (RGCCA) is a generalization of regularized canonical correlation analysis to three or more sets of variables, which is a component-based approach aiming to study the relationships between several sets of variables. Sparse generalized canonical correlation analysis (SGCCA) (proposed in Tenenhaus et al. (2014)), combines RGCCA with an `1-penalty… ▽ More Regularized generalized canonical correlation analysis (RGCCA) is a generalization of regularized canonical correlation analysis to three or more sets of variables, which is a component-based approach aiming to study the relationships between several sets of variables. Sparse generalized canonical correlation analysis (SGCCA) (proposed in Tenenhaus et al. (2014)), combines RGCCA with an `1-penalty, in which blocks are not necessarily fully connected, makes SGCCA a flexible method for analyzing a wide variety of practical problems, such as biology, chemistry, sensory analysis, marketing, food research, etc. In Tenenhaus et al. (2014), an iterative algorithm for SGCCA was designed based on the solution to the subproblem (LM-P1 for short) of maximizing a linear function on the intersection of an `1-norm ball and a unit `2-norm sphere proposed in Witten et al. (2009). However, the solution to the subproblem (LM-P1) proposed in Witten et al. (2009) is not correct, which may become the reason that the iterative algorithm for SGCCA is slow and not always convergent. For this, we first characterize the solution to the subproblem LM-P1, and the subproblems LM-P2 and LM-P3, which maximize a linear function on the intersection of an `1-norm sphere and a unit `2-norm sphere, and an `1-norm ball and a unit `2-norm sphere, respectively. Then we provide more efficient block coordinate descent (BCD) algorithms for SGCCA and its two variants, called SGCCA-BCD1, SGCCA-BCD2 and SGCCA-BCD3, corresponding to the subproblems LM-P1, LM-P2 and LM-P3, respectively, prove that they all globally converge to their stationary points. We further propose gradient projected (GP) methods for SGCCA and its two variants when using the Horst scheme, called SGCCA-GP1, SGCCA-GP2 and SGCCA-GP3, corresponding to the subproblems LM-P1, LM-P2 and LM-P3, respectively, and prove that they all △ Less

Submitted 2 November, 2023; originally announced November 2023.

arXiv:2310.15803 [pdf, other]

Optimal Strategies for Round-Trip Pairs Trading Under Geometric Brownian Motions

Authors: Emily Crawford Das, Jingzhi Tie, Qing Zhang

Abstract: This paper is concerned with an optimal strategy for simultaneously trading a pair of stocks. The idea of pairs trading is to monitor their price movements and compare their relative strength over time. A pairs trade is triggered by the divergence of their prices and consists of a pair of positions to short the strong stock and to long the weak one. Such a strategy bets on the reversal of their pr… ▽ More This paper is concerned with an optimal strategy for simultaneously trading a pair of stocks. The idea of pairs trading is to monitor their price movements and compare their relative strength over time. A pairs trade is triggered by the divergence of their prices and consists of a pair of positions to short the strong stock and to long the weak one. Such a strategy bets on the reversal of their price strengths. A round-trip trading strategy refers to opening and closing such a pair of security positions. Typical pairs-trading models usually assume a difference of the stock prices satisfies a mean-reversion equation. However, we consider the optimal pairs-trading problem by allowing the stock prices to follow general geometric Brownian motions. The objective is to trade the pairs over time to maximize an overall return with a fixed commission cost for each transaction. Initially, we allow the initial pairs position to be either long or flat. We then consider the problem when the initial pairs position may be long, flat, or short. In each case, the optimal policy is characterized by threshold curves obtained by solving the associated HJB equations. △ Less

Submitted 24 October, 2023; originally announced October 2023.

Comments: 47 pages, 5 figures

arXiv:2310.13186 [pdf]

A quantitative pairwise comparison-based constraint handling technique for constrained optimization

Authors: Ting Huang, Qiang Zhang, Witold Pedrycz, Shanlin Yang

Abstract: This study proposes a new constraint handling technique for assisting metaheuristic optimization algorithms to solve constrained optimization problems more effectively and efficiently. Given any two solutions of any constrained optimization problems, they are first mapped into a two-dimensional Cartesian coordinate system with their objective function value differences and constraint violation dif… ▽ More This study proposes a new constraint handling technique for assisting metaheuristic optimization algorithms to solve constrained optimization problems more effectively and efficiently. Given any two solutions of any constrained optimization problems, they are first mapped into a two-dimensional Cartesian coordinate system with their objective function value differences and constraint violation differences as the two axes. To the best of our knowledge, we are the first to deal with constraints by building such a Cartesian coordinate system. Then, the Cartesian coordinate system is divided into a series of grids by assigning ranks to different intervals of differences. In this way, a pairwise comparison criterion is derived with the use of the fused ranks, which achieves non-hierarchical comparison neither preferring objective function values nor constraint violations, resulting in more accurate evaluation compared with existing techniques. Moreover, an evaluation function that is equivalent to the pairwise comparison criterion is proposed, which further improves computational efficiency. The effectiveness and efficiency of the proposed constraint handling technique are verified on two well-known public datasets, that is, CEC 2006 and CEC 2017. The results demonstrate that metaheuristic optimization algorithms with using the proposed constraint handling technique can converge to a feasible optimal solution faster and more reliably. Experimental analysis on the parameters involved reveal guidance for their optimal settings. △ Less

Submitted 19 October, 2023; originally announced October 2023.

Comments: 58 pages in total (33 pages of main body and 25 pages of supplementary material), 38 figures in total (14 figures in the main body and 24 figures in the supplementary material)

MSC Class: 90-08 (Primary)

arXiv:2310.07985 [pdf, other]

Neural Combinatorial Optimization with Heavy Decoder: Toward Large Scale Generalization

Authors: Fu Luo, Xi Lin, Fei Liu, Qingfu Zhang, Zhenkun Wang

Abstract: Neural combinatorial optimization (NCO) is a promising learning-based approach for solving challenging combinatorial optimization problems without specialized algorithm design by experts. However, most constructive NCO methods cannot solve problems with large-scale instance sizes, which significantly diminishes their usefulness for real-world applications. In this work, we propose a novel Light En… ▽ More Neural combinatorial optimization (NCO) is a promising learning-based approach for solving challenging combinatorial optimization problems without specialized algorithm design by experts. However, most constructive NCO methods cannot solve problems with large-scale instance sizes, which significantly diminishes their usefulness for real-world applications. In this work, we propose a novel Light Encoder and Heavy Decoder (LEHD) model with a strong generalization ability to address this critical issue. The LEHD model can learn to dynamically capture the relationships between all available nodes of varying sizes, which is beneficial for model generalization to problems of various scales. Moreover, we develop a data-efficient training scheme and a flexible solution construction mechanism for the proposed LEHD model. By training on small-scale problem instances, the LEHD model can generate nearly optimal solutions for the Travelling Salesman Problem (TSP) and the Capacitated Vehicle Routing Problem (CVRP) with up to 1000 nodes, and also generalizes well to solve real-world TSPLib and CVRPLib problems. These results confirm our proposed LEHD model can significantly improve the state-of-the-art performance for constructive NCO. The code is available at https://github.com/CIAM-Group/NCO_code/tree/main/single_objective/LEHD. △ Less

Submitted 12 January, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

Comments: Accepted at NeurIPS 2023

arXiv:2310.00988 [pdf, other]

Stability and Optimal Decay Rates for Abstract Systems with Thermal Damping of Cattaneo's Type

Authors: Chenxi Deng, Zhong-Jie Han, Zhaobin Kuang, Qiong Zhang

Abstract: This paper studies the stability of an abstract thermoelastic system with Cattaneo's law, which describes finite heat propagation speed in a medium. We introduce a parameters region containing coupling, thermal dissipation, and possible inertial characteristics. The parameters region is partitioned into distinct subregions based on the spectral properties of the infinitesimal generator of the corr… ▽ More This paper studies the stability of an abstract thermoelastic system with Cattaneo's law, which describes finite heat propagation speed in a medium. We introduce a parameters region containing coupling, thermal dissipation, and possible inertial characteristics. The parameters region is partitioned into distinct subregions based on the spectral properties of the infinitesimal generator of the corresponding semigroup. By a careful estimation of the resolvent, we obtain distinct polynomial decay rates for systems with parameters located in different subregions. Furthermore, the optimality of these decay rates is proved. Finally, we apply our results to several coupled systems of partial differential equations. △ Less

Submitted 27 April, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

Comments: typos corrected

MSC Class: 35Q74(Primary) 74F05(Secondary)

arXiv:2309.13540 [pdf, ps, other]

Classification of aut-fixed subgroups in free-abelian times surface groups

Authors: Jialin Lei, Peng Wang, Qiang Zhang

Abstract: In this paper, we are concerned with free-abelian times surface groups, and show that they contain, up to isomorphism, infinitely many fixed subgroups of automorphisms. Moreover, we give a complete classification of their aut-fixed subgroups. In this paper, we are concerned with free-abelian times surface groups, and show that they contain, up to isomorphism, infinitely many fixed subgroups of automorphisms. Moreover, we give a complete classification of their aut-fixed subgroups. △ Less

Submitted 25 May, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

Comments: 18 pages

MSC Class: 20F65; 20F34; 57M07

arXiv:2309.12372 [pdf, ps, other]

The Furstenberg property in Puiseux monoids

Authors: Andrew Lin, Henrick Rabinovitz, Qiao Zhang

Abstract: Let $M$ be a commutative monoid. The monoid $M$ is called atomic if every non-invertible element of $M$ factors into atoms (i.e., irreducible elements), while $M$ is called a Furstenberg monoid if every non-invertible element of $M$ is divisible by an atom. Additive submonoids of $\mathbb{Q}$ consisting of nonnegative rationals are called Puiseux monoids, and their atomic structure has been active… ▽ More Let $M$ be a commutative monoid. The monoid $M$ is called atomic if every non-invertible element of $M$ factors into atoms (i.e., irreducible elements), while $M$ is called a Furstenberg monoid if every non-invertible element of $M$ is divisible by an atom. Additive submonoids of $\mathbb{Q}$ consisting of nonnegative rationals are called Puiseux monoids, and their atomic structure has been actively studied during the past few years. The primary purpose of this paper is to investigate the property of being Furstenberg in the context of Puiseux monoids. In this direction, we consider some properties weaker than being Furstenberg, and then we connect these properties with some atomic results which have been already established for Puiseux monoids. △ Less

Submitted 20 September, 2023; originally announced September 2023.

Comments: 17 pages

MSC Class: Primary: 20M13; 11Y05; Secondary: 20M14; 06F05

arXiv:2309.12271 [pdf, ps, other]

A generalization of the Witten conjecture through spectral curve

Authors: Shuai Guo, Ce Ji, Qingsheng Zhang

Abstract: We propose a generalization of the Witten conjecture, which connects a descendent enumerative theory with a specific reduction of KP integrable hierarchy. Our conjecture is realized by two parts: Part I (Geometry) establishes a correspondence between the descendent potential function (apart from ancestors) and the topological recursion of specific spectral curve data $(Σ, x,y,B)$; Part II (Integra… ▽ More We propose a generalization of the Witten conjecture, which connects a descendent enumerative theory with a specific reduction of KP integrable hierarchy. Our conjecture is realized by two parts: Part I (Geometry) establishes a correspondence between the descendent potential function (apart from ancestors) and the topological recursion of specific spectral curve data $(Σ, x,y,B)$; Part II (Integrability) claims that the TR descendent potential, defined at the boundary points of the spectral curve (where $dx$ has poles), is a tau function of a certain reduction of the multi-component KP hierarchy. In this paper, we show the geometry part for any formal descendent theory by using a generalized Laplace transform, and show the integrability part for the one-boundary cases. As applications, we generalize and prove the $r$KdV integrability of negative $r$-spin theory conjectured by Chidambaram, Garcia-Falide and Giacchetto [6], and prove the KdV integrability for the theory associated with the Weierstrass curve introduced by Dubrovin. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: 45 pages, 1 figure

MSC Class: 14N35; 37K10; 14J33

arXiv:2309.10445 [pdf, other]

Product of Rankin-Selberg convolutions and a new proof of Jacquet's local converse conjecture

Authors: Pan Yan, Qing Zhang

Abstract: In this article, we construct a family of integrals which represent the product of Rankin-Selberg $L$-functions of $\mathrm{GL}_{l}\times \mathrm{GL}_m$ and of $\mathrm{GL}_{l}\times \mathrm{GL}_n $ when $m+n<l$. When $n=0$, these integrals are those defined by Jacquet--Piatetski-Shapiro--Shalika up to a shift. In this sense, these new integrals generalize Jacquet--Piatetski-Shapiro--Shalika's Ran… ▽ More In this article, we construct a family of integrals which represent the product of Rankin-Selberg $L$-functions of $\mathrm{GL}_{l}\times \mathrm{GL}_m$ and of $\mathrm{GL}_{l}\times \mathrm{GL}_n $ when $m+n<l$. When $n=0$, these integrals are those defined by Jacquet--Piatetski-Shapiro--Shalika up to a shift. In this sense, these new integrals generalize Jacquet--Piatetski-Shapiro--Shalika's Rankin-Selberg convolution integrals. We study basic properties of these integrals. In particular, we define local gamma factors using this new family of integrals. As an application, we obtain a new proof of Jacquet's local converse conjecture using these new integrals. △ Less

Submitted 19 September, 2023; originally announced September 2023.

MSC Class: 11F70; 22E50

arXiv:2309.01341 [pdf, ps, other]

Decentralized Control for Discrete-time Mean-Field Systems with Multiple Controllers of Delayed Information

Authors: Qingyuan Qi, Zhiqiang Liu, Qianqian Zhang, Xinbei Lv

Abstract: In this paper, the finite horizon asymmetric information linear quadratic (LQ) control problem is investigated for a discrete-time mean field system. Different from previous works, multiple controllers with different information sets are involved in the mean field system dynamics. The coupling of different controllers makes it quite difficult in finding the optimal control strategy. Fortunately, b… ▽ More In this paper, the finite horizon asymmetric information linear quadratic (LQ) control problem is investigated for a discrete-time mean field system. Different from previous works, multiple controllers with different information sets are involved in the mean field system dynamics. The coupling of different controllers makes it quite difficult in finding the optimal control strategy. Fortunately, by applying the Pontryagin's maximum principle, the corresponding decentralized control problem of the finite horizon is investigated. The contributions of this paper can be concluded as: For the first time, based on the solution of a group of mean-field forward and backward stochastic difference equations (MF-FBSDEs), the necessary and sufficient solvability conditions are derived for the asymmetric information LQ control for the mean field system with multiple controllers. Furthermore, by the use of an innovative orthogonal decomposition approach, the optimal decentralized control strategy is derived, which is based on the solution to a non-symmetric Riccati-type equation. △ Less

Submitted 3 September, 2023; originally announced September 2023.

Showing 1–50 of 414 results for author: Zhang, Q