-
A rigidity result for ancient Ricci flows
Authors:
Qi S. Zhang
Abstract:
Using a size condition of the sharp log Sobolev functional (log entropy) near infinity only, we prove a rigidity result for ancient Ricci flows without sign condition on the curvatures. The result is also related to the problem of identifying type II ancient Ricci flows and their backward limits.
Using a size condition of the sharp log Sobolev functional (log entropy) near infinity only, we prove a rigidity result for ancient Ricci flows without sign condition on the curvatures. The result is also related to the problem of identifying type II ancient Ricci flows and their backward limits.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
A parallel framework for graphical optimal transport
Authors:
Jiaojiao Fan,
Isabel Haasler,
Qinsheng Zhang,
Johan Karlsson,
Yongxin Chen
Abstract:
We study multi-marginal optimal transport (MOT) problems where the underlying cost has a graphical structure. These graphical multi-marginal optimal transport problems have found applications in several domains including traffic flow control and regression problems in the Wasserstein space. MOT problem can be approached through two aspects: a single big MOT problem, or coupled minor OT problems. I…
▽ More
We study multi-marginal optimal transport (MOT) problems where the underlying cost has a graphical structure. These graphical multi-marginal optimal transport problems have found applications in several domains including traffic flow control and regression problems in the Wasserstein space. MOT problem can be approached through two aspects: a single big MOT problem, or coupled minor OT problems. In this paper, we focus on the latter approach and demonstrate it has efficiency gain from the parallelization. For tree-structured MOT problems, we introduce a novel parallelizable algorithm that significantly reduces computational complexity. Additionally, we adapt this algorithm for general graphs, employing the modified junction trees to enable parallel updates. Our contributions, validated through numerical experiments, offer new avenues for MOT applications and establish benchmarks in computational efficiency.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Decision-Focused Surrogate Modeling for Mixed-Integer Linear Optimization
Authors:
Shivi Dixit,
Rishabh Gupta,
Qi Zhang
Abstract:
Mixed-integer optimization is at the core of many online decision-making systems that demand frequent updates of decisions in real time. However, due to their combinatorial nature, mixed-integer linear programs (MILPs) can be difficult to solve, rendering them often unsuitable for time-critical online applications. To address this challenge, we develop a data-driven approach for constructing surro…
▽ More
Mixed-integer optimization is at the core of many online decision-making systems that demand frequent updates of decisions in real time. However, due to their combinatorial nature, mixed-integer linear programs (MILPs) can be difficult to solve, rendering them often unsuitable for time-critical online applications. To address this challenge, we develop a data-driven approach for constructing surrogate optimization models in the form of linear programs (LPs) that can be solved much more efficiently than the corresponding MILPs. We train these surrogate LPs in a decision-focused manner such that for different model inputs, they achieve the same or close to the same optimal solutions as the original MILPs. One key advantage of the proposed method is that it allows the incorporation of all the original MILP's linear constraints, which significantly increases the likelihood of obtaining feasible predicted solutions. Results from two computational case studies indicate that this decision-focused surrogate modeling approach is highly data-efficient and provides very accurate predictions of the optimal solutions. In these examples, it outperforms more commonly used neural-network-based optimization proxies.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
A Novel Coupled bES-FEM Formulation with SUPG stabilization for Thermo-Hydro-Mechanical Analysis in Saturated Porous Media
Authors:
Zi-Qi Tang,
Xi-Wen Zhou,
Yin-Fu Jin,
Zhen-Yu Yin,
Qi Zhang
Abstract:
Two primary types of numerical instabilities often occur in low-order finite element method (FEM) analyses of thermo-hydro-mechanical (THM) phenomena: (1) pressure oscillations arising improper interpolation of pressure and displacement fields; and (2) spatial oscillations induced by nonlinear convection terms in convection-dominated scenarios. In response to these issues, this paper proposes a no…
▽ More
Two primary types of numerical instabilities often occur in low-order finite element method (FEM) analyses of thermo-hydro-mechanical (THM) phenomena: (1) pressure oscillations arising improper interpolation of pressure and displacement fields; and (2) spatial oscillations induced by nonlinear convection terms in convection-dominated scenarios. In response to these issues, this paper proposes a novel stabilized edge-based smoothed FEM with a bubble function (bES-FEM) for THM analysis within saturated porous media. In the proposed framework, a cubic bubble function is first incorporated into ES-FEM to efficiently mitigate pressure oscillations that breach the Inf-Sup condition, and then the Streamline Upwind Petrov-Galerkin (SUPG) scheme is adopted in bES-FEM to effectively reduce the spurious oscillations in convection-dominated heat transfer scenarios. The accuracy of the bES-FEM with SUPG formulation for THM coupled problems is validated through a series of five benchmark tests. Moreover, the simulations of open-loop ground source energy systems demonstrate the proposed method's exceptional capability in tackling complex THM challenges in real-world applications. All the obtained results showcase the superiority of proposed bES-FEM with SUPG in eliminating the spatial and pressure oscillations, marking it as a promising tool for the exploration of coupled THM issues.
△ Less
Submitted 20 May, 2024;
originally announced June 2024.
-
Column generation for multistage stochastic mixed-integer nonlinear programs with discrete state variables
Authors:
Tushar Rathi,
Benjamin P. Riley,
Angela Flores-Quiroz,
Qi Zhang
Abstract:
Stochastic programming provides a natural framework for modeling sequential optimization problems under uncertainty; however, the efficient solution of large-scale multistage stochastic programs remains a challenge, especially in the presence of discrete decisions and nonlinearities. In this work, we consider multistage stochastic mixed-integer nonlinear programs (MINLPs) with discrete state varia…
▽ More
Stochastic programming provides a natural framework for modeling sequential optimization problems under uncertainty; however, the efficient solution of large-scale multistage stochastic programs remains a challenge, especially in the presence of discrete decisions and nonlinearities. In this work, we consider multistage stochastic mixed-integer nonlinear programs (MINLPs) with discrete state variables, which exhibit a decomposable structure that allows its solution using a column generation approach. Following a Dantzig-Wolfe reformulation, we apply column generation such that each pricing subproblem is an MINLP of much smaller size, making it more amenable to global MINLP solvers. We further propose a method for generating additional columns that satisfy the nonanticipativity constraints, leading to significantly improved convergence and optimal or near-optimal solutions for many large-scale instances in a reasonable computation time. The effectiveness of the tailored column generation algorithm is demonstrated via computational case studies on a multistage blending problem and a problem involving the routing of mobile generators in a power distribution network.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Few for Many: Tchebycheff Set Scalarization for Many-Objective Optimization
Authors:
Xi Lin,
Yilu Liu,
Xiaoyuan Zhang,
Fei Liu,
Zhenkun Wang,
Qingfu Zhang
Abstract:
Multi-objective optimization can be found in many real-world applications where some conflicting objectives can not be optimized by a single solution. Existing optimization methods often focus on finding a set of Pareto solutions with different optimal trade-offs among the objectives. However, the required number of solutions to well approximate the whole Pareto optimal set could be exponentially…
▽ More
Multi-objective optimization can be found in many real-world applications where some conflicting objectives can not be optimized by a single solution. Existing optimization methods often focus on finding a set of Pareto solutions with different optimal trade-offs among the objectives. However, the required number of solutions to well approximate the whole Pareto optimal set could be exponentially large with respect to the number of objectives, which makes these methods unsuitable for handling many optimization objectives. In this work, instead of finding a dense set of Pareto solutions, we propose a novel Tchebycheff set scalarization method to find a few representative solutions (e.g., 5) to cover a large number of objectives (e.g., $>100$) in a collaborative and complementary manner. In this way, each objective can be well addressed by at least one solution in the small solution set. In addition, we further develop a smooth Tchebycheff set scalarization approach for efficient optimization with good theoretical guarantees. Experimental studies on different problems with many optimization objectives demonstrate the effectiveness of our proposed method.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
On the Convergence of Multi-objective Optimization under Generalized Smoothness
Authors:
Qi Zhang,
Peiyao Xiao,
Kaiyi Ji,
Shaofeng Zou
Abstract:
Multi-objective optimization (MOO) is receiving more attention in various fields such as multi-task learning. Recent works provide some effective algorithms with theoretical analysis but they are limited by the standard $L$-smooth or bounded-gradient assumptions, which are typically unsatisfactory for neural networks, such as recurrent neural networks (RNNs) and transformers. In this paper, we stu…
▽ More
Multi-objective optimization (MOO) is receiving more attention in various fields such as multi-task learning. Recent works provide some effective algorithms with theoretical analysis but they are limited by the standard $L$-smooth or bounded-gradient assumptions, which are typically unsatisfactory for neural networks, such as recurrent neural networks (RNNs) and transformers. In this paper, we study a more general and realistic class of $\ell$-smooth loss functions, where $\ell$ is a general non-decreasing function of gradient norm. We develop two novel single-loop algorithms for $\ell$-smooth MOO problems, Generalized Smooth Multi-objective Gradient descent (GSMGrad) and its stochastic variant, Stochastic Generalized Smooth Multi-objective Gradient descent (SGSMGrad), which approximate the conflict-avoidant (CA) direction that maximizes the minimum improvement among objectives. We provide a comprehensive convergence analysis of both algorithms and show that they converge to an $ε$-accurate Pareto stationary point with a guaranteed $ε$-level average CA distance (i.e., the gap between the updating direction and the CA direction) over all iterations, where totally $\mathcal{O}(ε^{-2})$ and $\mathcal{O}(ε^{-4})$ samples are needed for deterministic and stochastic settings, respectively. Our algorithms can also guarantee a tighter $ε$-level CA distance in each iteration using more samples. Moreover, we propose a practical variant of GSMGrad named GSMGrad-FA using only constant-level time and space, while achieving the same performance guarantee as GSMGrad. Our experiments validate our theory and demonstrate the effectiveness of the proposed methods.
△ Less
Submitted 12 June, 2024; v1 submitted 29 May, 2024;
originally announced May 2024.
-
BO4IO: A Bayesian optimization approach to inverse optimization with uncertainty quantification
Authors:
Yen-An Lu,
Wei-Shou Hu,
Joel A. Paulson,
Qi Zhang
Abstract:
This work addresses data-driven inverse optimization (IO), where the goal is to estimate unknown parameters in an optimization model from observed decisions that can be assumed to be optimal or near-optimal solutions to the optimization problem. The IO problem is commonly formulated as a large-scale bilevel program that is notoriously difficult to solve. Deviating from traditional exact solution m…
▽ More
This work addresses data-driven inverse optimization (IO), where the goal is to estimate unknown parameters in an optimization model from observed decisions that can be assumed to be optimal or near-optimal solutions to the optimization problem. The IO problem is commonly formulated as a large-scale bilevel program that is notoriously difficult to solve. Deviating from traditional exact solution methods, we propose a derivative-free optimization approach based on Bayesian optimization, which we call BO4IO, to solve general IO problems. We treat the IO loss function as a black box and approximate it with a Gaussian process model. Using the predicted posterior function, an acquisition function is minimized at each iteration to query new candidate solutions and sequentially converge to the optimal parameter estimates. The main advantages of using Bayesian optimization for IO are two-fold: (i) it circumvents the need of complex reformulations of the bilevel program or specialized algorithms and can hence enable computational tractability even when the underlying optimization problem is nonconvex or involves discrete variables, and (ii) it allows approximations of the profile likelihood, which provide uncertainty quantification on the IO parameter estimates. We apply the proposed method to three computational case studies, covering different classes of forward optimization problems ranging from convex nonlinear to nonconvex mixed-integer nonlinear programs. Our extensive computational results demonstrate the efficacy and robustness of BO4IO to accurately estimate unknown model parameters from small and noisy datasets. In addition, the proposed profile likelihood analysis has proven to be effective in providing good approximations of the confidence intervals on the parameter estimates and assessing the identifiability of the unknown parameters.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
A note on the finitely generated fixed subgroup property
Authors:
Jialin Lei,
Jiming Ma,
Qiang Zhang
Abstract:
We study when a group of form $G\times\mathbb{Z}^m (m\geq 1)$ has the finitely generated fixed subgroup property of automorphisms ($\rm{FGFP}_a$), by using the BNS-invariant, and provide some partial answers and non-trivial examples.
We study when a group of form $G\times\mathbb{Z}^m (m\geq 1)$ has the finitely generated fixed subgroup property of automorphisms ($\rm{FGFP}_a$), by using the BNS-invariant, and provide some partial answers and non-trivial examples.
△ Less
Submitted 30 May, 2024; v1 submitted 21 May, 2024;
originally announced May 2024.
-
Random Scaling and Momentum for Non-smooth Non-convex Optimization
Authors:
Qinzi Zhang,
Ashok Cutkosky
Abstract:
Training neural networks requires optimizing a loss function that may be highly irregular, and in particular neither convex nor smooth. Popular training algorithms are based on stochastic gradient descent with momentum (SGDM), for which classical analysis applies only if the loss is either convex or smooth. We show that a very small modification to SGDM closes this gap: simply scale the update at…
▽ More
Training neural networks requires optimizing a loss function that may be highly irregular, and in particular neither convex nor smooth. Popular training algorithms are based on stochastic gradient descent with momentum (SGDM), for which classical analysis applies only if the loss is either convex or smooth. We show that a very small modification to SGDM closes this gap: simply scale the update at each time point by an exponentially distributed random scalar. The resulting algorithm achieves optimal convergence guarantees. Intriguingly, this result is not derived by a specific analysis of SGDM: instead, it falls naturally out of a more general framework for converting online convex optimization algorithms to non-convex optimization algorithms.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
A note on distance variance for categorical variables
Authors:
Qingyang Zhang
Abstract:
This study investigates the extension of distance variance, a validated spread metric for continuous and binary variables [Edelmann et al., 2020, Ann. Stat., 48(6)], to quantify the spread of general categorical variables. We provide both geometric and algebraic characterizations of distance variance, revealing its connections to some commonly used entropy measures, and the variance-covariance mat…
▽ More
This study investigates the extension of distance variance, a validated spread metric for continuous and binary variables [Edelmann et al., 2020, Ann. Stat., 48(6)], to quantify the spread of general categorical variables. We provide both geometric and algebraic characterizations of distance variance, revealing its connections to some commonly used entropy measures, and the variance-covariance matrix of the one-hot encoded representation. However, we demonstrate that distance variance fails to satisfy the Schur-concavity axiom for categorical variables with more than two categories, leading to counterintuitive results. This limitation hinders its applicability as a universal measure of spread.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Multistability of Bi-Reaction Networks
Authors:
Yixuan Liang,
Xiaoxian Tang,
Qian Zhang
Abstract:
We provide a sufficient and necessary condition in terms of the stoichiometric coefficients for a bi-reaction network to admit multistability. Also, this result completely characterizes the bi-reaction networks according to if they admit multistability.
We provide a sufficient and necessary condition in terms of the stoichiometric coefficients for a bi-reaction network to admit multistability. Also, this result completely characterizes the bi-reaction networks according to if they admit multistability.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Solving the train-platforming problem via a two-level Lagrangian Relaxation approach
Authors:
Qin Zhang,
Richard Martin Lusby,
Pan Shang,
Chang Liu,
Wenqian Liu
Abstract:
High-speed railway stations are crucial junctions in high-speed railway networks. Compared to operations on the tracks between stations, trains have more routing possibilities within stations. As a result, track allocation at a station is relatively complicated. In this study, we aim to solve the train platforming problem for a busy high-speed railway station by considering comprehensive track res…
▽ More
High-speed railway stations are crucial junctions in high-speed railway networks. Compared to operations on the tracks between stations, trains have more routing possibilities within stations. As a result, track allocation at a station is relatively complicated. In this study, we aim to solve the train platforming problem for a busy high-speed railway station by considering comprehensive track resources and interlocking configurations. A two-level space-time network is constructed to capture infrastructure information at various levels of detail from both macroscopic and microscopic perspectives. Additionally, we propose a nonlinear programming model that minimizes a weighted sum of total travel time and total deviation time for trains at the station. We apply a Two-level Lagrangian Relaxation (2-L LR) to a linearized version of the model and demonstrate how this induces a decomposable train-specific path choice problem at the macroscopic level that is guided by Lagrange multipliers associated with microscopic resource capacity violation. As case studies, the proposed model and solution approach are applied to a small virtual railway station and a high-speed railway hub station located on the busiest high-speed railway line in China. Through a comparison of other approaches that include Logic-based Benders Decomposition (LBBD), we highlight the superiority of the proposed method; on realistic instances, the 2-L LR method finds solution that are, on average, approximately 2% from optimality. Finally, we test algorithm performance at the operational level and obtain near-optimal solutions, with optimality gaps of approximately 1%, in a very short time.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Stability of the Abstract Thermoelastic System with Singularity
Authors:
Chenxi Deng,
Zhong-Jie Han,
Zhaobin Kuang,
Qiong Zhang
Abstract:
In this paper, we analyze an abstract thermoelastic system, where the heat conduction follows the Cattaneo law. Zero becomes a spectrum point of the system operator when the coupling and thermal damping parameters of the system satisfy specific conditions. We obtain the decay rates of solutions to the system with or without the inertial term. Furthermore, the decay rate of the system without inert…
▽ More
In this paper, we analyze an abstract thermoelastic system, where the heat conduction follows the Cattaneo law. Zero becomes a spectrum point of the system operator when the coupling and thermal damping parameters of the system satisfy specific conditions. We obtain the decay rates of solutions to the system with or without the inertial term. Furthermore, the decay rate of the system without inertial terms is shown to be optimal.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
Co-existence of Type II blow-ups with multiple blow-up rates for five-dimensional heat equation with critical nonlinear boundary conditions
Authors:
Juncheng Wei,
Zikai Ye,
Xiaoyu Zeng,
Qidi Zhang
Abstract:
We consider the following five-dimensional heat equation with critical boundary condition \begin{equation*}
\partial_t u=Δu
\mbox{ \ in \ } \mathbb{R}_+^5\times (0,T) ,
\quad
-\partial_{x_5}u =|u|^\frac{2}{3}u \mbox{ \ on \ } \pp \mathbb{R}^5_+ \times (0,T) . \end{equation*} Given $\mathfrak{o}$ distinct boundary points $q^{[i]} \in \partial \mathbb{R}_+^5$, and $\mathfrak{o}$ integers…
▽ More
We consider the following five-dimensional heat equation with critical boundary condition \begin{equation*}
\partial_t u=Δu
\mbox{ \ in \ } \mathbb{R}_+^5\times (0,T) ,
\quad
-\partial_{x_5}u =|u|^\frac{2}{3}u \mbox{ \ on \ } \pp \mathbb{R}^5_+ \times (0,T) . \end{equation*} Given $\mathfrak{o}$ distinct boundary points $q^{[i]} \in \partial \mathbb{R}_+^5$, and $\mathfrak{o}$ integers $l_i\in \mathbb{N}$ (possibly duplicated), $i=1,2,\dots, \mathfrak{o}$, for $T>0$ sufficiently small, we construct a finite-time blow-up solution $u$ with a type II blow-up rate $(T-t)^{-3l_i -3}$ for $x$ near $q^{[i]}$. This seems to be the first result of the co-existence of type II blowups with different blow-up rates. To accommodate highly unstable blowups with different blowup rates, we first develop a unified linear theory for the inner problem with more time decay in the blow-up scheme through restriction on the spatial growth of the right-hand side, and then use vanishing adjustment functions for deriving multiple rates at distinct points. This paper is inspired by [25, 52, 60].
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Modular data of non-semisimple modular categories
Authors:
Liang Chang,
Quinn T. Kolt,
Zhenghan Wang,
Qing Zhang
Abstract:
We investigate non-semisimple modular categories with an eye towards a structure theory, low-rank classification, and applications to low dimensional topology and topological physics. We aim to extend the well-understood theory of semisimple modular categories to the non-semisimple case by using representations of factorizable ribbon Hopf algebras as a case study. We focus on the Cohen-Westreich m…
▽ More
We investigate non-semisimple modular categories with an eye towards a structure theory, low-rank classification, and applications to low dimensional topology and topological physics. We aim to extend the well-understood theory of semisimple modular categories to the non-semisimple case by using representations of factorizable ribbon Hopf algebras as a case study. We focus on the Cohen-Westreich modular data, which is obtained from the Lyubashenko-Majid modular representation restricted to the Higman ideal of a factorizable ribbon Hopf algebra. The Cohen-Westreich $S$-matrix diagonalizes the mixed fusion rules and reduces to the usual $S$-matrix for semisimple modular categories. The paper includes detailed studies on small quantum groups $U_qsl(2)$ and the Drinfeld doubles of Nichols Hopf algebras, especially the $\mathrm{SL}(2, \mathbb{Z})$-representation on their centers, Cohen-Westreich modular data, and the congruence kernel theorem's validity.
△ Less
Submitted 6 May, 2024; v1 submitted 14 April, 2024;
originally announced April 2024.
-
Generic representations, open parameters and ABV-packets for $p$-adic groups
Authors:
Clifton Cunningham,
Sarah Dijols,
Andrew Fiori,
Qing Zhang
Abstract:
If $π$ is a representation of a $p$-adic group $G(F)$, and $φ$ is its Langlands parameter, can we use the moduli space of Langlands parameters to find a geometric property of $φ$ that will detect when $π$ is generic? In this paper we show that if $G$ is classical or if we assume the Kazhdan-Lusztig hypothesis for $G$, then the answer is yes, and the property is that the orbit of $φ$ is open. We al…
▽ More
If $π$ is a representation of a $p$-adic group $G(F)$, and $φ$ is its Langlands parameter, can we use the moduli space of Langlands parameters to find a geometric property of $φ$ that will detect when $π$ is generic? In this paper we show that if $G$ is classical or if we assume the Kazhdan-Lusztig hypothesis for $G$, then the answer is yes, and the property is that the orbit of $φ$ is open. We also propose an adaptation of Shahidi's enhanced genericity conjecture to ABV-packets: for every Langlands parameter $φ$ for a $p$-adic group $G(F)$, the ABV-packet $Π^{\mathrm{ABV}}_φ(G(F))$ contains a generic representation if and only if the local adjoint L-function $L(s,φ,\mathop{\text{Ad}})$ is regular at $s=1$, and show that this condition is equivalent to the "open parameter" condition above. We show that this genericity conjecture for ABV-packets follows from other standard conjectures and we verify its validity with the same conditions on $G$. We show that, in this case, the ABV-packet for $φ$ coincides with its $L$-packet. Finally, we prove Vogan's conjecture on $A$-packets for tempered parameters.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
A Copula Graphical Model for Multi-Attribute Data using Optimal Transport
Authors:
Qi Zhang,
Bing Li,
Lingzhou Xue
Abstract:
Motivated by modern data forms such as images and multi-view data, the multi-attribute graphical model aims to explore the conditional independence structure among vectors. Under the Gaussian assumption, the conditional independence between vectors is characterized by blockwise zeros in the precision matrix. To relax the restrictive Gaussian assumption, in this paper, we introduce a novel semipara…
▽ More
Motivated by modern data forms such as images and multi-view data, the multi-attribute graphical model aims to explore the conditional independence structure among vectors. Under the Gaussian assumption, the conditional independence between vectors is characterized by blockwise zeros in the precision matrix. To relax the restrictive Gaussian assumption, in this paper, we introduce a novel semiparametric multi-attribute graphical model based on a new copula named Cyclically Monotone Copula. This new copula treats the distribution of the node vectors as multivariate marginals and transforms them into Gaussian distributions based on the optimal transport theory. Since the model allows the node vectors to have arbitrary continuous distributions, it is more flexible than the classical Gaussian copula method that performs coordinatewise Gaussianization. We establish the concentration inequalities of the estimated covariance matrices and provide sufficient conditions for selection consistency of the group graphical lasso estimator. For the setting with high-dimensional attributes, a {Projected Cyclically Monotone Copula} model is proposed to address the curse of dimensionality issue that arises from solving high-dimensional optimal transport problems. Numerical results based on synthetic and real data show the efficiency and flexibility of our methods.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Global solvability for the Boussinesq system with fractional Laplacian
Authors:
Huiyang Zhang,
Shuokai Yan,
Qinghua Zhang
Abstract:
This paper focuses on the global solvability for the Boussinesq system with fractional Laplacian $(-Δ)^α$ in $\mathbb{R}^{n}$ for $n\geq3$. It proves the existence of a small positive number $\varepsilon=\varepsilon(n,α)$ such that for each $0<T<\infty$, if $\frac{1}{2}<α<\frac{2+n}{4}$ and $\|u_{0}\|_{\dot{H}^{s_{0}}}+T^{1/2}\|θ_{0}\|_{\dot{H}^{s_{0}-α}}\leq \varepsilon$, then the fractional Bous…
▽ More
This paper focuses on the global solvability for the Boussinesq system with fractional Laplacian $(-Δ)^α$ in $\mathbb{R}^{n}$ for $n\geq3$. It proves the existence of a small positive number $\varepsilon=\varepsilon(n,α)$ such that for each $0<T<\infty$, if $\frac{1}{2}<α<\frac{2+n}{4}$ and $\|u_{0}\|_{\dot{H}^{s_{0}}}+T^{1/2}\|θ_{0}\|_{\dot{H}^{s_{0}-α}}\leq \varepsilon$, then the fractional Boussinesq system has a unique strong solution on the bounded interval $[0,T]$. If $\frac{1}{2}<α<\frac{2+n}{6}$ and $\|u_{0}\|_{\dot{H}^{s_{0}}}+\|θ_{0}\|_{\dot{H}^{s_{0}-2α}}\leq \varepsilon$, then the fractional Boussinesq system has a unique strong solution on the whole interval $[0,\infty)$.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
Convergence Guarantees for RMSProp and Adam in Generalized-smooth Non-convex Optimization with Affine Noise Variance
Authors:
Qi Zhang,
Yi Zhou,
Shaofeng Zou
Abstract:
This paper provides the first tight convergence analyses for RMSProp and Adam in non-convex optimization under the most relaxed assumptions of coordinate-wise generalized smoothness and affine noise variance. We first analyze RMSProp, which is a special case of Adam with adaptive learning rates but without first-order momentum. Specifically, to solve the challenges due to dependence among adaptive…
▽ More
This paper provides the first tight convergence analyses for RMSProp and Adam in non-convex optimization under the most relaxed assumptions of coordinate-wise generalized smoothness and affine noise variance. We first analyze RMSProp, which is a special case of Adam with adaptive learning rates but without first-order momentum. Specifically, to solve the challenges due to dependence among adaptive update, unbounded gradient estimate and Lipschitz constant, we demonstrate that the first-order term in the descent lemma converges and its denominator is upper bounded by a function of gradient norm. Based on this result, we show that RMSProp with proper hyperparameters converges to an $ε$-stationary point with an iteration complexity of $\mathcal O(ε^{-4})$. We then generalize our analysis to Adam, where the additional challenge is due to a mismatch between the gradient and first-order momentum. We develop a new upper bound on the first-order term in the descent lemma, which is also a function of the gradient norm. We show that Adam with proper hyperparameters converges to an $ε$-stationary point with an iteration complexity of $\mathcal O(ε^{-4})$. Our complexity results for both RMSProp and Adam match with the complexity lower bound established in \cite{arjevani2023lower}.
△ Less
Submitted 3 April, 2024; v1 submitted 1 April, 2024;
originally announced April 2024.
-
Sobolev Calibration of Imperfect Computer Models
Authors:
Qingwen Zhang,
Wenjia Wang
Abstract:
Calibration refers to the statistical estimation of unknown model parameters in computer experiments, such that computer experiments can match underlying physical systems. This work develops a new calibration method for imperfect computer models, Sobolev calibration, which can rule out calibration parameters that generate overfitting calibrated functions. We prove that the Sobolev calibration enjo…
▽ More
Calibration refers to the statistical estimation of unknown model parameters in computer experiments, such that computer experiments can match underlying physical systems. This work develops a new calibration method for imperfect computer models, Sobolev calibration, which can rule out calibration parameters that generate overfitting calibrated functions. We prove that the Sobolev calibration enjoys desired theoretical properties including fast convergence rate, asymptotic normality and semiparametric efficiency. We also demonstrate an interesting property that the Sobolev calibration can bridge the gap between two influential methods: $L_2$ calibration and Kennedy and O'Hagan's calibration. In addition to exploring the deterministic physical experiments, we theoretically justify that our method can transfer to the case when the physical process is indeed a Gaussian process, which follows the original idea of Kennedy and O'Hagan's. Numerical simulations as well as a real-world example illustrate the competitive performance of the proposed method.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
Sample Complexity of Chance Constrained Optimization in Dynamic Environment
Authors:
Apurv Shukla,
Qian Zhang,
Le Xie
Abstract:
We study the scenario approach for solving chance-constrained optimization in time-coupled dynamic environments. Scenario generation methods approximate the true feasible region from scenarios generated independently and identically from the actual distribution. In this paper, we consider this problem in a dynamic environment, where the scenarios are assumed to be drawn sequentially from an unknow…
▽ More
We study the scenario approach for solving chance-constrained optimization in time-coupled dynamic environments. Scenario generation methods approximate the true feasible region from scenarios generated independently and identically from the actual distribution. In this paper, we consider this problem in a dynamic environment, where the scenarios are assumed to be drawn sequentially from an unknown and time-varying distribution. Such dynamic environments are driven by changing environmental conditions that could be found in many real-world applications such as energy systems. We couple the time-varying distributions using the Wasserstein metric between the sequence of scenario-generating distributions and the actual chance-constrained distribution. Our main results are bounds on the number of samples essential for ensuring the ex-post risk in chance-constrained optimization problems when the underlying feasible set is convex or non-convex. Finally, our results are illustrated on multiple numerical experiments for both types of feasible sets.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
Clunie lemma in several complex variables and application in PDEs
Authors:
Wenjie Hao,
Qingcai Zhang
Abstract:
Two purposes will be shown in this paper. The first one is to extend the classic Tumura-Clunie type theorem for meromorphic functions of one complex variable to meromorphic functions of several complex variables by using Clunie lemma. The second one is to characterize entire solutions of certain partial differential equations in $\mathbb{C}^{m}$. Our results are extensions and generalizations of t…
▽ More
Two purposes will be shown in this paper. The first one is to extend the classic Tumura-Clunie type theorem for meromorphic functions of one complex variable to meromorphic functions of several complex variables by using Clunie lemma. The second one is to characterize entire solutions of certain partial differential equations in $\mathbb{C}^{m}$. Our results are extensions and generalizations of the previous theorems by Liao-Ye \cite{Liao-Ye} and Li \cite{Li11}.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Smooth Tchebycheff Scalarization for Multi-Objective Optimization
Authors:
Xi Lin,
Xiaoyuan Zhang,
Zhiyuan Yang,
Fei Liu,
Zhenkun Wang,
Qingfu Zhang
Abstract:
Multi-objective optimization problems can be found in many real-world applications, where the objectives often conflict each other and cannot be optimized by a single solution. In the past few decades, numerous methods have been proposed to find Pareto solutions that represent different optimal trade-offs among the objectives for a given problem. However, these existing methods could have high com…
▽ More
Multi-objective optimization problems can be found in many real-world applications, where the objectives often conflict each other and cannot be optimized by a single solution. In the past few decades, numerous methods have been proposed to find Pareto solutions that represent different optimal trade-offs among the objectives for a given problem. However, these existing methods could have high computational complexity or may not have good theoretical properties for solving a general differentiable multi-objective optimization problem. In this work, by leveraging the smooth optimization technique, we propose a novel and lightweight smooth Tchebycheff scalarization approach for gradient-based multi-objective optimization. It has good theoretical properties for finding all Pareto solutions with valid trade-off preferences, while enjoying significantly lower computational complexity compared to other methods. Experimental results on various real-world application problems fully demonstrate the effectiveness of our proposed method.
△ Less
Submitted 14 March, 2024; v1 submitted 29 February, 2024;
originally announced February 2024.
-
High order conservative LDG-IMEX methods for the degenerate nonlinear non-equilibrium radiation diffusion problems
Authors:
Shaoqin Zheng,
Min Tang,
Qiang Zhang,
Tao Xiong
Abstract:
In this paper, we develop a class of high-order conservative methods for simulating non-equilibrium radiation diffusion problems. Numerically, this system poses significant challenges due to strong nonlinearity within the stiff source terms and the degeneracy of nonlinear diffusion terms. Explicit methods require impractically small time steps, while implicit methods, which offer stability, come w…
▽ More
In this paper, we develop a class of high-order conservative methods for simulating non-equilibrium radiation diffusion problems. Numerically, this system poses significant challenges due to strong nonlinearity within the stiff source terms and the degeneracy of nonlinear diffusion terms. Explicit methods require impractically small time steps, while implicit methods, which offer stability, come with the challenge to guarantee the convergence of nonlinear iterative solvers. To overcome these challenges, we propose a predictor-corrector approach and design proper implicit-explicit time discretizations. In the predictor step, the system is reformulated into a nonconservative form and linear diffusion terms are introduced as a penalization to mitigate strong nonlinearities. We then employ a Picard iteration to secure convergence in handling the nonlinear aspects. The corrector step guarantees the conservation of total energy, which is vital for accurately simulating the speeds of propagating sharp fronts in this system.
For spatial approximations, we utilize local discontinuous Galerkin finite element methods, coupled with positive-preserving and TVB limiters. We validate the orders of accuracy, conservation properties, and suitability of using large time steps for our proposed methods, through numerical experiments conducted on one- and two-dimensional spatial problems. In both homogeneous and heterogeneous non-equilibrium radiation diffusion problems, we attain a time stability condition comparable to that of a fully implicit time discretization. Such an approach is also applicable to many other reaction-diffusion systems.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Boosting Gradient Ascent for Continuous DR-submodular Maximization
Authors:
Qixin Zhang,
Zongqi Wan,
Zengde Deng,
Zaiyi Chen,
Xiaoming Sun,
Jialin Zhang,
Yu Yang
Abstract:
Projected Gradient Ascent (PGA) is the most commonly used optimization scheme in machine learning and operations research areas. Nevertheless, numerous studies and examples have shown that the PGA methods may fail to achieve the tight approximation ratio for continuous DR-submodular maximization problems. To address this challenge, we present a boosting technique in this paper, which can efficient…
▽ More
Projected Gradient Ascent (PGA) is the most commonly used optimization scheme in machine learning and operations research areas. Nevertheless, numerous studies and examples have shown that the PGA methods may fail to achieve the tight approximation ratio for continuous DR-submodular maximization problems. To address this challenge, we present a boosting technique in this paper, which can efficiently improve the approximation guarantee of the standard PGA to \emph{optimal} with only small modifications on the objective function. The fundamental idea of our boosting technique is to exploit non-oblivious search to derive a novel auxiliary function $F$, whose stationary points are excellent approximations to the global maximum of the original DR-submodular objective $f$. Specifically, when $f$ is monotone and $γ$-weakly DR-submodular, we propose an auxiliary function $F$ whose stationary points can provide a better $(1-e^{-γ})$-approximation than the $(γ^2/(1+γ^2))$-approximation guaranteed by the stationary points of $f$ itself. Similarly, for the non-monotone case, we devise another auxiliary function $F$ whose stationary points can achieve an optimal $\frac{1-\min_{\boldsymbol{x}\in\mathcal{C}}\|\boldsymbol{x}\|_{\infty}}{4}$-approximation guarantee where $\mathcal{C}$ is a convex constraint set. In contrast, the stationary points of the original non-monotone DR-submodular function can be arbitrarily bad~\citep{chen2023continuous}. Furthermore, we demonstrate the scalability of our boosting technique on four problems. In all of these four problems, our resulting variants of boosting PGA algorithm beat the previous standard PGA in several aspects such as approximation ratio and efficiency. Finally, we corroborate our theoretical findings with numerical experiments, which demonstrate the effectiveness of our boosting PGA methods.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
A local maximum principle for robust optimal control problems of quadratic BSDEs
Authors:
Tao Hao,
Jiaqiang Wen,
Qi Zhang
Abstract:
The paper concerns the necessary maximum principle for robust optimal control problems of quadratic BSDEs. The coefficient of the systems depends on the parameter $θ$, and the generator of BSDEs is of quadratic growth in $z$. Since the model is uncertain, the variational inequality is proved by weak convergence technique. In addition, due to the generator being quadratic with respect to $z$, the f…
▽ More
The paper concerns the necessary maximum principle for robust optimal control problems of quadratic BSDEs. The coefficient of the systems depends on the parameter $θ$, and the generator of BSDEs is of quadratic growth in $z$. Since the model is uncertain, the variational inequality is proved by weak convergence technique. In addition, due to the generator being quadratic with respect to $z$, the forward adjoint equations are SDEs with unbounded coefficient involving mean oscillation martingales. Using reverse Hölder inequality and John-Nirenberg inequality, we show that its solutions are continuous with respect to the parameter $θ$. The necessary and sufficient conditions for robust optimal control are proved by linearization method.
△ Less
Submitted 13 January, 2024;
originally announced January 2024.
-
Proximal observers for secure state estimation
Authors:
Laurent Bako,
Madiha Nadri,
Vincent Andrieu,
Qinghua Zhang
Abstract:
This paper discusses a general framework for designing robust state estimators for a class of discrete-time nonlinear systems. We consider systems that may be impacted by impulsive (sparse but otherwise arbitrary) measurement noise sequences. We show that a family of state estimators, robust to this type of undesired signal, can be obtained by minimizing a class of nonsmooth convex functions at ea…
▽ More
This paper discusses a general framework for designing robust state estimators for a class of discrete-time nonlinear systems. We consider systems that may be impacted by impulsive (sparse but otherwise arbitrary) measurement noise sequences. We show that a family of state estimators, robust to this type of undesired signal, can be obtained by minimizing a class of nonsmooth convex functions at each time step. The resulting state observers are defined through proximal operators. We obtain a nonlinear implicit dynamical system in term of estimation error and prove, in the noise-free setting, that it vanishes asymptotically when the minimized loss function and the to-beobserved system enjoy appropriate properties. From a computational perspective, even though the proposed observers can be implemented via efficient numerical procedures, they do not admit closed-form expressions. The paper argues that by adopting appropriate relaxations, simple and fast analytic expressions can be derived.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
A New Parallel Cooperative Landscape Smoothing Algorithm and Its Applications on TSP and UBQP
Authors:
Wei Wang,
Jialong Shi,
Jianyong Sun,
Arnaud Liefooghe,
Qingfu Zhang
Abstract:
Combinatorial optimization problem (COP) is difficult to solve because of the massive local optimal solutions in his solution space. Various methods have been put forward to smooth the solution space of COPs, including homotopic convex (HC) transformation for the traveling salesman problem (TSP). This paper first extends the HC transformation approach to the unconstrained binary quadratic programm…
▽ More
Combinatorial optimization problem (COP) is difficult to solve because of the massive local optimal solutions in his solution space. Various methods have been put forward to smooth the solution space of COPs, including homotopic convex (HC) transformation for the traveling salesman problem (TSP). This paper first extends the HC transformation approach to the unconstrained binary quadratic programming (UBQP). We theoretically prove the effectiveness of the proposed HC transformation method on smoothing the landscape of the UBQP. Subsequently, we introduce an iterative algorithmic framework incorporating HC transformation, referred as landscape smoothing iterated local search (LSILS). Our experimental analyses, conducted on various UBQP instances show the effectiveness of LSILS. Furthermore, this paper proposes a parallel cooperative variant of LSILS, denoted as PC-LSILS and apply it to both the UBQP and the TSP. Our experimental findings highlight that PC-LSILS improves the smoothing performance of the HC transformation, and further improves the overall performance of the algorithm.
△ Less
Submitted 6 January, 2024;
originally announced January 2024.
-
Roth-type Theorem for high-power system in Piatetski-Shapiro primes (II)
Authors:
Xiumin Ren,
Yu-chen Sun,
Qingqing Zhang,
Rui Zhang
Abstract:
We consider the nonlinear system $c_1p_1^d +c_2p_2^d + \dots + c_s p_s^d = 0$ with $c_1, c_2,\dots, c_s\in\mathbb Z$ being nonzero and satisfying $c_1 +c_2 + \dots + c_s = 0$. We show that for $s\ge 2\lfloor \frac{d^2}2\rfloor+1$ and $c\in\left(1, 1+c(d,s)\right)$, if the system has only $K$-trivial solutions in subset $\mathcal{A}$ of Piatetski-Shapiro primes up to $x$ and corresponding to $c$, t…
▽ More
We consider the nonlinear system $c_1p_1^d +c_2p_2^d + \dots + c_s p_s^d = 0$ with $c_1, c_2,\dots, c_s\in\mathbb Z$ being nonzero and satisfying $c_1 +c_2 + \dots + c_s = 0$. We show that for $s\ge 2\lfloor \frac{d^2}2\rfloor+1$ and $c\in\left(1, 1+c(d,s)\right)$, if the system has only $K$-trivial solutions in subset $\mathcal{A}$ of Piatetski-Shapiro primes up to $x$ and corresponding to $c$, then $|\mathcal{A}| \ll \frac{x^{\frac1c}}{\log x} $$\left(\log \log \log \log x\right)^{\frac{2-s}{dc}+\varepsilon}$.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
Gradient estimates on graphs with the $CDψ(n,-K)$condition
Authors:
Yi Li,
Qianwei Zhang
Abstract:
This paper investigates gradient estimates on graphs satisfying the $CDψ(n,-K)$ condition with positive constants $n,K$, and concave $C^{1}$ functions $ψ:(0,+\infty)\rightarrow\mathbb{R}$. Our study focuses on gradient estimates for positive solutions of the heat equation $\partial_{t}u=Δu$. Additionally, the estimate is extended to a heat-type equation $\partial_{t}u=Δu+cu^σ$, where $σ$ is a cons…
▽ More
This paper investigates gradient estimates on graphs satisfying the $CDψ(n,-K)$ condition with positive constants $n,K$, and concave $C^{1}$ functions $ψ:(0,+\infty)\rightarrow\mathbb{R}$. Our study focuses on gradient estimates for positive solutions of the heat equation $\partial_{t}u=Δu$. Additionally, the estimate is extended to a heat-type equation $\partial_{t}u=Δu+cu^σ$, where $σ$ is a constant and $c$ is a continuous function defined on $[0,+\infty)$. Furthermore, we utilize these estimates to derive heat kernel bounds and Harnack inequalities.
△ Less
Submitted 24 December, 2023;
originally announced December 2023.
-
Map-Reduce for Multiprocessing Large Data and Multi-threading for Data Scraping
Authors:
Zefeng Qiu,
Prashanth Umapathy,
Qingquan Zhang,
Guanqun Song,
Ting Zhu
Abstract:
This document is the final project report for our advanced operating system class. During this project, we mainly focused on applying multiprocessing and multi-threading technology to our whole project and utilized the map-reduce algorithm in our data cleaning and data analysis process. In general, our project can be divided into two components: data scraping and data processing, where the previou…
▽ More
This document is the final project report for our advanced operating system class. During this project, we mainly focused on applying multiprocessing and multi-threading technology to our whole project and utilized the map-reduce algorithm in our data cleaning and data analysis process. In general, our project can be divided into two components: data scraping and data processing, where the previous part was almost web wrangling with employing potential multiprocessing or multi-threading technology to speed up the whole process. And after we collect and scrape a large amount value of data as mentioned above, we can use them as input to implement data cleaning and data analysis, during this period, we take advantage of the map-reduce algorithm to increase efficiency.
△ Less
Submitted 22 December, 2023;
originally announced December 2023.
-
Quadratic and cubic Lagrange finite elements for mixed Laplace eigenvalue problems on criss-cross meshes
Authors:
Kaibo Hu,
Jiguang Sun,
Qian Zhang
Abstract:
In [6], it was shown that the linear Lagrange element space on criss-cross meshes and its divergence exhibit spurious eigenvalues when applied in the mixed formulation of the Laplace eigenvalue problem, despite satisfying both the inf-sup condition and ellipticity on the discrete kernel. The lack of a Fortin interpolation is responsible for the spurious eigenvalues produced by the linear Lagrange…
▽ More
In [6], it was shown that the linear Lagrange element space on criss-cross meshes and its divergence exhibit spurious eigenvalues when applied in the mixed formulation of the Laplace eigenvalue problem, despite satisfying both the inf-sup condition and ellipticity on the discrete kernel. The lack of a Fortin interpolation is responsible for the spurious eigenvalues produced by the linear Lagrange space. In contrast, results in [8] confirm that quartic and higher-order Lagrange elements do not yield spurious eigenvalues on general meshes without nearly singular vertices, including criss-cross meshes as a special case. In this paper, we investigate quadratic and cubic Lagrange elements on criss-cross meshes. We prove the convergence of discrete eigenvalues by fitting the Lagrange elements on criss-cross meshes into a complex and constructing a Fortin interpolation. As a by-product, we construct bounded commuting projections for the finite element Stokes complex, which induces isomorphisms between cohomologies of the continuous and discrete complexes. We provide numerical examples to validate the theoretical results.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Global solutions to quasilinear wave-Klein-Gordon systems in two space dimensions
Authors:
Qian Zhang
Abstract:
In this paper we prove global existence and global behavior of solutions to quasilinear wave-Klein-Gordon systems in $\mathbb{R}^{1+2}$ with quadratic nonlinearities satisfying the null condition. We consider small, regular and compactly supported initial data, and prove global existence, pointwise decay estimates and linear scattering for the solutions.
In this paper we prove global existence and global behavior of solutions to quasilinear wave-Klein-Gordon systems in $\mathbb{R}^{1+2}$ with quadratic nonlinearities satisfying the null condition. We consider small, regular and compactly supported initial data, and prove global existence, pointwise decay estimates and linear scattering for the solutions.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Unfitted finite element method for the quad-curl interface problem
Authors:
Hailong Guo,
Mingyan Zhang,
Qian Zhang,
Zhimin Zhang
Abstract:
In this paper, we introduce a novel unfitted finite element method to solve the quad-curl interface problem. We adapt Nitsche's method for curlcurl-conforming elements and double the degrees of freedom on interface elements. To ensure stability, we incorporate ghost penalty terms and a discrete divergence-free term. We establish the well-posedness of our method and demonstrate an optimal error bou…
▽ More
In this paper, we introduce a novel unfitted finite element method to solve the quad-curl interface problem. We adapt Nitsche's method for curlcurl-conforming elements and double the degrees of freedom on interface elements. To ensure stability, we incorporate ghost penalty terms and a discrete divergence-free term. We establish the well-posedness of our method and demonstrate an optimal error bound in the discrete energy norm. We also analyze the stiffness matrix's condition number. Our numerical tests back up our theory on convergence rates and condition numbers.
△ Less
Submitted 26 November, 2023;
originally announced November 2023.
-
Distributional Hessian and divdiv complexes on triangulation and cohomology
Authors:
Kaibo Hu,
Ting Lin,
Qian Zhang
Abstract:
In this paper, we construct discrete versions of some Bernstein-Gelfand-Gelfand (BGG) complexes, i.e., the Hessian and the divdiv complexes, on triangulations in 2D and 3D. The sequences consist of finite elements with local polynomial shape functions and various types of Dirac measure on subsimplices. The construction generalizes Whitney forms (canonical conforming finite elements) for the de Rha…
▽ More
In this paper, we construct discrete versions of some Bernstein-Gelfand-Gelfand (BGG) complexes, i.e., the Hessian and the divdiv complexes, on triangulations in 2D and 3D. The sequences consist of finite elements with local polynomial shape functions and various types of Dirac measure on subsimplices. The construction generalizes Whitney forms (canonical conforming finite elements) for the de Rham complex and Regge calculus/finite elements for the elasticity (Riemannian deformation) complex from discrete topological and Discrete Exterior Calculus perspectives. We show that the cohomology of the resulting complexes is isomorphic to the continuous versions, and thus isomorphic to the de~Rham cohomology with coefficients.
△ Less
Submitted 26 November, 2023;
originally announced November 2023.
-
Liouville theorem for quasilinear elliptic equations in $\mathbb R^N$
Authors:
Wangzhe Wu,
Qiqi Zhang
Abstract:
We prove Liouville theorem for the equation $Δ_m v + v^p + M |\nabla v|^{q}= 0$ in a domain $Ω\subset\mathbb R^n$, with $M\in \mathbb{R}$ in the critical and subcritical case. As a natural extension of our recent work \cite{MWZ}, the proof is based on an integral identity and Young's inequality.
We prove Liouville theorem for the equation $Δ_m v + v^p + M |\nabla v|^{q}= 0$ in a domain $Ω\subset\mathbb R^n$, with $M\in \mathbb{R}$ in the critical and subcritical case. As a natural extension of our recent work \cite{MWZ}, the proof is based on an integral identity and Young's inequality.
△ Less
Submitted 22 November, 2023;
originally announced November 2023.
-
A blow up solution of the Navier-Stokes equations with a super critical forcing term
Authors:
Qi S. Zhang
Abstract:
A forced solution $v$ of the axially symmetric Navier-Stokes equation in a finite cylinder $D$ with suitable boundary condition is constructed. The forcing term is in the super critical space $L^q_t L^1_x$ for all $q>1$. The velocity is in the energy space at the final moment when it blows up.
A forced solution $v$ of the axially symmetric Navier-Stokes equation in a finite cylinder $D$ with suitable boundary condition is constructed. The forcing term is in the super critical space $L^q_t L^1_x$ for all $q>1$. The velocity is in the energy space at the final moment when it blows up.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
Liouville theorem for elliptic equations involving the sum of the function and its gradient in $\mathbb R^n$
Authors:
Xi-nan Ma,
Wangzhe Wu,
Qiqi Zhang
Abstract:
We prove Liouville theorem for the equation $Δv + N v^p + M |\nabla v|^{q}= 0$ in $\mathbb R^n$, with $M, N > 0, q = \frac{2p}{p + 1}$ in the critical and subcritical case. The proof is based on an integral identity and Young inequality.
We prove Liouville theorem for the equation $Δv + N v^p + M |\nabla v|^{q}= 0$ in $\mathbb R^n$, with $M, N > 0, q = \frac{2p}{p + 1}$ in the critical and subcritical case. The proof is based on an integral identity and Young inequality.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Efficient Scenario Generation for Chance-constrained Economic Dispatch Considering Ambient Wind Conditions
Authors:
Qian Zhang,
Apurv Shukla,
Le Xie
Abstract:
Scenario generation is an effective data-driven method for solving chance-constrained optimization while ensuring desired risk guarantees with a finite number of samples. Crucial challenges in deploying this technique in the real world arise due to the absence of appropriate risk-tuning models tailored for the desired application. In this paper, we focus on designing efficient scenario generation…
▽ More
Scenario generation is an effective data-driven method for solving chance-constrained optimization while ensuring desired risk guarantees with a finite number of samples. Crucial challenges in deploying this technique in the real world arise due to the absence of appropriate risk-tuning models tailored for the desired application. In this paper, we focus on designing efficient scenario generation schemes for economic dispatch in power systems. We propose a novel scenario generation method based on filtering scenarios using ambient wind conditions. These filtered scenarios are deployed incrementally in order to meet desired risk levels while using minimum resources. In order to study the performance of the proposed scheme, we illustrate the procedure on case studies performed for both 24-bus and 118-bus systems with real-world wind power forecasting data. Numerical results suggest that the proposed filter-and-increment scenario generation model leads to a precise and efficient solution for the chance-constrained economic dispatch problem.
△ Less
Submitted 2 January, 2024; v1 submitted 3 November, 2023;
originally announced November 2023.
-
Two improved algorithms for sparse generalized canonical correlation analysis
Authors:
Kuo-Yue Li,
Qi-Ye Zhang,
Yong-Han Sun
Abstract:
Regularized generalized canonical correlation analysis (RGCCA) is a generalization of regularized canonical correlation analysis to three or more sets of variables, which is a component-based approach aiming to study the relationships between several sets of variables. Sparse generalized canonical correlation analysis (SGCCA) (proposed in Tenenhaus et al. (2014)), combines RGCCA with an `1-penalty…
▽ More
Regularized generalized canonical correlation analysis (RGCCA) is a generalization of regularized canonical correlation analysis to three or more sets of variables, which is a component-based approach aiming to study the relationships between several sets of variables. Sparse generalized canonical correlation analysis (SGCCA) (proposed in Tenenhaus et al. (2014)), combines RGCCA with an `1-penalty, in which blocks are not necessarily fully connected, makes SGCCA a flexible method for analyzing a wide variety of practical problems, such as biology, chemistry, sensory analysis, marketing, food research, etc. In Tenenhaus et al. (2014), an iterative algorithm for SGCCA was designed based on the solution to the subproblem (LM-P1 for short) of maximizing a linear function on the intersection of an `1-norm ball and a unit `2-norm sphere proposed in Witten et al. (2009). However, the solution to the subproblem (LM-P1) proposed in Witten et al. (2009) is not correct, which may become the reason that the iterative algorithm for SGCCA is slow and not always convergent. For this, we first characterize the solution to the subproblem LM-P1, and the subproblems LM-P2 and LM-P3, which maximize a linear function on the intersection of an `1-norm sphere and a unit `2-norm sphere, and an `1-norm ball and a unit `2-norm sphere, respectively. Then we provide more efficient block coordinate descent (BCD) algorithms for SGCCA and its two variants, called SGCCA-BCD1, SGCCA-BCD2 and SGCCA-BCD3, corresponding to the subproblems LM-P1, LM-P2 and LM-P3, respectively, prove that they all globally converge to their stationary points. We further propose gradient projected (GP) methods for SGCCA and its two variants when using the Horst scheme, called SGCCA-GP1, SGCCA-GP2 and SGCCA-GP3, corresponding to the subproblems LM-P1, LM-P2 and LM-P3, respectively, and prove that they all
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Optimal Strategies for Round-Trip Pairs Trading Under Geometric Brownian Motions
Authors:
Emily Crawford Das,
Jingzhi Tie,
Qing Zhang
Abstract:
This paper is concerned with an optimal strategy for simultaneously trading a pair of stocks. The idea of pairs trading is to monitor their price movements and compare their relative strength over time. A pairs trade is triggered by the divergence of their prices and consists of a pair of positions to short the strong stock and to long the weak one. Such a strategy bets on the reversal of their pr…
▽ More
This paper is concerned with an optimal strategy for simultaneously trading a pair of stocks. The idea of pairs trading is to monitor their price movements and compare their relative strength over time. A pairs trade is triggered by the divergence of their prices and consists of a pair of positions to short the strong stock and to long the weak one. Such a strategy bets on the reversal of their price strengths. A round-trip trading strategy refers to opening and closing such a pair of security positions. Typical pairs-trading models usually assume a difference of the stock prices satisfies a mean-reversion equation. However, we consider the optimal pairs-trading problem by allowing the stock prices to follow general geometric Brownian motions. The objective is to trade the pairs over time to maximize an overall return with a fixed commission cost for each transaction. Initially, we allow the initial pairs position to be either long or flat. We then consider the problem when the initial pairs position may be long, flat, or short. In each case, the optimal policy is characterized by threshold curves obtained by solving the associated HJB equations.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
A quantitative pairwise comparison-based constraint handling technique for constrained optimization
Authors:
Ting Huang,
Qiang Zhang,
Witold Pedrycz,
Shanlin Yang
Abstract:
This study proposes a new constraint handling technique for assisting metaheuristic optimization algorithms to solve constrained optimization problems more effectively and efficiently. Given any two solutions of any constrained optimization problems, they are first mapped into a two-dimensional Cartesian coordinate system with their objective function value differences and constraint violation dif…
▽ More
This study proposes a new constraint handling technique for assisting metaheuristic optimization algorithms to solve constrained optimization problems more effectively and efficiently. Given any two solutions of any constrained optimization problems, they are first mapped into a two-dimensional Cartesian coordinate system with their objective function value differences and constraint violation differences as the two axes. To the best of our knowledge, we are the first to deal with constraints by building such a Cartesian coordinate system. Then, the Cartesian coordinate system is divided into a series of grids by assigning ranks to different intervals of differences. In this way, a pairwise comparison criterion is derived with the use of the fused ranks, which achieves non-hierarchical comparison neither preferring objective function values nor constraint violations, resulting in more accurate evaluation compared with existing techniques. Moreover, an evaluation function that is equivalent to the pairwise comparison criterion is proposed, which further improves computational efficiency. The effectiveness and efficiency of the proposed constraint handling technique are verified on two well-known public datasets, that is, CEC 2006 and CEC 2017. The results demonstrate that metaheuristic optimization algorithms with using the proposed constraint handling technique can converge to a feasible optimal solution faster and more reliably. Experimental analysis on the parameters involved reveal guidance for their optimal settings.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
Neural Combinatorial Optimization with Heavy Decoder: Toward Large Scale Generalization
Authors:
Fu Luo,
Xi Lin,
Fei Liu,
Qingfu Zhang,
Zhenkun Wang
Abstract:
Neural combinatorial optimization (NCO) is a promising learning-based approach for solving challenging combinatorial optimization problems without specialized algorithm design by experts. However, most constructive NCO methods cannot solve problems with large-scale instance sizes, which significantly diminishes their usefulness for real-world applications. In this work, we propose a novel Light En…
▽ More
Neural combinatorial optimization (NCO) is a promising learning-based approach for solving challenging combinatorial optimization problems without specialized algorithm design by experts. However, most constructive NCO methods cannot solve problems with large-scale instance sizes, which significantly diminishes their usefulness for real-world applications. In this work, we propose a novel Light Encoder and Heavy Decoder (LEHD) model with a strong generalization ability to address this critical issue. The LEHD model can learn to dynamically capture the relationships between all available nodes of varying sizes, which is beneficial for model generalization to problems of various scales. Moreover, we develop a data-efficient training scheme and a flexible solution construction mechanism for the proposed LEHD model. By training on small-scale problem instances, the LEHD model can generate nearly optimal solutions for the Travelling Salesman Problem (TSP) and the Capacitated Vehicle Routing Problem (CVRP) with up to 1000 nodes, and also generalizes well to solve real-world TSPLib and CVRPLib problems. These results confirm our proposed LEHD model can significantly improve the state-of-the-art performance for constructive NCO. The code is available at https://github.com/CIAM-Group/NCO_code/tree/main/single_objective/LEHD.
△ Less
Submitted 12 January, 2024; v1 submitted 11 October, 2023;
originally announced October 2023.
-
Stability and Optimal Decay Rates for Abstract Systems with Thermal Damping of Cattaneo's Type
Authors:
Chenxi Deng,
Zhong-Jie Han,
Zhaobin Kuang,
Qiong Zhang
Abstract:
This paper studies the stability of an abstract thermoelastic system with Cattaneo's law, which describes finite heat propagation speed in a medium. We introduce a parameters region containing coupling, thermal dissipation, and possible inertial characteristics. The parameters region is partitioned into distinct subregions based on the spectral properties of the infinitesimal generator of the corr…
▽ More
This paper studies the stability of an abstract thermoelastic system with Cattaneo's law, which describes finite heat propagation speed in a medium. We introduce a parameters region containing coupling, thermal dissipation, and possible inertial characteristics. The parameters region is partitioned into distinct subregions based on the spectral properties of the infinitesimal generator of the corresponding semigroup. By a careful estimation of the resolvent, we obtain distinct polynomial decay rates for systems with parameters located in different subregions. Furthermore, the optimality of these decay rates is proved. Finally, we apply our results to several coupled systems of partial differential equations.
△ Less
Submitted 27 April, 2024; v1 submitted 2 October, 2023;
originally announced October 2023.
-
Classification of aut-fixed subgroups in free-abelian times surface groups
Authors:
Jialin Lei,
Peng Wang,
Qiang Zhang
Abstract:
In this paper, we are concerned with free-abelian times surface groups, and show that they contain, up to isomorphism, infinitely many fixed subgroups of automorphisms. Moreover, we give a complete classification of their aut-fixed subgroups.
In this paper, we are concerned with free-abelian times surface groups, and show that they contain, up to isomorphism, infinitely many fixed subgroups of automorphisms. Moreover, we give a complete classification of their aut-fixed subgroups.
△ Less
Submitted 25 May, 2024; v1 submitted 23 September, 2023;
originally announced September 2023.
-
The Furstenberg property in Puiseux monoids
Authors:
Andrew Lin,
Henrick Rabinovitz,
Qiao Zhang
Abstract:
Let $M$ be a commutative monoid. The monoid $M$ is called atomic if every non-invertible element of $M$ factors into atoms (i.e., irreducible elements), while $M$ is called a Furstenberg monoid if every non-invertible element of $M$ is divisible by an atom. Additive submonoids of $\mathbb{Q}$ consisting of nonnegative rationals are called Puiseux monoids, and their atomic structure has been active…
▽ More
Let $M$ be a commutative monoid. The monoid $M$ is called atomic if every non-invertible element of $M$ factors into atoms (i.e., irreducible elements), while $M$ is called a Furstenberg monoid if every non-invertible element of $M$ is divisible by an atom. Additive submonoids of $\mathbb{Q}$ consisting of nonnegative rationals are called Puiseux monoids, and their atomic structure has been actively studied during the past few years. The primary purpose of this paper is to investigate the property of being Furstenberg in the context of Puiseux monoids. In this direction, we consider some properties weaker than being Furstenberg, and then we connect these properties with some atomic results which have been already established for Puiseux monoids.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
A generalization of the Witten conjecture through spectral curve
Authors:
Shuai Guo,
Ce Ji,
Qingsheng Zhang
Abstract:
We propose a generalization of the Witten conjecture, which connects a descendent enumerative theory with a specific reduction of KP integrable hierarchy. Our conjecture is realized by two parts: Part I (Geometry) establishes a correspondence between the descendent potential function (apart from ancestors) and the topological recursion of specific spectral curve data $(Σ, x,y,B)$; Part II (Integra…
▽ More
We propose a generalization of the Witten conjecture, which connects a descendent enumerative theory with a specific reduction of KP integrable hierarchy. Our conjecture is realized by two parts: Part I (Geometry) establishes a correspondence between the descendent potential function (apart from ancestors) and the topological recursion of specific spectral curve data $(Σ, x,y,B)$; Part II (Integrability) claims that the TR descendent potential, defined at the boundary points of the spectral curve (where $dx$ has poles), is a tau function of a certain reduction of the multi-component KP hierarchy.
In this paper, we show the geometry part for any formal descendent theory by using a generalized Laplace transform, and show the integrability part for the one-boundary cases. As applications, we generalize and prove the $r$KdV integrability of negative $r$-spin theory conjectured by Chidambaram, Garcia-Falide and Giacchetto [6], and prove the KdV integrability for the theory associated with the Weierstrass curve introduced by Dubrovin.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Product of Rankin-Selberg convolutions and a new proof of Jacquet's local converse conjecture
Authors:
Pan Yan,
Qing Zhang
Abstract:
In this article, we construct a family of integrals which represent the product of Rankin-Selberg $L$-functions of $\mathrm{GL}_{l}\times \mathrm{GL}_m$ and of $\mathrm{GL}_{l}\times \mathrm{GL}_n $ when $m+n<l$. When $n=0$, these integrals are those defined by Jacquet--Piatetski-Shapiro--Shalika up to a shift. In this sense, these new integrals generalize Jacquet--Piatetski-Shapiro--Shalika's Ran…
▽ More
In this article, we construct a family of integrals which represent the product of Rankin-Selberg $L$-functions of $\mathrm{GL}_{l}\times \mathrm{GL}_m$ and of $\mathrm{GL}_{l}\times \mathrm{GL}_n $ when $m+n<l$. When $n=0$, these integrals are those defined by Jacquet--Piatetski-Shapiro--Shalika up to a shift. In this sense, these new integrals generalize Jacquet--Piatetski-Shapiro--Shalika's Rankin-Selberg convolution integrals. We study basic properties of these integrals. In particular, we define local gamma factors using this new family of integrals. As an application, we obtain a new proof of Jacquet's local converse conjecture using these new integrals.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Decentralized Control for Discrete-time Mean-Field Systems with Multiple Controllers of Delayed Information
Authors:
Qingyuan Qi,
Zhiqiang Liu,
Qianqian Zhang,
Xinbei Lv
Abstract:
In this paper, the finite horizon asymmetric information linear quadratic (LQ) control problem is investigated for a discrete-time mean field system. Different from previous works, multiple controllers with different information sets are involved in the mean field system dynamics. The coupling of different controllers makes it quite difficult in finding the optimal control strategy. Fortunately, b…
▽ More
In this paper, the finite horizon asymmetric information linear quadratic (LQ) control problem is investigated for a discrete-time mean field system. Different from previous works, multiple controllers with different information sets are involved in the mean field system dynamics. The coupling of different controllers makes it quite difficult in finding the optimal control strategy. Fortunately, by applying the Pontryagin's maximum principle, the corresponding decentralized control problem of the finite horizon is investigated. The contributions of this paper can be concluded as: For the first time, based on the solution of a group of mean-field forward and backward stochastic difference equations (MF-FBSDEs), the necessary and sufficient solvability conditions are derived for the asymmetric information LQ control for the mean field system with multiple controllers. Furthermore, by the use of an innovative orthogonal decomposition approach, the optimal decentralized control strategy is derived, which is based on the solution to a non-symmetric Riccati-type equation.
△ Less
Submitted 3 September, 2023;
originally announced September 2023.