Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Spanning and Splitting: Integer Semidefinite Programming for the Quadratic Minimum Spanning Tree Problem

Frank de Meijer111Delft Institute of Applied Mathematics, Delft University of Technology, Mekelweg 4, 2628 CD Delft, The Netherlands, f.j.j.demeijer@tudelft.nl 222Corresponding Author:f.j.j.demeijer@tudelft.nl \scalerel*    Melanie Siebenhofer333Institut für Mathematik, Alpen-Adria-Universität Klagenfurt, Universitätstraße 65-67, 9020 Klagenfurt, melanie.siebenhofer@aau.at, angelika.wiegele@aau.at 444This research was funded in part by the Austrian Science Fund (FWF) [10.55776/DOC78]. For open access purposes, the authors have applied a CC BY public copyright license to any author-accepted manuscript version arising from this submission. \scalerel*    Renata Sotirov555Tilburg University, Department of Econometrics & Operations Research, CentER, 5000 LE Tilburg, r.sotirov@tilburguniversity.edu \scalerel*    Angelika Wiegele‡§666Universität zu Köln, Weyertal 86–90, 50931 Köln, Germany \scalerel*
Abstract

In the quadratic minimum spanning tree problem (QMSTP) one wants to find the minimizer of a quadratic function over all possible spanning trees of a graph. We give two formulations of the QMSTP as mixed-integer semidefinite programs exploiting the algebraic connectivity of a graph. Based on these formulations, we derive a doubly nonnegative relaxation for the QMSTP and investigate classes of valid inequalities to strengthen the relaxation using the Chvátal-Gomory procedure for mixed-integer conic programming.

Solving the resulting relaxations is out of reach for off-the-shelf software. We therefore develop and implement a version of the Peaceman-Rachford splitting method that allows to compute the new bounds for graphs from the literature. The numerical results demonstrate that our bounds significantly improve over existing bounds from the literature in both quality and computation time, in particular for graphs with more than 30 vertices.

This work is further evidence that semidefinite programming is a valuable tool to obtain high-quality bounds for problems in combinatorial optimization, in particular for those that can be modelled as a quadratic problem.

Keywords: Combinatorial Optimization, Spanning Trees, Integer Semidefinite Programming, Algebraic Connectivity, Projection Methods

1 Introduction

The quadratic minimum spanning tree problem (QMSTP) is the problem of finding a spanning tree of a connected, undirected graph such that the sum of interaction costs over all pairs of edges in the tree is minimized. The QMSTP was introduced by Assad and Xu [1] in 1992. The adjacent-only quadratic minimum spanning tree problem (AQMSTP), that is, the QMSTP where the interaction costs of all non-adjacent edge pairs are assumed to be zero, is also introduced in [1]. Assad and Xu proved that both the QMSTP and AQMSTP are strongly 𝒩P𝒩𝑃{\mathcal{N}P}caligraphic_N italic_P-hard problems. Interestingly, the QMSTP remains 𝒩P𝒩𝑃{\mathcal{N}P}caligraphic_N italic_P-hard even when the cost matrix is of rank one [36].

There are many existing variants of the QMSTP problem, such as the minimum spanning tree problem with conflict pairs, the quadratic bottleneck spanning tree problem, and the bottleneck spanning tree problem with conflict pairs. For a description of those problems, see e.g., Ćustić et al. [12]. The QMSTP has various applications in telecommunication, transportation, energy and hydraulic networks, see e.g., [1, 7, 8].

There is a lot of research on lower-bounding approaches and exact algorithms for the QMSTP. The majority of lower bounding approaches for the QMSTP may be classified into Gilmore-Lawler (GL) type bounds [1, 11, 30, 37] and reformulation linearization technique (RLT) based bounds [34, 37]. The GL procedure is a well-known approach to construct lower bounds for quadratic binary optimization problems, see e.g., [18, 25]. The RLT is a method to derive a hierarchy of convex approximations of mixed-integer programming problems [38] where integer variables are binary. Lower bounding approaches based on an extended formulation of the minimum spanning tree problem are derived in [39]. For an overview of the above-mentioned lower bounding approaches and their comparison, see e.g., [39]. Semidefinite programming (SDP) lower bounds for the QMSTP are considered in [20]. SDP bounds incorporated in a branch-and-bound algorithm provide the best exact solution approach for the problem up to date [20]. Different exact approaches for solving the QMSTP are considered in [1, 11, 34, 33]. For a comparison of various heuristic approaches for solving the QMSTP, see Palubeckis et al. [31].

In this paper, we derive two mixed-integer semidefinite (MISDP) formulations for the QMSTP by exploiting the algebraic connectivity of a tree. Algebraic connectivity was also exploited in [13] and [14] to derive ISDP formulations for the traveling salesman problem (TSP) and the quadratic TSP, respectively. We prove that the continuous relaxation of the cut-set QMSTP formulation of the QMSTP is at least as strong as the continuous relaxations of MISDP formulations of the QMSTP. Further, we derive several classes of valid inequalities for our MISDPs by exploiting the Chvátal-Gomory (CG) procedure for mixed-integer conic programming [6, 14]. In particular, we show that the classical cut-set constraints and the first level RLT constraints are CG cuts. The cut-set constraints are derived from the linear matrix inequality (LMI) that is related to the algebraic connectivity of a tree. The RLT-type constraints are derived using two LMIs from the MISDP formulation of the QMSTP.

Our preliminary computational results show that the cut-set constraints have a small impact on the quality of our doubly nonnegative (DNN) relaxation of the QMSTP, but the RLT-type constraints improve the DNN bound. Therefore, we add RLT-type constraints to the DNN relaxation of the QMSTP. The resulting relaxation has a large number of constraints, and it is difficult to solve using state-of-the-art interior point methods. Therefore, we design a version of the Peaceman-Rachford splitting method (PRSM) that is able to handle a large number of cutting-planes efficiently. In particular, the PRSM algorithm is adding violated RLT-type inequalities iteratively while using warm-starts. The numerical results show that our bounds for the QMSTP outperform bounds from the literature in quality, as well as in computational time required to obtain them. Our approach shows significant improvement over other methods from the literature, particularly for larger instances, specifically, for graphs with more than 30 vertices.

Notation

The set of n×n𝑛𝑛n\times nitalic_n × italic_n real symmetric matrices is denoted by 𝒮nsuperscript𝒮𝑛{\mathcal{S}}^{n}caligraphic_S start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. The space of symmetric matrices is considered with the trace inner product, which for any X,Y𝒮n𝑋𝑌superscript𝒮𝑛X,Y\in{\mathcal{S}}^{n}italic_X , italic_Y ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is defined as X,Ytr(XY)𝑋𝑌tr𝑋𝑌\langle X,Y\rangle\coloneqq\text{tr}(XY)⟨ italic_X , italic_Y ⟩ ≔ tr ( italic_X italic_Y ). The associated norm is the Frobenius norm XFtr(XX)subscriptnorm𝑋𝐹tr𝑋𝑋\|X\|_{F}\coloneqq\sqrt{\text{tr}(XX)}∥ italic_X ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ≔ square-root start_ARG tr ( italic_X italic_X ) end_ARG. The cone of symmetric positive semidefinite matrices of order n𝑛nitalic_n is defined as 𝒮+n{X𝒮n:X𝟎}superscriptsubscript𝒮𝑛conditional-set𝑋superscript𝒮𝑛succeeds-or-equals𝑋0{\mathcal{S}}_{+}^{n}\coloneqq\{X\in{\mathcal{S}}^{n}:X\succeq\mathbf{0}\}caligraphic_S start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ≔ { italic_X ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT : italic_X ⪰ bold_0 }. We order the eigenvalues of X𝒮n𝑋superscript𝒮𝑛X\in{\mathcal{S}}^{n}italic_X ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT as follows λ1(X)λn(X)subscript𝜆1𝑋subscript𝜆𝑛𝑋\lambda_{1}(X)\leq\cdots\leq\lambda_{n}(X)italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_X ) ≤ ⋯ ≤ italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_X ). If it is clear from the context to which matrix the eigenvalues relate, we denote eigenvalues by λisubscript𝜆𝑖\lambda_{i}italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. The Hadamard product of two matrices X=(xij)𝑋subscript𝑥𝑖𝑗X=(x_{ij})italic_X = ( italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ) and Y=(yij)𝑌subscript𝑦𝑖𝑗Y=(y_{ij})italic_Y = ( italic_y start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ) of the same size is denoted by \circ, and defined as follows (XY)ijxijyij.subscript𝑋𝑌𝑖𝑗subscript𝑥𝑖𝑗subscript𝑦𝑖𝑗(X\circ Y)_{ij}\coloneqq x_{ij}y_{ij}.( italic_X ∘ italic_Y ) start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ≔ italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT . The operator diag:n×nn:diagsuperscript𝑛𝑛superscript𝑛\text{diag}\colon\mathbb{R}^{n\times n}\rightarrow\mathbb{R}^{n}diag : roman_ℝ start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT → roman_ℝ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT maps a square matrix to a vector consisting of its diagonal elements. The adjoint operator of diag is denoted by Diag:nn×n:Diagsuperscript𝑛superscript𝑛𝑛\text{Diag}\colon\mathbb{R}^{n}\rightarrow\mathbb{R}^{n\times n}Diag : roman_ℝ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → roman_ℝ start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT.

We denote by 𝟏nsubscript1𝑛{\mathbf{1}}_{n}bold_1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT the vector of all ones of length n𝑛nitalic_n, and define 𝐉n𝟏n𝟏nsubscript𝐉𝑛subscript1𝑛superscriptsubscript1𝑛top{\mathbf{J}}_{n}\coloneqq{\mathbf{1}}_{n}{\mathbf{1}}_{n}^{\top}bold_J start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≔ bold_1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT bold_1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. The indicator vector of SV𝑆𝑉S\subseteq Vitalic_S ⊆ italic_V is denoted by 𝟙S.subscriptdouble-struck-𝟙𝑆\mathbb{1}_{S}.blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT . The all-zero matrix of order n𝑛nitalic_n is denoted by 𝟎nsubscript0𝑛{\mathbf{0}}_{n}bold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. We use 𝐈nsubscript𝐈𝑛{\mathbf{I}}_{n}bold_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT to denote the identity matrix of order n𝑛nitalic_n, while its i𝑖iitalic_i-th column is given by 𝐮isubscript𝐮𝑖{\mathbf{u}}_{i}bold_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. In case the dimension of 𝟏nsubscript1𝑛{\mathbf{1}}_{n}bold_1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, 𝟎nsubscript0𝑛{\mathbf{0}}_{n}bold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, 𝐉nsubscript𝐉𝑛{\mathbf{J}}_{n}bold_J start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and 𝐈nsubscript𝐈𝑛\mathbf{I}_{n}bold_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is clear from the context, we omit the subscript.

We define the n𝑛nitalic_n-simplex as Δn{xp:x𝟎,i=1pxi=n}subscriptΔ𝑛conditional-set𝑥superscript𝑝formulae-sequence𝑥0superscriptsubscript𝑖1𝑝subscript𝑥𝑖𝑛\Delta_{n}\coloneqq\{x\in\mathbb{R}^{p}:x\geq\mathbf{0},\ \sum_{i=1}^{p}x_{i}=n\}roman_Δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≔ { italic_x ∈ roman_ℝ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT : italic_x ≥ bold_0 , ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_n } and the capped n𝑛nitalic_n-simplex as Δ¯n{xp:𝟎x𝟏,i=1pxi=n}subscript¯Δ𝑛conditional-set𝑥superscript𝑝formulae-sequence0𝑥1superscriptsubscript𝑖1𝑝subscript𝑥𝑖𝑛\bar{\Delta}_{n}\coloneqq\{x\in\mathbb{R}^{p}:\mathbf{0}\leq x\leq\mathbf{1},% \ \sum_{i=1}^{p}x_{i}=n\}over¯ start_ARG roman_Δ end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≔ { italic_x ∈ roman_ℝ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT : bold_0 ≤ italic_x ≤ bold_1 , ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_n }. By 𝒫subscript𝒫\mathcal{P}_{\mathcal{M}}caligraphic_P start_POSTSUBSCRIPT caligraphic_M end_POSTSUBSCRIPT we denote the projection operator onto the set \mathcal{M}caligraphic_M. We use [n]delimited-[]𝑛[n][ italic_n ] to denote the set of integers {1,,n}.1𝑛\{1,\ldots,n\}.{ 1 , … , italic_n } .

Given a subset SV𝑆𝑉S\subseteq Vitalic_S ⊆ italic_V of vertices in a graph G=(V,E)𝐺𝑉𝐸G=(V,E)italic_G = ( italic_V , italic_E ), we denote the set of edges with both endpoints in S𝑆Sitalic_S by E(S){{i,j}E:i,jS}𝐸𝑆conditional-set𝑖𝑗𝐸𝑖𝑗𝑆E(S)\coloneqq\{\{i,j\}\in E~{}:~{}i,j\in S\}italic_E ( italic_S ) ≔ { { italic_i , italic_j } ∈ italic_E : italic_i , italic_j ∈ italic_S } and the cut induced by S𝑆Sitalic_S by S{{i,j}E:iS,jS}𝑆conditional-set𝑖𝑗𝐸formulae-sequence𝑖𝑆𝑗𝑆\partial S\coloneqq\big{\{}\{i,j\}\in E~{}:~{}i\in S,j\notin S\big{\}}∂ italic_S ≔ { { italic_i , italic_j } ∈ italic_E : italic_i ∈ italic_S , italic_j ∉ italic_S }. However, when S={i}𝑆𝑖S=\{i\}italic_S = { italic_i } we define δ(i)S𝛿𝑖𝑆\delta(i)\coloneqq\partial Sitalic_δ ( italic_i ) ≔ ∂ italic_S.

2 The Quadratic Minimum Spanning Tree Problem

In this section, we formally introduce the QMSTP. Let G=(V,E)𝐺𝑉𝐸G=(V,E)italic_G = ( italic_V , italic_E ) be a connected, undirected graph with n=|V|𝑛𝑉n=|V|italic_n = | italic_V | vertices and m=|E|𝑚𝐸m=|E|italic_m = | italic_E | edges. Let Q=(qef)𝒮m𝑄subscript𝑞𝑒𝑓superscript𝒮𝑚Q=(q_{ef})\in{\mathcal{S}}^{m}italic_Q = ( italic_q start_POSTSUBSCRIPT italic_e italic_f end_POSTSUBSCRIPT ) ∈ caligraphic_S start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT be a matrix of interaction costs between edges of G𝐺Gitalic_G, where qeesubscript𝑞𝑒𝑒q_{ee}italic_q start_POSTSUBSCRIPT italic_e italic_e end_POSTSUBSCRIPT represents the cost of edge e𝑒eitalic_e.

The QMSTP can be formulated as the following binary quadratic programming problem:

minx𝒯eEfEqefxexf,subscript𝑥𝒯subscript𝑒𝐸subscript𝑓𝐸subscript𝑞𝑒𝑓subscript𝑥𝑒subscript𝑥𝑓\displaystyle\min\limits_{x\in\mathcal{T}}~{}\sum_{e\in E}\sum_{f\in E}q_{ef}x% _{e}x_{f},roman_min start_POSTSUBSCRIPT italic_x ∈ caligraphic_T end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_e ∈ italic_E end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_f ∈ italic_E end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_e italic_f end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ,

where 𝒯𝒯\mathcal{T}caligraphic_T denotes the set of all spanning trees in G𝐺Gitalic_G. Each spanning tree in 𝒯𝒯\mathcal{T}caligraphic_T is represented by its incidence vector of length m𝑚mitalic_m, and therefore

𝒯{x{0,1}m:eExe=n1,eSxe1,SV,S}.𝒯conditional-set𝑥superscript01𝑚formulae-sequencesubscript𝑒𝐸subscript𝑥𝑒𝑛1formulae-sequencesubscript𝑒𝑆subscript𝑥𝑒1formulae-sequencefor-all𝑆𝑉𝑆\displaystyle{\mathcal{T}}\coloneqq\bigg{\{}x\in\{0,1\}^{m}~{}:~{}\sum_{e\in E% }x_{e}=n-1,~{}\sum_{e\in\partial S}x_{e}\geq 1,~{}\forall S\subsetneq V,~{}S% \not=\emptyset\bigg{\}}.caligraphic_T ≔ { italic_x ∈ { 0 , 1 } start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT : ∑ start_POSTSUBSCRIPT italic_e ∈ italic_E end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT = italic_n - 1 , ∑ start_POSTSUBSCRIPT italic_e ∈ ∂ italic_S end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≥ 1 , ∀ italic_S ⊊ italic_V , italic_S ≠ ∅ } . (1)

The constraints of the type

eSxe1subscript𝑒𝑆subscript𝑥𝑒1\displaystyle\sum_{e\in\partial S}x_{e}\geq 1∑ start_POSTSUBSCRIPT italic_e ∈ ∂ italic_S end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≥ 1 (2)

are known as the cut-set constraints, and they ensure connectivity of a subgraph from 𝒯𝒯\mathcal{T}caligraphic_T. If Q𝑄Qitalic_Q is a diagonal matrix then the QMSTP reduces to the minimum spanning tree problem that is solvable in polynomial time [24, 35].

Let us now fix an ordering for the edges E={e1,,em}𝐸subscript𝑒1subscript𝑒𝑚E=\{e_{1},\ldots,e_{m}\}italic_E = { italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT }. For x𝒯𝑥𝒯x\in{\mathcal{T}}italic_x ∈ caligraphic_T define Y(yef)𝒮m𝑌subscript𝑦𝑒𝑓superscript𝒮𝑚Y\coloneqq(y_{ef})\in{\mathcal{S}}^{m}italic_Y ≔ ( italic_y start_POSTSUBSCRIPT italic_e italic_f end_POSTSUBSCRIPT ) ∈ caligraphic_S start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT such that yef=1subscript𝑦𝑒𝑓1y_{ef}=1italic_y start_POSTSUBSCRIPT italic_e italic_f end_POSTSUBSCRIPT = 1 if xe=1subscript𝑥𝑒1x_{e}=1italic_x start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT = 1 and xf=1subscript𝑥𝑓1x_{f}=1italic_x start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = 1, and yef=0subscript𝑦𝑒𝑓0y_{ef}=0italic_y start_POSTSUBSCRIPT italic_e italic_f end_POSTSUBSCRIPT = 0 otherwise. Then the QMSTP can be formulated as the following mixed-integer programming problem, see e.g., [1]:

minQ,Ys.t. diag(Y)=x,Y𝟏m=(n1)x𝟎Y𝐉m,Y𝒮m,x𝒯.𝑄𝑌s.t. formulae-sequencediag𝑌𝑥𝑌subscript1𝑚𝑛1𝑥missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequence0𝑌subscript𝐉𝑚formulae-sequence𝑌superscript𝒮𝑚𝑥𝒯\displaystyle\begin{aligned} \min\ &\langle Q,Y\rangle\\ \text{s.t. }&\text{diag}(Y)=x,\,\,Y\mathbf{1}_{m}=(n-1)x&&\\ &\mathbf{0}\leq Y\leq\mathbf{J}_{m},\,\,Y\in{\mathcal{S}}^{m},\,\,x\in\mathcal% {T}.\end{aligned}start_ROW start_CELL roman_min end_CELL start_CELL ⟨ italic_Q , italic_Y ⟩ end_CELL end_ROW start_ROW start_CELL s.t. end_CELL start_CELL diag ( italic_Y ) = italic_x , italic_Y bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = ( italic_n - 1 ) italic_x end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL bold_0 ≤ italic_Y ≤ bold_J start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_Y ∈ caligraphic_S start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_x ∈ caligraphic_T . end_CELL end_ROW (3)

One can verify that the constraints, in combination with the binarity of x𝑥xitalic_x, are sufficient to obtain the coupling between Y𝑌Yitalic_Y and x𝑥xitalic_x. Note that each row in Y𝑌Yitalic_Y is an incidence vector of a tree. The above model was introduced by Assad and Xu [1]. We refer to the above program the cut-set formulation of the QMSTP.

3 MISDP formulations for the QMSTP

In 1973, Fiedler [17] defined the algebraic connectivity, a(G)𝑎𝐺a(G)italic_a ( italic_G ), of a graph G𝐺Gitalic_G as the second smallest eigenvalue of the Laplacian matrix of the graph. It is well-known that the algebraic connectivity is greater than zero if and only if G𝐺Gitalic_G is a connected graph. In this section, we will exploit the algebraic connectivity of a tree to derive two MISDP formulations of the QMSTP. We also prove that the continuous relaxations of our MISDP formulations are at least as strong as the continuous relaxation of the cut-set QMSTP formulation (3).

It is known that the algebraic connectivity for the graph class of trees with n3𝑛3n\geq 3italic_n ≥ 3 vertices lies in the interval between 2(1cos(πn))21𝜋𝑛2\left(1-\cos\left(\frac{\pi}{n}\right)\right)2 ( 1 - roman_cos ( divide start_ARG italic_π end_ARG start_ARG italic_n end_ARG ) ) and 1111, see e.g., [19]. Here, 2(1cos(πn))21𝜋𝑛2\left(1-\cos\left(\frac{\pi}{n}\right)\right)2 ( 1 - roman_cos ( divide start_ARG italic_π end_ARG start_ARG italic_n end_ARG ) ) is the algebraic connectivity of the path graph, and 1111 is the algebraic connectivity of the star graph. It is also known that a tree on n𝑛nitalic_n vertices has exactly n1𝑛1n-1italic_n - 1 edges. Hence, a tree can be characterized as a connected graph with exactly n1𝑛1n-1italic_n - 1 edges, see also (1). We use those facts to characterize trees by means of positive semidefiniteness.

Proposition 1.

Let G𝐺Gitalic_G be a simple graph on n3𝑛3n\geq 3italic_n ≥ 3 vertices with n1𝑛1n-1italic_n - 1 edges. Let L𝐿{L}italic_L be its Laplacian matrix and let α,β𝛼𝛽\alpha,\beta\in\mathbb{R}italic_α , italic_β ∈ roman_ℝ with αβn𝛼𝛽𝑛\alpha\geq\frac{\beta}{n}italic_α ≥ divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG and 0<β2(1cos(πn))0𝛽21𝜋𝑛0<\beta\leq 2\left(1-\cos\left(\frac{\pi}{n}\right)\right)0 < italic_β ≤ 2 ( 1 - roman_cos ( divide start_ARG italic_π end_ARG start_ARG italic_n end_ARG ) ). Then, G𝐺Gitalic_G is a tree if and only if Z=L+α𝐉nβ𝐈n𝟎.𝑍𝐿𝛼subscript𝐉𝑛𝛽subscript𝐈𝑛succeeds-or-equals0{Z}={L}+\alpha\mathbf{J}_{n}-\beta\mathbf{I}_{n}\succeq\mathbf{0}.italic_Z = italic_L + italic_α bold_J start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_β bold_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⪰ bold_0 .

Proof.

Let 0=λ1λ2λn0subscript𝜆1subscript𝜆2subscript𝜆𝑛0=\lambda_{1}\leq\lambda_{2}\leq\dots\leq\lambda_{n}0 = italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ ⋯ ≤ italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT be the eigenvalues of the Laplacian matrix L𝐿Litalic_L. We denote by v1=𝟏superscript𝑣11v^{1}=\mathbf{1}italic_v start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT = bold_1 and visuperscript𝑣𝑖v^{i}italic_v start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT for i{2,,n}𝑖2𝑛i\in\{2,\dots,n\}italic_i ∈ { 2 , … , italic_n } the eigenvectors of L𝐿Litalic_L such that they form a basis of nsuperscript𝑛\mathbb{R}^{n}roman_ℝ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. The matrix 𝐉𝐉\mathbf{J}bold_J has eigenvalue n𝑛nitalic_n whose corresponding eigenvector is 𝟏1\mathbf{1}bold_1, and eigenvalue 00 of multiplicity n1𝑛1n-1italic_n - 1 with eigenvectors visuperscript𝑣𝑖v^{i}italic_v start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT for i{2,,n}𝑖2𝑛i\in\{2,\dots,n\}italic_i ∈ { 2 , … , italic_n }. Therefore, Z𝟏=(L+α𝐉β𝐈)𝟏=(αnβ)𝟏𝑍1𝐿𝛼𝐉𝛽𝐈1𝛼𝑛𝛽1Z\mathbf{1}=(L+\alpha\mathbf{J}-\beta\mathbf{I})\mathbf{1}=(\alpha n-\beta)% \mathbf{1}italic_Z bold_1 = ( italic_L + italic_α bold_J - italic_β bold_I ) bold_1 = ( italic_α italic_n - italic_β ) bold_1 and Zvi=(λiβ)vi,𝑍superscript𝑣𝑖subscript𝜆𝑖𝛽superscript𝑣𝑖Zv^{i}=(\lambda_{i}-\beta)v^{i},italic_Z italic_v start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT = ( italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_β ) italic_v start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT , from where it follows that the eigenvalues of Z𝑍Zitalic_Z are αnβ𝛼𝑛𝛽\alpha n-\betaitalic_α italic_n - italic_β and λiβsubscript𝜆𝑖𝛽\lambda_{i}-\betaitalic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_β for i{2,,n}𝑖2𝑛i\in\{2,\dots,n\}italic_i ∈ { 2 , … , italic_n }. Using the fact that αβn𝛼𝛽𝑛\alpha\geq\frac{\beta}{n}italic_α ≥ divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG we have αnβ0,𝛼𝑛𝛽0\alpha n-\beta\geq 0,italic_α italic_n - italic_β ≥ 0 , and thus Z𝑍Zitalic_Z is positive semidefinite if and only if its eigenvalue λ2βsubscript𝜆2𝛽\lambda_{2}-\betaitalic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_β is nonnegative.

Now suppose that G𝐺Gitalic_G is a tree. In this case, we know that a(G)=λ22(1cos(πn))𝑎𝐺subscript𝜆221𝜋𝑛a(G)=\lambda_{2}\geq 2\left(1-\cos\left(\frac{\pi}{n}\right)\right)italic_a ( italic_G ) = italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ 2 ( 1 - roman_cos ( divide start_ARG italic_π end_ARG start_ARG italic_n end_ARG ) ) holds. Therefore, we have that λ2β2(1cos(πn))β0,subscript𝜆2𝛽21𝜋𝑛𝛽0\lambda_{2}-\beta\geq 2\left(1-\cos\left(\frac{\pi}{n}\right)\right)-\beta\geq 0,italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_β ≥ 2 ( 1 - roman_cos ( divide start_ARG italic_π end_ARG start_ARG italic_n end_ARG ) ) - italic_β ≥ 0 , and thus Z𝟎succeeds-or-equals𝑍0Z\succeq\mathbf{0}italic_Z ⪰ bold_0.

On the other hand, if Z𝟎succeeds-or-equals𝑍0Z\succeq\mathbf{0}italic_Z ⪰ bold_0 then λ2β0subscript𝜆2𝛽0\lambda_{2}-\beta\geq 0italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_β ≥ 0. Since β>0𝛽0\beta>0italic_β > 0, it follows that a(G)=λ2>0𝑎𝐺subscript𝜆20a(G)=\lambda_{2}>0italic_a ( italic_G ) = italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > 0 and, thus G𝐺Gitalic_G is connected. As G𝐺Gitalic_G has n𝑛nitalic_n vertices and n1𝑛1n-1italic_n - 1 edges, it is a tree. ∎

The previous result can be generalized for any graph as follows.

Proposition 2.

Let G𝐺Gitalic_G be a simple graph on n3𝑛3n\geq 3italic_n ≥ 3 vertices and L𝐿Litalic_L be the Laplacian matrix of G𝐺Gitalic_G. Then a(G)β𝑎𝐺𝛽a(G)\geq\betaitalic_a ( italic_G ) ≥ italic_β if and only if L+βn𝐉nβ𝐈n𝟎succeeds-or-equals𝐿𝛽𝑛subscript𝐉𝑛𝛽subscript𝐈𝑛0L+\frac{\beta}{n}\mathbf{J}_{n}-\beta\mathbf{I}_{n}\succeq\mathbf{0}italic_L + divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG bold_J start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_β bold_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⪰ bold_0.

Proof.

Let 0=λ1λ2λn0subscript𝜆1subscript𝜆2subscript𝜆𝑛0=\lambda_{1}\leq\lambda_{2}\leq\dots\leq\lambda_{n}0 = italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ ⋯ ≤ italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT be the eigenvalues of L𝐿Litalic_L, the Laplacian matrix of G𝐺Gitalic_G. The eigenvalues of L+βn𝐉𝐿𝛽𝑛𝐉L+\frac{\beta}{n}\mathbf{J}italic_L + divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG bold_J are β𝛽\betaitalic_β and a(G)=λ2λn𝑎𝐺subscript𝜆2subscript𝜆𝑛a(G)=\lambda_{2}\leq\dots\leq\lambda_{n}italic_a ( italic_G ) = italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ ⋯ ≤ italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. If a(G)β𝑎𝐺𝛽a(G)\geq\betaitalic_a ( italic_G ) ≥ italic_β, then all eigenvalues of L+βn𝐉𝐿𝛽𝑛𝐉L+\frac{\beta}{n}\mathbf{J}italic_L + divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG bold_J are greater or equal than β𝛽\betaitalic_β and therefore L+βn𝐉β𝐈𝟎succeeds-or-equals𝐿𝛽𝑛𝐉𝛽𝐈0L+\frac{\beta}{n}\mathbf{J}-\beta\mathbf{I}\succeq\mathbf{0}italic_L + divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG bold_J - italic_β bold_I ⪰ bold_0. Conversely, if L+βn𝐉β𝐈𝟎succeeds-or-equals𝐿𝛽𝑛𝐉𝛽𝐈0L+\frac{\beta}{n}\mathbf{J}-\beta\mathbf{I}\succeq\mathbf{0}italic_L + divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG bold_J - italic_β bold_I ⪰ bold_0, then all eigenvalues of L+βn𝐉𝐿𝛽𝑛𝐉L+\frac{\beta}{n}\mathbf{J}italic_L + divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG bold_J greater or equal to β𝛽\betaitalic_β and therefore a(G)β𝑎𝐺𝛽a(G)\geq\betaitalic_a ( italic_G ) ≥ italic_β. ∎

In the sequel, we exploit Proposition 1 to derive MISDP formulations for the QMSTP. Let us first define the set of adjacency matrices of induced subgraphs of G𝐺Gitalic_G with n𝑛nitalic_n vertices and n1𝑛1n-1italic_n - 1 edges:

{X{0,1}n×n𝒮n:X,𝐉n=2(n1),xij=0 if {i,j}E}.conditional-set𝑋superscript01𝑛𝑛superscript𝒮𝑛formulae-sequence𝑋subscript𝐉𝑛2𝑛1subscript𝑥𝑖𝑗0 if 𝑖𝑗𝐸\displaystyle\mathcal{F}\coloneqq\left\{X\in\{0,1\}^{n\times n}\cap\mathcal{S}% ^{n}~{}:~{}\langle X,\mathbf{J}_{n}\rangle=2(n-1),\ x_{ij}=0\text{ if }\{i,j\}% \notin E\right\}.caligraphic_F ≔ { italic_X ∈ { 0 , 1 } start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT ∩ caligraphic_S start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT : ⟨ italic_X , bold_J start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⟩ = 2 ( italic_n - 1 ) , italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = 0 if { italic_i , italic_j } ∉ italic_E } . (4)

The set of all adjacency matrices of spanning trees on n𝑛nitalic_n vertices is:

𝒯M={X𝒮n:Diag(X𝟏)X+α𝐉β𝐈𝟎},subscript𝒯𝑀conditional-set𝑋superscript𝒮𝑛succeeds-or-equalsDiag𝑋1𝑋𝛼𝐉𝛽𝐈0\displaystyle\mathcal{T}_{M}=\mathcal{F}\cap\left\{X\in\mathcal{S}^{n}~{}:~{}% \text{Diag}(X{\bf 1})-X+\alpha\mathbf{J}-\beta\mathbf{I}\succeq\mathbf{0}% \right\},caligraphic_T start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT = caligraphic_F ∩ { italic_X ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT : Diag ( italic_X bold_1 ) - italic_X + italic_α bold_J - italic_β bold_I ⪰ bold_0 } , (5)

where αβn𝛼𝛽𝑛\alpha\geq\frac{\beta}{n}italic_α ≥ divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG and 0<β2(1cos(πn))0𝛽21𝜋𝑛0<\beta\leq 2\left(1-\cos\left(\frac{\pi}{n}\right)\right)0 < italic_β ≤ 2 ( 1 - roman_cos ( divide start_ARG italic_π end_ARG start_ARG italic_n end_ARG ) ). There is a bijection :𝒯M𝒯:subscript𝒯𝑀𝒯{\mathcal{B}}:{\mathcal{T}}_{M}\to{\mathcal{T}}caligraphic_B : caligraphic_T start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT → caligraphic_T, see (1), where (X)𝑋{\mathcal{B}}(X)caligraphic_B ( italic_X ) maps X𝑋Xitalic_X to a column vector containing the entries of X corresponding to E𝐸Eitalic_E with respect to the fixed ordering of the edge set. Hence, the QMSTP can be written as the following MISDP problem:

min\displaystyle\min\ roman_min Q,Y𝑄𝑌\displaystyle\langle Q,Y\rangle⟨ italic_Q , italic_Y ⟩ (6a)
s.t. diag(Y)=(X),Y𝟏m=(n1)(X)formulae-sequencediag𝑌𝑋𝑌subscript1𝑚𝑛1𝑋\displaystyle\text{diag}(Y)={\mathcal{B}}(X),\,\,Y\mathbf{1}_{m}=(n-1){% \mathcal{B}}(X)diag ( italic_Y ) = caligraphic_B ( italic_X ) , italic_Y bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = ( italic_n - 1 ) caligraphic_B ( italic_X ) (6b)
Diag(X𝟏)X+α𝐉nβ𝐈n𝟎succeeds-or-equalsDiag𝑋1𝑋𝛼subscript𝐉𝑛𝛽subscript𝐈𝑛0\displaystyle\text{Diag}(X\mathbf{1})-X+\alpha\mathbf{J}_{n}-\beta\mathbf{I}_{% n}\succeq\mathbf{0}Diag ( italic_X bold_1 ) - italic_X + italic_α bold_J start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_β bold_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⪰ bold_0 (6c)
𝟎Y𝐉m,Y𝒮m,X.formulae-sequence0𝑌subscript𝐉𝑚formulae-sequence𝑌superscript𝒮𝑚𝑋\displaystyle\mathbf{0}\leq Y\leq\mathbf{J}_{m},\,\,Y\in{\mathcal{S}}^{m},\,\,% X\in\mathcal{F}.bold_0 ≤ italic_Y ≤ bold_J start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_Y ∈ caligraphic_S start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_X ∈ caligraphic_F . (6d)

One can verify that the integrality of the matrix variable Y𝑌Yitalic_Y in (6) follows from the integrality of the matrix variable X𝑋Xitalic_X. Let us compare the continuous relaxations of (6) and the continuous relaxation of the cut-set QMSTP formulation (3). We first show the following result.

Proposition 3.

Let X𝒮n𝑋superscript𝒮𝑛X\in\mathcal{S}^{n}italic_X ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT be a matrix such that 𝟎X𝐉0𝑋𝐉\mathbf{0}\leq X\leq{\mathbf{J}}bold_0 ≤ italic_X ≤ bold_J, diag(X)=𝟎diag𝑋0\text{diag}(X)=\mathbf{0}diag ( italic_X ) = bold_0, X,𝐉=2(n1)𝑋𝐉2𝑛1\langle X,{\mathbf{J}}\rangle=2(n-1)⟨ italic_X , bold_J ⟩ = 2 ( italic_n - 1 ), and minSViSjSxij=1subscript𝑆𝑉subscript𝑖𝑆subscript𝑗𝑆subscript𝑥𝑖𝑗1\min_{\emptyset\neq S\subsetneq V}\sum_{i\in S}\sum_{j\notin S}x_{ij}=1roman_min start_POSTSUBSCRIPT ∅ ≠ italic_S ⊊ italic_V end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_j ∉ italic_S end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = 1. Then λ2(Diag(X𝟏)X)2(1cosπn)subscript𝜆2Diag𝑋1𝑋21𝜋𝑛\lambda_{2}(\text{Diag}(X\mathbf{1})-X)\geq 2(1-\cos\frac{\pi}{n})italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( Diag ( italic_X bold_1 ) - italic_X ) ≥ 2 ( 1 - roman_cos divide start_ARG italic_π end_ARG start_ARG italic_n end_ARG ).

Proof.

The proof is similar to the proof of Statement 4.3. in [17]. ∎

It is not difficult to show that for a feasible (x,Y)𝑥𝑌(x,Y)( italic_x , italic_Y ) for the continuous relaxation of the cut-set QMSTP formulation (3) one can construct a feasible pair (X,Y)𝑋𝑌(X,Y)( italic_X , italic_Y ) for the continuous relaxation of (6). This leads us to the following result.

Corollary 1.

The continuous relaxation of the cut-set QMSTP formulation is at least as strong as the continuous relaxations of (6).

This result is not very surprising. Namely, Goemans and Rendl [40] show a similar result that relates the subtour elimination relaxation and an algebraic connectivity based SDP relaxation for the traveling salesman problem.

One can also formulate the QMSTP by exploiting theory on discrete PSD matrices from [15], i.e., the following result.

Theorem 1 ([15]).

Let Z=(Xxx1)𝟎𝑍matrix𝑋𝑥superscript𝑥top1succeeds-or-equals0Z=\begin{pmatrix}X&x\\ x^{\top}&1\\ \end{pmatrix}\succeq\mathbf{0}italic_Z = ( start_ARG start_ROW start_CELL italic_X end_CELL start_CELL italic_x end_CELL end_ROW start_ROW start_CELL italic_x start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) ⪰ bold_0 with diag(X)=xdiag𝑋𝑥\text{diag}({X})={x}diag ( italic_X ) = italic_x. Then, rank(Z)=1rank𝑍1{\text{rank}(Z)}=1rank ( italic_Z ) = 1 if and only if X{0,1}n×n𝑋superscript01𝑛𝑛{{X}\in\{0,1\}^{n\times n}}italic_X ∈ { 0 , 1 } start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT.

Now, by using the previous result, we formulate the QMSTP as the following MISDP:

min\displaystyle\min\ roman_min Q,Y𝑄𝑌\displaystyle\langle Q,Y\rangle⟨ italic_Q , italic_Y ⟩ (7a)
s.t. diag(Y)=(X)diag𝑌𝑋\displaystyle\text{diag}(Y)={\mathcal{B}}(X)diag ( italic_Y ) = caligraphic_B ( italic_X ) (7b)
(Y(X)(X)1)𝟎succeeds-or-equalsmatrix𝑌𝑋superscript𝑋top10\displaystyle\begin{pmatrix}Y&{\mathcal{B}}(X)\\ {\mathcal{B}}(X)^{\top}&1\end{pmatrix}\succeq\mathbf{0}( start_ARG start_ROW start_CELL italic_Y end_CELL start_CELL caligraphic_B ( italic_X ) end_CELL end_ROW start_ROW start_CELL caligraphic_B ( italic_X ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) ⪰ bold_0 (7c)
Diag(X𝟏)X+α𝐉nβ𝐈n𝟎succeeds-or-equalsDiag𝑋1𝑋𝛼subscript𝐉𝑛𝛽subscript𝐈𝑛0\displaystyle\text{Diag}(X\mathbf{1})-X+\alpha\mathbf{J}_{n}-\beta\mathbf{I}_{% n}\succeq\mathbf{0}Diag ( italic_X bold_1 ) - italic_X + italic_α bold_J start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_β bold_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⪰ bold_0 (7d)
Y𝒮m,X,formulae-sequence𝑌superscript𝒮𝑚𝑋\displaystyle Y\in{\mathcal{S}}^{m},\,\,X\in\mathcal{F},italic_Y ∈ caligraphic_S start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_X ∈ caligraphic_F , (7e)

where \mathcal{F}caligraphic_F is given in (4). In (7), we do not impose integrality on the off-diagonal elements of Y𝑌Yitalic_Y as those follow by the integrality of (X)𝑋{\mathcal{B}}(X)caligraphic_B ( italic_X ), see [15]. Due to the integrality of Y𝑌Yitalic_Y, the constraints (7b) and (7c) ensure that Y=(X)(X)𝑌𝑋superscript𝑋topY={\mathcal{B}}(X){\mathcal{B}}(X)^{\top}italic_Y = caligraphic_B ( italic_X ) caligraphic_B ( italic_X ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. Now, by using the same arguments as earlier, one can show the following result.

Corollary 2.

The continuous relaxation of the cut-set QMSTP formulation is at least as strong as the continuous relaxations of (7).

It is difficult to directly compare the continuous relaxations of (6) and (7). However, by adding the constraint Y𝟏m=(n1)(X)𝑌subscript1𝑚𝑛1𝑋Y\mathbf{1}_{m}=(n-1){\mathcal{B}}(X)italic_Y bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = ( italic_n - 1 ) caligraphic_B ( italic_X ) to (7) (which is redundant in the presence of integrality), we have the following result.

Corollary 3.

The continuous relaxation of (7) with additional constraint Y𝟏m=(n1)(X)𝑌subscript1𝑚𝑛1𝑋Y\mathbf{1}_{m}=(n-1){\mathcal{B}}(X)italic_Y bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = ( italic_n - 1 ) caligraphic_B ( italic_X ) dominates the continuous relaxation of (6).

3.1 Valid inequalities

In this section, we derive Chvátal-Gomory cuts from the MISDP formulations of the QMSTP from the previous section. Some of those cuts coincide with well-known cuts from the literature. In particular, we show that the cut-set constraints (2) and some of the first level RLT constraints are CG cuts.

Let us first present a result that applies to any graph having several connected components.

Proposition 4.

Let L𝐿Litalic_L be the Laplacian matrix of a graph on n3𝑛3n\geq 3italic_n ≥ 3 vertices, consisting of exactly k2𝑘2k\geq 2italic_k ≥ 2 connected components. Let {S1,,Sk}subscript𝑆1subscript𝑆𝑘\{S_{1},\dots,S_{k}\}{ italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } be the partition of the vertices implied by these components. For each [k]delimited-[]𝑘\ell\in[k]roman_ℓ ∈ [ italic_k ], let vsubscript𝑣{v}_{\ell}italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT be the vector defined as

(v)i{n|S|if iS|S|if iS.subscriptsubscript𝑣𝑖cases𝑛subscript𝑆if iSsubscript𝑆if iS.({v}_{\ell})_{i}\coloneqq\begin{dcases*}n-\lvert S_{\ell}\rvert&if $i\in S_{% \ell}$\\ -\lvert S_{\ell}\rvert&if $i\notin S_{\ell}$.\end{dcases*}( italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≔ { start_ROW start_CELL italic_n - | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | end_CELL start_CELL if italic_i ∈ italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL - | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | end_CELL start_CELL if italic_i ∉ italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT . end_CELL end_ROW

Then vv,L+βn𝐉β𝐈<0subscript𝑣subscriptsuperscript𝑣top𝐿𝛽𝑛𝐉𝛽𝐈0\langle{v}_{\ell}{v}^{\top}_{\ell},L+\frac{\beta}{n}\mathbf{J}-\beta\mathbf{I}% \rangle<0⟨ italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_v start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , italic_L + divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG bold_J - italic_β bold_I ⟩ < 0 for all [k]delimited-[]𝑘\ell\in[k]roman_ℓ ∈ [ italic_k ] and β>0𝛽0\beta>0italic_β > 0.

Proof.

Recall the LMI from Proposition 2. We can write v=n𝟙S|S|𝟏subscript𝑣𝑛subscriptdouble-struck-𝟙subscript𝑆subscript𝑆1{v}_{\ell}=n\mathbb{1}_{S_{\ell}}-\lvert S_{\ell}\rvert\mathbf{1}italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_n blackboard_𝟙 start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT - | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | bold_1, [k]delimited-[]𝑘\ell\in[k]roman_ℓ ∈ [ italic_k ]. It is not difficult to verify that 𝟏1\mathbf{1}bold_1 and 𝟙Ssubscriptdouble-struck-𝟙subscript𝑆\mathbb{1}_{S_{\ell}}blackboard_𝟙 start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT for [k]delimited-[]𝑘\ell\in[k]roman_ℓ ∈ [ italic_k ] are eigenvectors of L𝐿Litalic_L corresponding to the zero eigenvalue. It further holds that 𝐉𝟙S=|S|𝟏𝐉subscriptdouble-struck-𝟙subscript𝑆subscript𝑆1\mathbf{J}\mathbb{1}_{S_{\ell}}=\lvert S_{\ell}\rvert\mathbf{1}bold_J blackboard_𝟙 start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT = | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | bold_1 for all [k]delimited-[]𝑘\ell\in[k]roman_ℓ ∈ [ italic_k ], and therefore, 𝐉v=n|S|𝟏|S|n𝟏=𝟎𝐉subscript𝑣𝑛subscript𝑆1subscript𝑆𝑛10\mathbf{J}{v}_{\ell}=n\lvert S_{\ell}\rvert\mathbf{1}-\lvert S_{\ell}\rvert n% \mathbf{1}=\mathbf{0}bold_J italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_n | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | bold_1 - | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | italic_n bold_1 = bold_0. This implies that vsubscript𝑣v_{\ell}italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is an eigenvector of L+βn𝐉β𝐈𝐿𝛽𝑛𝐉𝛽𝐈L+\frac{\beta}{n}\mathbf{J}-\beta\mathbf{I}italic_L + divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG bold_J - italic_β bold_I corresponding to the eigenvalue β𝛽-\beta- italic_β, and therefore the above inequality holds. ∎

A similar result was obtained in [14] in the context of a directed node-disjoint cycle cover. Let us restate Proposition 4 in terms of the adjacency matrix of a graph.

Corollary 4.

Let G𝐺Gitalic_G be a graph with n3𝑛3n\geq 3italic_n ≥ 3 vertices consisting of k2𝑘2k\geq 2italic_k ≥ 2 connected components. Denote by Ssubscript𝑆S_{\ell}italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT the set of vertices in component [k]delimited-[]𝑘\ell\in[k]roman_ℓ ∈ [ italic_k ]. Let X𝑋Xitalic_X be the adjacency matrix of G𝐺Gitalic_G. Further, let v=n𝟙S|S|𝟏subscript𝑣𝑛subscriptdouble-struck-𝟙subscript𝑆subscript𝑆1v_{\ell}=n\mathbb{1}_{S_{\ell}}-\lvert S_{\ell}\rvert\mathbf{1}italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_n blackboard_𝟙 start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT - | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | bold_1 for all [k]delimited-[]𝑘\ell\in[k]roman_ℓ ∈ [ italic_k ] and let v(2)=vvsubscriptsuperscript𝑣2subscript𝑣subscript𝑣{v}^{(2)}_{\ell}={v}_{\ell}\circ{v}_{\ell}italic_v start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∘ italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT. Then

vvv(2)𝟏,X>vv,βn𝐉β𝐈subscript𝑣subscriptsuperscript𝑣topsubscriptsuperscript𝑣2superscript1top𝑋subscript𝑣subscriptsuperscript𝑣top𝛽𝑛𝐉𝛽𝐈\langle{v}_{\ell}{v}^{\top}_{\ell}-{v}^{(2)}_{\ell}\mathbf{1}^{\top},X\rangle>% \Big{\langle}{v}_{\ell}{v}^{\top}_{\ell},\frac{\beta}{n}\mathbf{J}-\beta% \mathbf{I}\Big{\rangle}⟨ italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_v start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT - italic_v start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_1 start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_X ⟩ > ⟨ italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_v start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG bold_J - italic_β bold_I ⟩

for all [k]delimited-[]𝑘\ell\in[k]roman_ℓ ∈ [ italic_k ] and β>0𝛽0\beta>0italic_β > 0.

Proof.

The claim follows from Proposition 4, the fact that L=Diag(X𝟏)X𝐿Diag𝑋1𝑋L=\text{Diag}(X{\mathbf{1}})-Xitalic_L = Diag ( italic_X bold_1 ) - italic_X and using v(2)𝟏,X=vv,Diag(X𝟏)superscriptsubscript𝑣2superscript1top𝑋subscript𝑣subscriptsuperscript𝑣topDiag𝑋1\langle{v}_{\ell}^{(2)}\mathbf{1}^{\top},X\rangle=\langle{v}_{\ell}{v}^{\top}_% {\ell},\text{Diag}(X{\mathbf{1}})\rangle⟨ italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT bold_1 start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_X ⟩ = ⟨ italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_v start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , Diag ( italic_X bold_1 ) ⟩. ∎

We can now use the result of Corollary 4 to derive Chvátal-Gomory cuts for the QMSTP. Let β=2(1cos(πn))𝛽21𝜋𝑛\beta=2\left(1-\cos\left(\frac{\pi}{n}\right)\right)italic_β = 2 ( 1 - roman_cos ( divide start_ARG italic_π end_ARG start_ARG italic_n end_ARG ) ), then we have the following CG cut:

vvv(2)𝟏,Xvv,βn𝐉β𝐈[k],formulae-sequencesubscript𝑣subscriptsuperscript𝑣topsubscriptsuperscript𝑣2superscript1top𝑋subscript𝑣subscriptsuperscript𝑣top𝛽𝑛𝐉𝛽𝐈delimited-[]𝑘\displaystyle\langle{v}_{\ell}{v}^{\top}_{\ell}-{v}^{(2)}_{\ell}\mathbf{1}^{% \top},X\rangle\leq\left\lfloor\bigg{\langle}{v}_{\ell}{v}^{\top}_{\ell},\frac{% \beta}{n}\mathbf{J}-\beta\mathbf{I}\bigg{\rangle}\right\rfloor\quad\ell\in[k],⟨ italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_v start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT - italic_v start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_1 start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_X ⟩ ≤ ⌊ ⟨ italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_v start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG bold_J - italic_β bold_I ⟩ ⌋ roman_ℓ ∈ [ italic_k ] ,

where vsubscript𝑣v_{\ell}italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is defined as in Proposition 4. One can use the above cuts within a branch-and-cut framework to solve (6) and/or (7). In particular, those cuts may be used to separate matrices that are in \mathcal{F}caligraphic_F, see (4), but not in 𝒯Msubscript𝒯𝑀\mathcal{T}_{M}caligraphic_T start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT, see (5).

In the sequel, we derive the cut-set constraints (2) as CG cuts. Let SV𝑆𝑉S\subsetneq Vitalic_S ⊊ italic_V, S𝑆S\neq\emptysetitalic_S ≠ ∅ and X𝑋Xitalic_X be feasible for (6) or (7). Then, for the PSD matrix 𝟙S𝟙Ssubscriptdouble-struck-𝟙𝑆superscriptsubscriptdouble-struck-𝟙𝑆top\mathbb{1}_{S}\mathbb{1}_{S}^{\top}blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT we have that

𝟙S𝟙S,Diag(X𝟏)X+α𝐉β𝐈0,subscriptdouble-struck-𝟙𝑆superscriptsubscriptdouble-struck-𝟙𝑆topDiag𝑋1𝑋𝛼𝐉𝛽𝐈0\displaystyle\langle\mathbb{1}_{S}\mathbb{1}_{S}^{\top},\text{Diag}(X\mathbf{1% })-X+\alpha\mathbf{J}-\beta\mathbf{I}\rangle\geq 0,⟨ blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , Diag ( italic_X bold_1 ) - italic_X + italic_α bold_J - italic_β bold_I ⟩ ≥ 0 , (8)

is a valid inequality for (6) and (7). After rewriting (8) and exploiting 𝟙S𝟙S,Diag(X𝟏)=𝟙S𝟏,Xsubscriptdouble-struck-𝟙𝑆superscriptsubscriptdouble-struck-𝟙𝑆topDiag𝑋1subscriptdouble-struck-𝟙𝑆superscript1top𝑋\langle\mathbb{1}_{S}\mathbb{1}_{S}^{\top},\text{Diag}(X{\mathbf{1}})\rangle=% \langle\mathbb{1}_{S}\mathbf{1}^{\top},X\rangle⟨ blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , Diag ( italic_X bold_1 ) ⟩ = ⟨ blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT bold_1 start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_X ⟩, we have 𝟙S𝟙S𝟙S𝟏,X𝟙S𝟙S,α𝐉β𝐈.subscriptdouble-struck-𝟙𝑆superscriptsubscriptdouble-struck-𝟙𝑆topsubscriptdouble-struck-𝟙𝑆superscript1top𝑋subscriptdouble-struck-𝟙𝑆superscriptsubscriptdouble-struck-𝟙𝑆top𝛼𝐉𝛽𝐈\langle\mathbb{1}_{S}\mathbb{1}_{S}^{\top}-\mathbb{1}_{S}\mathbf{1}^{\top},X% \rangle\leq\langle\mathbb{1}_{S}\mathbb{1}_{S}^{\top},\alpha\mathbf{J}-\beta% \mathbf{I}\rangle.⟨ blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT - blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT bold_1 start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_X ⟩ ≤ ⟨ blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_α bold_J - italic_β bold_I ⟩ . Since the left-hand side of this inequality is integer, we may round the right-hand side, which results in the following CG cut 𝟙S𝟙S𝟙S𝟏,X𝟙S𝟙S,α𝐉β𝐈,subscriptdouble-struck-𝟙𝑆superscriptsubscriptdouble-struck-𝟙𝑆topsubscriptdouble-struck-𝟙𝑆superscript1top𝑋subscriptdouble-struck-𝟙𝑆superscriptsubscriptdouble-struck-𝟙𝑆top𝛼𝐉𝛽𝐈\langle\mathbb{1}_{S}\mathbb{1}_{S}^{\top}-\mathbb{1}_{S}\mathbf{1}^{\top},X% \rangle\leq\lfloor\langle\mathbb{1}_{S}\mathbb{1}_{S}^{\top},\alpha\mathbf{J}-% \beta\mathbf{I}\rangle\rfloor,⟨ blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT - blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT bold_1 start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_X ⟩ ≤ ⌊ ⟨ blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT blackboard_𝟙 start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_α bold_J - italic_β bold_I ⟩ ⌋ , which after rewriting the left-hand side results in the following inequality

iSjSxij|S|(|S|αβ).subscript𝑖𝑆subscript𝑗𝑆subscript𝑥𝑖𝑗𝑆𝑆𝛼𝛽\displaystyle-\sum_{i\in S}\sum_{j\notin S}x_{ij}\leq\left\lfloor\lvert S% \rvert(\lvert S\rvert\alpha-\beta)\right\rfloor.- ∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_j ∉ italic_S end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ≤ ⌊ | italic_S | ( | italic_S | italic_α - italic_β ) ⌋ . (9)

For α=βn𝛼𝛽𝑛\alpha=\frac{\beta}{n}italic_α = divide start_ARG italic_β end_ARG start_ARG italic_n end_ARG and β=2(1cos(πn))𝛽21𝜋𝑛\beta=2\left(1-\cos\left(\frac{\pi}{n}\right)\right)italic_β = 2 ( 1 - roman_cos ( divide start_ARG italic_π end_ARG start_ARG italic_n end_ARG ) ) we have that |S|(|S|αβ)=1𝑆𝑆𝛼𝛽1\lfloor\lvert S\rvert(\lvert S\rvert\alpha-\beta)\rfloor=-1⌊ | italic_S | ( | italic_S | italic_α - italic_β ) ⌋ = - 1, and the above CG cut implies the cut-set constraint eSxe1subscript𝑒𝑆subscript𝑥𝑒1\sum_{e\in\partial S}x_{e}\geq 1∑ start_POSTSUBSCRIPT italic_e ∈ ∂ italic_S end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≥ 1, see also (2). Let us summarize the previous discussion.

Proposition 5.

Let SV𝑆𝑉S\subsetneq Vitalic_S ⊊ italic_V, S𝑆S\neq\emptysetitalic_S ≠ ∅. Then, the cut-set constraint (2) is a Chvátal-Gomory cut with respect to the MISDPs (6) and (7).

Subsequently, we derive valid inequalities by exploiting the constraint (7c) that may be equivalently reformulated as Y(X)(X)𝟎succeeds-or-equals𝑌𝑋superscript𝑋top0Y-{\mathcal{B}}(X){\mathcal{B}}(X)^{\top}\succeq{\mathbf{0}}italic_Y - caligraphic_B ( italic_X ) caligraphic_B ( italic_X ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⪰ bold_0. Let X𝑋Xitalic_X, Y𝑌Yitalic_Y be feasible for (7), iV𝑖𝑉i\in Vitalic_i ∈ italic_V, and 𝟙δ(i)subscriptdouble-struck-𝟙𝛿𝑖\mathbb{1}_{\delta(i)}blackboard_𝟙 start_POSTSUBSCRIPT italic_δ ( italic_i ) end_POSTSUBSCRIPT be the indicator vector of δ(i)𝛿𝑖\delta(i)italic_δ ( italic_i ). For fE𝑓𝐸f\in Eitalic_f ∈ italic_E, we define the following positive semidefinite matrix Pf𝐮k𝟙δ(i)+𝟙δ(i)𝐮k+𝐈m+(n1)𝐮k𝐮k,subscript𝑃𝑓subscript𝐮𝑘superscriptsubscriptdouble-struck-𝟙𝛿𝑖topsubscriptdouble-struck-𝟙𝛿𝑖superscriptsubscript𝐮𝑘topsubscript𝐈𝑚𝑛1subscript𝐮𝑘superscriptsubscript𝐮𝑘topP_{f}\coloneqq{\mathbf{u}}_{k}\mathbb{1}_{\delta(i)}^{\top}+\mathbb{1}_{\delta% (i)}{\mathbf{u}}_{k}^{\top}+{\mathbf{I}}_{m}+(n-1){\mathbf{u}}_{k}{\mathbf{u}}% _{k}^{\top},italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ≔ bold_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT blackboard_𝟙 start_POSTSUBSCRIPT italic_δ ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + blackboard_𝟙 start_POSTSUBSCRIPT italic_δ ( italic_i ) end_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + bold_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + ( italic_n - 1 ) bold_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , where the index k𝑘kitalic_k corresponds to the ordering number of the edge f𝑓fitalic_f, i.e., (X)k=xfsubscript𝑋𝑘subscript𝑥𝑓{\mathcal{B}}(X)_{k}=x_{f}caligraphic_B ( italic_X ) start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT. Since Pf𝟎succeeds-or-equalssubscript𝑃𝑓0P_{f}\succeq{\mathbf{0}}italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ⪰ bold_0, it follows that Y(X)(X),Pf0.𝑌𝑋superscript𝑋topsubscript𝑃𝑓0\langle Y-{\mathcal{B}}(X){\mathcal{B}}(X)^{\top},P_{f}\rangle\geq 0.⟨ italic_Y - caligraphic_B ( italic_X ) caligraphic_B ( italic_X ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ⟩ ≥ 0 . By rewriting the left-hand side, we have

Y(X)(X),Pf=2eδ(i)yfe2xf(X)𝟙δ(i)0,𝑌𝑋superscript𝑋topsubscript𝑃𝑓2subscript𝑒𝛿𝑖subscript𝑦𝑓𝑒2subscript𝑥𝑓𝑋superscriptsubscriptdouble-struck-𝟙𝛿𝑖top0\langle Y-{\mathcal{B}}(X){\mathcal{B}}(X)^{\top},P_{f}\rangle=2\sum_{e\in% \delta(i)}y_{fe}-2x_{f}{\mathcal{B}}(X)\mathbb{1}_{\delta(i)}^{\top}\geq 0,⟨ italic_Y - caligraphic_B ( italic_X ) caligraphic_B ( italic_X ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ⟩ = 2 ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_f italic_e end_POSTSUBSCRIPT - 2 italic_x start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT caligraphic_B ( italic_X ) blackboard_𝟙 start_POSTSUBSCRIPT italic_δ ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ≥ 0 ,

from where it follows eδ(i)yfexf(X)𝟙δ(i)xf,subscript𝑒𝛿𝑖subscript𝑦𝑓𝑒subscript𝑥𝑓𝑋superscriptsubscriptdouble-struck-𝟙𝛿𝑖topsubscript𝑥𝑓\sum_{e\in\delta(i)}y_{fe}\geq x_{f}{\mathcal{B}}(X)\mathbb{1}_{\delta(i)}^{% \top}\geq x_{f},∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_f italic_e end_POSTSUBSCRIPT ≥ italic_x start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT caligraphic_B ( italic_X ) blackboard_𝟙 start_POSTSUBSCRIPT italic_δ ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ≥ italic_x start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , since (X)𝟙δ(i)1𝑋superscriptsubscriptdouble-struck-𝟙𝛿𝑖top1{\mathcal{B}}(X)\mathbb{1}_{\delta(i)}^{\top}\geq 1caligraphic_B ( italic_X ) blackboard_𝟙 start_POSTSUBSCRIPT italic_δ ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ≥ 1 due to the fact that the underlying graph is connected. Moreover, (X)𝟙δ(i)1𝑋superscriptsubscriptdouble-struck-𝟙𝛿𝑖top1{\mathcal{B}}(X)\mathbb{1}_{\delta(i)}^{\top}\geq 1caligraphic_B ( italic_X ) blackboard_𝟙 start_POSTSUBSCRIPT italic_δ ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ≥ 1 is the cut-set constraint that is a CG cut. Thus, we have the following constraints

eδ(i)yfexffE,iV.formulae-sequencesubscript𝑒𝛿𝑖subscript𝑦𝑓𝑒subscript𝑥𝑓formulae-sequencefor-all𝑓𝐸for-all𝑖𝑉\displaystyle\sum_{e\in\delta(i)}y_{fe}\geq x_{f}\qquad\forall f\in E,~{}% \forall i\in V.∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_f italic_e end_POSTSUBSCRIPT ≥ italic_x start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ∀ italic_f ∈ italic_E , ∀ italic_i ∈ italic_V . (10)

Interestingly, these constraints follow also from the reformulation-linearization technique [38] applied to the cut-set constraints (2) with |S|=1𝑆1|S|=1| italic_S | = 1. Namely, after multiplying both sides of (2) by xfsubscript𝑥𝑓x_{f}italic_x start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT and replacing xfxesubscript𝑥𝑓subscript𝑥𝑒x_{f}x_{e}italic_x start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT by yfesubscript𝑦𝑓𝑒y_{fe}italic_y start_POSTSUBSCRIPT italic_f italic_e end_POSTSUBSCRIPT, one obtains the constraints (10). We refer later to the constraints (10) as RLT-type constraints.

Proposition 6.

Let iV𝑖𝑉i\in Vitalic_i ∈ italic_V and fE𝑓𝐸f\in Eitalic_f ∈ italic_E. Then, the constraint (10) is a Chvátal-Gomory cut with respect to MISDP (7).

4 DNN relaxation

Here, we derive two doubly nonnegative relaxations for the QMSTP and derive their facially reduced formulations.

To this end, instead of the matrix X𝑋Xitalic_X, we introduce a vector y𝑦yitalic_y that results in relaxing (X)𝑋{\mathcal{B}}(X)caligraphic_B ( italic_X ) to y𝑦yitalic_y. We then use formulation (7) where we drop the linear matrix inequality (7d), and relax the constraint X𝑋X\in\mathcal{F}italic_X ∈ caligraphic_F, see (7e), to 𝟏y=n1superscript1top𝑦𝑛1\mathbf{1}^{\top}y=n-1bold_1 start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_y = italic_n - 1. Furthermore, we add the constraint Y𝟏=(n1)y𝑌1𝑛1𝑦Y\mathbf{1}=(n-1)yitalic_Y bold_1 = ( italic_n - 1 ) italic_y that can be derived from (7). Additionally, we impose nonnegativity constraints on the matrix variable, and obtain the following DNN relaxation:

min\displaystyle\min\ roman_min Q,Y𝑄𝑌\displaystyle\langle Q,Y\rangle⟨ italic_Q , italic_Y ⟩ (11a)
s.t. diag(Y)=ydiag𝑌𝑦\displaystyle\text{diag}(Y)=ydiag ( italic_Y ) = italic_y (11b)
Y𝟏=(n1)y,  1y=n1formulae-sequence𝑌1𝑛1𝑦superscript1top𝑦𝑛1\displaystyle Y\mathbf{1}=(n-1)y,\,\,\mathbf{1}^{\top}y=n-1italic_Y bold_1 = ( italic_n - 1 ) italic_y , bold_1 start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_y = italic_n - 1 (11c)
Y𝟎,(Yyy1)𝟎.formulae-sequence𝑌0succeeds-or-equalsmatrix𝑌𝑦superscript𝑦top10\displaystyle{Y\geq\mathbf{0}},~{}~{}\begin{pmatrix}Y&y\\ {y}^{\top}&1\end{pmatrix}\succeq\mathbf{0}.italic_Y ≥ bold_0 , ( start_ARG start_ROW start_CELL italic_Y end_CELL start_CELL italic_y end_CELL end_ROW start_ROW start_CELL italic_y start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) ⪰ bold_0 . (11d)

The above relaxation does not include the connectivity constraint (7d), because that constraint has only a small impact on the bound. However, it makes the relaxation more difficult to solve. In order to include a type of connectivity constraints in (11), we consider valid inequalities from Section 3.1. Preliminary numerical results show that by adding the cut-set constraints (2), see also Proposition 5, the resulting bound only marginally improves on the DNN bound (11). The RLT-type cuts (10), however, turn out to have a more positive impact on the bound value. We therefore present the following strengthening of the relaxation (11):

minQ,Ys.t. (11b)–(11d)eδ(i)yfeyffE,iV.𝑄𝑌s.t. (11b)–(11d)missing-subexpressionformulae-sequencesubscript𝑒𝛿𝑖subscript𝑦𝑓𝑒subscript𝑦𝑓formulae-sequencefor-all𝑓𝐸for-all𝑖𝑉\displaystyle\begin{aligned} \min\ &\langle Q,Y\rangle\\ \text{s.t. }&\text{\eqref{subeq:sdp1:diag}--\eqref{subeq:sdp1:psd}}\\ &\sum_{e\in\delta(i)}y_{fe}\geq y_{f}\qquad\forall f\in E,~{}\forall i\in V.% \end{aligned}start_ROW start_CELL roman_min end_CELL start_CELL ⟨ italic_Q , italic_Y ⟩ end_CELL end_ROW start_ROW start_CELL s.t. end_CELL start_CELL ( )–( ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_f italic_e end_POSTSUBSCRIPT ≥ italic_y start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ∀ italic_f ∈ italic_E , ∀ italic_i ∈ italic_V . end_CELL end_ROW (12)

In the remaining part of this section, we perform facial reduction of the DNN relaxations. Let Y~=(Yyy1)~𝑌𝑌𝑦superscript𝑦top1\widetilde{Y}=\left(\begin{smallmatrix}Y&y\\ {y}^{\top}&1\end{smallmatrix}\right)over~ start_ARG italic_Y end_ARG = ( start_ROW start_CELL italic_Y end_CELL start_CELL italic_y end_CELL end_ROW start_ROW start_CELL italic_y start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL 1 end_CELL end_ROW ) and Q~=(Q𝟎m𝟎m0).~𝑄𝑄subscript0𝑚subscriptsuperscript0top𝑚0\widetilde{Q}=\left(\begin{smallmatrix}Q&\mathbf{0}_{m}\\ {\mathbf{0}}^{\top}_{m}&0\end{smallmatrix}\right).over~ start_ARG italic_Q end_ARG = ( start_ROW start_CELL italic_Q end_CELL start_CELL bold_0 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_0 start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_CELL start_CELL 0 end_CELL end_ROW ) . It is not difficult to verify that

T=(𝟏m(n1))𝑇matrixsubscript1𝑚𝑛1\displaystyle T=\begin{pmatrix}\mathbf{1}_{m}\\ -(n-1)\end{pmatrix}italic_T = ( start_ARG start_ROW start_CELL bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL - ( italic_n - 1 ) end_CELL end_ROW end_ARG ) (13)

is an eigenvector corresponding to the zero eigenvalue of any matrix Y~~𝑌\widetilde{Y}over~ start_ARG italic_Y end_ARG feasible for (11). Since there is no feasible matrix Y~~𝑌\widetilde{Y}over~ start_ARG italic_Y end_ARG which is positive definite, the DNN relaxation (11) has no Slater feasible point.

To provide a facially reduced DNN relaxation of (11), let W(m+1)×m𝑊superscript𝑚1𝑚W\in\mathbb{R}^{(m+1)\times m}italic_W ∈ roman_ℝ start_POSTSUPERSCRIPT ( italic_m + 1 ) × italic_m end_POSTSUPERSCRIPT be a matrix whose columns form a basis for 𝒲=null(T)𝒲nullsuperscript𝑇top{\mathcal{W}}={\rm null}(T^{\top})caligraphic_W = roman_null ( italic_T start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ), see (13). As we will show in Theorem 3 later on in this section, the relaxation (11) may be equivalently written as the following facially reduced relaxation:

minQ~,WRWs.t. diag(WRW)=(WRW)𝐮m+1(WRW)m+1,m+1=1WRW𝟎,R𝟎.~𝑄𝑊𝑅superscript𝑊tops.t. diag𝑊𝑅superscript𝑊top𝑊𝑅superscript𝑊topsubscript𝐮𝑚1missing-subexpressionsubscript𝑊𝑅superscript𝑊top𝑚1𝑚11missing-subexpressionformulae-sequence𝑊𝑅superscript𝑊top0succeeds-or-equals𝑅0\displaystyle\begin{aligned} \min\ &\langle\widetilde{Q},WRW^{\top}\rangle\\ \text{s.t. }&\text{diag}(WRW^{\top})=(WRW^{\top})\mathbf{u}_{m+1}\\ &(WRW^{\top})_{m+1,m+1}=1\\ &WRW^{\top}\geq\mathbf{0},\quad R\succeq\mathbf{0}.\end{aligned}start_ROW start_CELL roman_min end_CELL start_CELL ⟨ over~ start_ARG italic_Q end_ARG , italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⟩ end_CELL end_ROW start_ROW start_CELL s.t. end_CELL start_CELL diag ( italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) = ( italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) bold_u start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ( italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_m + 1 , italic_m + 1 end_POSTSUBSCRIPT = 1 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ≥ bold_0 , italic_R ⪰ bold_0 . end_CELL end_ROW (14)

We obtained this relaxation from (11) by replacing Y~~𝑌\widetilde{Y}over~ start_ARG italic_Y end_ARG with WRW𝑊𝑅superscript𝑊topWRW^{\top}italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT and removing redundant constraints. Note that the feasible set of (11) is contained in W𝒮+mW𝑊subscriptsuperscript𝒮𝑚superscript𝑊topW\mathcal{S}^{m}_{+}W^{\top}italic_W caligraphic_S start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT + end_POSTSUBSCRIPT italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, which is a face of 𝒮+m+1subscriptsuperscript𝒮𝑚1\mathcal{S}^{m+1}_{+}caligraphic_S start_POSTSUPERSCRIPT italic_m + 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT + end_POSTSUBSCRIPT. To show that (14) has an interior point, we use Theorem 3.15 from [22]. That theorem additionally takes into account a zero pattern in the feasible matrix, which is not present in our problem.

Theorem 2 (Theorem 3.15 in [22]).

Let 𝒬={ym:𝒜((yyyy1))=𝟎,y𝟎},𝒬conditional-set𝑦superscript𝑚formulae-sequence𝒜matrix𝑦superscript𝑦top𝑦limit-from𝑦top10𝑦0\mathcal{Q}=\Bigg{\{}y\in\mathbb{R}^{m}~{}:~{}\mathcal{A}\bigg{(}\bigg{(}% \begin{matrix}yy^{\top}&y\\ y\top&1\end{matrix}\bigg{)}\bigg{)}=\mathbf{0},\ y\geq\mathbf{0}\Bigg{\}},caligraphic_Q = { italic_y ∈ roman_ℝ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT : caligraphic_A ( ( start_ARG start_ROW start_CELL italic_y italic_y start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL italic_y end_CELL end_ROW start_ROW start_CELL italic_y ⊤ end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) ) = bold_0 , italic_y ≥ bold_0 } , where 𝒜𝒜\mathcal{A}caligraphic_A is a linear transformation, be the feasible set of a quadratically constrained program. Suppose aff(conv(𝒬))=affconv𝒬\text{aff}(\operatorname{conv}(\mathcal{Q}))=\mathcal{L}aff ( roman_conv ( caligraphic_Q ) ) = caligraphic_L with dim()=pdimension𝑝\dim(\mathcal{L})=proman_dim ( caligraphic_L ) = italic_p. Then, there exist a matrix C𝐶Citalic_C with full row rank and d𝑑ditalic_d such that ={ym:Cy=d}.conditional-set𝑦superscript𝑚𝐶𝑦𝑑\mathcal{L}=\big{\{}y\in\mathbb{R}^{m}:Cy=d\big{\}}.caligraphic_L = { italic_y ∈ roman_ℝ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT : italic_C italic_y = italic_d } .

Let M=(Cd)𝑀matrix𝐶𝑑M=(\begin{matrix}C&-d\end{matrix})italic_M = ( start_ARG start_ROW start_CELL italic_C end_CELL start_CELL - italic_d end_CELL end_ROW end_ARG ) and W𝑊Witalic_W be a matrix such that its columns form a basis of null(M)null𝑀{\rm null}(M)roman_null ( italic_M ). Let 𝒥={(i,j):yiyj=0y𝒬}𝒥conditional-set𝑖𝑗subscript𝑦𝑖subscript𝑦𝑗0for-all𝑦𝒬\mathcal{J}=\big{\{}(i,j):y_{i}y_{j}=0\ \forall y\in\mathcal{Q}\big{\}}caligraphic_J = { ( italic_i , italic_j ) : italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 ∀ italic_y ∈ caligraphic_Q } and 𝒥csuperscript𝒥𝑐\mathcal{J}^{c}caligraphic_J start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT be its complement. Then, there exists a Slater point R^^𝑅\hat{R}over^ start_ARG italic_R end_ARG for the facially reduced, DNN feasible set:

𝒬^R={R𝒮p+1:R𝟎,(WRW)𝒥=0,(WRW)𝒥c𝟎,𝒜(WRW)=𝟎}.subscript^𝒬𝑅conditional-set𝑅superscript𝒮𝑝1formulae-sequencesucceeds-or-equals𝑅0formulae-sequencesubscript𝑊𝑅superscript𝑊top𝒥0formulae-sequencesubscript𝑊𝑅superscript𝑊topsuperscript𝒥𝑐0𝒜𝑊𝑅superscript𝑊top0\hat{\mathcal{Q}}_{R}=\Big{\{}R\in\mathcal{S}^{p+1}:R\succeq\mathbf{0},\ \big{% (}{WRW^{\top}}\big{)}_{\mathcal{J}}=0,\ \big{(}{WRW^{\top}}\big{)}_{\mathcal{J% }^{c}}\geq\mathbf{0},\ \mathcal{A}\big{(}WRW^{\top}\big{)}=\mathbf{0}\Big{\}}.over^ start_ARG caligraphic_Q end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT = { italic_R ∈ caligraphic_S start_POSTSUPERSCRIPT italic_p + 1 end_POSTSUPERSCRIPT : italic_R ⪰ bold_0 , ( italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT caligraphic_J end_POSTSUBSCRIPT = 0 , ( italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT caligraphic_J start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≥ bold_0 , caligraphic_A ( italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) = bold_0 } .

We are now ready to state the following result on our facially reduced problem.

Theorem 3.

For n3𝑛3n\geq 3italic_n ≥ 3, the DNN relaxation (14) is a strictly feasible equivalent reformulation of (11).

Proof.

Let

𝒬n(m)subscript𝒬𝑛𝑚\displaystyle\mathcal{Q}_{n}(m)caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_m ) ={y{0,1}m:𝟏my=n1}={ym:𝟏my=n1,yiyi=yii[m]}absentconditional-set𝑦superscript01𝑚superscriptsubscript1𝑚top𝑦𝑛1conditional-set𝑦superscript𝑚formulae-sequencesuperscriptsubscript1𝑚top𝑦𝑛1subscript𝑦𝑖subscript𝑦𝑖subscript𝑦𝑖for-all𝑖delimited-[]𝑚\displaystyle=\big{\{}y\in\{0,1\}^{m}:\mathbf{1}_{m}^{\top}y=n-1\big{\}}=\big{% \{}y\in\mathbb{R}^{m}:\mathbf{1}_{m}^{\top}y=n-1,\ y_{i}y_{i}=y_{i}\ \forall i% \in[m]\big{\}}= { italic_y ∈ { 0 , 1 } start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT : bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_y = italic_n - 1 } = { italic_y ∈ roman_ℝ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT : bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_y = italic_n - 1 , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∀ italic_i ∈ [ italic_m ] }
={ym:𝒜((yyyy1))=𝟎,y𝟎},absentconditional-set𝑦superscript𝑚formulae-sequence𝒜matrix𝑦superscript𝑦top𝑦superscript𝑦top10𝑦0\displaystyle=\bigg{\{}y\in\mathbb{R}^{m}:\mathcal{A}\bigg{(}\begin{pmatrix}yy% ^{\top}&y\\ y^{\top}&1\end{pmatrix}\bigg{)}=\mathbf{0},\ y\geq\mathbf{0}\bigg{\}},= { italic_y ∈ roman_ℝ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT : caligraphic_A ( ( start_ARG start_ROW start_CELL italic_y italic_y start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL italic_y end_CELL end_ROW start_ROW start_CELL italic_y start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) ) = bold_0 , italic_y ≥ bold_0 } ,

where 𝒜(X)=(𝒜1(X),,𝒜2m+1(X))𝒜𝑋superscriptmatrixsubscript𝒜1𝑋subscript𝒜2𝑚1𝑋top\mathcal{A}(X)=\begin{pmatrix}\mathcal{A}_{1}(X),\cdots,\mathcal{A}_{2m+1}(X)% \end{pmatrix}^{\top}caligraphic_A ( italic_X ) = ( start_ARG start_ROW start_CELL caligraphic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_X ) , ⋯ , caligraphic_A start_POSTSUBSCRIPT 2 italic_m + 1 end_POSTSUBSCRIPT ( italic_X ) end_CELL end_ROW end_ARG ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT with

𝒜i(X)subscript𝒜𝑖𝑋\displaystyle\mathcal{A}_{i}(X)caligraphic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_X ) =(𝐮i𝐮i12𝐮i12𝐮i0),Xabsentmatrixsubscript𝐮𝑖superscriptsubscript𝐮𝑖top12subscript𝐮𝑖12superscriptsubscript𝐮𝑖top0𝑋\displaystyle=\bigg{\langle}\begin{pmatrix}\mathbf{u}_{i}\mathbf{u}_{i}^{\top}% &-\frac{1}{2}\mathbf{u}_{i}\\ -\frac{1}{2}\mathbf{u}_{i}^{\top}&0\end{pmatrix},X\bigg{\rangle}= ⟨ ( start_ARG start_ROW start_CELL bold_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL - divide start_ARG 1 end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL - divide start_ARG 1 end_ARG start_ARG 2 end_ARG bold_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) , italic_X ⟩ for all i[m], andfor all 𝑖delimited-[]𝑚, and\displaystyle\text{for all }i\in[m]\text{, and}for all italic_i ∈ [ italic_m ] , and
𝒜m+i(X)subscript𝒜𝑚𝑖𝑋\displaystyle\mathcal{A}_{m+i}(X)caligraphic_A start_POSTSUBSCRIPT italic_m + italic_i end_POSTSUBSCRIPT ( italic_X ) =12(𝐮i(𝟏m(n1))+(𝟏m(n1))ui),Xabsent12subscript𝐮𝑖matrixsuperscriptsubscript1𝑚top𝑛1matrixsubscript1𝑚𝑛1superscriptsubscript𝑢𝑖top𝑋\displaystyle=\bigg{\langle}\frac{1}{2}\bigg{(}\mathbf{u}_{i}\begin{pmatrix}% \mathbf{1}_{m}^{\top}&-(n-1)\end{pmatrix}+\begin{pmatrix}\mathbf{1}_{m}\\ -(n-1)\end{pmatrix}u_{i}^{\top}\bigg{)},X\bigg{\rangle}= ⟨ divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( bold_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( start_ARG start_ROW start_CELL bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL - ( italic_n - 1 ) end_CELL end_ROW end_ARG ) + ( start_ARG start_ROW start_CELL bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL - ( italic_n - 1 ) end_CELL end_ROW end_ARG ) italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) , italic_X ⟩ for all i[m+1].for all 𝑖delimited-[]𝑚1\displaystyle\text{for all }i\in[m+1].for all italic_i ∈ [ italic_m + 1 ] .

Note that in the definition of 𝒬n(m)subscript𝒬𝑛𝑚\mathcal{Q}_{n}(m)caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_m ), the equality 𝒜i(X)=0subscript𝒜𝑖𝑋0\mathcal{A}_{i}(X)=0caligraphic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_X ) = 0 models the constraint yi2=yisuperscriptsubscript𝑦𝑖2subscript𝑦𝑖y_{i}^{2}=y_{i}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for all i[m]𝑖delimited-[]𝑚i\in[m]italic_i ∈ [ italic_m ]. The constraint 𝒜2m+1(X)=0subscript𝒜2𝑚1𝑋0\mathcal{A}_{2m+1}(X)=0caligraphic_A start_POSTSUBSCRIPT 2 italic_m + 1 end_POSTSUBSCRIPT ( italic_X ) = 0 models the constraint 𝟏my=n1superscriptsubscript1𝑚top𝑦𝑛1\mathbf{1}_{m}^{\top}y=n-1bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_y = italic_n - 1. For the indices i[m]𝑖delimited-[]𝑚i\in[m]italic_i ∈ [ italic_m ], the constraint 𝒜m+i(X)=0subscript𝒜𝑚𝑖𝑋0\mathcal{A}_{m+i}(X)=0caligraphic_A start_POSTSUBSCRIPT italic_m + italic_i end_POSTSUBSCRIPT ( italic_X ) = 0 models the redundant constraint yi(𝟏my)=(n1)yisubscript𝑦𝑖superscriptsubscript1𝑚top𝑦𝑛1subscript𝑦𝑖y_{i}(\mathbf{1}_{m}^{\top}y)=(n-1)y_{i}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_y ) = ( italic_n - 1 ) italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for all 1im1𝑖𝑚1\leq i\leq m1 ≤ italic_i ≤ italic_m.

The convex hull equals conv(𝒬n(m))={y[0,1]m:𝟏my=n1}convsubscript𝒬𝑛𝑚conditional-set𝑦superscript01𝑚superscriptsubscript1𝑚top𝑦𝑛1\operatorname{conv}(\mathcal{Q}_{n}(m))=\big{\{}y\in[0,1]^{m}:\mathbf{1}_{m}^{% \top}y=n-1\big{\}}roman_conv ( caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_m ) ) = { italic_y ∈ [ 0 , 1 ] start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT : bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_y = italic_n - 1 }. For each index i[m]𝑖delimited-[]𝑚i\in[m]italic_i ∈ [ italic_m ] there exist vectors y1,y2𝒬n(m)superscript𝑦1superscript𝑦2subscript𝒬𝑛𝑚y^{1},y^{2}\in\mathcal{Q}_{n}(m)italic_y start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∈ caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_m ) such that yi1>0subscriptsuperscript𝑦1𝑖0y^{1}_{i}>0italic_y start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0 and yi2<1subscriptsuperscript𝑦2𝑖1y^{2}_{i}<1italic_y start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < 1, hence, we get that the affine hull is aff(conv(𝒬n(m)))={ym:𝟏my=n1},affconvsubscript𝒬𝑛𝑚conditional-set𝑦superscript𝑚superscriptsubscript1𝑚top𝑦𝑛1\text{aff}(\operatorname{conv}(\mathcal{Q}_{n}(m)))=\{y\in\mathbb{R}^{m}:% \mathbf{1}_{m}^{\top}y=n-1\},aff ( roman_conv ( caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_m ) ) ) = { italic_y ∈ roman_ℝ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT : bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_y = italic_n - 1 } , and has dimension mrank(𝟏m)=m1𝑚ranksuperscriptsubscript1𝑚top𝑚1m-\text{rank}(\mathbf{1}_{m}^{\top})=m-1italic_m - rank ( bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) = italic_m - 1. Hence, M=T𝑀superscript𝑇topM=T^{\top}italic_M = italic_T start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT where M𝑀Mitalic_M is from Theorem 2 and T𝑇Titalic_T given in (13). Let W(m+1)×m𝑊superscript𝑚1𝑚W\in\mathbb{R}^{(m+1)\times m}italic_W ∈ roman_ℝ start_POSTSUPERSCRIPT ( italic_m + 1 ) × italic_m end_POSTSUPERSCRIPT be a matrix whose columns form a basis of the nullspace of M𝑀Mitalic_M. Then, a face of 𝒮+m+1subscriptsuperscript𝒮𝑚1{\mathcal{S}}^{m+1}_{+}caligraphic_S start_POSTSUPERSCRIPT italic_m + 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT + end_POSTSUBSCRIPT containing the feasible set of (11) is of the form W𝒮+mW𝑊subscriptsuperscript𝒮𝑚superscript𝑊topW{\mathcal{S}}^{m}_{+}W^{\top}italic_W caligraphic_S start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT + end_POSTSUBSCRIPT italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. Therefore, one can replace Y~~𝑌\widetilde{Y}over~ start_ARG italic_Y end_ARG with WRW𝑊𝑅superscript𝑊topWRW^{\top}italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT in (11).

Moreover, it holds that for each pair of indices (i,j)[m]×[m]𝑖𝑗delimited-[]𝑚delimited-[]𝑚(i,j)\in[m]\times[m]( italic_i , italic_j ) ∈ [ italic_m ] × [ italic_m ], there exists a vector y𝒬n(m)𝑦subscript𝒬𝑛𝑚y\in\mathcal{Q}_{n}(m)italic_y ∈ caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_m ) such that yi=yj=1subscript𝑦𝑖subscript𝑦𝑗1y_{i}=y_{j}=1italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 1, and hence the index set 𝒥={(i,j):yiyj=0y𝒬}𝒥conditional-set𝑖𝑗subscript𝑦𝑖subscript𝑦𝑗0for-all𝑦𝒬\mathcal{J}=\big{\{}(i,j):y_{i}y_{j}=0\ \forall y\in\mathcal{Q}\big{\}}caligraphic_J = { ( italic_i , italic_j ) : italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 ∀ italic_y ∈ caligraphic_Q } is empty. Thus, by Theorem 2, there exists a Slater feasible point for the facially reduced DNN relaxation (14). ∎

On top of imposing strict feasibility, facial reduction reduces both the number of variables and constraints. Therefore, the relaxation (14) is preferred over (11). In a similar fashion, relaxation (12) can be rewritten by replacing Y~~𝑌\widetilde{Y}over~ start_ARG italic_Y end_ARG in (12) by WRW𝑊𝑅superscript𝑊topWRW^{\top}italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT.

5 Peaceman-Rachford splitting method for the QMSTP

Interior point solvers have difficulties computing our DNN relaxations for medium-sized problems in a reasonable time due to the large number of (inequality) constraints. Therefore, we use the Peaceman-Rachford splitting method (PRSM) for computing the bounds. The PRSM was first proposed in [32, 27] and is a symmetric variant of the alternating direction method of multipliers (ADMM). For more details and convergence results we refer to [21].

5.1 PRSM for solving the DNN relaxation

In this section, we outline the main steps of the Peaceman-Rachford splitting method for solving the DNN relaxation for the QMSTP (14).

Recall that the matrix W𝑊Witalic_W should be such that its columns provide a basis for 𝒲=null(T)𝒲nullsuperscript𝑇top\mathcal{W}={\rm null}(T^{\top})caligraphic_W = roman_null ( italic_T start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ). For reasons explained later, we additionally require the columns of W𝑊Witalic_W to be orthonormal. Therefore, we take W𝑊Witalic_W as the matrix obtained from applying a QR decomposition to ((n1)𝐈m𝟏m)superscriptmatrix𝑛1subscript𝐈𝑚subscript1𝑚top(\begin{matrix}(n-1)\mathbf{I}_{m}&\mathbf{1}_{m}\end{matrix})^{\top}( start_ARG start_ROW start_CELL ( italic_n - 1 ) bold_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_CELL start_CELL bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT.

Now, we define the following sets

\displaystyle\mathcal{R}caligraphic_R {RSm:R𝟎,tr(R)=n},absentconditional-set𝑅superscript𝑆𝑚formulae-sequencesucceeds-or-equals𝑅0tr𝑅𝑛\displaystyle\coloneqq\left\{R\in S^{m}~{}\colon~{}R\succeq\mathbf{0},\ \text{% tr}(R)=n\right\},≔ { italic_R ∈ italic_S start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT : italic_R ⪰ bold_0 , tr ( italic_R ) = italic_n } , (15)
𝒴𝒴\displaystyle\mathcal{Y}caligraphic_Y {Y~Sm+1:Y~=(Yyy1),diag(Y)=y, 0Y~𝐉,tr(Y~)=n},absentconditional-set~𝑌superscript𝑆𝑚1formulae-sequenceformulae-sequence~𝑌matrix𝑌𝑦superscript𝑦top1formulae-sequencediag𝑌𝑦 0~𝑌𝐉tr~𝑌𝑛\displaystyle\coloneqq\bigg{\{}\widetilde{Y}\in S^{m+1}~{}\colon~{}\widetilde{% Y}=\begin{pmatrix}Y&y\\ y^{\top}&1\end{pmatrix},\ \text{diag}(Y)=y,\ \mathbf{0}\leq\widetilde{Y}\leq% \mathbf{J},\ \text{tr}(\widetilde{Y})=n\bigg{\}},≔ { over~ start_ARG italic_Y end_ARG ∈ italic_S start_POSTSUPERSCRIPT italic_m + 1 end_POSTSUPERSCRIPT : over~ start_ARG italic_Y end_ARG = ( start_ARG start_ROW start_CELL italic_Y end_CELL start_CELL italic_y end_CELL end_ROW start_ROW start_CELL italic_y start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) , diag ( italic_Y ) = italic_y , bold_0 ≤ over~ start_ARG italic_Y end_ARG ≤ bold_J , tr ( over~ start_ARG italic_Y end_ARG ) = italic_n } , (16)

and rewrite (14) as

min{Q~,Y~:Y~=WRW,R,Y~𝒴}.:~𝑄~𝑌formulae-sequence~𝑌𝑊𝑅superscript𝑊topformulae-sequence𝑅~𝑌𝒴\min~{}\Big{\{}\big{\langle}\widetilde{Q},\widetilde{Y}\big{\rangle}~{}\colon~% {}\widetilde{Y}=WRW^{\top},\ R\in\mathcal{R},\ \widetilde{Y}\in\mathcal{Y}\Big% {\}}.roman_min { ⟨ over~ start_ARG italic_Q end_ARG , over~ start_ARG italic_Y end_ARG ⟩ : over~ start_ARG italic_Y end_ARG = italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_R ∈ caligraphic_R , over~ start_ARG italic_Y end_ARG ∈ caligraphic_Y } . (17)

Note that we added redundant constraints to 𝒴𝒴\mathcal{Y}caligraphic_Y and \mathcal{R}caligraphic_R, where the constraint tr(R)=ntr𝑅𝑛\text{tr}(R)=ntr ( italic_R ) = italic_n holds, since the columns in W𝑊Witalic_W are orthonormalized. Those redundant constraints help for the efficiency of the algorithm, see e.g., [16, 29, 26].

For a fixed penalty parameter β>0𝛽0\beta>0italic_β > 0, the augmented Lagrangian function of (17) w.r.t. the constraint Y~=WRW~𝑌𝑊𝑅superscript𝑊top\widetilde{Y}=WRW^{\top}over~ start_ARG italic_Y end_ARG = italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is

β(R,Y~,S)=Q~,Y~+S,Y~WRW+β2Y~WRWF2.subscript𝛽𝑅~𝑌𝑆~𝑄~𝑌𝑆~𝑌𝑊𝑅superscript𝑊top𝛽2subscriptsuperscriptdelimited-∥∥~𝑌𝑊𝑅superscript𝑊top2𝐹\mathcal{L}_{\beta}(R,\widetilde{Y},S)=\big{\langle}\widetilde{Q},\widetilde{Y% }\big{\rangle}+\big{\langle}S,\widetilde{Y}-WRW^{\top}\big{\rangle}+\frac{% \beta}{2}\big{\lVert}\widetilde{Y}-WRW^{\top}\big{\rVert}^{2}_{F}.caligraphic_L start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT ( italic_R , over~ start_ARG italic_Y end_ARG , italic_S ) = ⟨ over~ start_ARG italic_Q end_ARG , over~ start_ARG italic_Y end_ARG ⟩ + ⟨ italic_S , over~ start_ARG italic_Y end_ARG - italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⟩ + divide start_ARG italic_β end_ARG start_ARG 2 end_ARG ∥ over~ start_ARG italic_Y end_ARG - italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT .

The basic idea of the PRSM is to iteratively alternate between optimizing βsubscript𝛽\mathcal{L}_{\beta}caligraphic_L start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT over R𝑅Ritalic_R and Y~~𝑌\widetilde{Y}over~ start_ARG italic_Y end_ARG and updating the dual variable S𝑆Sitalic_S. The (k+1)𝑘1(k+1)( italic_k + 1 )-th iteration of the PRSM to minimize the augmented Lagrangian function is

Rk+1superscript𝑅𝑘1\displaystyle R^{k+1}italic_R start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT =argminRβ(R,Y~k,Sk)absentsubscriptargmin𝑅subscript𝛽𝑅superscript~𝑌𝑘superscript𝑆𝑘\displaystyle=\operatorname*{arg\,min}_{R\in\mathcal{R}}\mathcal{L}_{\beta}(R,% \widetilde{Y}^{k},S^{k})= start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT italic_R ∈ caligraphic_R end_POSTSUBSCRIPT caligraphic_L start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT ( italic_R , over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT , italic_S start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT )
Sk+12superscript𝑆𝑘12\displaystyle S^{\frac{k+1}{2}}italic_S start_POSTSUPERSCRIPT divide start_ARG italic_k + 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT =Sk+γ1β(Y~kWRk+1W)absentsuperscript𝑆𝑘subscript𝛾1𝛽superscript~𝑌𝑘𝑊superscript𝑅𝑘1superscript𝑊top\displaystyle=S^{k}+\gamma_{1}\beta(\widetilde{Y}^{k}-WR^{k+1}W^{\top})= italic_S start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT + italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_β ( over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT - italic_W italic_R start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT )
Y~k+1superscript~𝑌𝑘1\displaystyle\widetilde{Y}^{k+1}over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT =argminY~𝒴β(Rk+1,Y~,Sk+12)absentsubscriptargmin~𝑌𝒴subscript𝛽superscript𝑅𝑘1~𝑌superscript𝑆𝑘12\displaystyle=\operatorname*{arg\,min}_{\widetilde{Y}\in\mathcal{Y}}\mathcal{L% }_{\beta}(R^{k+1},\widetilde{Y},S^{\frac{k+1}{2}})= start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT over~ start_ARG italic_Y end_ARG ∈ caligraphic_Y end_POSTSUBSCRIPT caligraphic_L start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT ( italic_R start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT , over~ start_ARG italic_Y end_ARG , italic_S start_POSTSUPERSCRIPT divide start_ARG italic_k + 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT )
Sk+1superscript𝑆𝑘1\displaystyle S^{k+1}italic_S start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT =Sk+12+γ2β(Y~k+1WRk+1W),absentsuperscript𝑆𝑘12subscript𝛾2𝛽superscript~𝑌𝑘1𝑊superscript𝑅𝑘1superscript𝑊top\displaystyle=S^{\frac{k+1}{2}}+\gamma_{2}\beta(\widetilde{Y}^{k+1}-WR^{k+1}W^% {\top}),= italic_S start_POSTSUPERSCRIPT divide start_ARG italic_k + 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT + italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_β ( over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT - italic_W italic_R start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ,

with step lengths γ1(1,1)subscript𝛾111\gamma_{1}\in(-1,1)italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ ( - 1 , 1 ) and γ2(0,1+52)subscript𝛾20152\gamma_{2}\in\big{(}0,\frac{1+\sqrt{5}}{2}\big{)}italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ ( 0 , divide start_ARG 1 + square-root start_ARG 5 end_ARG end_ARG start_ARG 2 end_ARG ) satisfying γ1+γ2>0subscript𝛾1subscript𝛾20\gamma_{1}+\gamma_{2}>0italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > 0 and |γ1|<1+γ2γ22subscript𝛾11subscript𝛾2superscriptsubscript𝛾22\lvert\gamma_{1}\rvert<1+\gamma_{2}-\gamma_{2}^{2}| italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | < 1 + italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, see [21]. The optimization problems occurring in this PRSM scheme can be simplified to projection problems. Namely, optimizing the augmented Lagrangian over \mathcal{R}caligraphic_R can be simplified to

Rk+1superscript𝑅𝑘1\displaystyle R^{k+1}italic_R start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT =argminRSk,WRW+β2Y~kWRWF=𝒫(W(Y~k+1βSk)W),absentsubscriptargmin𝑅superscript𝑆𝑘𝑊𝑅superscript𝑊top𝛽2subscriptdelimited-∥∥superscript~𝑌𝑘𝑊𝑅superscript𝑊top𝐹subscript𝒫superscript𝑊topsuperscript~𝑌𝑘1𝛽superscript𝑆𝑘𝑊\displaystyle=\operatorname*{arg\,min}_{R\in\mathcal{R}}~{}\langle S^{k},-WRW^% {\top}\rangle+\frac{\beta}{2}\big{\lVert}\widetilde{Y}^{k}-WRW^{\top}\big{% \rVert}_{F}=\mathcal{P}_{\mathcal{R}}\bigg{(}W^{\top}\bigg{(}\widetilde{Y}^{k}% +\frac{1}{\beta}S^{k}\bigg{)}W\bigg{)},= start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT italic_R ∈ caligraphic_R end_POSTSUBSCRIPT ⟨ italic_S start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT , - italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⟩ + divide start_ARG italic_β end_ARG start_ARG 2 end_ARG ∥ over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT - italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT = caligraphic_P start_POSTSUBSCRIPT caligraphic_R end_POSTSUBSCRIPT ( italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG italic_β end_ARG italic_S start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ) italic_W ) ,

where we exploited the fact that the columns of W𝑊Witalic_W are orthonormal. The projection 𝒫(M)subscript𝒫𝑀\mathcal{P}_{\mathcal{R}}(M)caligraphic_P start_POSTSUBSCRIPT caligraphic_R end_POSTSUBSCRIPT ( italic_M ) of a matrix M𝒮m𝑀superscript𝒮𝑚M\in\mathcal{S}^{m}italic_M ∈ caligraphic_S start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT onto the set \mathcal{R}caligraphic_R can be computed by projecting the eigenvalues of M𝑀Mitalic_M in the spectral decomposition onto the n𝑛nitalic_n-simplex ΔnsubscriptΔ𝑛\Delta_{n}roman_Δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, see e.g., [26]. In more detail, let M=UDiag(λ)U𝑀𝑈Diag𝜆superscript𝑈topM=U\text{Diag}({\lambda})U^{\top}italic_M = italic_U Diag ( italic_λ ) italic_U start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT be the eigenvalue decomposition of M𝑀Mitalic_M with λ𝜆{\lambda}italic_λ denoting the vector of eigenvalues of M𝑀Mitalic_M, then 𝒫(M)=UDiag(𝒫Δn(λ))U.subscript𝒫𝑀𝑈Diagsubscript𝒫subscriptΔ𝑛𝜆superscript𝑈top\mathcal{P}_{\mathcal{R}}(M)=U\text{Diag}(\mathcal{P}_{\Delta_{n}}({\lambda}))% U^{\top}.caligraphic_P start_POSTSUBSCRIPT caligraphic_R end_POSTSUBSCRIPT ( italic_M ) = italic_U Diag ( caligraphic_P start_POSTSUBSCRIPT roman_Δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_λ ) ) italic_U start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT . The projection onto the simplex can be performed efficiently. We refer to [9] for an overview of algorithms for projecting onto the simplex and their complexities.

Similarly, the optimization problem over the polyhedral set 𝒴𝒴\mathcal{Y}caligraphic_Y can be reformulated as

Y~k+1superscript~𝑌𝑘1\displaystyle\widetilde{Y}^{k+1}over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT =argminY~𝒴Q~,Y~+Sk+12,Y~+β2Y~WRk+1WF=𝒫𝒴(WRk+1W1β(Q~+Sk+12)).absentsubscriptargmin~𝑌𝒴~𝑄~𝑌superscript𝑆𝑘12~𝑌𝛽2subscriptdelimited-∥∥~𝑌𝑊superscript𝑅𝑘1superscript𝑊top𝐹subscript𝒫𝒴𝑊superscript𝑅𝑘1superscript𝑊top1𝛽~𝑄superscript𝑆𝑘12\displaystyle=\operatorname*{arg\,min}_{\widetilde{Y}\in\mathcal{Y}}\big{% \langle}\widetilde{Q},\widetilde{Y}\big{\rangle}+\big{\langle}S^{\frac{k+1}{2}% },\widetilde{Y}\big{\rangle}+\frac{\beta}{2}\big{\lVert}\widetilde{Y}-WR^{k+1}% W^{\top}\big{\rVert}_{F}=\mathcal{P}_{\mathcal{Y}}\Big{(}WR^{k+1}W^{\top}-% \frac{1}{\beta}\big{(}\widetilde{Q}+S^{\frac{k+1}{2}}\big{)}\Big{)}.= start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT over~ start_ARG italic_Y end_ARG ∈ caligraphic_Y end_POSTSUBSCRIPT ⟨ over~ start_ARG italic_Q end_ARG , over~ start_ARG italic_Y end_ARG ⟩ + ⟨ italic_S start_POSTSUPERSCRIPT divide start_ARG italic_k + 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT , over~ start_ARG italic_Y end_ARG ⟩ + divide start_ARG italic_β end_ARG start_ARG 2 end_ARG ∥ over~ start_ARG italic_Y end_ARG - italic_W italic_R start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT = caligraphic_P start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( italic_W italic_R start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_β end_ARG ( over~ start_ARG italic_Q end_ARG + italic_S start_POSTSUPERSCRIPT divide start_ARG italic_k + 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) ) .

The projection onto 𝒴𝒴\mathcal{Y}caligraphic_Y can then be done in the following way

𝒫𝒴((Zzzω))=𝒫[0,1]((ZDiag(diag(Z))+vvv1)),subscript𝒫𝒴matrix𝑍𝑧superscript𝑧top𝜔subscript𝒫01matrix𝑍Diagdiag𝑍𝑣𝑣superscript𝑣top1\mathcal{P}_{\mathcal{Y}}\bigg{(}\begin{pmatrix}Z&z\\ z^{\top}&\omega\end{pmatrix}\bigg{)}=\mathcal{P}_{[0,1]}\Bigg{(}\begin{pmatrix% }Z-\text{Diag}(\text{diag}(Z))+v&v\\ v^{\top}&1\end{pmatrix}\Bigg{)},caligraphic_P start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( ( start_ARG start_ROW start_CELL italic_Z end_CELL start_CELL italic_z end_CELL end_ROW start_ROW start_CELL italic_z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL italic_ω end_CELL end_ROW end_ARG ) ) = caligraphic_P start_POSTSUBSCRIPT [ 0 , 1 ] end_POSTSUBSCRIPT ( ( start_ARG start_ROW start_CELL italic_Z - Diag ( diag ( italic_Z ) ) + italic_v end_CELL start_CELL italic_v end_CELL end_ROW start_ROW start_CELL italic_v start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) ) ,

where v=𝒫Δ¯(n1)(13diag(Z)+23z)𝑣subscript𝒫¯Δ𝑛113diag𝑍23𝑧v=\mathcal{P}_{\bar{\Delta}(n-1)}\big{(}\frac{1}{3}\text{diag}(Z)+\frac{2}{3}z% \big{)}italic_v = caligraphic_P start_POSTSUBSCRIPT over¯ start_ARG roman_Δ end_ARG ( italic_n - 1 ) end_POSTSUBSCRIPT ( divide start_ARG 1 end_ARG start_ARG 3 end_ARG diag ( italic_Z ) + divide start_ARG 2 end_ARG start_ARG 3 end_ARG italic_z ) and 𝒫[0,1]subscript𝒫01\mathcal{P}_{[0,1]}caligraphic_P start_POSTSUBSCRIPT [ 0 , 1 ] end_POSTSUBSCRIPT denotes the elementwise projection onto the interval [0,1]01[0,1][ 0 , 1 ].

5.2 PRSM for solving the strengthened DNN relaxation

In this subsection, we modify the previously described PRSM algorithm that solves the relaxation (14), so that it can handle additional RLT-type constraints.

Let us extend the set 𝒴𝒴\mathcal{Y}caligraphic_Y, see (16), by adding the RLT-type constraints, yielding

𝒴RLT={Y~Sm+1:Y~=(Yyy1),diag(Y)=y,tr(Y~)=n, 0Y~𝐉,eδ(i)yfeyffE,iV}.subscript𝒴𝑅𝐿𝑇conditional-set~𝑌superscript𝑆𝑚1formulae-sequenceformulae-sequence~𝑌matrix𝑌𝑦superscript𝑦top1formulae-sequencediag𝑌𝑦formulae-sequencetr~𝑌𝑛 0~𝑌𝐉formulae-sequencesubscript𝑒𝛿𝑖subscript𝑦𝑓𝑒subscript𝑦𝑓formulae-sequencefor-all𝑓𝐸for-all𝑖𝑉\mathcal{Y}_{RLT}=\bigg{\{}\widetilde{Y}\in S^{m+1}\colon\widetilde{Y}=\begin{% pmatrix}Y&y\\ y^{\top}&1\end{pmatrix},\ \text{diag}(Y)=y,\ \text{tr}(\widetilde{Y})=n,\ % \mathbf{0}\leq\widetilde{Y}\leq\mathbf{J},\\ \sum_{e\in\delta(i)}y_{fe}\geq y_{f}\quad\forall f\in E,~{}\forall i\in V\bigg% {\}}.start_ROW start_CELL caligraphic_Y start_POSTSUBSCRIPT italic_R italic_L italic_T end_POSTSUBSCRIPT = { over~ start_ARG italic_Y end_ARG ∈ italic_S start_POSTSUPERSCRIPT italic_m + 1 end_POSTSUPERSCRIPT : over~ start_ARG italic_Y end_ARG = ( start_ARG start_ROW start_CELL italic_Y end_CELL start_CELL italic_y end_CELL end_ROW start_ROW start_CELL italic_y start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) , diag ( italic_Y ) = italic_y , tr ( over~ start_ARG italic_Y end_ARG ) = italic_n , bold_0 ≤ over~ start_ARG italic_Y end_ARG ≤ bold_J , end_CELL end_ROW start_ROW start_CELL ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_f italic_e end_POSTSUBSCRIPT ≥ italic_y start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ∀ italic_f ∈ italic_E , ∀ italic_i ∈ italic_V } . end_CELL end_ROW

Thus, the strengthened DNN relaxation (17) is as follows

min{Q~,Y~:Y~=WRW,R,Y~𝒴RLT}.:~𝑄~𝑌formulae-sequence~𝑌𝑊𝑅superscript𝑊topformulae-sequence𝑅~𝑌subscript𝒴𝑅𝐿𝑇\min~{}\Big{\{}\big{\langle}\widetilde{Q},\widetilde{Y}\big{\rangle}~{}\colon~% {}\widetilde{Y}=WRW^{\top},\ R\in\mathcal{R},\ \widetilde{Y}\in\mathcal{Y}_{% RLT}\Big{\}}.roman_min { ⟨ over~ start_ARG italic_Q end_ARG , over~ start_ARG italic_Y end_ARG ⟩ : over~ start_ARG italic_Y end_ARG = italic_W italic_R italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_R ∈ caligraphic_R , over~ start_ARG italic_Y end_ARG ∈ caligraphic_Y start_POSTSUBSCRIPT italic_R italic_L italic_T end_POSTSUBSCRIPT } . (18)

The RLT-type constraints make the projection onto 𝒴RLTsubscript𝒴𝑅𝐿𝑇\mathcal{Y}_{RLT}caligraphic_Y start_POSTSUBSCRIPT italic_R italic_L italic_T end_POSTSUBSCRIPT significantly harder. To the best of our knowledge, there is no closed-form expression for the projection onto 𝒴RLTsubscript𝒴𝑅𝐿𝑇\mathcal{Y}_{RLT}caligraphic_Y start_POSTSUBSCRIPT italic_R italic_L italic_T end_POSTSUBSCRIPT. However, one may write 𝒴RLTsubscript𝒴𝑅𝐿𝑇\mathcal{Y}_{RLT}caligraphic_Y start_POSTSUBSCRIPT italic_R italic_L italic_T end_POSTSUBSCRIPT as an intersection of sets that are easier to project on and then use an algorithm to project onto the intersection of convex sets. The cyclic Dykstra’s projection algorithm [5] is a suitable algorithm. An overview and analysis of algorithms to project onto the intersection of convex sets can be found in [2].

To apply Dykstra’s cyclic projection algorithm, let 𝒦𝒦\mathcal{K}caligraphic_K denote a coloring of the graph G𝐺Gitalic_G, i.e., 𝒦={K1,,KN}𝒦subscript𝐾1subscript𝐾𝑁\mathcal{K}=\{K_{1},\ldots,K_{N}\}caligraphic_K = { italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_K start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT } is a partitioning of V𝑉Vitalic_V into independent sets of G𝐺Gitalic_G. We then define the polyhedral sets 𝒴ksuperscript𝒴𝑘\mathcal{Y}^{k}caligraphic_Y start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT as

𝒴k{Y~(m+1)×(m+1):Y~=(Yyy1),diag(Y)=y,eδ(i)yfeyffE,iKk},superscript𝒴𝑘conditional-set~𝑌superscript𝑚1𝑚1formulae-sequence~𝑌matrix𝑌𝑦superscript𝑦top1formulae-sequencediag𝑌𝑦formulae-sequencesubscript𝑒𝛿𝑖subscript𝑦𝑓𝑒subscript𝑦𝑓formulae-sequencefor-all𝑓𝐸for-all𝑖subscript𝐾𝑘\displaystyle\mathcal{Y}^{k}\coloneqq\left\{\widetilde{Y}\in\mathbb{R}^{(m+1)% \times(m+1)}\,:\,\,\widetilde{Y}=\begin{pmatrix}Y&y\\ y^{\top}&1\end{pmatrix},\ \text{diag}(Y)=y,~{}\sum_{e\in\delta(i)}y_{fe}\geq y% _{f}\quad\forall f\in E,~{}\forall i\in K_{k}\right\},caligraphic_Y start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ≔ { over~ start_ARG italic_Y end_ARG ∈ roman_ℝ start_POSTSUPERSCRIPT ( italic_m + 1 ) × ( italic_m + 1 ) end_POSTSUPERSCRIPT : over~ start_ARG italic_Y end_ARG = ( start_ARG start_ROW start_CELL italic_Y end_CELL start_CELL italic_y end_CELL end_ROW start_ROW start_CELL italic_y start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) , diag ( italic_Y ) = italic_y , ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_f italic_e end_POSTSUBSCRIPT ≥ italic_y start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ∀ italic_f ∈ italic_E , ∀ italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } ,

for k=1,,N𝑘1𝑁k=1,\ldots,Nitalic_k = 1 , … , italic_N. With this we can now rewrite 𝒴RLTsubscript𝒴𝑅𝐿𝑇\mathcal{Y}_{RLT}caligraphic_Y start_POSTSUBSCRIPT italic_R italic_L italic_T end_POSTSUBSCRIPT as 𝒴RLT=𝒴(k=1N𝒴k).subscript𝒴𝑅𝐿𝑇𝒴superscriptsubscript𝑘1𝑁superscript𝒴𝑘\mathcal{Y}_{RLT}=\mathcal{Y}\cap\left(\bigcap_{k=1}^{N}\mathcal{Y}^{k}\right).caligraphic_Y start_POSTSUBSCRIPT italic_R italic_L italic_T end_POSTSUBSCRIPT = caligraphic_Y ∩ ( ⋂ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT caligraphic_Y start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ) .

The projection onto the sets 𝒴ksuperscript𝒴𝑘\mathcal{Y}^{k}caligraphic_Y start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT can be performed independently over each row fE𝑓𝐸f\in Eitalic_f ∈ italic_E of Y𝑌Yitalic_Y and the corresponding entries of yfsubscript𝑦𝑓y_{f}italic_y start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT in Y~~𝑌\widetilde{Y}over~ start_ARG italic_Y end_ARG. This allows us to restrict ourselves to projections onto the following type of sets

Tfk{zm+2:zf=zm+1=zm+2,eδ(i)zezfiKk},subscriptsuperscript𝑇𝑘𝑓conditional-set𝑧superscript𝑚2formulae-sequencesubscript𝑧𝑓subscript𝑧𝑚1subscript𝑧𝑚2formulae-sequencesubscript𝑒𝛿𝑖subscript𝑧𝑒subscript𝑧𝑓for-all𝑖subscript𝐾𝑘\displaystyle T^{k}_{f}\coloneqq\left\{z\in\mathbb{R}^{m+2}\,:\,\,z_{f}=z_{m+1% }=z_{m+2},~{}\sum_{e\in\delta(i)}z_{e}\geq z_{f}\quad\forall i\in K_{k}\right\},italic_T start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ≔ { italic_z ∈ roman_ℝ start_POSTSUPERSCRIPT italic_m + 2 end_POSTSUPERSCRIPT : italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≥ italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ∀ italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } , (19)

where the first m+1𝑚1m+1italic_m + 1 entries correspond to the f𝑓fitalic_f-th row of Y~~𝑌\widetilde{Y}over~ start_ARG italic_Y end_ARG and the last entry zm+2subscript𝑧𝑚2z_{m+2}italic_z start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT corresponds to Y~m+1,fsubscript~𝑌𝑚1𝑓\widetilde{Y}_{m+1,f}over~ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT italic_m + 1 , italic_f end_POSTSUBSCRIPT. The projection onto Tfksuperscriptsubscript𝑇𝑓𝑘T_{f}^{k}italic_T start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT can then be computed as presented in the following proposition.

Proposition 7.

Let am+2𝑎superscript𝑚2a\in\mathbb{R}^{m+2}italic_a ∈ roman_ℝ start_POSTSUPERSCRIPT italic_m + 2 end_POSTSUPERSCRIPT, fE𝑓𝐸f\in Eitalic_f ∈ italic_E and let 𝒦={K1,,KN}𝒦subscript𝐾1subscript𝐾𝑁\mathcal{K}=\{K_{1},\ldots,K_{N}\}caligraphic_K = { italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_K start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT } denote a coloring of G𝐺Gitalic_G. For each iKk𝑖subscript𝐾𝑘i\in K_{k}italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, we define giaf+am+1+am+23eδ(i)aesubscript𝑔𝑖subscript𝑎𝑓subscript𝑎𝑚1subscript𝑎𝑚23subscript𝑒𝛿𝑖subscript𝑎𝑒g_{i}\coloneqq\frac{a_{f}+a_{m+1}+a_{m+2}}{3}-\sum_{e\in\delta(i)}a_{e}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≔ divide start_ARG italic_a start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG - ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT and sort these values in non-increasing order, i.e., gσ(1)gσ(2)gσ(nk)subscript𝑔𝜎1subscript𝑔𝜎2subscript𝑔𝜎subscript𝑛𝑘g_{\sigma(1)}\geq g_{\sigma(2)}\geq\dots\geq g_{\sigma(n_{k})}italic_g start_POSTSUBSCRIPT italic_σ ( 1 ) end_POSTSUBSCRIPT ≥ italic_g start_POSTSUBSCRIPT italic_σ ( 2 ) end_POSTSUBSCRIPT ≥ ⋯ ≥ italic_g start_POSTSUBSCRIPT italic_σ ( italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT, where nk=|Kk|subscript𝑛𝑘subscript𝐾𝑘n_{k}=|K_{k}|italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = | italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | and σ:[nk]Kk:𝜎delimited-[]subscript𝑛𝑘subscript𝐾𝑘\sigma\colon[n_{k}]\to K_{k}italic_σ : [ italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] → italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is an appropriate sorting permutation. For each p[nk]𝑝delimited-[]subscript𝑛𝑘p\in[n_{k}]italic_p ∈ [ italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ], let

ω(p)j=1pgσ(j)d(σ(j))3+j=1p1d(σ(j)),𝜔𝑝superscriptsubscript𝑗1𝑝subscript𝑔𝜎𝑗𝑑𝜎𝑗3superscriptsubscript𝑗1𝑝1𝑑𝜎𝑗\displaystyle\omega(p)\coloneqq\frac{\sum_{j=1}^{p}\frac{g_{\sigma(j)}}{d({% \sigma(j)})}}{3+\sum_{j=1}^{p}\frac{1}{d({\sigma(j)})}},italic_ω ( italic_p ) ≔ divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT divide start_ARG italic_g start_POSTSUBSCRIPT italic_σ ( italic_j ) end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( italic_σ ( italic_j ) ) end_ARG end_ARG start_ARG 3 + ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_d ( italic_σ ( italic_j ) ) end_ARG end_ARG ,

where d(σ(j))𝑑𝜎𝑗d(\sigma(j))italic_d ( italic_σ ( italic_j ) ) denotes the degree of vertex σ(j)𝜎𝑗\sigma(j)italic_σ ( italic_j ) in G𝐺Gitalic_G. If gi0subscript𝑔𝑖0g_{i}\leq 0italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ 0 for all iKk𝑖subscript𝐾𝑘i\in K_{k}italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, then 𝒫Tfk(a)=zsubscript𝒫subscriptsuperscript𝑇𝑘𝑓𝑎𝑧\mathcal{P}_{T^{k}_{f}}(a)=zcaligraphic_P start_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_a ) = italic_z, where ze=aesubscript𝑧𝑒subscript𝑎𝑒z_{e}=a_{e}italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT for all eE{f}𝑒𝐸𝑓e\in E\setminus\{f\}italic_e ∈ italic_E ∖ { italic_f } and zf=zm+1=zm+2=af+am+1+am+23subscript𝑧𝑓subscript𝑧𝑚1subscript𝑧𝑚2subscript𝑎𝑓subscript𝑎𝑚1subscript𝑎𝑚23z_{f}=z_{m+1}=z_{m+2}=\frac{a_{f}+a_{m+1}+a_{m+2}}{3}italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT = divide start_ARG italic_a start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG. Otherwise, let psuperscript𝑝p^{*}italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT denote the largest index p𝑝pitalic_p for which gσ(p)>ω(p)subscript𝑔𝜎𝑝𝜔𝑝g_{\sigma(p)}>\omega(p)italic_g start_POSTSUBSCRIPT italic_σ ( italic_p ) end_POSTSUBSCRIPT > italic_ω ( italic_p ). Then, 𝒫Tfk(a)=zsubscript𝒫subscriptsuperscript𝑇𝑘𝑓𝑎𝑧\mathcal{P}_{T^{k}_{f}}(a)=zcaligraphic_P start_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_a ) = italic_z, where

zesubscript𝑧𝑒\displaystyle z_{e}italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ={af+am+1+am+23ω(p)if e{f,m+1,m+2},ae+1d(i)(giω(p))if eδ(i){f} for iKkV(f) with σ(i)p,ae1d(i)1eδ(i){f}aeif eδ(i){f} for iKkV(f) with eδ(i){f}ae<0,aeotherwise.absentcasessubscript𝑎𝑓subscript𝑎𝑚1subscript𝑎𝑚23𝜔superscript𝑝if e{f,m+1,m+2}subscript𝑎𝑒1𝑑𝑖subscript𝑔𝑖𝜔superscript𝑝if eδ(i){f} for iKkV(f) with σ(i)p,subscript𝑎𝑒1𝑑𝑖1subscript𝑒𝛿𝑖𝑓subscript𝑎𝑒if eδ(i){f} for iKkV(f) with eδ(i){f}ae<0subscript𝑎𝑒otherwise.\displaystyle=\begin{cases}\frac{a_{f}+a_{m+1}+a_{m+2}}{3}-\omega(p^{*})&\text% {if $e\in\{f,m+1,m+2\}$},\\ a_{e}+\frac{1}{d(i)}(g_{i}-\omega(p^{*}))&\text{if $e\in\delta(i)\setminus\{f% \}$ for $i\in K_{k}\setminus V(f)$ with $\sigma(i)\leq p^{*}$,}\\ a_{e}-\frac{1}{d(i)-1}\sum_{e\in\delta(i)\setminus\{f\}}a_{e}&\text{if $e\in% \delta(i)\setminus\{f\}$ for $i\in K_{k}\cap V(f)$ with $\sum_{e\in\delta(i)% \setminus\{f\}}a_{e}<0$},\\ a_{e}&\text{otherwise.}\end{cases}= { start_ROW start_CELL divide start_ARG italic_a start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG - italic_ω ( italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_CELL start_CELL if italic_e ∈ { italic_f , italic_m + 1 , italic_m + 2 } , end_CELL end_ROW start_ROW start_CELL italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG italic_d ( italic_i ) end_ARG ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_ω ( italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) end_CELL start_CELL if italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } for italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_V ( italic_f ) with italic_σ ( italic_i ) ≤ italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , end_CELL end_ROW start_ROW start_CELL italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_d ( italic_i ) - 1 end_ARG ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT end_CELL start_CELL if italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } for italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∩ italic_V ( italic_f ) with ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT < 0 , end_CELL end_ROW start_ROW start_CELL italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT end_CELL start_CELL otherwise. end_CELL end_ROW
Proof.

First, observe that if gσ(1)0subscript𝑔𝜎10g_{\sigma(1)}\leq 0italic_g start_POSTSUBSCRIPT italic_σ ( 1 ) end_POSTSUBSCRIPT ≤ 0, then gi0subscript𝑔𝑖0g_{i}\leq 0italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ 0 for all iKk𝑖subscript𝐾𝑘i\in K_{k}italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Consequently, the projection of a𝑎aitalic_a onto Tfksubscriptsuperscript𝑇𝑘𝑓T^{k}_{f}italic_T start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT is given by z𝑧zitalic_z, where z𝑧zitalic_z is such that ze=aesubscript𝑧𝑒subscript𝑎𝑒z_{e}=a_{e}italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT for all eE{f}𝑒𝐸𝑓e\in E\setminus\{f\}italic_e ∈ italic_E ∖ { italic_f } and zf=zm+1=zm+2=af+am+1+am+23subscript𝑧𝑓subscript𝑧𝑚1subscript𝑧𝑚2subscript𝑎𝑓subscript𝑎𝑚1subscript𝑎𝑚23z_{f}=z_{m+1}=z_{m+2}=\frac{a_{f}+a_{m+1}+a_{m+2}}{3}italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT = divide start_ARG italic_a start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG.

If gσ(1)>0subscript𝑔𝜎10g_{\sigma(1)}>0italic_g start_POSTSUBSCRIPT italic_σ ( 1 ) end_POSTSUBSCRIPT > 0, then ω(1)=gσ(1)d(σ(1))3+1d(σ(1))<gσ(1)d(σ(1))gσ(1).𝜔1subscript𝑔𝜎1𝑑𝜎131𝑑𝜎1subscript𝑔𝜎1𝑑𝜎1subscript𝑔𝜎1\omega(1)=\frac{\frac{g_{\sigma(1)}}{d(\sigma(1))}}{3+\frac{1}{d(\sigma(1))}}<% \frac{g_{\sigma(1)}}{d(\sigma(1))}\leq g_{\sigma(1)}.italic_ω ( 1 ) = divide start_ARG divide start_ARG italic_g start_POSTSUBSCRIPT italic_σ ( 1 ) end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( italic_σ ( 1 ) ) end_ARG end_ARG start_ARG 3 + divide start_ARG 1 end_ARG start_ARG italic_d ( italic_σ ( 1 ) ) end_ARG end_ARG < divide start_ARG italic_g start_POSTSUBSCRIPT italic_σ ( 1 ) end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( italic_σ ( 1 ) ) end_ARG ≤ italic_g start_POSTSUBSCRIPT italic_σ ( 1 ) end_POSTSUBSCRIPT . Hence, the largest index p𝑝pitalic_p for which gσ(p)>ω(p)subscript𝑔𝜎𝑝𝜔𝑝g_{\sigma(p)}>\omega(p)italic_g start_POSTSUBSCRIPT italic_σ ( italic_p ) end_POSTSUBSCRIPT > italic_ω ( italic_p ), i.e., the index psuperscript𝑝p^{*}italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, exists. Next, we prove that the projection z=𝒫Tfk(a)𝑧subscript𝒫subscriptsuperscript𝑇𝑘𝑓𝑎z=\mathcal{P}_{T^{k}_{f}}(a)italic_z = caligraphic_P start_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_a ) is of the described form.

Using the fact that zf=zm+1=zm+2subscript𝑧𝑓subscript𝑧𝑚1subscript𝑧𝑚2z_{f}=z_{m+1}=z_{m+2}italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT, the vector z𝑧zitalic_z can be obtained as the solution of the following optimization problem, where we restrict to the support of the constraints in Tfksubscriptsuperscript𝑇𝑘𝑓T^{k}_{f}italic_T start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT.

minziKkeδ(i){f}aeze22+afzf22+am+1zf22+am+2zf22s.t.eδ(i)zezfiKk.subscript𝑧subscript𝑖subscript𝐾𝑘subscript𝑒𝛿𝑖𝑓superscriptsubscriptnormsubscript𝑎𝑒subscript𝑧𝑒22subscriptsuperscriptnormsubscript𝑎𝑓subscript𝑧𝑓22subscriptsuperscriptnormsubscript𝑎𝑚1subscript𝑧𝑓22subscriptsuperscriptnormsubscript𝑎𝑚2subscript𝑧𝑓22s.t.formulae-sequencesubscript𝑒𝛿𝑖subscript𝑧𝑒subscript𝑧𝑓for-all𝑖subscript𝐾𝑘\displaystyle\begin{aligned} \min_{z}\quad&\sum_{i\in K_{k}}\,\sum_{e\in\delta% (i)\setminus\{f\}}||a_{e}-z_{e}||_{2}^{2}+||a_{f}-z_{f}||^{2}_{2}+||a_{m+1}-z_% {f}||^{2}_{2}+||a_{m+2}-z_{f}||^{2}_{2}\\ \text{s.t.}\quad&\sum_{e\in\delta(i)}z_{e}\geq z_{f}\qquad\forall i\in K_{k}.% \end{aligned}start_ROW start_CELL roman_min start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT end_CELL start_CELL ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } end_POSTSUBSCRIPT | | italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + | | italic_a start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT - italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + | | italic_a start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT - italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + | | italic_a start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT - italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL s.t. end_CELL start_CELL ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≥ italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ∀ italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT . end_CELL end_ROW (20)

Let λisubscript𝜆𝑖\lambda_{i}italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, iKk𝑖subscript𝐾𝑘i\in K_{k}italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, denote the dual variables corresponding to the constraints of (20). We further denote by V(f)𝑉𝑓V(f)italic_V ( italic_f ) the two vertices in G𝐺Gitalic_G adjacent to fE𝑓𝐸f\in Eitalic_f ∈ italic_E. Then, the KKT optimality conditions for (20) are as follows

2(zeae)λi2subscript𝑧𝑒subscript𝑎𝑒subscript𝜆𝑖\displaystyle 2(z_{e}-a_{e})-\lambda_{i}2 ( italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ) - italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT =0eδ(i){f},iKkformulae-sequenceabsent0formulae-sequencefor-all𝑒𝛿𝑖𝑓for-all𝑖subscript𝐾𝑘\displaystyle=0\qquad\forall e\in\delta(i)\setminus\{f\},~{}\forall i\in K_{k}= 0 ∀ italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } , ∀ italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT (21)
6zf2(af+am+1+am+2)+iKkV(f)λi6subscript𝑧𝑓2subscript𝑎𝑓subscript𝑎𝑚1subscript𝑎𝑚2subscript𝑖subscript𝐾𝑘𝑉𝑓subscript𝜆𝑖\displaystyle 6z_{f}-2(a_{f}+a_{m+1}+a_{m+2})+\sum_{i\in K_{k}\setminus V(f)}% \lambda_{i}6 italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT - 2 ( italic_a start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT =0absent0\displaystyle=0= 0 (22)
eδ(i)zesubscript𝑒𝛿𝑖subscript𝑧𝑒\displaystyle\sum_{e\in\delta(i)}z_{e}∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT zfiKkformulae-sequenceabsentsubscript𝑧𝑓for-all𝑖subscript𝐾𝑘\displaystyle\geq z_{f}\quad\ \,\forall i\in K_{k}≥ italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ∀ italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT (23)
λi(zfeδ(i)ze)subscript𝜆𝑖subscript𝑧𝑓subscript𝑒𝛿𝑖subscript𝑧𝑒\displaystyle\lambda_{i}(z_{f}-\sum_{e\in\delta(i)}z_{e})italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ) =0iKkformulae-sequenceabsent0for-all𝑖subscript𝐾𝑘\displaystyle=0\qquad\forall i\in K_{k}= 0 ∀ italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT (24)
λisubscript𝜆𝑖\displaystyle\lambda_{i}italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT 0iKk.formulae-sequenceabsent0for-all𝑖subscript𝐾𝑘\displaystyle\geq 0\qquad\forall i\in K_{k}.≥ 0 ∀ italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT . (25)

It follows from (21) and (22) that we have

zfsubscript𝑧𝑓\displaystyle z_{f}italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT =af+am+1+am+2316iKkV(f)λi,absentsubscript𝑎𝑓subscript𝑎𝑚1subscript𝑎𝑚2316subscript𝑖subscript𝐾𝑘𝑉𝑓subscript𝜆𝑖\displaystyle=\frac{a_{f}+a_{m+1}+a_{m+2}}{3}-\frac{1}{6}\sum_{i\in K_{k}% \setminus V(f)}\lambda_{i},= divide start_ARG italic_a start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG - divide start_ARG 1 end_ARG start_ARG 6 end_ARG ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , and (26)
zesubscript𝑧𝑒\displaystyle z_{e}italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT =ae+12λiabsentsubscript𝑎𝑒12subscript𝜆𝑖\displaystyle=a_{e}+\frac{1}{2}\lambda_{i}= italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT eδ(i){f},iKk.formulae-sequencefor-all𝑒𝛿𝑖𝑓for-all𝑖subscript𝐾𝑘\displaystyle\forall e\in\delta(i)\setminus\{f\},\ \forall i\in K_{k}.∀ italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } , ∀ italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT .

Suppose KKksuperscript𝐾subscript𝐾𝑘K^{*}\subseteq K_{k}italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ⊆ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the set of vertices for which λi>0subscript𝜆𝑖0\lambda_{i}>0italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0 at an optimal solution of (20). The complementary slackness constraints (24) then imply that zf=eδ(i)zesubscript𝑧𝑓subscript𝑒𝛿𝑖subscript𝑧𝑒z_{f}=\sum_{e\in\delta(i)}z_{e}italic_z start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT for all iKV(f)𝑖superscript𝐾𝑉𝑓i\in K^{*}\setminus V(f)italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) and eδ(i){f}ze=0subscript𝑒𝛿𝑖𝑓subscript𝑧𝑒0\sum_{e\in\delta(i)\setminus\{f\}}z_{e}=0∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT = 0 for iKV(f)𝑖superscript𝐾𝑉𝑓i\in K^{*}\cap V(f)italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∩ italic_V ( italic_f ). Note that |KV(f)|1superscript𝐾𝑉𝑓1\lvert K^{*}\cap V(f)\rvert\leq 1| italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∩ italic_V ( italic_f ) | ≤ 1 since Kksubscript𝐾𝑘K_{k}italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is an independent set in G𝐺Gitalic_G. By exploiting (26) and jKkV(f)λj=jKV(f)λjsubscript𝑗subscript𝐾𝑘𝑉𝑓subscript𝜆𝑗subscript𝑗superscript𝐾𝑉𝑓subscript𝜆𝑗\sum_{j\in K_{k}\setminus V(f)}\lambda_{j}=\sum_{j\in K^{*}\setminus V(f)}% \lambda_{j}∑ start_POSTSUBSCRIPT italic_j ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_j ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, these equations can be rewritten to

af+am+1+am+23subscript𝑎𝑓subscript𝑎𝑚1subscript𝑎𝑚23\displaystyle\frac{a_{f}+a_{m+1}+a_{m+2}}{3}divide start_ARG italic_a start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG 16jKV(f)λj=eδ(i)(ae+12λi)16subscript𝑗superscript𝐾𝑉𝑓subscript𝜆𝑗subscript𝑒𝛿𝑖subscript𝑎𝑒12subscript𝜆𝑖\displaystyle-\frac{1}{6}\sum_{j\in K^{*}\setminus V(f)}\lambda_{j}=\sum_{e\in% \delta(i)}\left(a_{e}+\frac{1}{2}\lambda_{i}\right)- divide start_ARG 1 end_ARG start_ARG 6 end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT ( italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT )
λisubscript𝜆𝑖\displaystyle\Longleftrightarrow\quad\lambda_{i}⟺ italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT =2d(i)(af+am+1+am+23eδ(i)ae16jKV(f)λj)absent2𝑑𝑖subscript𝑎𝑓subscript𝑎𝑚1subscript𝑎𝑚23subscript𝑒𝛿𝑖subscript𝑎𝑒16subscript𝑗superscript𝐾𝑉𝑓subscript𝜆𝑗\displaystyle=\frac{2}{d(i)}\left(\frac{a_{f}+a_{m+1}+a_{m+2}}{3}-\sum_{e\in% \delta(i)}a_{e}-\frac{1}{6}\sum_{j\in K^{*}\setminus V(f)}\lambda_{j}\right)= divide start_ARG 2 end_ARG start_ARG italic_d ( italic_i ) end_ARG ( divide start_ARG italic_a start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG - ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG 6 end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT )
λisubscript𝜆𝑖\displaystyle\Longleftrightarrow\quad\lambda_{i}⟺ italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT =2d(i)(gi16jKV(f)λj)absent2𝑑𝑖subscript𝑔𝑖16subscript𝑗superscript𝐾𝑉𝑓subscript𝜆𝑗\displaystyle=\frac{2}{d(i)}\left(g_{i}-\frac{1}{6}\sum_{j\in K^{*}\setminus V% (f)}\lambda_{j}\right)= divide start_ARG 2 end_ARG start_ARG italic_d ( italic_i ) end_ARG ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG 6 end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) (27)

for all iKV(f)𝑖superscript𝐾𝑉𝑓i\in K^{*}\setminus V(f)italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ). Summing the latter equations over all iKV(f)𝑖superscript𝐾𝑉𝑓i\in K^{*}\setminus V(f)italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) yields

iKV(f)λi=2iKV(f)gid(i)13iKV(f)1d(i)jKV(f)λj,subscript𝑖superscript𝐾𝑉𝑓subscript𝜆𝑖2subscript𝑖superscript𝐾𝑉𝑓subscript𝑔𝑖𝑑𝑖13subscript𝑖superscript𝐾𝑉𝑓1𝑑𝑖subscript𝑗superscript𝐾𝑉𝑓subscript𝜆𝑗\sum_{i\in K^{*}\setminus V(f)}\lambda_{i}=2\sum_{i\in K^{*}\setminus V(f)}% \frac{g_{i}}{d(i)}-\frac{1}{3}\sum_{i\in K^{*}\setminus V(f)}\frac{1}{d(i)}% \sum_{j\in K^{*}\setminus V(f)}\lambda_{j},∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 2 ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( italic_i ) end_ARG - divide start_ARG 1 end_ARG start_ARG 3 end_ARG ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_d ( italic_i ) end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ,

or equivalently, iKV(f)λi=2iKV(f)gid(i)1+13iKV(f)1d(i).subscript𝑖superscript𝐾𝑉𝑓subscript𝜆𝑖2subscript𝑖superscript𝐾𝑉𝑓subscript𝑔𝑖𝑑𝑖113subscript𝑖superscript𝐾𝑉𝑓1𝑑𝑖\sum_{i\in K^{*}\setminus V(f)}\lambda_{i}=\frac{2\sum_{i\in K^{*}\setminus V(% f)}\frac{g_{i}}{d(i)}}{1+\frac{1}{3}\sum_{i\in K^{*}\setminus V(f)}\frac{1}{d(% i)}}.∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = divide start_ARG 2 ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( italic_i ) end_ARG end_ARG start_ARG 1 + divide start_ARG 1 end_ARG start_ARG 3 end_ARG ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_d ( italic_i ) end_ARG end_ARG . After substitution into (27), we obtain

λi=2d(i)(giiKV(f)gid(i)3+iKV(f)1d(i))>0subscript𝜆𝑖2𝑑𝑖subscript𝑔𝑖subscript𝑖superscript𝐾𝑉𝑓subscript𝑔𝑖𝑑𝑖3subscript𝑖superscript𝐾𝑉𝑓1𝑑𝑖0\displaystyle\lambda_{i}=\frac{2}{d(i)}\left(g_{i}-\frac{\sum_{i\in K^{*}% \setminus V(f)}\frac{g_{i}}{d(i)}}{3+\sum_{i\in K^{*}\setminus V(f)}\frac{1}{d% (i)}}\right)>0italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = divide start_ARG 2 end_ARG start_ARG italic_d ( italic_i ) end_ARG ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - divide start_ARG ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( italic_i ) end_ARG end_ARG start_ARG 3 + ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_d ( italic_i ) end_ARG end_ARG ) > 0 (28)

for all iKV(f)𝑖superscript𝐾𝑉𝑓i\in K^{*}\setminus V(f)italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ). For each i(KkK)V(F)𝑖subscript𝐾𝑘superscript𝐾𝑉𝐹i\in(K_{k}\setminus K^{*})\setminus V(F)italic_i ∈ ( italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ∖ italic_V ( italic_F ), we have λi=0subscript𝜆𝑖0\lambda_{i}=0italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0. The inequalities (23) for these i𝑖iitalic_i then read

eδ(i)aeaf+am+1+am+23iKV(f)gid(i)3+iKV(f)1d(i),subscript𝑒𝛿𝑖subscript𝑎𝑒subscript𝑎𝑓subscript𝑎𝑚1subscript𝑎𝑚23subscript𝑖superscript𝐾𝑉𝑓subscript𝑔𝑖𝑑𝑖3subscript𝑖superscript𝐾𝑉𝑓1𝑑𝑖\sum_{e\in\delta(i)}a_{e}\geq\frac{a_{f}+a_{m+1}+a_{m+2}}{3}-\frac{\sum_{i\in K% ^{*}\setminus V(f)}\frac{g_{i}}{d(i)}}{3+\sum_{i\in K^{*}\setminus V(f)}\frac{% 1}{d(i)}},∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≥ divide start_ARG italic_a start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG - divide start_ARG ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( italic_i ) end_ARG end_ARG start_ARG 3 + ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_d ( italic_i ) end_ARG end_ARG ,

or equivalently,

giiKV(f)gid(i)3+iKV(f)1d(i)0.subscript𝑔𝑖subscript𝑖superscript𝐾𝑉𝑓subscript𝑔𝑖𝑑𝑖3subscript𝑖superscript𝐾𝑉𝑓1𝑑𝑖0g_{i}-\frac{\sum_{i\in K^{*}\setminus V(f)}\frac{g_{i}}{d(i)}}{3+\sum_{i\in K^% {*}\setminus V(f)}\frac{1}{d(i)}}\leq 0.italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - divide start_ARG ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( italic_i ) end_ARG end_ARG start_ARG 3 + ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_d ( italic_i ) end_ARG end_ARG ≤ 0 . (29)

By combining (28) and (29) we obtain the following optimality conditions on the dual variables λ𝜆\lambdaitalic_λ concerning the indices in KkV(F)subscript𝐾𝑘𝑉𝐹K_{k}\setminus V(F)italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_V ( italic_F )

{2d(i)(giiKV(f)gid(i)3+iKV(f)1d(i))>0for all iKV(f),giiKV(f)gid(i)3+iKV(f)1d(i)0for all i(KkK)V(F).\displaystyle\left\{\begin{aligned} \frac{2}{d(i)}\left(g_{i}-\frac{\sum_{i\in K% ^{*}\setminus V(f)}\frac{g_{i}}{d(i)}}{3+\sum_{i\in K^{*}\setminus V(f)}\frac{% 1}{d(i)}}\right)&>0&&\text{for all $i\in K^{*}\setminus V(f)$,}\\ g_{i}-\frac{\sum_{i\in K^{*}\setminus V(f)}\frac{g_{i}}{d(i)}}{3+\sum_{i\in K^% {*}\setminus V(f)}\frac{1}{d(i)}}&\leq 0&&\text{for all $i\in(K_{k}\setminus K% ^{*})\setminus V(F)$.}\end{aligned}\right.{ start_ROW start_CELL divide start_ARG 2 end_ARG start_ARG italic_d ( italic_i ) end_ARG ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - divide start_ARG ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( italic_i ) end_ARG end_ARG start_ARG 3 + ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_d ( italic_i ) end_ARG end_ARG ) end_CELL start_CELL > 0 end_CELL start_CELL end_CELL start_CELL for all italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) , end_CELL end_ROW start_ROW start_CELL italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - divide start_ARG ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( italic_i ) end_ARG end_ARG start_ARG 3 + ∑ start_POSTSUBSCRIPT italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_d ( italic_i ) end_ARG end_ARG end_CELL start_CELL ≤ 0 end_CELL start_CELL end_CELL start_CELL for all italic_i ∈ ( italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ∖ italic_V ( italic_F ) . end_CELL end_ROW (30)

We conclude from the conditions (30) that the support of λ𝜆\lambdaitalic_λ restricted to KkV(f)subscript𝐾𝑘𝑉𝑓K_{k}\setminus V(f)italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_V ( italic_f ) always consists of the vertices for which gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT lies above a certain threshold value. To find this threshold value, we sort the gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT’s in non-increasing order and check all possible candidate sets for KV(f)superscript𝐾𝑉𝑓K^{*}\setminus V(f)italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) corresponding to the first r𝑟ritalic_r entries in this sorted list. Let σ:[nk]Kk:𝜎delimited-[]subscript𝑛𝑘subscript𝐾𝑘\sigma\colon[n_{k}]\to K_{k}italic_σ : [ italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] → italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denote an according sorting permutation, i.e., σ𝜎\sigmaitalic_σ is bijective and fulfills gσ(1)gσ(2)gσ(nk)subscript𝑔𝜎1subscript𝑔𝜎2subscript𝑔𝜎subscript𝑛𝑘g_{\sigma(1)}\geq g_{\sigma(2)}\geq\dots\geq g_{\sigma(n_{k})}italic_g start_POSTSUBSCRIPT italic_σ ( 1 ) end_POSTSUBSCRIPT ≥ italic_g start_POSTSUBSCRIPT italic_σ ( 2 ) end_POSTSUBSCRIPT ≥ ⋯ ≥ italic_g start_POSTSUBSCRIPT italic_σ ( italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT. For each candidate set {σ(1),,σ(p)}KkV(f)𝜎1𝜎𝑝subscript𝐾𝑘𝑉𝑓\{\sigma(1),\ldots,\sigma(p)\}\subseteq K_{k}\setminus V(f){ italic_σ ( 1 ) , … , italic_σ ( italic_p ) } ⊆ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_V ( italic_f ), it suffices to check whether gσ(p)subscript𝑔𝜎𝑝g_{\sigma(p)}italic_g start_POSTSUBSCRIPT italic_σ ( italic_p ) end_POSTSUBSCRIPT is strictly larger than the candidate threshold value

ω(p)j=1pgσ(j)d(σ(j))3+j=1p1d(σ(j)).𝜔𝑝superscriptsubscript𝑗1𝑝subscript𝑔𝜎𝑗𝑑𝜎𝑗3superscriptsubscript𝑗1𝑝1𝑑𝜎𝑗\omega(p)\coloneqq\frac{\sum_{j=1}^{p}\frac{g_{\sigma(j)}}{d(\sigma(j))}}{3+% \sum_{j=1}^{p}\frac{1}{d(\sigma(j))}}.italic_ω ( italic_p ) ≔ divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT divide start_ARG italic_g start_POSTSUBSCRIPT italic_σ ( italic_j ) end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( italic_σ ( italic_j ) ) end_ARG end_ARG start_ARG 3 + ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_d ( italic_σ ( italic_j ) ) end_ARG end_ARG .

If psuperscript𝑝p^{*}italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is the largest index for which this holds, then this candidate set equals KV(f)superscript𝐾𝑉𝑓K^{*}\setminus V(f)italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ). The existence of such a psuperscript𝑝p^{*}italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is guaranteed by the existence of a solution to the projection problem (20).

Finally, we need to address the optimality conditions for all iKkV(f)𝑖subscript𝐾𝑘𝑉𝑓i\in K_{k}\cap V(f)italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∩ italic_V ( italic_f ). In case iKV(f)𝑖superscript𝐾𝑉𝑓i\in K^{*}\cap V(f)italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∩ italic_V ( italic_f ), we have λi>0subscript𝜆𝑖0\lambda_{i}>0italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0, and due to complementary slackness (24) it holds that

0=eδ(i){f}ze=eδ(i){f}(ae+12λi),or equivalently, λi=2d(i)1eδ(i){f}ae>0.formulae-sequence0subscript𝑒𝛿𝑖𝑓subscript𝑧𝑒subscript𝑒𝛿𝑖𝑓subscript𝑎𝑒12subscript𝜆𝑖or equivalently, subscript𝜆𝑖2𝑑𝑖1subscript𝑒𝛿𝑖𝑓subscript𝑎𝑒00=\sum_{e\in\delta(i)\setminus\{f\}}z_{e}=\sum_{e\in\delta(i)\setminus\{f\}}% \Big{(}a_{e}+\frac{1}{2}\lambda_{i}\Big{)},~{}~{}\text{or equivalently, }~{}~{% }\lambda_{i}=-\frac{2}{d(i)-1}\sum_{e\in\delta(i)\setminus\{f\}}a_{e}>0.0 = ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } end_POSTSUBSCRIPT ( italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , or equivalently, italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = - divide start_ARG 2 end_ARG start_ARG italic_d ( italic_i ) - 1 end_ARG ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT > 0 .

We note here that we may w.l.o.g. assume that d(i)>1𝑑𝑖1d(i)>1italic_d ( italic_i ) > 1. Namely, if d(i)=1𝑑𝑖1d(i)=1italic_d ( italic_i ) = 1, then the set δ(i){f}𝛿𝑖𝑓\delta(i)\setminus\{f\}italic_δ ( italic_i ) ∖ { italic_f } is empty, hence λisubscript𝜆𝑖\lambda_{i}italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT will not appear anywhere in (26), making this dual variable redundant.

For the case i(KkK)V(f)𝑖subscript𝐾𝑘superscript𝐾𝑉𝑓i\in(K_{k}\setminus K^{*})\cap V(f)italic_i ∈ ( italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ∩ italic_V ( italic_f ), and hence λi=0subscript𝜆𝑖0\lambda_{i}=0italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0, condition (24) with (23) reads as eδ(i){f}ae0subscript𝑒𝛿𝑖𝑓subscript𝑎𝑒0\sum_{e\in\delta(i)\setminus\{f\}}a_{e}\geq 0∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≥ 0. Combining both cases, we obtain the following optimality conditions for iKkV(F)𝑖subscript𝐾𝑘𝑉𝐹i\in K_{k}\cap V(F)italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∩ italic_V ( italic_F ):

{eδ(i){f}ae<0for iKV(f),eδ(i){f}ae0for i(KkK)V(f).\left\{\begin{aligned} \sum_{e\in\delta(i)\setminus\{f\}}a_{e}&<0&&\text{for $% i\in K^{*}\cap V(f)$,}\\ \sum_{e\in\delta(i)\setminus\{f\}}a_{e}&\geq 0&&\text{for $i\in(K_{k}\setminus K% ^{*})\cap V(f)$.}\end{aligned}\right.{ start_ROW start_CELL ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT end_CELL start_CELL < 0 end_CELL start_CELL end_CELL start_CELL for italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∩ italic_V ( italic_f ) , end_CELL end_ROW start_ROW start_CELL ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT end_CELL start_CELL ≥ 0 end_CELL start_CELL end_CELL start_CELL for italic_i ∈ ( italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ∩ italic_V ( italic_f ) . end_CELL end_ROW

Altogether, the equations (26) then imply

zesubscript𝑧𝑒\displaystyle z_{e}italic_z start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ={af+am+1+am+23ω(p)if f{e,m+1,m+2},ae+1d(i)(giω(p))if eδ(i){f} and iKV(f),ae1d(i)1eδ(i){f}aeif eδ(i){f} and iKkV(f) with eδ(i){f}ae<0,aeotherwise.absentcasessubscript𝑎𝑓subscript𝑎𝑚1subscript𝑎𝑚23𝜔superscript𝑝if f{e,m+1,m+2},subscript𝑎𝑒1𝑑𝑖subscript𝑔𝑖𝜔superscript𝑝if eδ(i){f} and iKV(f),subscript𝑎𝑒1𝑑𝑖1subscript𝑒𝛿𝑖𝑓subscript𝑎𝑒if eδ(i){f} and iKkV(f) with eδ(i){f}ae<0,subscript𝑎𝑒otherwise.\displaystyle=\begin{cases}\frac{a_{f}+a_{m+1}+a_{m+2}}{3}-\omega(p^{*})&\text% {if $f\in\{e,m+1,m+2\}$,}\\ a_{e}+\frac{1}{d(i)}(g_{i}-\omega(p^{*}))&\text{if $e\in\delta(i)\setminus\{f% \}$ and $i\in K^{*}\setminus V(f)$,}\\ a_{e}-\frac{1}{d(i)-1}\sum_{e\in\delta(i)\setminus\{f\}}a_{e}&\text{if $e\in% \delta(i)\setminus\{f\}$ and $i\in K_{k}\cap V(f)$ with $\sum_{e\in\delta(i)% \setminus\{f\}}a_{e}<0,$}\\ a_{e}&\text{otherwise.}\end{cases}= { start_ROW start_CELL divide start_ARG italic_a start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT italic_m + 2 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG - italic_ω ( italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_CELL start_CELL if italic_f ∈ { italic_e , italic_m + 1 , italic_m + 2 } , end_CELL end_ROW start_ROW start_CELL italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG italic_d ( italic_i ) end_ARG ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_ω ( italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) end_CELL start_CELL if italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } and italic_i ∈ italic_K start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∖ italic_V ( italic_f ) , end_CELL end_ROW start_ROW start_CELL italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_d ( italic_i ) - 1 end_ARG ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT end_CELL start_CELL if italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } and italic_i ∈ italic_K start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∩ italic_V ( italic_f ) with ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) ∖ { italic_f } end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT < 0 , end_CELL end_ROW start_ROW start_CELL italic_a start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT end_CELL start_CELL otherwise. end_CELL end_ROW

It follows from Lemma 7 that the projection onto Tfksubscriptsuperscript𝑇𝑘𝑓T^{k}_{f}italic_T start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT involves both a sorting and an enumeration of a list of nksubscript𝑛𝑘n_{k}italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT elements. Hence, the worst-case time complexity is O(nklognk)𝑂subscript𝑛𝑘subscript𝑛𝑘O(n_{k}\log n_{k})italic_O ( italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_log italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ).

In fact, for computational purposes, we are not going to project on 𝒴RLTsubscript𝒴𝑅𝐿𝑇\mathcal{Y}_{RLT}caligraphic_Y start_POSTSUBSCRIPT italic_R italic_L italic_T end_POSTSUBSCRIPT but iteratively add violated cuts only. For that, we denote by 𝒞V×E𝒞𝑉𝐸\mathcal{C}\subseteq V\times Ecaligraphic_C ⊆ italic_V × italic_E the set of violated cuts that we to add to 𝒴𝒴\mathcal{Y}caligraphic_Y, where an element (i,f)𝑖𝑓(i,f)( italic_i , italic_f ) represents the cut eδ(i)yefyfsubscript𝑒𝛿𝑖subscript𝑦𝑒𝑓subscript𝑦𝑓\sum_{e\in\delta(i)}y_{ef}\geq y_{f}∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_e italic_f end_POSTSUBSCRIPT ≥ italic_y start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT. We further define analogously to 𝒴RLTsubscript𝒴𝑅𝐿𝑇\mathcal{Y}_{RLT}caligraphic_Y start_POSTSUBSCRIPT italic_R italic_L italic_T end_POSTSUBSCRIPT the polyhedral set

𝒴𝒞{Y~(m+1)×(m+1):Y~=(Yyy1),diag(Y)=y,eδ(i)yfeyf(i,f)𝒞}.subscript𝒴𝒞conditional-set~𝑌superscript𝑚1𝑚1formulae-sequence~𝑌matrix𝑌𝑦superscript𝑦top1formulae-sequencediag𝑌𝑦formulae-sequencesubscript𝑒𝛿𝑖subscript𝑦𝑓𝑒subscript𝑦𝑓for-all𝑖𝑓𝒞\mathcal{Y}_{\mathcal{C}}\coloneqq\left\{\widetilde{Y}\in\mathbb{R}^{(m+1)% \times(m+1)}\,:\,\,\widetilde{Y}=\begin{pmatrix}Y&y\\ y^{\top}&1\end{pmatrix},\ \text{diag}(Y)=y,~{}\sum_{e\in\delta(i)}y_{fe}\geq y% _{f}\quad\forall(i,f)\in\mathcal{C}\right\}.caligraphic_Y start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT ≔ { over~ start_ARG italic_Y end_ARG ∈ roman_ℝ start_POSTSUPERSCRIPT ( italic_m + 1 ) × ( italic_m + 1 ) end_POSTSUPERSCRIPT : over~ start_ARG italic_Y end_ARG = ( start_ARG start_ROW start_CELL italic_Y end_CELL start_CELL italic_y end_CELL end_ROW start_ROW start_CELL italic_y start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) , diag ( italic_Y ) = italic_y , ∑ start_POSTSUBSCRIPT italic_e ∈ italic_δ ( italic_i ) end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_f italic_e end_POSTSUBSCRIPT ≥ italic_y start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ∀ ( italic_i , italic_f ) ∈ caligraphic_C } .

The projection follows the same idea as explained above for the projection onto 𝒴RLTsubscript𝒴𝑅𝐿𝑇\mathcal{Y}_{RLT}caligraphic_Y start_POSTSUBSCRIPT italic_R italic_L italic_T end_POSTSUBSCRIPT, but in this case, instead of partitioning the vertex set V𝑉Vitalic_V into independent sets, we can partition the constraints in 𝒞𝒞\mathcal{C}caligraphic_C for each edge f𝑓fitalic_f separately. For a fixed f𝑓fitalic_f, we partition the vertices occurring together with f𝑓fitalic_f in 𝒞𝒞\mathcal{C}caligraphic_C into independent sets K1f,,KNffsubscriptsuperscript𝐾𝑓1subscriptsuperscript𝐾𝑓subscript𝑁𝑓K^{f}_{1},\dots,K^{f}_{N_{f}}italic_K start_POSTSUPERSCRIPT italic_f end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_K start_POSTSUPERSCRIPT italic_f end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Note that the number of independent sets Nfsubscript𝑁𝑓N_{f}italic_N start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT for an edge will probably be way smaller than the number of colors needed to color the whole graph, which can, in the worst case of a complete graph, be the number of vertices. Furthermore, as mentioned above, it is possible to project independently over each row fE𝑓𝐸f\in Eitalic_f ∈ italic_E, which allows us to parallelize this step. Hence, we cluster the cut constraints in 𝒞k={(i,f)𝒞:fE,iKkf}subscript𝒞𝑘conditional-set𝑖𝑓𝒞formulae-sequence𝑓𝐸𝑖subscriptsuperscript𝐾𝑓𝑘\mathcal{C}_{k}=\big{\{}(i,f)\in{\mathcal{C}}:f\in E,\ i\in K^{f}_{k}\big{\}}caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = { ( italic_i , italic_f ) ∈ caligraphic_C : italic_f ∈ italic_E , italic_i ∈ italic_K start_POSTSUPERSCRIPT italic_f end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } for 1kNmax1𝑘subscript𝑁𝑚𝑎𝑥1\leq k\leq N_{max}1 ≤ italic_k ≤ italic_N start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT with Nmaxmax{Nf:fE}subscript𝑁𝑚𝑎𝑥:subscript𝑁𝑓𝑓𝐸N_{max}\coloneqq\max\{N_{f}:f\in E\}italic_N start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT ≔ roman_max { italic_N start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT : italic_f ∈ italic_E } and obtain 𝒴𝒞=𝒴(k=1Nmax𝒴𝒞k),subscript𝒴𝒞𝒴superscriptsubscript𝑘1subscript𝑁𝑚𝑎𝑥subscript𝒴subscript𝒞𝑘\mathcal{Y}_{\mathcal{C}}=\mathcal{Y}\cap\Bigg{(}\bigcap_{k=1}^{N_{max}}% \mathcal{Y}_{\mathcal{C}_{k}}\Bigg{)},caligraphic_Y start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT = caligraphic_Y ∩ ( ⋂ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT end_POSTSUPERSCRIPT caligraphic_Y start_POSTSUBSCRIPT caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , where we can easily project onto 𝒴𝒞ksubscript𝒴subscript𝒞𝑘\mathcal{Y}_{\mathcal{C}_{k}}caligraphic_Y start_POSTSUBSCRIPT caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT using Proposition 7. A pseudocode for the Cyclic Dykstra projection algorithm to project onto 𝒴𝒞subscript𝒴𝒞\mathcal{Y}_{\mathcal{C}}caligraphic_Y start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT can be found in Algorithm 1.

Algorithm 1 Dykstra’s cyclic projection algorithm to project onto 𝒴𝒞subscript𝒴𝒞\mathcal{Y}_{\mathcal{C}}caligraphic_Y start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT

Input: matrix M𝑀Mitalic_M, cuts 𝒞𝒞\mathcal{C}caligraphic_C, εprojsubscript𝜀𝑝𝑟𝑜𝑗\varepsilon_{proj}italic_ε start_POSTSUBSCRIPT italic_p italic_r italic_o italic_j end_POSTSUBSCRIPT
Output: the projection 𝒫𝒴𝒞(M)subscript𝒫subscript𝒴𝒞𝑀\mathcal{P}_{\mathcal{Y}_{\mathcal{C}}}(M)caligraphic_P start_POSTSUBSCRIPT caligraphic_Y start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_M ) of M𝑀Mitalic_M onto 𝒴𝒞subscript𝒴𝒞\mathcal{Y}_{\mathcal{C}}caligraphic_Y start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT

1:cluster 𝒞𝒞\mathcal{C}caligraphic_C into {𝒞1,,𝒞Nmax}subscript𝒞1subscript𝒞subscript𝑁𝑚𝑎𝑥\{\mathcal{C}_{1},\dots,\mathcal{C}_{N_{max}}\}{ caligraphic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , caligraphic_C start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT }
2:initialize X=M𝑋𝑀X=Mitalic_X = italic_M, P=𝟎𝑃0P=\mathbf{0}italic_P = bold_0, Q1==QNmax=𝟎subscript𝑄1subscript𝑄subscript𝑁𝑚𝑎𝑥0Q_{1}=\dots=Q_{N_{max}}=\mathbf{0}italic_Q start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ⋯ = italic_Q start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT = bold_0
3:repeat
4:    Xold=Xsubscript𝑋𝑜𝑙𝑑𝑋X_{old}=Xitalic_X start_POSTSUBSCRIPT italic_o italic_l italic_d end_POSTSUBSCRIPT = italic_X
5:    Xtmp=X+Psubscript𝑋𝑡𝑚𝑝𝑋𝑃X_{tmp}=X+Pitalic_X start_POSTSUBSCRIPT italic_t italic_m italic_p end_POSTSUBSCRIPT = italic_X + italic_P
6:    X=𝒫𝒴(Xtmp)𝑋subscript𝒫𝒴subscript𝑋𝑡𝑚𝑝X=\mathcal{P}_{\mathcal{Y}}(X_{tmp})italic_X = caligraphic_P start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_t italic_m italic_p end_POSTSUBSCRIPT )
7:    P=XtmpX𝑃subscript𝑋𝑡𝑚𝑝𝑋P=X_{tmp}-Xitalic_P = italic_X start_POSTSUBSCRIPT italic_t italic_m italic_p end_POSTSUBSCRIPT - italic_X
8:    for k=1,,Nmax𝑘1subscript𝑁𝑚𝑎𝑥k=1,\dots,N_{max}italic_k = 1 , … , italic_N start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT do
9:        Xtmp=X+Qksubscript𝑋𝑡𝑚𝑝𝑋subscript𝑄𝑘X_{tmp}=X+Q_{k}italic_X start_POSTSUBSCRIPT italic_t italic_m italic_p end_POSTSUBSCRIPT = italic_X + italic_Q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT
10:        X=𝒫𝒴𝒞k(Xtmp)𝑋subscript𝒫subscript𝒴subscript𝒞𝑘subscript𝑋𝑡𝑚𝑝X=\mathcal{P}_{\mathcal{Y}_{\mathcal{C}_{k}}}(X_{tmp})italic_X = caligraphic_P start_POSTSUBSCRIPT caligraphic_Y start_POSTSUBSCRIPT caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_t italic_m italic_p end_POSTSUBSCRIPT )
11:        Qk=XtmpXsubscript𝑄𝑘subscript𝑋𝑡𝑚𝑝𝑋Q_{k}=X_{tmp}-Xitalic_Q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_X start_POSTSUBSCRIPT italic_t italic_m italic_p end_POSTSUBSCRIPT - italic_X
12:    end for
13:until XoldX<εprojdelimited-∥∥subscript𝑋𝑜𝑙𝑑𝑋subscript𝜀𝑝𝑟𝑜𝑗\lVert X_{old}-X\rVert<\varepsilon_{proj}∥ italic_X start_POSTSUBSCRIPT italic_o italic_l italic_d end_POSTSUBSCRIPT - italic_X ∥ < italic_ε start_POSTSUBSCRIPT italic_p italic_r italic_o italic_j end_POSTSUBSCRIPT
14:return X𝑋Xitalic_X

To compute the lower bound (18) with a PRSM algorithm, we first compute the DNN bound (17) with the PRSM, as explained in the previous subsection. Then, we separate violated cuts from the current solution and add the ncutsmax most violated ones to 𝒞𝒞\mathcal{C}caligraphic_C. We then proceed to compute (17) with the additional new cuts in 𝒞𝒞\mathcal{C}caligraphic_C with the PRSM and use the solution from before for a warm-start. This process of separating and adding new cuts to 𝒞𝒞\mathcal{C}caligraphic_C in an outer loop is iterated until one of the stopping criteria is met. Algorithm 2 provides a pseudocode for the described algorithm.

Algorithm 2 PRSM algorithm to compute lower bounds on the QMST

Input: graph G=(V,E)𝐺𝑉𝐸G=(V,E)italic_G = ( italic_V , italic_E ), cost matrix Q~~𝑄\widetilde{Q}over~ start_ARG italic_Q end_ARG
Output: (valid) lower bound LB

1:initialize Y~0superscript~𝑌0\widetilde{Y}^{0}over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT, S0superscript𝑆0S^{0}italic_S start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT, β𝛽\betaitalic_β, γ1subscript𝛾1\gamma_{1}italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, γ2subscript𝛾2\gamma_{2}italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, set 𝒞=𝒞\mathcal{C}=\emptysetcaligraphic_C = ∅\triangleright cf. Section 6
2:compute W𝑊Witalic_W, e.g., apply QR decomposition to ((n1)𝐈m𝟏m)superscriptmatrix𝑛1subscript𝐈𝑚subscript1𝑚top(\begin{matrix}(n-1)\mathbf{I}_{m}&\mathbf{1}_{m}\ \end{matrix})^{\top}( start_ARG start_ROW start_CELL ( italic_n - 1 ) bold_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_CELL start_CELL bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT
3:k=0𝑘0k=0italic_k = 0
4:while no stopping criteria met do
5:    while no stopping criteria met do
6:        Rk+1=𝒫(W(Y~k+1βSk)W)superscript𝑅𝑘1subscript𝒫superscript𝑊topsuperscript~𝑌𝑘1𝛽superscript𝑆𝑘𝑊R^{k+1}=\mathcal{P}_{\mathcal{R}}(W^{\top}(\widetilde{Y}^{k}+\frac{1}{\beta}S^% {k})W)italic_R start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT = caligraphic_P start_POSTSUBSCRIPT caligraphic_R end_POSTSUBSCRIPT ( italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG italic_β end_ARG italic_S start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ) italic_W )
7:        Sk+12=Sk+γ1β(Y~kWRk+1W)superscript𝑆𝑘12superscript𝑆𝑘subscript𝛾1𝛽superscript~𝑌𝑘𝑊superscript𝑅𝑘1superscript𝑊topS^{\frac{k+1}{2}}=S^{k}+\gamma_{1}\beta(\widetilde{Y}^{k}-WR^{k+1}W^{\top})italic_S start_POSTSUPERSCRIPT divide start_ARG italic_k + 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT = italic_S start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT + italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_β ( over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT - italic_W italic_R start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT )
8:        Y~k+1=𝒫𝒴𝒞(WRk+1W1β(Q~+Sk+12))superscript~𝑌𝑘1subscript𝒫subscript𝒴𝒞𝑊superscript𝑅𝑘1superscript𝑊top1𝛽~𝑄superscript𝑆𝑘12\widetilde{Y}^{k+1}=\mathcal{P}_{\mathcal{Y}_{\mathcal{C}}}\big{(}WR^{k+1}W^{% \top}-\frac{1}{\beta}\big{(}\widetilde{Q}+S^{\frac{k+1}{2}}\big{)}\big{)}over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT = caligraphic_P start_POSTSUBSCRIPT caligraphic_Y start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_W italic_R start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_β end_ARG ( over~ start_ARG italic_Q end_ARG + italic_S start_POSTSUPERSCRIPT divide start_ARG italic_k + 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) )
9:        Sk+1=Sk+12+γ2β(Y~k+1WRk+1W)superscript𝑆𝑘1superscript𝑆𝑘12subscript𝛾2𝛽superscript~𝑌𝑘1𝑊superscript𝑅𝑘1superscript𝑊topS^{k+1}=S^{\frac{k+1}{2}}+\gamma_{2}\beta(\widetilde{Y}^{k+1}-WR^{k+1}W^{\top})italic_S start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT = italic_S start_POSTSUPERSCRIPT divide start_ARG italic_k + 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT + italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_β ( over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT - italic_W italic_R start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT )
10:        k=k+1𝑘𝑘1k=k+1italic_k = italic_k + 1
11:    end while
12:    compute a valid lower bound LB from Sksuperscript𝑆𝑘S^{k}italic_S start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT\triangleright cf. Section 5.3
13:    separate violated cuts and add the ncutsmax most violated ones to 𝒞𝒞\mathcal{C}caligraphic_C
14:    cluster the cuts in 𝒞𝒞\mathcal{C}caligraphic_C
15:end while
16:return LB

5.3 Stopping criteria and post-processing of the PRSM algorithm

In this subsection, we briefly discuss the stopping criteria and the post-processing phase of our PRSM algorithm.

Stopping criteria

We use several criteria to decide when to stop the inner and outer iterations of Algorithm 2. The main stopping criteria for the inner while loop is when the primal and dual errors satisfy

max{Y~k+1WRk+1WF1+Y~k+1F,βW(Y~kY~k+1)WF1+Sk+1F}εPRSM,subscriptdelimited-∥∥superscript~𝑌𝑘1𝑊superscript𝑅𝑘1superscript𝑊top𝐹1subscriptdelimited-∥∥superscript~𝑌𝑘1𝐹𝛽subscriptdelimited-∥∥superscript𝑊topsuperscript~𝑌𝑘superscript~𝑌𝑘1𝑊𝐹1subscriptdelimited-∥∥superscript𝑆𝑘1𝐹subscript𝜀𝑃𝑅𝑆𝑀\max\Bigg{\{}\frac{\big{\lVert}\widetilde{Y}^{k+1}-WR^{k+1}W^{\top}\big{\rVert% }_{F}}{1+\big{\lVert}\widetilde{Y}^{k+1}\big{\rVert}_{F}},\ \beta\frac{\big{% \lVert}W^{\top}\big{(}\widetilde{Y}^{k}-\widetilde{Y}^{k+1}\big{)}W\big{\rVert% }_{F}}{1+\big{\lVert}S^{k+1}\big{\rVert}_{F}}\Bigg{\}}\leq\varepsilon_{PRSM},roman_max { divide start_ARG ∥ over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT - italic_W italic_R start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT end_ARG start_ARG 1 + ∥ over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT end_ARG , italic_β divide start_ARG ∥ italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT - over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT ) italic_W ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT end_ARG start_ARG 1 + ∥ italic_S start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT end_ARG } ≤ italic_ε start_POSTSUBSCRIPT italic_P italic_R italic_S italic_M end_POSTSUBSCRIPT ,

cf. [4]. We further stop the inner iterations when the maximum number of total PRSM iterations or a time limit is reached. In that case, we compute a valid dual bound as described below, and stop the algorithm.

For the outer loop, we have the following possible stopping criteria. If an upper bound is known, the algorithm stops as soon as the obtained valid lower bound closes the gap. We further stop the algorithm if the number of new violated cuts found is below a certain threshold ncutsmin. If the improvement of the valid lower bound compared to the valid lower bound of the previous outer iteration is smaller than epslbimprov, we stop the algorithm as well. And finally, we stop after a maximum of noutermax outer iterations.

Valid lower bound

The value obtained as an output of Algorithm 2 does not necessarily provide a lower bound for the problem, as the convergence of the PRSM is typically not monotonic, and one stops the algorithm earlier. Therefore, it is necessary to perform a postprocessing procedure to obtain a valid lower bound. We apply the approach presented in [26]. The safe lower bound derived by this method is then given by

lb(Sout)=minY~𝒴𝒞Q~+Sout,Y~nλmax(WSoutW),lbsuperscript𝑆outsubscript~𝑌subscript𝒴𝒞~𝑄superscript𝑆out~𝑌𝑛subscript𝜆superscript𝑊topsuperscript𝑆out𝑊\text{lb}(S^{\text{out}})=\min_{\widetilde{Y}\in\mathcal{Y}_{\mathcal{C}}}% \langle\widetilde{Q}+S^{\text{out}},\widetilde{Y}\rangle-n\lambda_{\max}(W^{% \top}S^{\text{out}}W),lb ( italic_S start_POSTSUPERSCRIPT out end_POSTSUPERSCRIPT ) = roman_min start_POSTSUBSCRIPT over~ start_ARG italic_Y end_ARG ∈ caligraphic_Y start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⟨ over~ start_ARG italic_Q end_ARG + italic_S start_POSTSUPERSCRIPT out end_POSTSUPERSCRIPT , over~ start_ARG italic_Y end_ARG ⟩ - italic_n italic_λ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_W start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_S start_POSTSUPERSCRIPT out end_POSTSUPERSCRIPT italic_W ) ,

where Soutsuperscript𝑆outS^{\text{out}}italic_S start_POSTSUPERSCRIPT out end_POSTSUPERSCRIPT denotes the dual matrix variable resulting from (an early stop of) the PRSM. The computation of this lower bound boils down to computing the largest eigenvalue and solving a linear program. Similarly, one can obtain a valid lower bound from the PRSM algorithm that solves (17), by replacing 𝒴𝒞subscript𝒴𝒞\mathcal{Y}_{\mathcal{C}}caligraphic_Y start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT with 𝒴𝒴\mathcal{Y}caligraphic_Y, see (16), in the above expression.

6 Numerical results

We implemented777The code can be found on https://github.com/melaniesi/QMST.jl and as ancillary files on the arXiv page of this paper. our algorithm in Julia [3] version 1.10.0. For solving the linear program to compute a valid lower bound, we are using the solver HiGHS [23] with the modeling language JuMP [28]. The projection onto 𝒞ksubscript𝒞𝑘\mathcal{C}_{k}caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is multithreaded. All computations were carried out on an AMD EPYC 7343 with 16 cores with 4.00GHz and 1024GB RAM, operated under Debian GNU/Linux 11.

Parameter setting

We initialize the matrices, penalty parameters, and step lengths as follows. As starting values for the matrices, we choose S0=𝟎superscript𝑆00S^{0}=\mathbf{0}italic_S start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT = bold_0 and

Y~0=((n1)m𝐈+(n1)(n2)m(m1)(𝐉𝐈)(n1)m𝟏(n1)m𝟏1)superscript~𝑌0𝑛1𝑚𝐈𝑛1𝑛2𝑚𝑚1𝐉𝐈𝑛1𝑚1𝑛1𝑚superscript1top1\widetilde{Y}^{0}=\left(\begin{smallmatrix}\frac{(n-1)}{m}\mathbf{I}+\frac{(n-% 1)(n-2)}{m(m-1)}(\mathbf{J}-\mathbf{I})&~{}\frac{(n-1)}{m}\mathbf{1}\\[6.45831% pt] \frac{(n-1)}{m}\mathbf{1}^{\top}&1\end{smallmatrix}\right)over~ start_ARG italic_Y end_ARG start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT = ( start_ROW start_CELL divide start_ARG ( italic_n - 1 ) end_ARG start_ARG italic_m end_ARG bold_I + divide start_ARG ( italic_n - 1 ) ( italic_n - 2 ) end_ARG start_ARG italic_m ( italic_m - 1 ) end_ARG ( bold_J - bold_I ) end_CELL start_CELL divide start_ARG ( italic_n - 1 ) end_ARG start_ARG italic_m end_ARG bold_1 end_CELL end_ROW start_ROW start_CELL divide start_ARG ( italic_n - 1 ) end_ARG start_ARG italic_m end_ARG bold_1 start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL 1 end_CELL end_ROW )

Based on the results of numerical tests, we have determined the values for the penalty parameter β𝛽\betaitalic_β and step lengths. We set the step length parameters to γ1=0.9subscript𝛾10.9\gamma_{1}=0.9italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0.9, γ2=1subscript𝛾21\gamma_{2}=1italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 1. For the penalty parameter, let qmaxmax{tr(Q),QF}subscript𝑞tr𝑄subscriptdelimited-∥∥𝑄𝐹q_{\max}\coloneqq\max\{\text{tr}(Q),\lVert Q\rVert_{F}\}italic_q start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ≔ roman_max { tr ( italic_Q ) , ∥ italic_Q ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT } and qminmin{tr(Q),QF}subscript𝑞tr𝑄subscriptdelimited-∥∥𝑄𝐹q_{\min}\coloneqq\min\{\text{tr}(Q),\lVert Q\rVert_{F}\}italic_q start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ≔ roman_min { tr ( italic_Q ) , ∥ italic_Q ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT }, we then set

β={qminm+1QFif qmaxqmin<1.2,qmaxqminQFelse.𝛽casessubscript𝑞𝑚1subscriptdelimited-∥∥𝑄𝐹if subscript𝑞subscript𝑞1.2subscript𝑞subscript𝑞subscriptdelimited-∥∥𝑄𝐹else.\beta=\begin{cases}\sqrt{\frac{q_{\min}}{m+1}\lVert Q\rVert_{F}}&\text{if }% \frac{q_{\max}}{q_{\min}}<1.2,\\ \sqrt{\frac{q_{\max}}{q_{\min}}\lVert Q\rVert_{F}}&\text{else.}\end{cases}italic_β = { start_ROW start_CELL square-root start_ARG divide start_ARG italic_q start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG start_ARG italic_m + 1 end_ARG ∥ italic_Q ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT end_ARG end_CELL start_CELL if divide start_ARG italic_q start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_ARG start_ARG italic_q start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG < 1.2 , end_CELL end_ROW start_ROW start_CELL square-root start_ARG divide start_ARG italic_q start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_ARG start_ARG italic_q start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG ∥ italic_Q ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT end_ARG end_CELL start_CELL else. end_CELL end_ROW

We run our algorithm for all instances with εPRSM=104subscript𝜀𝑃𝑅𝑆𝑀superscript104\varepsilon_{PRSM}=10^{-4}italic_ε start_POSTSUBSCRIPT italic_P italic_R italic_S italic_M end_POSTSUBSCRIPT = 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT and the parameter εprojsubscript𝜀𝑝𝑟𝑜𝑗\varepsilon_{proj}italic_ε start_POSTSUBSCRIPT italic_p italic_r italic_o italic_j end_POSTSUBSCRIPT is set to 105superscript10510^{-5}10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT. Violated cuts are considered if the violation is greater than 103superscript10310^{-3}10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT and after each outer iteration, the ncutsmax = m𝑚mitalic_m most violated cuts are added. No further cuts are added if the improvement of the lower bound is smaller than epslbimprov = 103superscript10310^{-3}10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT or the number of new violated cuts found is less than ncutsmin = 10. The maximum wall-clock time for running our algorithm is set to 3 hours per instance, and the maximum number of total iterations is set to 10 0001000010\,00010 000. We set the number of maximum outer iterations to noutermax = 10.

Benchmark instances

We test our algorithm on the following three benchmark sets. The first benchmark set OP was introduced in [30] by Öncan and Punnen. The benchmark set consists of 3 different classes, each consisting of 160 instances on complete graphs: the OPsym, OPvsym and OPesym instances. The OPsym instances have diagonal entries chosen uniformly from [100]delimited-[]100[100][ 100 ], and the off-diagonal values are uniformly distributed at random in [20]delimited-[]20[20][ 20 ]. For instances in the class OPvsym, the diagonal values are uniformly distributed in [10 000]delimited-[]10000[10\,000][ 10 000 ], and the off-diagonal values Q{i,j},{k,l}subscript𝑄𝑖𝑗𝑘𝑙Q_{\{i,j\},\{k,l\}}italic_Q start_POSTSUBSCRIPT { italic_i , italic_j } , { italic_k , italic_l } end_POSTSUBSCRIPT are computed as w(i)w(j)w(k)w(l)𝑤𝑖𝑤𝑗𝑤𝑘𝑤𝑙w(i)w(j)w(k)w(l)italic_w ( italic_i ) italic_w ( italic_j ) italic_w ( italic_k ) italic_w ( italic_l ), where w:V[10]:𝑤𝑉delimited-[]10w\colon V\to[10]italic_w : italic_V → [ 10 ] assigns to each vertex in the graph a uniformly distributed weight at random in [10]delimited-[]10[10][ 10 ]. The cost matrix for instances of the type OPesym is constructed in the following way. First, the vertex coordinates are randomly chosen in the box [0,100]×[0,100]01000100[0,100]\times[0,100][ 0 , 100 ] × [ 0 , 100 ], and the edges are represented as straight lines connecting vertices. The edge cost Qeesubscript𝑄𝑒𝑒Q_{ee}italic_Q start_POSTSUBSCRIPT italic_e italic_e end_POSTSUBSCRIPT is then set as the length of the edge e𝑒eitalic_e, and the interaction cost between two edges e𝑒eitalic_e and f𝑓fitalic_f is computed as the Euclidean distance between the midpoints of e𝑒eitalic_e and f𝑓fitalic_f. For each of those test sets, they randomly generated 10 instances each for n{6,7,,17,18}{20,30,50}𝑛671718203050n\in\{6,7,\dots,17,18\}\cup\{20,30,50\}italic_n ∈ { 6 , 7 , … , 17 , 18 } ∪ { 20 , 30 , 50 }. We do not include the benchmark instances of type OPesym and n=20𝑛20n=20italic_n = 20 in our study, as we were unable to locate the correct instances888 In the benchmark set https://data.mendeley.com/datasets/cmnh9xc6wb/1, the instances indicated as type OPesym for n=20𝑛20n=20italic_n = 20 are the OPvsym for n=6𝑛6n=6italic_n = 6..

The second family of benchmark instances CP was introduced by Cordone and Passeri in [10]. The benchmark set consists of 108 instances divided into 4 classes, specifying the sets from which the diagonal and off-diagonal values of the cost matrix are chosen uniformly at random. For each pair of the number of vertices n{10,15,20,25,30,35,40,45,50}𝑛101520253035404550n\in\{10,15,20,25,30,35,40,45,50\}italic_n ∈ { 10 , 15 , 20 , 25 , 30 , 35 , 40 , 45 , 50 }, density d{33%,67%,100%}𝑑percent33percent67percent100d\in\{33\%,67\%,100\%\}italic_d ∈ { 33 % , 67 % , 100 % } and class, one random graph was generated. The values of the cost matrix Q𝑄Qitalic_Q are uniformly distributed on the sets as listed below.

class CP1 CP2 CP3 CP4
diagonal values [10] [10] [100] [100]
off-diagonal values [10] [100] [10] [100]

The last benchmark set SV was introduced by Sotirov and Verchére in their recent paper [39]. It consists of 24 instances, with one random graph for each pair of n{10,12,14,16,18,20,25,30}𝑛1012141618202530n\in\{10,12,14,16,18,20,25,30\}italic_n ∈ { 10 , 12 , 14 , 16 , 18 , 20 , 25 , 30 } and d{33%,67%,100%}𝑑percent33percent67percent100d\in\{33\%,67\%,100\%\}italic_d ∈ { 33 % , 67 % , 100 % }. They constructed the cost matrices in such a way that for a given maximum cost for the diagonal entries, and a maximum cost for the off-diagonal entries, 10% of the edges have high interaction costs with each other (between 90 and 100% of the maximum off-diagonal cost) and low interaction costs with the rest (between 20 and 40% of the maximum off-diagonal cost). The other 90% of edges have an interaction cost of between 50 and 70% of the maximum off-diagonal cost with each other. The diagonal entries are chosen to be between 0 and 20% of the maximum diagonal cost.

Bounds from the literature

We compare our numerical results to lower bounds from [20, 34, 39]. The upper bounds on the benchmark instances are taken from the literature.

The bounds from [20], called LAGN and LAGP, are used in the to-date best exact algorithm for the QMSTP. Those bounds are obtained from two different ways of dualizing an SDP relaxation of QMSTP. For LAGN, the semidefiniteness constraint is dualized, and a subgradient method is used to compute the optimum. Whereas for computing LAGP, there is no semidefiniteness constraint present, but a semi-infinite reformulation together with polyhedral cutting planes is solved using a bundle method.

The lower bounds VS1 and VS2 were introduced by Sotirov and Verchére in [39]. These lower bounds are based on an extended formulation of the minimum quadratic spanning tree problem and are strengthened by facet defining inequalities of the Boolean Quadric polytope. The lower bound VS2 is stronger than VS1.

Pereira et al. [34] solved several benchmark problems of sizes up to 50 vertices using a RLT based relaxation RLT1. RLT1 is an incomplete first level RLT relaxation and is computed by dualizing the symmetry constraint, applying the GL procedure, and using a subgradient algorithm. Another RLT based bound among the strongest relaxations in the literature is RLT2, presented in [37]. The authors of [37] use a dual-ascent procedure for computing their relaxation based on the second-level of RLT.

Computational results

We first present a comparison of our algorithm to the results from [20], where the authors also compute SDP bounds. Their computations were carried out on a machine with 32 GB RAM and two E5645 Intel Xeon processors, with six 2.40GHz cores each.

The structure of Section 6 is analogous to Table 4 in [20] and reads as follows. The rows are grouped into 3 blocks, each reporting the results averaged over all CP instances with the same property as specified in the first column of the table. The first block of rows averages over instances of the same size, the second averages the results over the densities of the graphs, and the last block averages over the different classes of the CP instances. In the second column of Section 6, we report the average gap obtained by the valid lower bound obtained with our PRSM algorithm when stopping after the first outer iteration, cf. (17). We compute the relative gap between that lower bound (LBDNNsubscriptLB𝐷𝑁𝑁\text{LB}_{DNN}LB start_POSTSUBSCRIPT italic_D italic_N italic_N end_POSTSUBSCRIPT) and the best known upper bound (UB) from the literature using 100(UBLBDNN)/UB100𝑈𝐵subscriptLB𝐷𝑁𝑁𝑈𝐵100(UB-\text{LB}_{DNN})/UB100 ( italic_U italic_B - LB start_POSTSUBSCRIPT italic_D italic_N italic_N end_POSTSUBSCRIPT ) / italic_U italic_B. We remark here that the same gap was calculated in Guimarães et al. [20].999There was a typo in that paper that claims differently, but our statement can be easily verified. In the third column, we report the average wall clock time in seconds needed to compute this lower bound. In column 4, we report the average gap obtained by the bound returned by Algorithm 2, cf. (18), and in column 5, the average time needed to compute this bound. In the sixth and seventh column, we list the average gaps and computation times for the bound LAGN of [20], which is used in the best up-to-date exact algorithm for the QMSTP. The average gaps and computation times of LAGP, the second lower bound introduced in [20], are given in the last two columns of Section 6.

The results in Section 6 show that for the CP instances, our lower bounds are, on average, significantly stronger than the SDP bounds LAGN and LAGP. Except for the instances with n{10,15}𝑛1015n\in\{10,15\}italic_n ∈ { 10 , 15 }, the average computation times for solving our relaxations are smaller than those reported for computing SDP bounds LAGN and LAGP. The average time to compute the DNN + CUTS bound, that is (18), over all CP instances is 51515151 seconds, compared to 1 36013601\,3601 360 and 5 65256525\,6525 652 seconds for LAGN and LAGP, respectively. More significant difference in the computation times and relative gaps can be seen for larger instances. One can also observe that the less dense the instances are, the smaller the average relative gap. Furthermore, the effect of adding cuts is more significant for sparse graphs than for dense graphs. Guimarães et al. [20, Table 4] compare their bounds to RLT1 [34], which can be computed approximately three times faster than LAGN but yields much weaker bounds. The average gap of bound RLT2 [37] over all instances of size n35𝑛35n\leq 35italic_n ≤ 35 for each of the four CP classes is at least three times larger than our reported average gaps for (17). Overall, Section 6 shows that, especially for larger CP instances, our bounds are significantly stronger and faster to compute than any other bounds.

In the LABEL:tab:CP1, LABEL:tab:CP2, LABEL:tab:CP3, LABEL:tab:CP4 and LABEL:tab:SV we report the numerical results for all benchmark instances of the test sets CP and SV. The first four columns give details about the instance as the number of vertices, the edge density, the number of edges and an upper bound on the QMST. The next three columns report the valid lower bound (17) obtained after the first outer loop of our PRSM algorithm, the relative gap to the upper bound 100(UBLBDNN)/UB100𝑈𝐵subscriptLB𝐷𝑁𝑁𝑈𝐵100(UB-\text{LB}_{DNN})/UB100 ( italic_U italic_B - LB start_POSTSUBSCRIPT italic_D italic_N italic_N end_POSTSUBSCRIPT ) / italic_U italic_B, and the wall clock time in seconds needed to compute that bound. The last six columns outline the numerical results of our algorithm to compute (18). In columns 8 to 10, we provide the valid lower bound returned by our algorithm, the relative gap, and the wall clock time needed to compute the lower bound. The next two columns list the total number of iterations and the total number of cuts added. In the last column, we report the relative gap closed by adding the RLT-type cuts to the DNN relaxation (17). This performance measurement is computed as 100(LBDNN+CUTSLBDNN)/(UBLBDNN),100subscriptLB𝐷𝑁𝑁𝐶𝑈𝑇𝑆subscriptLB𝐷𝑁𝑁UBsubscriptLB𝐷𝑁𝑁100(\text{LB}_{DNN+CUTS}-\text{LB}_{DNN})/(\text{UB}-\text{LB}_{DNN}),100 ( LB start_POSTSUBSCRIPT italic_D italic_N italic_N + italic_C italic_U italic_T italic_S end_POSTSUBSCRIPT - LB start_POSTSUBSCRIPT italic_D italic_N italic_N end_POSTSUBSCRIPT ) / ( UB - LB start_POSTSUBSCRIPT italic_D italic_N italic_N end_POSTSUBSCRIPT ) , where LBDNN refers to the lower bound (17) reported in column 5 and LBDNN+CUTS is the lower bound (18) reported in column 8 in each table. This metric gives information on how much the gap to the upper bound was improved.

LABEL:tab:CP1, LABEL:tab:CP2, LABEL:tab:CP3 and LABEL:tab:CP4 show that especially for CP instances with n30𝑛30n\geq 30italic_n ≥ 30 vertices and edge density 100% there were only a few violated cuts found. Hence, the relative improvement of the DNN relaxation by adding those cuts was only marginal. One can further observe that the improvement of the relative gap and the relative gap closed, is better for smaller instances. For larger instances, adding cuts such as the RLT-type of the cut-set constraints for subsets S𝑆Sitalic_S of size 2 and larger, might further improve the DNN bounds.

LABEL:tab:SV presents the results of our algorithm for the benchmark set SV introduced in [39]. To the best of our knowledge, there are no results on LAGN, LAGP, and RLT2 for this benchmark set. The by far best lower bound up to date for the SV instance set was VS2. Our DNN relaxation bound without cuts outperforms VS2 for all instances, with the number of edges m45𝑚45m\geq 45italic_m ≥ 45, except for the instance with n=12𝑛12n=12italic_n = 12 and d=67%𝑑percent67d=67\%italic_d = 67 %. Both our relaxations yield a relative gap of less than 1%. The relative gap of VS2 ranges between 0 and 16.4%. The maximum runtime to compute the DNN bound for these instances is less than 5 seconds, whereas computation time for bound VS2 of n=30𝑛30n=30italic_n = 30 and d=100%𝑑percent100d=100\%italic_d = 100 % was reported to be 45 minutes. Computing the DNN bound with cuts is faster than the reported time to compute VS2 for all instances with more than 80 edges.

LABEL:tab:OPsym, LABEL:tab:OPesym and LABEL:tab:OPvsym read similarly to the tables for the CP and SV benchmark sets but the results are averaged over all instances of the same size. Again, to the best of our knowledge, we are not aware of any detailed and complete results for LAGN and LAGP on the OP benchmark set.

LABEL:tab:OPsym reports the results obtained for the benchmark set OPsym. The lower bound (18) with cuts outperforms VS2 for n10𝑛10n\geq 10italic_n ≥ 10, and RLT2 for n8𝑛8n\geq 8italic_n ≥ 8 with the exception of n=18𝑛18n=18italic_n = 18, where the average relative gap for RLT2 is reported to be 33% and is 33.41% for the DNN bound with cuts. For n=50𝑛50n=50italic_n = 50, no bounds were reported. One can observe that the absolute improvement by adding RLT cuts to (17) for n9𝑛9n\geq 9italic_n ≥ 9 is approximately 20.

LABEL:tab:OPesym shows that for the benchmark set OPesym adding the RLT-type cuts to (17) yields a substantial improvement of the relative gap. The DNN lower bound with cuts yields better bounds compared to VS2 but is clearly dominated by RLT1, giving an average relative gap between 0.2%percent0.20.2\%0.2 % and 1.7%percent1.71.7\%1.7 % for instances with n30𝑛30n\leq 30italic_n ≤ 30.

The authors of [39] report that the relative gap of the VS1 lower bound is less than or equal 0.2% for all instances of the class OPvsym. Although, on average, not many violated cuts to be added were found, the averaged relative bound closed is above 49% for all instances except that with n{6,7}𝑛67n\in\{6,7\}italic_n ∈ { 6 , 7 }, where on average only 0.5 violated cuts were found. Considering the instances with n11𝑛11n\geq 11italic_n ≥ 11, the average relative bound closed is even above 80%.

The time limit of 3 hours was reached by all instances from OPesym and OPvsym of size n=50𝑛50n=50italic_n = 50 and almost all of those instances of size n=30𝑛30n=30italic_n = 30. The higher computational costs for those two classes of benchmark instances can be explained, among other things, by the high number of clusters Nmaxsubscript𝑁N_{\max}italic_N start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT, cf. Section 5.2. The number of clusters has a direct effect on the computation time of Dykstra’s algorithm, which accounts for a substantial part of the overall computation time. The average number of clusters needed for the OPvsym and OPesym instances are 6.43 and 6.38, whereas the average over all other benchmark instances is 3.26. Note that for those two classes of instances, added RTL-type constraints significantly improve lower bounds. Additionally, as for the CP3 instances, one can observe the higher number of iterations until convergence of the algorithm compared to other classes in our benchmark sets.

This study Guimarães et al. [20]
DNN DNN + CUTS LAGN LAGP
gap (%) time (s) gap (%) time (s) gap (%) time (s) gap (%) time (s)
\csvreader[head to column names, late after line =
\lagptime \csvreader[head to column names, late after line = \dnntime \lbgapp \lbtime \lagngap \lagntime \lagpgap
\lagptime \csvreader[head to column names, late after line = \dnntime \lbgapp \lbtime \lagngap \lagntime \lagpgap
\lagptime \dnntime \lbgapp \lbtime \lagngap \lagntime \lagpgap
Table 1: Comparison to averaged results on lower bounds for CP instances.

7 Conclusion

This paper provides two mixed-integer semidefinite programming formulations for the quadratic minimum spanning tree problem. Each of these formulations includes only one connectivity constraint, which is a linear matrix inequality based on the algebraic connectivity of trees. By exploiting the MISDP formulations, we derive a DNN relaxation for the QMSTP. We also derive the cut-set and RLT-type constraints as Chvátal-Gomory cuts of the MISDP by applying a CG procedure for mixed integer conic programming. The RLT-type constraints are added to the DNN relaxation, resulting in a strengthened DNN relaxation. An iterative cutting plane Peaceman-Rachford splitting method is designed to compute the DNN relaxation with the RLT-type constraints of the QMSTP efficiently.

The computational experiments on the benchmark instances from the literature demonstrate that our bounds significantly outperform existing bounds both in quality and computation time. While other approaches struggled to compute bounds for larger instances, we compute strong bounds in short time.

Given these results, incorporating our new bounds in a branch-and-bound algorithm would be the obvious next step for further research. Another topic for future research would be to incorporate additional RLT-type cut-set constraints to further strengthen the DNN relaxation.

Instance DNN DNN + CUTS
n𝑛nitalic_n d𝑑ditalic_d (%) m𝑚mitalic_m UB LB gap (%) time (s) LB gap (%) time (s) iterations cuts closed (%)
\csvreader[head to column names, late after line =
\relgapclosed
Table 2: Results for CP1.
Instance DNN DNN + CUTS
n𝑛nitalic_n d𝑑ditalic_d (%) m𝑚mitalic_m UB LB gap (%) time (s) LB gap (%) time (s) iterations cuts closed (%)
\csvreader[head to column names, late after line =
\relgapclosed
Table 3: Results for CP2.
Instance DNN DNN + CUTS
n𝑛nitalic_n d𝑑ditalic_d (%) m𝑚mitalic_m UB LB gap (%) time (s) LB gap (%) time (s) iterations cuts closed (%)
\csvreader[head to column names, late after line =
\relgapclosed
Table 4: Results for CP3.
Instance DNN DNN + CUTS
n𝑛nitalic_n d𝑑ditalic_d (%) m𝑚mitalic_m UB LB gap (%) time (s) LB gap (%) time (s) iterations cuts closed (%)
\csvreader[head to column names, late after line =
\relgapclosed
Table 5: Results for CP4.
Instance DNN DNN + CUTS
n𝑛nitalic_n d𝑑ditalic_d (%) m𝑚mitalic_m UB LB gap (%) time (s) LB gap (%) time (s) iterations cuts closed (%)
\csvreader[head to column names, late after line =
\relgapclosed
Table 6: Results for SV instances.
Instance DNN DNN + CUTS
n𝑛nitalic_n m𝑚mitalic_m UB LB gap (%) time (s) LB gap (%) time (s) iterations cuts closed (%)
\csvreader[head to column names, late after line =
\relgapclosed
Table 7: Results for OPsym instances.
Instance DNN DNN + CUTS
n𝑛nitalic_n m𝑚mitalic_m UB LB gap (%) time (s) LB gap (%) time (s) iterations cuts closed (%)
\csvreader[head to column names, late after line =
\relgapclosed
Table 8: Results for OPesym instances.
Instance DNN DNN + CUTS
n𝑛nitalic_n m𝑚mitalic_m UB LB gap (%) time (s) LB gap (%) time (s) iterations cuts closed (%)
\csvreader[head to column names, late after line =
\relgapclosed
Table 9: Results for OPvsym instances.

References

  • [1] Arjang Assad and Weixuan Xu. The quadratic minimum spanning tree problem. Naval Res. Logist., 39(3):399–417, 1992.
  • [2] Heinz H. Bauschke and Valentin R. Koch. Projection methods: Swiss army knives for solving feasibility and best approximation problems with halfspaces. In Infinite products of operators and their applications, volume 636 of Contemp. Math., pages 1–40. AMS, Providence, RI, 2015.
  • [3] Jeff Bezanson, Alan Edelman, Stefan Karpinski, and Viral B Shah. Julia: A fresh approach to numerical computing. SIAM Review, 59(1):65–98, 2017.
  • [4] Stephen Boyd, Neal Parikh, Eric Chu, Borja Peleato, and Jonathan Eckstein. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends® in Machine Learning, 3(1):1–122, 2011.
  • [5] James P. Boyle and Richard L. Dykstra. A method for finding projections onto the intersection of convex sets in hilbert spaces. In Richard Dykstra, Tim Robertson, and Farroll T. Wright, editors, Advances in Order Restricted Statistical Inference, pages 28–47, New York, NY, 1986.
  • [6] M. Tolga Çezik and Garud N. Iyengar. Cuts for mixed 0-1 conic programming. Math. Program., 104:179–202, 2005.
  • [7] Tzu-Chiang Chiang, Chien-Hung Liu, and Yueh-Min Huang. A near-optimal multicast scheme for mobile ad hoc networks using a hybrid genetic algorithm. Expert Syst. Appl., 33(3):734 – 742, 2007.
  • [8] Wushow Chou and Aaron Kershenbaum. A unified algorithm for designing multidrop teleprocessing networks. IEEE Trans. Commun., 22:1762–1772, 1974.
  • [9] Laurent Condat. Fast projection onto the simplex and the 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ball. Math. Program., 158(1–2):575–585, 2016.
  • [10] Roberto Cordone and Gianluca Passeri. Heuristic and exact approaches to the quadratic minimum spanning tree problem. In Seventh Cologne Twente Workshop on Graphs and Comb. Opt., Gargano, Italy, 13-15 May, 2008, pages 52–55. University of Milan, 2008.
  • [11] Roberto Cordone and Gianluca Passeri. Solving the quadratic minimum spanning tree problem. Appl. Math. Comput., 218:11597–11612, 2012.
  • [12] Ante Ćustić, Ruonan Zhang, and Abraham P. Punnen. The quadratic minimum spanning tree problem and its variations. Discrete Optim., 27:73–87, 2018.
  • [13] D. Cvetković, M. Čangalović, and V. Kovačević-Vujčić. Semidefinite programming methods for the symmetric traveling salesman problem. In G. Cornuj́ols, R.E. Burkard, and G.J. Woeginger, editors, Integer programming and Combinatorial Optimization (IPCO 1999), volume 1610 of Lecture Notes in Comput. Sci. Springer, Berlin, Heidelberg, 1999.
  • [14] Frank de Meijer and Renata Sotirov. The Chvátal-Gomory procedure for integer SDPs with applications in combinatorial optimization. Math. Program., 2024.
  • [15] Frank de Meijer and Renata Sotirov. On integrality in semidefinite programming for discrete optimization. SIAM J. Optim., 34(1), 2024.
  • [16] Frank de Meijer, Renata Sotirov, Angelika Wiegele, and Shudian Zhao. Partitioning through projections: Strong SDP bounds for large graph partition problems. Comput. Oper. Res., 151:106088, 2023.
  • [17] Miroslav Fiedler. Algebraic connectivity of graphs. Czechoslov. Math. J., 23(2):298–305, 1973.
  • [18] Paul C. Gilmore. Optimal and suboptimal algorithms for the quadratic assignment problem. J Soc Ind Appl Math, 10(2):305–313, 1962.
  • [19] Robert Grone and Russell Merris. Ordering trees by algebraic connectivity. Graphs Combin., 6:229–237, 1990.
  • [20] Dilson A. Guimarães, Alexandre S. da Cunha, and Dilson L. Pereira. Semidefinite programming lower bounds and branch-and-bound algorithms for the quadratic minimum spanning tree problem. European J. of Oper. Res., 280(1):46–58, 2020.
  • [21] Bingsheng He, Feng Ma, and Xiaoming Yuan. Convergence study on the symmetric version of ADMM with larger step sizes. SIAM J. on Imaging Sci., 9(3):1467–1501, 2016.
  • [22] Hao Hu, Renata Sotirov, and Henry Wolkowicz. Facial reduction for symmetry reduced semidefinite and doubly nonnegative programs. Math. Program., 200(1):475–529, 2023.
  • [23] Qi Huangfu and J. A. Julian Hall. Parallelizing the dual revised simplex method. Math. Program. Comput., 10(1):119–142, 2018.
  • [24] Joseph B. Kruskal. On the shortest spanning subtree of a graph and the traveling salesman problem. In Proc. Amer. Math. Soc., 7, 1956.
  • [25] Eugene L. Lawler. The quadratic assignment problem. Manag. Sci., 9(4):586–599, 1963.
  • [26] Xinxin Li, Ting Kei Pong, Hao Sun, and Henry Wolkowicz. A strictly contractive Peaceman-Rachford splitting method for the doubly nonnegative relaxation of the minimum cut problem. Comput. Optim. Appl., 78(3):853–891, 2021.
  • [27] Pierre-Louis Lions and Bertrand Mercier. Splitting algorithms for the sum of two nonlinear operators. SIAM J. Numer. Anal., 16(6):964–979, 1979.
  • [28] Miles Lubin, Oscar Dowson, Joaquim Dias Garcia, Joey Huchette, Benoît Legat, and Juan Pablo Vielma. JuMP 1.0: Recent improvements to a modeling language for mathematical optimization. Math. Program. Comput., 15:581–589, 2023.
  • [29] Danilo E. Oliveira, Henry Wolkowicz, and Yangyang Xu. ADMM for the SDP relaxation of the QAP. Math. Program. Comput., 10:631–658, 2018.
  • [30] Temel Öncan and Abraham P. Punnen. The quadratic minimum spanning tree problem: A lower bounding procedure and an efficient search algorithm. Comput. Oper. Res., 37(10):176–1773, 2010.
  • [31] Gintaras Palubeckis, Dalius Rubliauskas, and Aleksandras Targamadz. Metaheuristic approaches for the quadratic minimum spanning tree problem. Inform. Tech. Control, 29:257––268, 2010.
  • [32] Donald W. Peaceman and Henry H. Rachford. The numerical solution of parabolic and elliptic differential equations. SIAM J. Appl. Math., 3(1):28–41, 1955.
  • [33] Dilson L. Pereira, Michel Gendreau, and Alexandre S. da Cunha. Branch-and-cut and branch-and-cut-and-price algorithms for the adjacent only quadratic minimum spanning tree problem. Networks, 65:367–379, 2015.
  • [34] Dilson L. Pereira, Michel Gendreau, and Alexandre S. da Cunha. Lower bounds and exact algorithms for the quadratic minimum spanning tree problem. Comput. Oper. Res., 63:149 – 160, 2015.
  • [35] Robert C. Prim. Shortest connection networks and some generalizations. The Bell Systems Technical Journal, 36(6):1389–1401, 1957.
  • [36] Abraham P. Punnen. Combinatorial optimization with multiplicative objective function. Int. J. Oper. Quant. Manag., 7:205–209, 2001.
  • [37] Borzou Rostami and Federico Malucelli. Lower bounds for the quadratic minimum spanning tree problem based on reduced cost computation. Comput. Oper. Res., 64:178–188, 2015.
  • [38] Hanif D. Sherali and Warren P. Adams. A reformulation-linearization technique for solving discrete and continuous nonconvex problems, volume 31. SSBM, 2013.
  • [39] Renata Sotirov and Zoe Verchére. The quadratic minimum spanning tree problem: Lower bounds via extended formulations. Vietnam J. Math., 2024.
  • [40] Henry Wolkowicz, Romesh Saigal, and Lieven Vandenberghe. Handbook of Semidefinite Programming: Theory, Algorithms, and Applications. Internat. Ser. Oper. Res. Management Sci. Springer, 2000.