Balance with Memory in Signed Networks via Mittag-Leffler Matrix Functions

Yu Tian Nordita, Stockholm University and KTH Royal Institute of Technology, SE-106 91 Stockholm, Sweden (yu.tian@su.se). Ernesto Estrada Institute of Cross-Disciplinary Physics and Complex Systems, IFISC (UIB-CSIC), Palma de Mallorca, 07122, Spain (estrada@ifisc.uib-csic.es).

Abstract

Structural balance is an important characteristic of graphs/networks where edges can be positive or negative, with direct impact on the study of real-world complex systems. When a network is not structurally balanced, it is important to know how much balance still exists in it. Although several measures have been proposed to characterize the degree of balance, the use of matrix functions of the signed adjacency matrix emerges as a very promising area of research. Here, we take a step forward to using Mittag-Leffler (ML) matrix functions to quantify the notion of balance of signed networks. We show that the ML balance index can be obtained from first principles on the basis of a nonconservative diffusion dynamic, and that it accounts for the memory of the system about the past, by diminishing the penalization that long cycles typically receive in other matrix functions. Finally, we demonstrate the important information in the ML balance index with both artificial signed networks and real-world networks in various contexts, ranging from biological and ecological to social ones.

1 Introduction

The use of matrix functions [31] has represented a significant advance in the development of mathematical models of networks $G=\left(V,E\right)$ in the last 20 years [7, 21]. In particular, the use of functions of the adjacency matrix $A$ of a network, $f\left(A\right)$ , has impacted the areas of study of vertex centrality measures [7] as well as our understanding of the navigability of networks [17, 44]. The mathematical roots of these developments come from the fact that $\left(A^{k}\right)_{uv}$ counts the number of walks of length $k$ connecting the vertices $u,v\in V$ , where a walk is a sequence of (not necessarily different) consecutive vertices and edges in the network (see [8, 25, 33] for original sources). Therefore, defining matrix functions of the type $f\left(A\right)=\sum_{k=0}^{\infty}c_{k}A^{k}$ allows to quantify the “importance”, or centrality, of a vertex $u\in V$ , by taking $\left(f\left(A\right)\right)_{uu}$ as a counting of all self-returning walks starting at vertex $u$ , and giving more weight to the smaller than to the longer ones through the constants $\{c_{k}\}$ [22]. Similarly, the term $\left(f\left(A\right)\right)_{uv}$ accounts for the “communicability” capacity between the vertices [18]. Building on the field of Euclidean matrix theory [6, 28, 35], circum-Euclidean distances [1, 47], a.k.a, spherical Euclidean distance, between pairs of vertices can also be obtained by defining $\left(f\left(A\right)\right)_{uu}+\left(f\left(A\right)\right)_{vv}-\left(f% \left(A\right)\right)_{uv}$ for positive-definite matrix functions $f\left(A\right)$ [12] and angles $\left(f\left(A\right)\right)_{uv}/\sqrt{\left(f\left(A\right)\right)_{uu}\left% (f\left(A\right)\right)_{vv}}$ [19] (see also [23]).

The historical background for the use of matrix functions to study networks can be traced back to the work of Katz [33] who proposed $\left(I-\varepsilon A\right)^{-1}$ , with $0<\varepsilon<\left(\lambda_{1}\left(A\right)\right)^{-1}$ where $\lambda_{1}\left(A\right)$ is the spectral radius of $A$ , to define a vertex centrality index, nowadays known as Katz centrality. However, the resolvent of the adjacency matrix $\left(I-\varepsilon A\right)^{-1}$ is parametric, where the parameter $\varepsilon$ is upper-bounded by the reciprocal of $\lambda_{1}\left(A\right)$ . Then when $\lambda_{1}\left(A\right)$ is significantly large, most of the information of the network structure stored in the $A$ matrix is making almost no contribution. This has been recently shown in examples where $\lambda_{1}\left(A\right)\gg 1$ , and the resolvent of $A$ does not provide reasonable results [15]. Hence, the definitions of subgraph centrality $\left(e^{A}\right)_{uu}$ [22] and communicability $\left(e^{A}\right)_{uv}$ [18] have triggered much recent interest. Another advance of the use of matrix exponential is its interpretation and derivation in different contexts, ranging from coupled quantum harmonic oscillators [20] and compartmental epidemiological models [36], to nonconservative diffusion [11]. Last but not least, we can think of $e^{A}=\sum_{k=0}^{\infty}\left(k!\right)^{-1}A^{k}$ by replacing the factorial for its more general definition based on Euler Gamma functions: $E_{\alpha,\beta}\left(A\right)=\sum_{k=0}^{\infty}\left(\varGamma\left(\alpha k% +\beta\right)\right)^{-1}A^{k}$ , $\alpha,\beta>0$ . It retrieves the exponential when $\alpha=1$ and $\beta=1$ , but in general represents the Mittag-Leffler matrix functions of $A$ . The idea of using $E_{\alpha,\beta}\left(A\right)$ to define centrality and communicability indices was previously developed independently by Arrigo and Durastante [4] and by Estrada [14].

In this work, we take a step forward to using Mittag-Leffler matrix functions to quantify the degree of balance of signed graphs. A signed graph $G_{s}$ can have both positive and negative edges [51]. The signs of the edges emerge in various real-world scenarios. For instance, positive signs may represent friendship, collaboration, alliances, etc., while negative ones may represent enemity, hostility, conflicts, etc. in social networks [2, 32]. In voting systems, they may represent whether two voters support the same or different candidates ; in recommendation systems, they can correspond to whether two users recommend the same product, or they have discrepant opinions about the same product. In transcriptional networks, edges represent the action of a transcription factor on one of its target genes, and the sign means activation ( $+$ ) or inhibition ( $-$ ) [46]. Cooperation and competition between species in ecological networks [43] and between products in economic networks [48, 50] can also be assigned to positive and negative edges, respectively.

The important notion of balance can be defined through the sign of cycles, which is the product of the signs of its edges [9, 29]. Specifically, a graph is balanced if and only if all its cycles are positive; otherwise it is unbalanced. If we focus on a signed triangle, it is balanced if either (i) all its edges are positive or (ii) two edges are positive and one is negative. The stability of the first triangle is self-evident, while in the second, the structure indicating that the “enemy of my enemy is my friend” provokes our feeling of stability by the formation of a coalition against the common enemy. The all-negative triangle is clearly unbalanced, the same as the one with only one negative edge. In the latter case, there are clear tensions between the two vertices sharing the negative edge and the one with whom they share positive ones. Think about the tensions in a cycle of friends apart from one couple in conflict. We would expect that the tensions existing between the members decay as its length increases. Hence, Estrada and Benzi has proposed the index $K\left(G_{s}\right)=tr\left[e^{A}\right]/tr\left[e^{\left|A\right|}\right]$ where $\left|A\right|$ is the entrywise absolute value of $A$ , to quantify the degree of balance [16]. In this way, $K\left(C_{3}^{-}\right)\approx 0.686$ , $K\left(C_{4}^{-}\right)\approx 0.915$ , and $K\left(C_{5}^{-}\right)\approx 0.983$ , where $C_{k}^{-}$ denotes the cycle of length $k$ and one negative edge. Further, $K\left(C_{10}^{-}\right)\approx 0.99999947$ , which is very close to balance (where $K=1$ ). Is it not the case that the factorial penalization used in the exponential is too heavy and fool us in this case? Here, by completing a close walk of length $10$ , the information contained by the negative edge present in this cycle is almost completely forgotten.

In this paper, we start by showing that the balance index $K\left(G_{s}\right)$ can be obtained from first principles on the basis of a nonconservative (NC) diffusion dynamic taking place on the graph $G_{s}$ relative to its underlying unsigned graph. Using this approach, we generalize the NC diffusion on graphs to a temporal-fractional model using Caputo fractional derivative. In this way, we generalize the balance index $K(G_{s})$ to indices based on Mittag-Leffler (ML) matrix functions of $A$ . These new indices are derived from first-principles diffusion processes which are temporally non-local. Therefore, the ML balance index accounts for certain memory of the system about the past, by diminishing the penalization that long cycles typically receive. We illustrate our results with the use of some artificial signed graphs, as well as real-world networks representing gene transcription networks, ecological competition between plant species in vast regions of Spain, and social networks in rural villages in Honduras.

2 Preliminaries

Let us consider an undirected connected signed graph $G_{s}=\left(V,E,\rho\right)$ where $V=\{1,2,\dots,n\}$ is the vertex set, an edge $(i,j)\in E$ is an unordered pair of two distinct nodes in the set $V$ , and $\rho:E\to\varSigma$ , $\varSigma=\left\{\pm 1\right\}$ , associates each edge with a sign. Let $A\left(G\right)=A\in\mathbb{R}^{n\times n}$ be the adjacency matrix of $G$ . Specifically, if there is no edge between nodes $i,j$ , $A_{ij}=0$ ; otherwise, $A_{ij}=\rho((i,j))$ denotes the edge sign. We will also consider the graph where we ignore the edge sign $\tilde{G}$ , and the unsigned adjacency matrix $\left|A\right|$ where the absolute values are taken entrywise.

2.1 Structural balance

A fundamental notion in the study of signed networks is the so-called structural balance [9, 30]. A signed graph is structurally balanced if and only if there is no cycle with an odd number of negative edges, which can be effectively defined through the following theorem.

Theorem 2.1 (structure theorem for balance [29]).

A signed graph $G$ is structurally balanced if and only if there is a bipartition of the node set into $V=V_{1}\cup V_{2}$ with $V_{1}$ and $V_{2}$ being mutually disjoint and one of them being nonempty, s.t. any edge between the two is negative while any edge within each node subset is positive.

There are several indices proposed to quantify the degree of balance, e.g., [13, 16, 24, 27, 29, 34, 45, 49]. One of the first measures based on the walk lengths was proposed by Estrada and Benzi [16],

K\left(G\right)\coloneqq\dfrac{Tr\left(e^{A}\right)}{Tr\left(e^{\left|A\right|% }\right)}=\dfrac{\sum_{j=1}^{n}e^{\lambda_{j}}}{\sum_{j=1}^{n}e^{\mu_{j}}},

(2.1)

where $\lambda_{j}$ and $\mu_{j}$ denote the eigenvalues of $A$ and $\left|A\right|$ , respectively.

We now introduce switching equivalence, which generalizes the idea of balance.

Definition 2.2.

The operation of reversing the signs of all edges connecting a subset $S\subseteq V$ and its complement is called switching the subset $S$ . Two signed configurations $\rho,\rho^{\prime}:E\to\varSigma$ are said to be switching equivalent if there exists $S\subseteq V$ such that $\rho^{\prime}$ can be obtained from $\rho$ by switching the subset $S$ , denoted by $\rho\approx\rho^{\prime}$ .

Switching equivalence is an equivalence relation on sign configurations of a fixed underlying graph, and the corresponding equivalent classes are called switching classes. Clearly, balanced graphs comprise one switching class. It is also known that the spectra of signed graphs are switching invariant [5, 51].

2.2 Laplacians and Mittag-Leffler matrix function

We consider the signed Laplacian $L_{A}$ as

L_{A}\left(i,j\right)=\left\{\begin{array}[]{cc}\sum_{\left(i,j\right)\in E}% \left|A_{ij}\right|&i=j\\ -A_{ij}&i\neq j.\end{array}\right.

(2.2)

It governs the diffusion dynamics by Altafini’s consensus model [3] that we will talk about in more detail later. We also introduce the Lerman-Ghosh Laplacian [26, 37],

\displaystyle L_{\chi}=\chi I-A,

(2.3)

where $\chi\in\mathbb{R}^{+}$ , and $I$ is the identity matrix. The index we will propose in this paper is closely related to the dynamics governed by this Laplacian, and we will show that it shares an important property with the dynamics by the signed Laplacian. Specifically, we will apply the time-fractional Caputo derivative,

\displaystyle D_{t}^{\alpha}u(t)=\frac{1}{\Gamma(1-\alpha)}\int_{0}^{t}\frac{u% ^{\prime}(\tau)}{(t-\tau)^{\alpha}}d\tau,

(2.4)

where $u^{\prime}(\tau)$ denotes the usual derivative. We assume that $u$ is differentiable and the convolution can be defined. Here, $0<\alpha\leq 1$ , $0<t<\infty$ , and $\varGamma\left(x\right)$ is the Euler gamma function. We also recall that a diffusion process is said to be conservative if the number of diffusive particles is constant along the time; otherwise, it is called a nonconservative diffusion [11]. Finally, we introduce the building block of the balance index we will propose, the Mittag-Leffler (ML) function of a matrix, say $M$ ,

E_{\alpha}\left(M\right)\coloneqq\sum_{k=0}^{\infty}\dfrac{M^{k}}{\varGamma% \left(\alpha k+1\right)}.

(2.5)

The study of these matrix functions for networks was previously studied by Arrigo and Durastante [4], and they also proposed to use $E_{\alpha}^{\gamma}\coloneqq E_{\alpha}\left(\gamma M\right)$ with $\gamma\leq\Gamma(\alpha+1)$ accounting for fair contribution of walks in graphs. In our implementations, we adopt this suggestion, with $\gamma=\Gamma(\alpha+1)$ .

3 Motivation

There are $2^{15}$ ways to put signs on the edges of the Petersen graph, on which only five (excluding the unsigned one) are essentially different [52] (see Fig. 1). We consider the diffusion dynamics by Altafini’s consensus model [3]. Let $u\left(t\right)$ be the vector representing the state of the vertices in $G_{s}$ at time $t$ , with $u\left(0\right)=u^{0}$ , and let $\dot{u}\left(t\right)$ be the vector of their time derivatives. Then,

\dot{u}_{i}\left(t\right)=-\sum_{\left(i,j\right)\in E}\left|A_{ij}\right|% \left(u_{i}\left(t\right)-\textnormal{sgn}\left(A_{ij}\right)u_{j}\left(t% \right)\right),

(3.1)

where $\text{sgn}(\cdot)$ returns the sign of the value. Hence,

\dot{u}\left(t\right)=-L_{A}u\left(t\right);u\left(0\right)=u^{0}.

(3.2)

Refer to caption — Figure 1: The five switching isomorphism types of signed Petersen graph excluding the unsigned one, where solid lines represent positive edges and dashed lines represent negative ones.

We consider the convergence time, $t_{c}$ , at which the state values are sufficiently close to each other, with tolerance $10^{-5}$ , i.e., $\left|u_{v}\left(t_{c}\right)-u_{u}\left(t_{c}\right)\right|<10^{-5}$ , $\forall u,v\in G_{s}$ . We note that the only difference of graphs in Fig. 1 lies in their sign patterns, and we denote a negative cycle of length $k$ by $C_{k}^{-}$ . We find that the graph having the most $C_{5}^{-}$ , graph e), is the graph reaching the consensus in a fastest time, with 12 $C_{5}^{-}$ and $t_{c}=11$ . The graph having the least $C_{5}^{-}$ , graph a), is the one that delays the most, with 4 $C_{5}^{-}$ and $t_{c}=48$ . It is known that consensus will never be reached if a graph is balanced, but instead the dynamics reaches a dissensus state. Therefore, graph a) is more similar to a balanced graph in the sense that it delays more to reach the consensus than graph e). However, this simple arithmetic is broken when we consider that graphs b) and d), both with 6 $C_{5}^{-}$ , but with the first almost doubling the time for consensus of the second ( $24$ versus $14$ ). We can then extend the analysis to consider $C_{6}^{-}$ , which clearly indicates that graph d) is less similar to a balanced graph than graph b), with $10$ versus $6$ $C_{6}^{-}$ , respectively. Under this kind of semiquantitative analysis, a problem emerges when considering graph c) with $t_{c}=22$ , which has $8$ $C_{5}^{-}$ , more than that of graphs b) and d), but 4 $C_{6}^{-}$ less than that of the previous two graphs. We defer more details to Supplementary Material.

Since the Petersen graphs are cubic, $L_{A}\left(i,i\right)=3$ , $\forall i\in V$ , which allows us to write $L_{A}=L_{\chi}=\chi I-A$ , where $L_{\chi}$ is the Lerman-Ghosh Laplacian (2.3) as introduced in section 2, and $\chi=3$ here. The solution to the Cauchy problem (3.1) is

u\left(t\right)=e^{-tL_{\chi}}u^{0}=e^{-t\chi}e^{tA}u^{0}.

(3.3)

Then, at a given time $t$ , the concentration at a vertex $v$ is

u_{v}\left(t\right)=\sum_{j}\left(e^{-t\chi}e^{tA}\right)_{v,j}u_{j}^{0}.

(3.4)

Suppose that the initial concentration is totally located at the vertex $v$ , $u_{j}^{0}=\delta_{j,v}$ , where $\delta_{i,j}$ is the Kronecker delta, then

u_{v}\left(t\right)=e^{-t\chi}\left(e^{tA}\right)_{vv}.

(3.5)

Then the total concentration remaining at the vertices when the initial concentration has been totally allocated at them is

T_{s}\coloneqq\sum_{v}u_{v}\left(t\right)=e^{-t\chi}\sum_{v}\left(e^{tA}\right% )_{vv}=e^{-t\chi}Tr\left(e^{tA}\right),

(3.6)

where $Tr\left(\cdot\right)$ returns the trace of a matrix. In a similar way, we can ignore the edge sign and consider the underlying graph $\tilde{G}$ ,

T_{u}\coloneqq\sum_{v}u_{v}\left(t\right)=e^{-t\chi}\sum_{v}\left(e^{t\left|A% \right|}\right)_{vv}=e^{-t\chi}Tr\left(e^{t\left|A\right|}\right).

(3.7)

A way to account for the “influence” of the edge signs on the diffusion is $T_{s}/T_{u}$ , such that for $t=1$ we recover the measure $K(G)$ in (2.1). This balance index can be easily generalized by taking any value of $t=\beta$ , $K(G,\beta)=Tr(\exp(\beta A))/Tr(\exp(\beta\left|A\right|))$ .

For the five nonsimilar signed Petersen graphs, although there is a good correlation between $K\left(G\right)$ and $t_{c}$ (Pearson correlation: $r^{2}\approx 0.924$ ), there is an important inversion in the values of $K\left(G\right)$ for graphs c) and d). Specifically, for $K(G)$ , graph c) has value $0.941$ while graph d) has value $0.947$ , but graph c) has larger $t_{c}$ than graph d); see Supplementary Materials for details. The problem seems to be produced by the differences in the penalization that the cycles of length 5 and 6 receive in the exponential function. To see this, we examine the difference between $Tr\left(A^{k}\right)/k!$ for graph c) and graph d) for values of $1\leq k\leq 10$ ; see Fig. 2. We find that the largest contribution is the one of $k=5,$ which is about $-0.333$ , followed by that of $k=6,$ which is $0.2$ . This reflects the fact that graph c) has more negative cycles of length 5 than d), that d) has more negative cycles of length 6 than c), but that cycles of length 6 are much heavily penalized than those of length 5. We can put it in the following way. If one has to pay $1 for every $C_{5}^{-}$ but only $0.1 for each $C_{6}^{-}$ , graph c) will have to pay $8.40, while only $7.00 is needed for graph d). But if the penalty for $C_{6}^{-}$ increases to $0.5, then graph c) will need to pay $10 while $11 will be paid by graph d). Therefore, the problem we raise in this paper is how to tune the penalization of longer cycles, such that their contribution to the balance/unbalance of networks becomes more relevant when necessary. We propose to achieve it while keeping the first principles explained before that connect the balance index $K\left(G\right)$ with a (nonconservative) diffusion on graphs.

We end up this section by proving that Altafini’s model of consensus on signed graphs is nonconeservative, unless the graph is balanced and the initial vector is the eigenvector corresponding to the eigenvalue $0$ . We should also notice that when the graph does not contain any negative edge, Altafini’s consensus model is effectively the consensus model with the graph Laplacian and it is conservative.

Proposition 3.1.

The diffusion by Altafini’s consensus model is nonconservative, unless the graph is balanced and the intial vector $u^{0}$ is the eigenvector corresponding to the smallest eigenvalue $0$ .

Proof.

The solution to the Altafini’s consensus is

\displaystyle u(t)=e^{-tL_{A}}u^{0}.

(3.8)

Let $0\leq\mu_{1}\leq\mu_{2}\leq\dots\leq\mu_{n}$ be the eigenvalues of $L_{A}$ , and let $\phi_{i}$ be the orthonormal eigenvector associated with $\mu_{i}$ . Then,

\displaystyle u(t)=e^{-t\mu_{1}}\phi_{1}\phi_{1}^{T}u^{0}+e^{-t\mu_{2}}\phi_{2% }\phi_{2}^{T}u^{0}+\dots+e^{-t\mu_{n}}\phi_{n}\phi_{n}^{T}u^{0}.

(3.9)

We know that if a signed graph is unbalanced, $\mu_{1}>0$ . Then,

\displaystyle\lim_{t\to\infty}u(t)=\mathbf{0},

where $\mathbf{0}$ is the all-zero vector. Hence, the diffusion is nonconservative.

We now consider the balanced case, where $\mu_{1}=0$ and $\mu_{2}>0$ . Then,

\displaystyle\lim_{t\to\infty}u(t)=\phi_{1}\phi_{1}^{T}u^{0}.

Hence $\lim_{t\to\infty}\mathbf{1}^{T}u(t)=\mathbf{1}^{T}u^{0}$ if and only if $u^{0}=\phi_{1}$ . In the case of $u^{0}=\phi_{1}$ , from Eq. (3.9), we have $\mathbf{1}^{T}u(t)=\mathbf{1}^{T}u^{0}$ , by the orthogonality of eigenvectors. Hence, the diffusion is conservative if and only if the graph is balanced and $u^{0}=\phi_{1}$ . ∎

4 Main results

4.1 Nonconservative fractional diffusion and balance

We know that Altafini’s dynamics on signed graphs is nonconservative. Let us now consider a more general nonconservative diffusive model on the signed graph based on the Lerman-Ghosh Laplacian $L_{\chi}$ [26, 37]. To make the process more general, we also replace the standard time derivative $\dot{u}\left(t\right)$ by the time-fractional Caputo derivative $D_{t}^{\alpha}$ as in Eq. (2.4). Hence, the nonconservative diffusion on the signed graph we consider is

D_{t}^{\alpha}u\left(t\right)=-L_{\chi}u\left(t\right);u\left(0\right)=u^{0}.

(4.1)

The solution of Eq. (4.1) is given by

\displaystyle u(t)=E_{\alpha}(-t^{\alpha}L_{\chi})u^{0},

(4.2)

where $E_{\alpha}(\cdot)$ is the Mittag-Leffler function as in Eq. (2.5). Let us focus again on the concentration at a vertex designated by $v$ ,

u_{v}\left(t\right)=\sum_{j}\left(E_{\alpha}\left(-t^{\alpha}L_{\chi}\right)% \right)_{vj}u_{j}^{0},

(4.3)

and if the initial concentration is totally located at the vertex $v$ , $u_{j}^{0}=\delta_{j,v}$ , we get

u_{v}\left(t\right)=\left(E_{\alpha}\left(-t^{\alpha}L_{\chi}\right)\right)_{% vv}.

(4.4)

One main difference between the exponential and the Mittag-Leffler function is that in general $E_{\alpha}\left(P+Q\right)\neq E_{\alpha}\left(P\right)E_{\alpha}\left(Q\right)$ , even when $P$ and $Q$ commute [42]. This equality holds in general only when (i) $P$ and $Q$ commute and (ii) $\alpha=1$ .

Here, we consider the special case when $\chi=0$ , such that

\displaystyle D_{t}^{\alpha}u\left(t\right)=Au\left(t\right);u\left(0\right)=u% ^{0}.

(4.5)

The concentration at vertex $v$ with $u_{j}^{0}=\delta_{j,v}$ is

\displaystyle u_{v}\left(t\right)=\left(E_{\alpha}\left(t^{\alpha}A\right)% \right)_{vv}.

Then the total concentration remaining at the vertices when the initial concentration has been totally allocated at them is

\tilde{T}_{s}\coloneqq\sum_{v}u_{v}\left(t\right)=\sum_{v}\left(E_{\alpha}% \left(t^{\alpha}A\right)\right)_{vv}=Tr\left(E_{\alpha}\left(t^{\alpha}A\right% )\right).

(4.6)

Similarly, we can ignore the edge sign and obtain the total concentration in $\tilde{G}$ ,

\tilde{T}_{u}\coloneqq\sum_{v}u_{v}\left(t\right)=\sum_{v}\left(E_{\alpha}% \left(t^{\alpha}\left|A\right|\right)\right)_{vv}=Tr\left(E_{\alpha}\left(t^{% \alpha}\left|A\right|\right)\right).

(4.7)

Finally, we summarise the influence of the edge signs on the diffusion as the ratio $\tilde{T}_{s}/\tilde{T}_{u}$ , such that for $t=1$ we have

K_{\alpha}\left(G_{s}\right)\coloneqq\dfrac{Tr\left(E_{\alpha}\left(A\right)% \right)}{Tr\left(E_{\alpha}\left(\left|A\right|\right)\right)}.

(4.8)

We note that $K\left(G\right)$ is the particular case when $\alpha=1$ .

4.2 How global balance is accounted for

A closed walk (CW) is said to be positive (negative) if the product of the signs of all its composing edges is positive (negative). Let $M_{k}\left(i,i\right)=\left(A^{k}\right)_{ii}$ be the total “number” of CWs of length $k$ starting at vertex $i$ , then

M_{k}\left(i,i\right)=\mu_{k}^{+}\left(i,i\right)-\mu_{k}^{-}\left(i,i\right),

(4.9)

where $\mu_{k}^{+}\left(i,i\right)$ is the number of positive CWs of length $k$ starting at $i$ , and $\mu_{k}^{-}\left(i,i\right)$ is the same for negative CWs [10]. Obviously,

\begin{split}Tr\left(E_{\alpha}\left(A\right)\right)=\sum_{i=1}^{n}\sum_{k=0}^% {\infty}\dfrac{M_{k}\left(i,i\right)}{\varGamma\left(\alpha k+1\right)}=Tr% \left(E_{\alpha}^{+}\left(A\right)\right)-Tr\left(E_{\alpha}^{-}\left(A\right)% \right),\end{split}

(4.10)

where $Tr\left(E_{\alpha}^{\pm}\left(A\right)\right)=\sum_{i=1}^{n}\sum_{k=0}^{\infty% }\mu_{k}^{\pm}\left(i,i\right)/\varGamma\left(\alpha k+1\right)$ are the positive and negative contributions to $Tr\left(E_{\alpha}\left(A\right)\right)$ . We note that they are not the same as $Tr\left(E_{\alpha}\left(A^{\pm}\right)\right)$ where $A^{\pm}$ are the adjacency matrices only for positive and negative edges of $G_{s}$ , respectively. Similarly,

\begin{split}Tr\left(E_{\alpha}\left(\left|A\right|\right)\right)&=Tr\left(E_{% \alpha}^{+}\left(A\right)\right)+Tr\left(E_{\alpha}^{-}\left(A\right)\right)% \end{split}.

(4.11)

Hence,

K_{\alpha}\left(G_{s}\right)=\dfrac{Tr\left(E_{\alpha}^{+}\left(A\right)\right% )-Tr\left(E_{\alpha}^{-}\left(A\right)\right)}{Tr\left(E_{\alpha}^{+}\left(A% \right)\right)+Tr\left(E_{\alpha}^{-}\left(A\right)\right)}.

(4.12)

We now understand $K_{\alpha}(G_{s})$ through its two different terms. Let us recall that a trivial CW is a walk starting at and ending at the same vertex but not involving any cycle in the graph. Hence, any trivial CW is always positive. Therefore, $Tr\left(E_{\alpha}^{+}\left(A\right)\right)$ accounts for all trivial CWs and nontrivial positive CWs. In a nontrivial positive CW, there can be any number of balanced cycles, and also even number of unbalanced cycles. We can understand it as follows. Consider a negative triangle with sign pattern $+,+,-$ for edges $(A,B),(B,C),(A,C)$ , respectively. Then, a voting system on this triangle will end up in contradictions after one round of information passing. For example, if A votes Y(N), then B will vote Y(N), and C will also vote Y(N), but then A will need to vote N(Y) since it is in conflict with C, contradicting its initial vote. However, if the number of rounds is even such contradictions disappear, eliminating the tension in the system. In closing, the term $Tr\left(E_{\alpha}^{+}\left(A\right)\right)$ accounts for all CWs in the signed graph that involves no tensions from the perspective of balance. This necessarily leads to the fact that all tensions are encoded in $Tr\left(E_{\alpha}^{-}\left(A\right)\right)$ . Indeed, any negative CW necessarily contains a negative cycle, which by definition is unbalanced. Therefore, the difference $Tr\left(E_{\alpha}^{+}\left(A\right)\right)-Tr\left(E_{\alpha}^{-}\left(A% \right)\right)$ accounts for the magnitude of “tensions” existing in the signed graph in terms of balance, such that $K_{\alpha}\left(G_{s}\right)$ will be $0$ if the balanced and unbalanced contributions are equal, and will be $1$ if there are no unbalanced contributions.

4.3 How memory is accounted for

We first show that the time-fractional Caputo derivative accounts for the memory of the system about its past. We start by writing

\begin{split}D_{t}^{\alpha}u\left(t\right)=\dfrac{1}{\varGamma\left(1-\alpha% \right)}\int_{0}^{t}\left[\dfrac{1}{\left(t-\tau\right)^{\alpha}}\right]u^{% \prime}\left(\tau\right)d\tau=\dfrac{1}{\varGamma\left(1-\alpha\right)}\int_{0% }^{t}w\left(\tau\right)u^{\prime}\left(\tau\right)d\tau,\end{split}

(4.13)

where $w\left(\tau\right)=1/\left(t-\tau\right)^{\alpha}$ is used to indicate that $u^{\prime}\left(\tau\right)$ is integrated in a weighted way that $w\left(\tau\rightarrow 0\right)$ is significantly smaller than $w\left(\tau\rightarrow t\right)$ . Odibat [40] has proved that

D_{t}^{\alpha}u\left(t\right)=C\left(u,h,\alpha\right)-E_{C}\left(u,h,\alpha% \right),

(4.14)

where $E_{C}\left(u,h,\alpha\right)\leq\mathcal{O}\left(h^{2}\right)$ is the error term, and

\begin{split}C\left(u,h,\alpha\right)&=\dfrac{h^{1-\alpha}}{\varGamma\left(3-% \alpha\right)}\left[\underset{\textnormal{remote past}}{\underbrace{\left(% \left(k-1\right)^{2-\alpha}-\left(k+\alpha-2\right)k^{1-\alpha}\right)u^{% \prime}\left(0\right)}}\right.\\ &\left.+\underset{\textnormal{recent past}}{\underbrace{\sum_{j=1}^{k-1}\left(% \left(k-j+1\right)^{2-\alpha}-2\left(k-j\right)^{2-\alpha}+\left(k-j-1\right)^% {2-\alpha}\right)u^{\prime}\left(t_{j}\right)}}+\underset{\textnormal{present}% }{\underbrace{u^{\prime}\left(t\right)}}\right],\end{split}

(4.15)

where the interval $\left[0,t\right]$ has been subdivided into $k$ subintervals $\left[t_{j},t_{j+1}\right]$ for $j=0,\ldots,k$ of equal length $h=t/k$ . The term $C\left(u,h,\alpha\right)$ confirms that differently from the standard time derivative which considers only the present, the Caputo one takes into account the “remote past” and “recent past” together with the “present” state of the evolution of the function $u\left(\cdot\right)$ . Additionally, the time-fractional Caputo derivative gives smaller weight to the remote past, and such weight increases as we approach to the contribution of the present, which receives the largest weight.

Let us now see the special case of $\alpha=1$ and how memory could be incorporated while changing $\alpha$ . We note that $C\left(u,h,\alpha=1\right)=u^{\prime}\left(t\right)$ . The solution of the NC diffusion (4.5) with $u_{j}^{0}=\delta_{j,v}$ is given by $u_{v}\left(t\right)=\left(e^{tA}\right)_{vv}$ , i.e., the exponential of the adjacency matrix. For $t=1$ , we know that

e^{A}=I+A+\dfrac{A^{2}}{2!}+\ldots+\dfrac{A^{k}}{k!}+\ldots,

(4.16)

which means that walks taking a large number of steps are so heavily penalized by $1/\left(k!\right)$ that they are almost forgotten. Let us consider again an unbalanced cycle of $10$ vertices and only one negative edge, $C_{10}^{-}$ . Here, we truncate the expressions (4.16) and $e^{\left|A\right|}$ at a given value $k$ , denoted by $e^{A}\left(k\right)$ and $e^{\left|A\right|}\left(k\right)$ , respectively. Then, for any $k<10$ , we have that $Tr(e^{A}\left(k\right))=Tr(e^{\left|A\right|}\left(k\right)).$ Therefore, any penalization $c_{k}$ in $f\left(A\right)=\sum_{k=0}^{\infty}c_{k}A^{k}$ that makes $c_{10}Tr(\left|A\right|^{10})\approx 0$ will lead to $Tr\left(f\left(A\left(C_{10}^{-}\right)\right)\right)\approx Tr\left(f\left(% \left|A\left(C_{10}^{-}\right)\right|\right)\right)$ . This is exactly what happens with $c_{k}=1/k!$ , where $Tr\left(\exp\left(A\left(C_{10}^{-}\right)\right)\right)\approx 22.7958$ and $Tr\left(\exp\left(\left|A\left(C_{10}^{-}\right)\right|\right)\right)\approx 2% 2.7959$ leading to $K_{1}\left(C_{10}^{-}\right)\approx 0.99999947$ . That is, the index has almost completely forgotten that the graph contains a negative edge. However, the extra freedom introduced in $\alpha$ allows us to incorporate the information in the past in an appropriate manner. For $C_{10}^{-}$ , $\alpha=0.5$ makes that even the remote past receives some weight in the navigation of the diffusive particles, remembering the presence of the negative edge, with $K_{0.5}\approx 0.98290619$ . Such memory can further take effect by dropping $\alpha$ , which may be considered as the memory effect parameter, e.g., $K_{0.25}\approx 0.109$ .

We now proceed to find the analytical expression for the degree of balance with memory for unbalanced cycles, i.e., cycles with an odd number of negative edges. We start by proving that unbalanced cycles share the same spectrum, independently of the exact number of negative edges.

Proposition 4.1.

There are two switching classes for signed cycles of length $n$ , one corresponding to balance and the other corresponding to unbalance.

Proof.

For balanced cycles of length $n$ , we know that they form a switching class. We then prove that all unbalanced signed cycles form one switching class. For an unbalanced signed cycle of length $n$ , denoted by $C_{n}$ , if we randomly remove one edge $e$ , it becomes a tree $T_{n}$ . We know that every signed tree is balanced, hence $T_{n}$ is balanced and is switching equivalent to the all-positive configuration. Edge $e$ necessarily breaks the balance structure, since otherwise $C_{n}$ is balanced. Hence, only one edge violates the balance structure, and $C_{n}$ is then switching equivalent to the unbalanced cycles of length $n$ and one negative edge. Hence, all unbalanced signed cycles of length $n$ form one switching class. ∎

Corollary 4.2.

All unbalanced signed cycles of length $n$ share the same eigenvalues, i.e., they are cospectral.

As a consequence of the previous results we will focus on the analytical study of unbalanced cycles as a general class.

Definition 4.3 ([38]).

Let $E_{\alpha}\left(z\right)$ be the Mittag-Leffler function of $z.$ Then, we define the following integral:

\mathcal{E}_{\nu,\alpha}\left(z\right)\coloneqq\frac{1}{\pi}\int_{0}^{\pi}\cos% \left(\nu\theta\right)E_{\alpha}\left(z\cos\theta\right)d\theta,\>\nu\in% \mathbb{Z}.

(4.17)

Remark 4.4.

Notice that $\mathcal{E}_{\nu,\alpha=1}\left(z\right)=\frac{1}{\pi}\int_{0}^{\pi}e^{z\cos% \theta}\cos\left(\nu\theta\right)d\theta\eqqcolon I_{\nu}\left(z\right)$ is the modified Bessel function of the first kind. The fractional modified Bessel function of the first kind can be calculated by using the following result.

Lemma 4.5 ([38]).

Let $\mathcal{E}_{\nu,\alpha}(z)$ be the fractional modified Bessel function of the first kind of $z$ with fractional parameter $\alpha$ and $\nu\in\mathbb{Z}$ . Then,

\mathcal{E}_{\nu,\alpha}(z)=\sum_{k=0}^{\infty}\frac{\left(2k+\nu\right)!}{% \varGamma\left(\alpha\left(2k+\nu\right)+1\right)k!\left(k+\nu\right)!}\left(% \frac{z}{2}\right)^{2k+\nu}.

(4.18)

Theorem 4.6.

Let $C_{n}^{-}$ be the cycle graph with an odd number of negative edges and $0<\alpha\leq 1$ . Then,

K_{\alpha}^{\gamma}\left(C_{n}^{-}\right)=\dfrac{\sum_{k=1}^{n}E_{\alpha}\left% (2\gamma\cos\left(\dfrac{\left(\left(2k+1\right)\pi\right)}{n}\right)\right)}{% \sum_{k=1}^{n}E_{\alpha}\left(2\gamma\cos\left(\dfrac{\left(2k\pi\right)}{n}% \right)\right)},

(4.19)

where $K_{\alpha}^{\gamma}\coloneqq Tr\left(E_{\alpha}\left(\gamma A\right)\right)/Tr% \left(E_{\alpha}\left(\gamma\left|A\right|\right)\right)$ is a even more general form of the index $K_{\alpha}$ , and $\gamma$ is a positive constant, and

\underset{n\rightarrow\infty}{\lim}\frac{1}{n}\sum_{k=1}^{n}E_{\alpha}\left(2% \gamma\cos\left(\dfrac{\left(2k\pi\right)}{n}\right)\right)=\mathcal{E}_{0,% \alpha}\left(2\gamma\right).

(4.20)

Proof.

We can obtain Eq. (4.19) by the eigenvalues of cycles of size $n$ and those of the same size and one negative edge [39]. Then for the limit, since for $k=1,2,\dots,n$ , the angles $2k\pi/n$ uniformly cover the interval $[0,2\pi]$ , we can write

\underset{n\rightarrow\infty}{\lim}\left(\dfrac{1}{n}\sum_{k=1}^{n}E_{\alpha}% \left(2\gamma\cos\left(\dfrac{2k\pi}{n}\right)\right)\right)=\left(\frac{1}{2% \pi}\int_{0}^{2\pi}E_{\alpha}\left(2\gamma\cos\vartheta\right)d\vartheta\right),

(4.21)

where

\left(\frac{1}{2\pi}\int_{0}^{2\pi}E_{\alpha}\left(2\gamma\cos\vartheta\right)% d\vartheta\right)=\left(\frac{1}{\pi}\int_{0}^{\pi}E_{\alpha}\left(2\gamma\cos% \vartheta\right)d\vartheta\right)=\mathcal{E}_{\alpha,0}\left(2\gamma\right).

(4.22)

∎

We cannot apply the same approximation as in the proof of (4.20) to the numerator of $K_{\alpha}^{\gamma}$ , because approximating the numerator to the denominator for very large $n$ largely depends on the values of $\alpha$ . For instance, for $\alpha=1$ when $n=10$ the difference between the two terms is of the order of $10^{-6}$ and it drops to $10^{-15}$ for $n=20.$ However, for $\alpha=0.25$ , this difference is of the order of $10^{7}$ for $n=10$ and remains of the order of $10^{6}$ for $n=20,$ and of $10^{3}$ for $n=40.$ Therefore, because the denominator can be approximated by $\mathcal{E}_{\alpha,0}\left(2\gamma\right)$ , we have that for $\alpha=1$ , the balance index approaches $1$ for relatively small unbalanced cycles, while this is far from being the case for $\alpha=0.25$ . This is visually clear when we examine the change of $K_{\alpha}\left(C_{n}\right)$ as a function of both $n$ and $\alpha$ (note that $\gamma=\varGamma\left(\alpha+1\right)$ throughout the paper so we ignore the superscript); see Fig. 3 (left). Specifically, for values of $\alpha$ close to $1$ , the values of $K_{\alpha}\left(C_{n}\right)$ are close to $1$ for almost all cycles with size $n\geq 10$ . At the other extreme when $\alpha$ is close to $0$ , the values of the balance index are extremely low for almost every cycle with $n\leq 20$ .

4.4 Properties of the balance index with memory

We start with the range of the balance index we have proposed.

Theorem 4.7.

The index $K_{\alpha}\left(G\right)$ is bounded as

0\leq K_{\alpha}\left(G\right)\leq 1,

(4.23)

where the upper bound is reached if and only if the signed graph $G$ is balanced.

Proof.

It is clear from Eq. (4.12) that $K_{\alpha}(G)\leq 1$ . We now examine the lower bound. Let $\left\{\lambda_{j}\left(A\right)\right\}=\left\{\lambda_{j}^{+}\left(A\right)% \right\}\cup\left\{\lambda_{j}^{-}\left(A\right)\right\}$ be the eigenvalues of $A$ , where $\left\{\lambda_{j}^{+}\left(A\right)\right\}$ and $\left\{\lambda_{j}^{-}\left(A\right)\right\}$ are the sets of nonnegative and negative eigenvalues of $A$ , respectively. Clearly, $E_{\alpha}\left(\lambda_{j}^{+}\left(A\right)\right)\geq 0$ . We then consider negative eigenvalues $\lambda_{j}^{-}\left(A\right)$ , and $E_{\alpha}\left(-\left|\lambda_{j}^{-}\left(A\right)\right|\right)$ . We note that

\left(-1\right)^{k}\dfrac{d^{k}E_{\alpha}\left(-x\right)}{dx^{k}}\geq 0,

(4.24)

for all $x$ and for $0\leq\alpha\leq 1$ [41]. Hence, $E_{\alpha}\left(-\left|\lambda_{j}^{-}\left(A\right)\right|\right)\geq 0$ and

Tr\left(E_{\alpha}\left(A\right)\right)=\sum_{j=1}^{n}E_{\alpha}\left(\lambda_% {j}\left(A\right)\right)\geq 0.

(4.25)

We note that $\left|A\right|$ is a nonnegative matrix, and so is any power of $\left|A\right|$ , thus $E_{\alpha}\left(\left|A\right|\right)$ . Hence, $Tr\left(E_{\alpha}\left(\left|A\right|\right)\right)\geq 0$ , and then $0\leq K_{\alpha}\left(G\right)$ .

It is clear from Eq. (4.12) that $K_{\alpha}=1$ if and only if $Tr\left(E^{-}_{\alpha}\left(A\right)\right)=0$ , if and only if there is no negative closed walks of any length involving any vertices, if and only if there is no negative cycles, i.e., the graph is balanced. While for the lower bound, we require $Tr\left(E^{-}_{\alpha}\left(A\right)\right)=Tr\left(E^{+}_{\alpha}\left(A% \right)\right)$ , which can only happen when the number of negative (unbalanced) closed walks is sufficiently large in a signed graph. ∎

We now consider again the signed Petersen graphs, specifically the two labelled as c) and d) in Fig. 1. We recall that although graph d) reaches consensus at a time significantly smaller than graph d), the first has a larger value of $K_{1}$ , due to the heavy penalization to walks of relatively large sizes, imposed by the exponential (see section 3). We now consider $K_{\alpha}$ as a function of $\alpha$ , between these two graphs; see Fig. 3 (right). Specifically, at $\alpha=1$ , the graph d) is more balanced than graph c), corresponding to a negative value of the difference between $K_{\alpha}$ of c) minus that of d). This negative difference becomes larger when $\alpha$ drops from $1$ , reaching a minimum at about $\alpha=0.84.$ However, after this point, the trend reverses towards positive values, reaching the maximum at around $\alpha=0.5.$ At this value of $\alpha$ , the penalization of longer cycles is not as heavy, since the larger number of negative hexagons in d) overcome the larger number of negative pentagons in c). If we now correlate the values of $K_{0.5}$ versus $t_{c}$ , the squared Pearson correlation coefficient has value $0.981$ , which clearly contrasts with the one of $0.924$ for $K_{1}$ , implying that $K_{0.5}$ provides a better indicator of balance in terms of the convergence of the diffusion ( $K_{0.5}$ for the five signed Petersen graphs of Fig. 1, from a) to e), are: $0.3878$ ; $0.1973$ ; $0.1514$ ; $0.1302$ ; $0.0787$ ).

To gain more insights about the significance of the use of memory to account for balance in signed graphs, let us further explore the changes. When $\alpha$ drops from $1.0$ to about $0.8$ , the balance index of the signed Petersen graph d) increases relative to that of graph c). This can be explained by the fact that these graphs are triangle and quadrilateral free, and the smallest cycle is of length five. As we drop initially the value of $\alpha$ from $1$ , the contribution of $C_{5}^{-}$ increases, and because graph c) has more of these cycles than d), it is less balanced relative to d). However, as we continue dropping $\alpha$ , the contribution of $C_{6}^{-}$ growth significantly. In this case, graph d) overcomes graph c) in the number of $C_{6}^{-}$ , which make c) more balanced than d) after some critical value and the difference reaches a maximum at around $\alpha=0.5$ . Below this value of the memory parameter $\alpha$ , the longer cycles, namely $C_{8}^{-}$ and $C_{9}^{-}$ , makes their contribution. In this case, graph c) overcomes d) in the number of $C_{8}^{-}$ , but d) contains a bit more $C_{9}^{-}$ than c); see Supplementary Material for details. The effect of these longer unbalanced cycles is a further decay of the balance index of both graphs for $\alpha<0.5$ .

5 Examples of applications

5.1 Gene regulatory networks

We first consider the gene regulatory networks of Saccharomyses cerevisiae (yeast) and of Bacillus subtilis, previously studied as signed undirected graphs by Soranzo et al. [46]. We maintain the undirected versions of these networks, and consider only their giant connected components. The balance index at $\alpha=1$ indicates that the network of S. cerevisiae is slightly more balanced than that of B. subtilis, with $K_{1}\approx 0.933$ versus $K_{1}\approx 0.643$ , respectively. We note that the difference between the values of $K_{1}$ for both networks is smaller than $0.3$ and both are not far from $1$ , which implies that there is no significant difference in their degree of balance and that they form relatively balanced systems. However, if we allow for an increment of the memory in the system by dropping $\alpha$ , then the results change significantly. As can be seen in Fig. 4 (left), the difference in balance between the two gene regulatory networks increases up to $0.9$ (of a maximum of $1.0$ ) when $\alpha$ drops from $1.0$ to $0.62$ , where $K_{0.62}\approx 0.928$ for S. cerevisiae and $K_{0.62}\approx 0.014$ for B. subtilis. That is, while the gene regulatory network of yeast is highly balanced, the one of B. subtilis is extremely unbalanced.

This difference is mainly due to the fact that the network of B. subtilis has a large number of relatively large unbalanced cycles. We observe that although $C_{n}^{-}$ grows exponentially fast in both networks, it grows faster for the network of B. subtilis; see Supplementary Material for details. This can be implied from Fig. 4 (middle) where we visualise the percentages of negative cycles of increasing lengths. It can be seen that the network of yeast has relatively more unbalanced triangles and pentagons but significantly less percentage of negative squares than the one of B. subtilis. This explains why both networks have comparable values of the balance index when $\alpha$ is close to one, i.e., the memory of the system is relatively low although the one of yeast is slightly more balanced than the one of B. subtilis. However, when cycles of longer lengths ( $n\geq 6$ ) are taken into account, the gene regulatory network of B. subtilis has systematically more percentage of unbalanced cycles than the one of yeast. This clearly explains why the network of B. subtilis is significantly less balanced than the one of yeast for relatively low values of $\alpha$ , i.e., the memory of the system increases.

Regarding the change of $K_{\alpha}$ with respect to the drop of the memory parameter $\alpha$ in signed graphs, another interesting characteristic is the possibility of nonmonotonicity; see the case of the gene regulatory network of yeast in Fig. 4 (right). Specifically, for the network of yeast, $K_{\alpha}$ increases when $\alpha$ drops from $1.0$ to about $0.75$ , and then decays very quickly for values $\alpha<0.75$ . In practical terms, this means that there is an “optimal” value of the memory that maximizes the degree of balance of this network, and that such value is different from $\alpha=1$ . The structural explanation for this nonmonotonicity can also be found in the plot in Fig. 4 (middle). We observe that there is a significant drop in the percentage of negative squares in this network, which contributes to increasing balance when we drop $\alpha$ from $1.0$ to about $0.75$ . However, as value of $\alpha$ decays beyond $0.75$ , the longer negative cycles become more important, and the global balance of the network quickly decays.

5.2 Spatial ecological networks

We now study a series of $31$ signed networks representing patterns of spatial (co)occurrence of plants in four major locations in Spain, specifically, Cabo de Gata-Nijar National Park (36.77N, –2.11W), Monegros (41.65N, –0.71W), Sierra de Guara (42.27N, 0.18W) and Ordesa-Monte Perdido National Park (42.63N, –0.11W). The vertices of these networks represent plant species and two vertices form an edge in the graph if the corresponding plants has a spatial association, which was calculated by comparing the number of times that the two species appeared at the same point on the transects. Two plant species share a positive edge if they appeared associated in close region of space, while negative associations correspond to plants appearing separated at a significant distance in space [43]. Therefore, patterns of signed cycles appear in these networks. The meaning of these patterns is self-explained, where a fully positive triangle, for instance, indicates that the three plants have certain type of cooperative relations that allow them to coexist in the same spatial region. A fully negative triangle indicates competitive interactions between the three species that avoids their coexistence in the same location.

We give the average values of $K_{\alpha=1}$ and $K_{\alpha=0.8}$ for the networks in each of the four major locations in Table 1. We also reproduce the values of the mean temperature and precipitation of those regions as reported by Saiz et al. [43]. The results for $\left\langle K_{\alpha=1}\right\rangle$ are qualitatively similar to those in [43], where an index of balance $R$ based on triangles only was used. These results lead to the fact that the balance in those places of higher temperature and lower precipitation is bigger than in those where the temperature is low and the precipitation is high. Both $\left\langle K_{\alpha=1}\right\rangle$ and $R$ identify Monegros as the site with the largest balance and Sierra de Guara as the one more out of balance. However, when we increase the memory of the system by considering a lower value of $\alpha$ , e.g., $\left\langle K_{\alpha=0.8}\right\rangle$ , a swap on the values of balance of Cabo de Gata and Monegros appears;see Table 1.

location	Temp. ( ${}^{\circ}C$ )	Prec. (mm)	$\left\langle K_{\alpha=1}\right\rangle$	$\left\langle K_{\alpha=0.8}\right\rangle$
Cabo de Gata	24	328	$0.335\pm 0.265$	$0.165\pm 0.192$
Monegros	21	360	$0.376\pm 0.221$	$0.129\pm 0.130$
Sierra de Guara	17	927	$0.0883\pm 0.033$	$0.0061\pm 0.0042$
Ordesa-Monte Perdido	11	1485	$0.0985\pm 0.072$	$0.014\pm 0.016$

Table 1: Values of the balance degree indices

\left\langle K_{\alpha=1}\right\rangle

and

\left\langle K_{\alpha=0.8}\right\rangle

averaged for all the networks in the four main geographic locations studied here. The values of the mean temperature and precipitation in those regions are reported as in Saiz et al. [42].

We visualise the average and standard deviations of $K_{\alpha}$ for $0.4\leq\alpha\leq 1$ for the $31$ signed ecological networks grouped in the four major sites under study in Fig. 5 (left). The crossing between the balance rankings of Cabo de Gata and Monegros occurs around $\alpha=0.8$ . Furthermore, if we try to explain the degree of balance of these sites by considering a single parameter like the precipitation – while noticing the risks of doing any correlation for only four points – we observe some interesting patterns. A power-law fitting of the type: $\left\langle K_{\alpha=1}\right\rangle\sim P^{-2.116}$ gives a correlation coefficient of $r\approx 0.88$ . Similarly, $R\sim P^{-1.428}$ with $r\approx 0.89$ . However, when memory effects take place, we obtain: $\left\langle K_{\alpha=0.8}\right\rangle\sim P^{-3.156}$ with $r\approx 0.94$ . That is, the memory effects increase the amount of variance in the index explained from $77\%$ with $\left\langle K_{\alpha=1}\right\rangle$ to $88\%$ with $\left\langle K_{\alpha=0.8}\right\rangle$ .

The question of the existence of an optimal value for the memory effect remains. We obtain the power-law correlation between the balance indices $K_{\alpha}$ for $0.4\leq\alpha\leq 1$ and the mean precipitation in the corresponding main locations. The correlation coefficient increases when $\alpha$ drops from $1$ up to $0.6$ , and then it decays very quickly; see Fig. 5 (right). This implies that memory effects in ecological systems may have an optimum. However, more research in this area is needed to obtain more conclusive insights about this important question, and we leave it to future work.

5.3 Social networks in rural villages

Finally, we consider a set of social networks constructed from the data of $24696$ people aged $12$ to $93$ years in geographically isolated villages in western Honduras [32]. The vertices of these networks represent residents within each village, and they are connected by a positive (negative) edge if either of them identify the other as a friend (an enemy), while if one identify the other as a friend while the other identify the one as an enemy, we connect them by a negative edge. We note that the case that they have no opinion of each other is also allowed. By design, the networks are solely within-village networks, and we select $11$ of them for our analysis, labelled as $A$ up to $K$ . Cycles of various lengths can frequently occur in such social networks, positive or negative. Corresponding to balance theory [9], positive cycles indicate that the residents can be partitioned into one or two communities without conflicts inside each community, while negative cycles indicate the existence of conflicts of the relationships between residents.

We consider the balance index $K_{\alpha}$ for $0.4\leq\alpha\leq 1$ for the $11$ signed social networks; see Fig. 6. We find that all networks are not completely balanced: all $11$ networks reaches the maximum value of the balance index $K_{\alpha}$ at $\alpha=1$ , and $K_{\alpha}$ quickly decreases as $\alpha$ deviates from $1$ , where the values are almost $0$ for all networks at $\alpha=0.5$ . For example, village D has the maximum index value in all $11$ networks at $\alpha=1$ , with $K_{1}\approx 0.723$ , which implies that the network is close to being balanced. However, it becomes less than $0.5$ at $\alpha=0.8$ , and continue decreasing as we increase the memory effect through parameter $\alpha$ . These imply the abundance of long negative cycles in these social networks, which is consistent with the results in [32], such as the homophily of negative relationships.

Specifically, we observe a clear crossing between the change of the index values of village D and that of village E as $\alpha$ deviates from $1$ . In order to understand the differences between the balance of the villages $D$ and $E$ , we start by defining the following truncated series, $M_{r}^{s}\coloneqq\sum_{k=0}^{r}Tr\left(A^{k}\right)/\varGamma\left(\alpha k+1\right)$ which accounts for the signed contributions of the different spectral moments; $M_{r}^{u}\coloneqq\sum_{k=0}^{r}Tr\left(\left|A\right|^{k}\right)/\varGamma% \left(\alpha k+1\right)$ which accounts for the total contribution of closed walks, and $M_{r}=M_{r}^{s}/M_{r}^{u}$ . Obviously, $M_{r\rightarrow\infty}=M_{r\rightarrow\infty}^{s}/M_{r\rightarrow\infty}^{t}=% Tr\left(E_{\alpha}\left(A\right)\right)/Tr\left(E_{\alpha}\left(\left|A\right|% \right)\right)$ recovers the balance index $K_{\alpha}$ . We start by truncating the series at $r=3$ which is where the first signed cycles appear, and then continue increasing $r$ . First, we plot the results for the two graphs when $\alpha=1$ in the left of Fig. 7. Hence, the network of village $E$ (purple circles) appears to be more balanced than the one of village $D$ if we truncate the sum of spectral moments below $r=8$ . Indeed, a simple index based only on triangles indicates that $E$ is more balanced than $D$ . At about $r=8$ the cumulative sum of moments for graph $D$ become larger than that of graph $E$ , indicating that now the former graph is more balanced. The reason for this swap in the balance order is not directly caused by the larger number of longer signed cycles in one over the other, as we have seen in previous examples, but due to the fact that for graph $D$ the ratio of the cumulative sum of moments of length smaller than $8$ is smaller than that for the graph $E$ . However, increasing this sum to higher-order moments makes it bigger for graph $D$ than to graph $E$ . For example, the ratio $M_{7}\left(D\right)\approx 2336/2850\approx 0.8197$ which is smaller than $M_{7}\left(E\right)\approx 4442/5394\approx 0.8236$ . However, $M_{8}\left(D\right)\approx(2336+239)/(2850+393)\approx 0.7941$ is smaller than $M_{8}\left(E\right)\approx(4442+815)/(5394+1278)\approx 0.7880$ , which is independent of the fact that $(239/393)<(815/1278)$ ,but depending on the rates on which the numerator and denominator of the $M_{7}\left(D\right)$ and $M_{7}\left(E\right)$ growth by the addition of the individual terms.

This effect previously seen for $\alpha=1$ disappears when we increases the memory effect; see the right of Fig. 7. Specifically, penalizing less the walks increases the difference in balance in favor of graph $E$ relative to $D$ . Therefore, we may consider the fact that the factorial penalization points out to graph $D$ as more balance than $E$ as an artifact of this type of penalization, which indeed is solved when the memory of the system increases. This emphasize the importance of selecting an optimal memory effect parameter $\alpha$ to understand the level of balance of the signed social networks.

6 A useful approximation

We know that

Tr\left(E_{\alpha}\left(\gamma A\right)\right)=\sum_{j=1}^{n}E_{\alpha}\left(% \gamma\lambda_{j}\right).

(6.1)

Let $\lambda_{1}^{\left(m_{1}\right)}>\lambda_{2}^{\left(m_{2}\right)}>\ldots>% \lambda_{r}^{\left(m_{r}\right)}$ be the distinct eigenvalues of $A$ together with their multiplicities $m_{i}$ . For $\alpha$ relatively low, the function $E_{\alpha}\left(z\right)$ grows extremely fast with the values $z$ . Therefore, for relatively small values of $\alpha$ the difference $\lambda_{1}>\lambda_{2}$ is magnified by $E_{\alpha}\left(\lambda_{1}\right)\ggg E_{\alpha}\left(\lambda_{2}\right),$ which implies that

\underset{\alpha\rightarrow 0}{\lim}Tr\left(E_{\alpha}\left(\gamma A\right)% \right)=m_{1}E_{\alpha}\left(\gamma\lambda_{1}\right).

(6.2)

If the eigenvalues of $\left|A\right|$ are $\mu_{1}>\mu_{2}\geq\ldots\geq\mu_{n}$ we will have that

\tilde{K}_{\alpha}\coloneqq\underset{\alpha\rightarrow 0}{\lim}K_{\alpha}\left% (G_{s}\right)=\dfrac{m_{1}E_{\alpha}\left(\gamma\lambda_{1}\right)}{E_{\alpha}% \left(\gamma\mu_{1}\right)}.

(6.3)

We experimentally verify the goodness of this approximation in some of the networks we have studied previously, specifically the signed networks of the rural villages. In Fig. 8 (left), we plot the relative error in the balance index with memory $K_{\alpha}$ when approximated by $\tilde{K}_{\alpha}$ for the $11$ signed networks of the rural villages. We observe that for values of $\alpha$ close to $1$ , the relative error is relatively large for most of the networks, with values of up to $0.7$ for village K. However, the error significantly drops when $\alpha$ changes systematically to $0$ , and in particular, for $\alpha=0.5$ the average relative error for the $11$ networks is $0.071$ with only network D having a relatively large error of about $0.3$ . When $\alpha=0.4$ all networks display error below $0.1$ , where most of them have extremely low errors, e.g., below $10^{-10}.$

As in the derivation, the main driver for this approximation is the spectral gap of the adjacency matrix of the signed graph, i.e., $\lambda_{1}-\lambda_{2}$ . Here we obtain the value of $\alpha$ for which the relative error in the approximation of $K_{\alpha}$ by $\tilde{K}_{\alpha}$ drops below $0.1$ (for illustrative purposes), denoted by $\alpha_{c}$ . In Fig. 8 (right), we plot the values of $\alpha_{c}$ for every network as a function of the relative spectral gap $\left(\lambda_{1}-\lambda_{2}\right)/\lambda_{1}$ . We observe that increasing the spectral gap makes the approximation works better even for relatively large values of $\alpha$ (Pearson correlation: $0.942$ ). In contrast, for those networks like the one of village D, where $\lambda_{1}\approx 6.333$ and $\lambda_{2}\approx 6.258$ , it is hard to converge even for relatively low values of $\alpha$ . However, the trend holds where the relative error of this approximation is significantly lower for values of $\alpha$ relatively low than that for the value of $\alpha=1$ , where the balance index corresponds to the exponential.

Acknowledgments

We would like to acknowledge Dr. H. Saiz and Prof. C. Altafini for sharing datasets used in this work. E.E. acknowledges support from the Maria de Maeztu project CEX2021-001164-M funded by the MCIN/AEI/10.13039/501100011033. Y.T. is funded by the Wallenberg Initiative on Networks and Quantum Information (WINQ).

References

[1] A. Alfakih. Euclidean distance matrices and their applications in rigidity theory. Springer, 2018.
[2] C. Altafini. Dynamics of opinion forming in structurally balanced social networks. PLoS ONE, 7(6):1–9, 2012.
[3] C. Altafini. Consensus problems on networks with antagonistic interactions. IEEE Transactions on Automatic Control, 58(4):935–946, 2013.
[4] Francesca Arrigo and Fabio Durastante. Mittag–leffler functions and their applications in network science. SIAM Journal on Matrix Analysis and Applications, 42(4):1581–1601, 2021.
[5] F. Atay and S. Liu. Cheeger constants, structural balance, and spectral clustering analysis for signed graphs. Discrete Math., 343(1):111616, 2020.
[6] R. Balaji and R. Bapat. On euclidean distance matrices. Linear Algebra Appl., 424(1):108–117, 2007.
[7] Michele Benzi and Paola Boito. Matrix functions in network analysis. GAMM-Mitteilungen, 43(3):e202000012, 2020.
[8] C. Berge. The theory of graphs. Courier Corporation, 2001.
[9] Dorwin Cartwright and Frank Harary. Structural balance: a generalization of heider’s theory. Psychological review, 63(5):277, 1956.
[10] F. Diaz-Diaz and E. Estrada. Signed graphs in data sciences via communicability geometry. arXiv preprint arXiv:2403.07493, 2024.
[11] E. Estrada. Conservative vs. non-conservative diusion towards a target in a networked environment. In The Target Problem. Springer, Berlin, 2024.
[12] Ernesto Estrada. The communicability distance in graphs. Linear Algebra and its Applications, 436(11):4317–4328, 2012.
[13] Ernesto Estrada. Rethinking structural balance in signed social networks. Discrete Applied Mathematics, 268:70–90, 2019.
[14] Ernesto Estrada. The many facets of the estrada indices of graphs and networks. SeMA Journal, 79(1):57–125, 2022.
[15] Ernesto Estrada. Communicability cosine distance: similarity and symmetry in graphs/networks. Computational and Applied Mathematics, 43(1):49, 2024.
[16] Ernesto Estrada and Michele Benzi. Walk-based measure of balance in signed networks: Detecting lack of balance in social networks. Physical Review E, 90(4):042802, 2014.
[17] Ernesto Estrada, Jesús Gómez-Gardeñes, and Lucas Lacasa. Network bypasses sustain complexity. Proceedings of the National Academy of Sciences, 120(31):e2305001120, 2023.
[18] Ernesto Estrada and Naomichi Hatano. Communicability in complex networks. Physical Review E, 77(3):036111, 2008.
[19] Ernesto Estrada and Naomichi Hatano. Communicability angle and the spatial efficiency of networks. SIAM Review, 58(4):692–715, 2016.
[20] Ernesto Estrada, Naomichi Hatano, and Michele Benzi. The physics of communicability in complex networks. Physics reports, 514(3):89–119, 2012.
[21] Ernesto Estrada and Desmond J Higham. Network properties revealed through matrix functions. SIAM review, 52(4):696–714, 2010.
[22] Ernesto Estrada and Juan A Rodriguez-Velazquez. Subgraph centrality in complex networks. Physical Review E, 71(5):056103, 2005.
[23] Ernesto Estrada, MG Sanchez-Lirola, and José Antonio De La Peña. Hyperspherical embedding of graphs and networks in communicability spaces. Discrete Applied Mathematics, 176:53–77, 2014.
[24] G. Facchetti, G. Iacono, and C. Altafini. Computing global structural balance in large-scale signed social networks. Proc. Natl. Acad. Sci., 108(52):20953–20958, 2011.
[25] L. Festinger. The analysis of sociograms using matrix algebra. Hum. Relat., 2(2):153–158, 1949.
[26] Rumi Ghosh, Kristina Lerman, Tawan Surachawala, Konstatin Voevodski, and Shanghua Teng. Non-conservative diffusion and its application to social network analysis. Journal of Complex Networks, 12(1):cnae006, 2024.
[27] P. Giscard, P. Rochet, and R. Wilson. Evaluating balance on social networks from their simple cycles. J. Complex Netw., 5(5):750–775, 05 2017.
[28] J. Gower. Properties of euclidean and non-euclidean distance matrices. Linear Algebra Appl., 67:81–97, 1985.
[29] F. Harary. On the notion of balance of a signed graph. Michigan Math. J., 2(2):143–146, 1953.
[30] F. Heider. Attitudes and cognitive organization. J. Psychol., 21(1):107–112, 1946.
[31] Nicholas J Higham. Functions of matrices: theory and computation. SIAM, 2008.
[32] Alexander Isakov, James H Fowler, Edoardo M Airoldi, and Nicholas A Christakis. The structure of negative social ties in rural village networks. Sociological science, 6:197, 2019.
[33] Leo Katz. A new status index derived from sociometric analysis. Psychometrika, 18(1):39–43, 1953.
[34] A. Kirkley, G. Cantwell, and M. Newman. Balance in signed networks. Phys. Rev. E, 99:012320, Jan 2019.
[35] N. Krislock and H. Wolkowicz. Euclidean distance matrices and applications. Springer, 2012.
[36] Chul-Ho Lee, Srinivas Tenneti, and Do Young Eun. Transient dynamics of epidemic spreading and its mitigation on large networks. In Proceedings of the twentieth ACM international symposium on mobile ad hoc networking and computing, pages 191–200, 2019.
[37] Kristina Lerman and Rumi Ghosh. Network structure, topology, and dynamics in generalized models of synchronization. Physical Review E, 86(2):026108, 2012.
[38] Andrés Martín and Ernesto Estrada. Fractional-modified bessel function of the first kind of integer order. Mathematics, 11(7):1630, 2023.
[39] A. Mathai and T. Zalavsky. On adjacency matrices and descriptors of signed cycle graphs. J. Comb. Inf. Syst. Sci., 37(2-4):369–382, 2012.
[40] Zaid Odibat. Approximations of fractional integrals and caputo fractional derivatives. Applied Mathematics and Computation, 178(2):527–533, 2006.
[41] H Polard. The completely monotonic character of the mittag-leffler function. Bull. Am. Math. Soc, 52:908–910, 1948.
[42] Amir Sadeghi and João R Cardoso. Some notes on properties of the matrix mittag-leffler function. Applied Mathematics and Computation, 338:733–738, 2018.
[43] Hugo Saiz, Jesús Gómez-Gardeñes, Paloma Nuche, Andrea Girón, Yolanda Pueyo, and Concepción L Alados. Evidence of structural balance in spatial ecological networks. Ecography, 40(6):733–741, 2017.
[44] Caio Seguin, Martijn P Van Den Heuvel, and Andrew Zalesky. Navigation of brain networks. Proceedings of the National Academy of Sciences, 115(24):6297–6302, 2018.
[45] R. Singh and B. Adhikari. Measuring the balance of signed networks and its application to sign prediction. J. Stat. Mech. Theory Exp., 2017(6):063302, jun 2017.
[46] Nicola Soranzo, Fahimeh Ramezani, Giovanni Iacono, and Claudio Altafini. Decompositions of large-scale biological systems based on dynamical properties. Bioinformatics, 28(1):76–83, 2012.
[47] P Tarazaga, T. Hayden, and J. Wells. Circum-euclidean distance matrices and faces. Linear Algebra Appl., 232:77–96, 1996.
[48] Y. Tian. Role Extraction, Dynamics, and Optimisation on Networks. PhD thesis, University of Oxford, October 2022. Available at https://ora.ox.ac.uk/objects/uuid:8145297d-3f88-4d67-9c34-575beb1a4c6c.
[49] Y. Tian and R. Lambiotte. Spreading and structural balance on signed networks. SIAM J. Appl. Dyn. Syst., 23(1):50–80, 2024.
[50] Y. Tian, S. Lautz, A. Wallis, and R. Lambiotte. Extracting complements and substitutes from sales data: a network perspective. EPJ Data Sci., 10(1):45, 2021.
[51] Thomas Zaslavsky. Signed graphs. Discrete Applied Mathematics, 4(1):47–74, 1982.
[52] Thomas Zaslavsky. Six signed petersen graphs, and their automorphisms. Discrete Mathematics, 312(9):1558–1583, 2012.

Appendix A Tables

We give more details of the results from the signed Petersen graphs in Tables 2 and 3.

graph	$t_{c}$	$C_{5}^{-}$	$C_{6}^{-}$	$K\left(G\right)$
a	48	4	4	0.968
b	24	6	6	0.951
c	22	8	4	0.941
d	14	6	10	0.947
e	11	12	0	0.919

Table 2: Values of the time of consensus

t_{c}

using Altafini’s model [3] for the graphs in Fig. 1 as well as the values of the number of negative cycles of length 5

C_{5}^{-}

and 6

C_{6}^{-}

in those graphs. The values of the balance degree index

K\left(G\right)

are also given for each graph.

	graph c		graph d
$n$	$C_{n}^{+}$	$C_{n}^{-}$	$C_{n}^{+}$	$C_{n}^{-}$
5	4	8	6	6
6	6	4	0	10
8	7	8	15	0
9	12	8	10	10

Table 3: Values of the number of positive

C_{n}^{+}

and negative

C_{n}^{-}

cycles of length

5\leq n\leq 8

for the signed Petersen graphs c) and d) of Fig. 1.

We summarise the exact numbers of positive and negative cycles in the gene regulatory networks in Table 4.

	S. cereviciae		B. subtillus
$n$	$C_{n}^{+}$	$C_{n}^{-}$	$C_{n}^{+}$	$C_{n}^{-}$
3	35	27	129	75
4	1174	114	1366	547
5	152	135	1780	1031
6	3855	763	8003	4975
7	1875	1119	20645	14261
8	34321	13332	72722	48309
9	29473	16740	179137	122709
10	271954	128469	547246	364806
11	476800	279258	1443443	1002857

Table 4: Values of the number of positive

C_{n}^{+}

and negative

C_{n}^{-}

cycles of length

3\leq n\leq 11

for the gene regulatory networks of yeast and B. subtilis.