Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

HTML conversions sometimes display errors due to content that did not convert correctly from the source. This paper uses the following packages that are not yet supported by the HTML conversion tool. Feedback on these issues are not necessary; they are known and are being worked on.

  • failed: tabstackengine

Authors: achieve the best HTML results from your LaTeX submissions by following these best practices.

License: arXiv.org perpetual non-exclusive license
arXiv:2402.18203v1 [physics.soc-ph] 28 Feb 2024

Exploring the space of graphs with fixed discrete curvatures

Michelle Roost Karel Devriendt Giulio Zucal Jürgen Jost
Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany
(February 2024)
Abstract

Discrete curvatures are quantities associated to the nodes and edges of a graph that reflect the local geometry around them. These curvatures have a rich mathematical theory and they have recently found success as a tool to analyze networks across a wide range of domains. In this work, we consider the problem of constructing graphs with a prescribed set of discrete edge curvatures, and explore the space of such graphs. We address this problem in two ways: first, we develop an evolutionary algorithm to sample graphs with discrete curvatures close to a given set. We use this algorithm to explore how other network statistics vary when constrained by the discrete curvatures in the network. Second, we solve the exact reconstruction problem for the specific case of Forman–Ricci curvature. By leveraging the theory of Markov bases, we obtain a finite set of rewiring moves that connects the space of all graphs with a fixed discrete curvature.
 
Keywords: network geometry, discrete curvature, Markov bases, Markov chain Monte Carlo methods, evolutionary algorithm, edge-based network measures

1 Introduction

Networks are used ubiquitously as a tool to model, analyze and design complex systems in a wide range of domains of science and technology such as neuroscience [24], epidemiology [48], economics [28, 6], urban systems [4], energy infrastructure [47] and many more. Across these different disciplines, networks, graphs and their higher-order generalizations [35, 5, 8] provide a unifying mathematical language that allows to translate concepts and tools from one setting to another. Moreover, a series of observations at the turn of the 21st century found that the networks extracted from real-world systems often exhibit common features: heavy-tailed degree distributions [3], a “small world” structure [60] and rich mesoscale connectivity patterns such as communities [26]. This initiated an ongoing pursuit to find models and theories that explain the presence of these features, and develop new mathematical methods to detect, describe and quantify them. We point to [11, 58, 12] for a critical perspective on these observations.

Among the many approaches to study complex networks, this article is concerned with geometry and more specifically with discrete curvature as a tool to describe the structure of graphs; see [10, 62] for an overview of different notions of geometry associated with complex systems and their applications. We now explain in more detail the main topic of the article: discrete curvature.
 
Curvature is a concept from Riemannian geometry, the study of smooth objects such as surfaces equipped with a metric, and quantifies how much these objects differ locally from being flat. While classical definitions of curvature need the structure of a Riemannian manifold, more recent works have generalized these definitions so they can be applied to other metric spaces such as graphs; this results in discrete notions of curvature — see Section 2 for a concrete example.

Definitions of discrete curvature on graphs111We note that these definitions cover many types of discrete curvature such as scalar curvature and Ricci curvature. Furthermore, in many cases these discrete curvatures are defined for more general objects, with graphs arising as a special case. were given for instance by Forman based on Laplacian operators [23, 53, 32], by Ollivier and later Lin, Lu and Yau based on neighborhood overlap [44, 45, 36], by Bakry and Émery based on the Bochner identity [2], by Erbar and Maas based on a certain entropy functional [21], by Steinerberger based on solutions to an equilibrium problem [55], and by Devriendt and Lambiotte based on the effective resistance [14, 15]. Importantly, many of these discrete curvatures satisfy properties and results analogous to the powerful theorems from differential geometry; see for instance [37, 36, 7, 31, 39, 23, 38, 15, 55]. Of particular relevance to the central question of emergent structure and organization in complex systems, we note that many notions of discrete curvature satisfy powerful ‘local-to-global’ theorems, where local constraints on the curvature lead to statements on the global structure of the graph.

Aside from a rich mathematical theory, discrete curvature has also found successful applications as a tool to analyze and characterize the structure of complex real-world networks. It has been used for instance in the context of community detection [43, 27, 22], in the study of biological networks [51, 19], in characterizing connectivity patterns in brain networks [20], in studying market instabilities in financial networks [49], and to detect and address the problem of oversquashing in graph neural networks [57, 9].
 
In this article, we consider the following question: How to construct graphs with prescribed discrete curvatures? To our knowledge, this inverse problem has not yet been addressed for any notion of discrete curvature, and we consider it a basic methodological gap in trying to understand how the geometry of a graph or network influences its other features. As a first step in addressing this methodological gap, this article provides two different graph reconstruction approaches for specific choices of discrete curvatures. We furthermore explore, both experimentally and theoretically, the resulting space of graphs with fixed discrete curvatures.

In the first part, Section 3, we consider the problem of approximate reconstruction and develop a methodology that works for any notion of discrete curvature. We use a Markov chain Monte Carlo-type algorithm (MCMC) to iteratively construct graphs with discrete curvatures close to a target set of curvatures. We then use this algorithm to explore the ensemble of graphs with a given discrete curvature sequence in a number of experiments. For computational reasons, our discussion focuses on two simple notions of discrete curvature, Forman–Ricci and augmented Forman–Ricci curvature, but we have verified the methodology for other curvatures as well.

In the second part, Section 4, we take a more theoretical approach and consider the problem of exact reconstruction of graphs from a given target Forman–Ricci curvature sequence. In this specific case, the problem reduces to a combinatorial question involving the so-called joint degrees in the graph; see for instance [54]. Using known characterizations of possible joint degrees in combination with the theory of Markov bases [17], we show that there exists a finite set of graph rewiring moves that connect any two graphs with the same Forman–Ricci curvatures and node degrees; this is the content of Theorem 4.12. The set of rewiring moves can be computed using existing computer algebra packages, and we give explicit examples for the case of graphs with a small maximum degree. Our result naturally leads to an ergodic Markov chain over the set of all graphs with equal Forman–Ricci curvatures and degrees.
 
Organization: The rest of this article is organized as follows. Section 2 introduces the relevant definitions from graph theory and discrete curvature and formalizes the reconstruction question. In Section 3, we propose an evolutionary MCMC algorithm for approximate reconstruction from a target curvature sequence. We then use the algorithm in a number of experiments and describe network statistics of graphs sampled using the algorithm. In Section 4, we consider the exact reconstruction problem. We formalize the problem and solve it using the theory of joint degree matrices and Markov bases. Section 5 concludes the article with a summary of the results and an outlook towards future applications and extensions.

2 Background and Forman–Ricci curvatures

We consider simple graphs G=(V,E)𝐺𝑉𝐸G=(V,E)italic_G = ( italic_V , italic_E ) where V𝑉Vitalic_V is the set of nodes and E𝐸Eitalic_E is the set of edges, which are pairs {u,v}E𝑢𝑣𝐸\{u,v\}\in E{ italic_u , italic_v } ∈ italic_E of distinct nodes that are connected in the graph. As mentioned in the introduction, we will characterize the structural properties of graphs by considering their discrete curvature. Our focus will be on two definitions of discrete Ricci curvature on edges.

The Forman–Ricci curvature for graphs is defined with respect to a choice of positive weights w𝑤witalic_w for the nodes and edges. The curvature of an edge e={u,v}𝑒𝑢𝑣e=\{u,v\}italic_e = { italic_u , italic_v } is equal to [53]

Fw(e)=wu(1eueewewe)+wv(1eveewewe),subscript𝐹𝑤𝑒subscript𝑤𝑢1subscript𝑢superscript𝑒superscript𝑒𝑒subscript𝑤𝑒subscript𝑤superscript𝑒subscript𝑤𝑣1subscript𝑣superscript𝑒superscript𝑒𝑒subscript𝑤𝑒subscript𝑤superscript𝑒F_{w}(e)=w_{u}\Biggl{(}1-\sum_{\begin{subarray}{c}e^{\prime}\ni u\\ e^{\prime}\neq e\end{subarray}}\sqrt{\frac{w_{e}}{w_{e^{\prime}}}}\Biggr{)}+w_% {v}\Biggl{(}1-\sum_{\begin{subarray}{c}e^{\prime}\ni v\\ e^{\prime}\neq e\end{subarray}}\sqrt{\frac{w_{e}}{w_{e^{\prime}}}}\Biggr{)},italic_F start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ( italic_e ) = italic_w start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( 1 - ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_e start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∋ italic_u end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≠ italic_e end_CELL end_ROW end_ARG end_POSTSUBSCRIPT square-root start_ARG divide start_ARG italic_w start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT end_ARG start_ARG italic_w start_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG end_ARG ) + italic_w start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( 1 - ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_e start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∋ italic_v end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≠ italic_e end_CELL end_ROW end_ARG end_POSTSUBSCRIPT square-root start_ARG divide start_ARG italic_w start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT end_ARG start_ARG italic_w start_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG end_ARG ) ,

where esuperscript𝑒e^{\prime}italic_e start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT runs over all edges that contain u𝑢uitalic_u (resp. v𝑣vitalic_v) and are distinct from e𝑒eitalic_e. Setting all weights to w=1𝑤1w=1italic_w = 1 results in the combinatorial formula for the Forman–Ricci curvature of an edge that will be used in this article:

F(e)=4deg(u)deg(v),𝐹𝑒4degree𝑢degree𝑣F(e)=4-\deg(u)-\deg(v),italic_F ( italic_e ) = 4 - roman_deg ( italic_u ) - roman_deg ( italic_v ) , (1)

for e={u,v}𝑒𝑢𝑣e=\{u,v\}italic_e = { italic_u , italic_v } and where deg(u)degree𝑢\deg(u)roman_deg ( italic_u ) denotes the degree of node u𝑢uitalic_u, i. e., its number of neighbors in G𝐺Gitalic_G. Clearly, Forman–Ricci curvature is a very simple notion of discrete curvature that only depends on the local degree information of the graph. For this reason, we will also consider an extended notion of discrete Ricci curvature based on Forman’s original curvature for CW complexes [23]. The augmented Forman–Ricci curvature for unweighted graphs is defined in [50] as

F#(e)=4deg(u)deg(v)+3𝒯(e),superscript𝐹#𝑒4degree𝑢degree𝑣3𝒯𝑒F^{\#}(e)=4-\deg(u)-\deg(v)+3\mathcal{T}(e),italic_F start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT ( italic_e ) = 4 - roman_deg ( italic_u ) - roman_deg ( italic_v ) + 3 caligraphic_T ( italic_e ) , (2)

where 𝒯(e)𝒯𝑒\mathcal{T}(e)caligraphic_T ( italic_e ) is the number of triangles in the graph that contain the edge e={u,v}𝑒𝑢𝑣e=\{u,v\}italic_e = { italic_u , italic_v }. The augmented Forman–Ricci curvature is widely used because it can be computed efficiently while still capturing the local geometry of a graph well; for instance, it correlates highly with other notions of discrete curvature such as Ollivier–Ricci curvature [50] and Bakry–Émery curvature [40]. One can further augment F#(e)superscript𝐹#𝑒F^{\#}(e)italic_F start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT ( italic_e ) by considering cycles of length four and more, but this involves a tradeoff between expressivity and computational complexity of the discrete curvature; see for instance [32, 56, 22] for more on this. Figure 1 shows a small graph with the Forman–Ricci and augmented Forman–Ricci curvatures computed for all edges.

Refer to caption
Figure 1: The Forman–Ricci curvatures F(e)𝐹𝑒F(e)italic_F ( italic_e ) and augmented Forman–Ricci curvatures F#(e)superscript𝐹#𝑒F^{\#}(e)italic_F start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT ( italic_e ) in a graph. The differences due to the presence of a triangle are highlighted in red.

For a given graph G𝐺Gitalic_G, the curvature sequence is the multiset (set with possible repeats) of all discrete curvatures of its edges. For instance, the Forman–Ricci curvature sequence is the multiset {F(e)}eEsubscript𝐹𝑒𝑒𝐸\{F(e)\}_{e\in E}{ italic_F ( italic_e ) } start_POSTSUBSCRIPT italic_e ∈ italic_E end_POSTSUBSCRIPT, which for the example in Figure 1 equals {3,3,2,2,1,1,0,0}33221100\{-3,-3,-2,-2,-1,-1,0,0\}{ - 3 , - 3 , - 2 , - 2 , - 1 , - 1 , 0 , 0 }. We can now make the main methodological question of this article more precise:

Given a multiset 𝒞superscript𝒞normal-⋆\mathcal{C}^{\star}caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT of numbers, construct a graph with curvature sequence 𝒞superscript𝒞normal-⋆\mathcal{C}^{\star}caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT. (Q1)

The answer to this question can then be used to address a second goal of the article: to explore the space of all graphs that have a given discrete curvature.

Section 3 addresses question (Q1) algorithmically by proposing an algorithm that samples graphs G^^𝐺\hat{G}over^ start_ARG italic_G end_ARG such that 𝒞(G^)𝒞𝒞^𝐺superscript𝒞\mathcal{C}(\hat{G})\approx\mathcal{C}^{\star}caligraphic_C ( over^ start_ARG italic_G end_ARG ) ≈ caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, for any notion of discrete curvature. Section 4 addresses question (Q1) theoretically for the case of Forman–Ricci curvature. We show that there exists a finite set of graph operations, such that any two graphs G,G𝐺superscript𝐺G,G^{\prime}italic_G , italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT with the same Forman–Ricci curvature sequence 𝒞(G)=𝒞(G)𝒞𝐺𝒞superscript𝐺\mathcal{C}(G)=\mathcal{C}(G^{\prime})caligraphic_C ( italic_G ) = caligraphic_C ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) and degree sequence can be transformed into each other by using these graph operations. Throughout the article, we limit our scope to choices of 𝒞superscript𝒞\mathcal{C}^{\star}caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT for which there exists at least one graph G𝐺Gitalic_G with curvature sequence 𝒞superscript𝒞\mathcal{C}^{\star}caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT. Characterizing the multisets that are realizable as discrete curvature sequences of a graph is an interesting problem in itself, and it is still unexplored for most notions of discrete curvature.

3 Approximate graph reconstruction from curvatures

The main methodological question of this article (Q1) asks to construct graphs with predefined discrete curvatures. Similar to reconstructing graphs from statistics such as motif counts or eigenvalues [29], this is a very difficult task in general and it highly depends on the choice of discrete curvature. Therefore, this section proposes an algorithm for approximate reconstruction that works for any choice of curvature. For a given multiset 𝒞superscript𝒞\mathcal{C}^{\star}caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, the algorithm samples graphs G^^𝐺\hat{G}over^ start_ARG italic_G end_ARG such that 𝒞(G^)𝒞𝒞^𝐺superscript𝒞\mathcal{C}(\hat{G})\approx\mathcal{C}^{\star}caligraphic_C ( over^ start_ARG italic_G end_ARG ) ≈ caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT. We use this algorithm to explore the ensemble of graphs with curvatures close to a given target sequence. In particular, we study how a number of standard network summary statistics vary over this ensemble.

3.1 Evolutionary algorithm for sampling graphs with given curvatures

We follow the methodology of [29] and use an evolutionary Markov chain Monte Carlo-type algorithm (MCMC) that consists of three steps: initialization, mutation and selection. The algorithm starts with an initial graph G0subscript𝐺0G_{0}italic_G start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT which is then evolved iteratively to make its curvature sequence gradually closer to the target sequence. After choosing a notion of discrete curvature of interest, the algorithm takes three inputs: (i) the number of nodes n𝑛nitalic_n and the target sequence 𝒞superscript𝒞\mathcal{C}^{\star}caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, which is a multiset of numbers, (ii) a parameter θ𝜃\thetaitalic_θ which introduces variability to avoid the algorithm from getting stuck and (iii) a parameter T𝑇Titalic_T, which is the number of steps after which the algorithm halts. In practice we choose the parameters θ𝜃\thetaitalic_θ and T𝑇Titalic_T for a given sequence using grid search. We now describe the algorithm steps in more detail.
\blacktriangleright Initialization: Construct a random graph G0subscript𝐺0G_{0}italic_G start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT with n𝑛nitalic_n nodes and m:=|𝒞|assign𝑚superscript𝒞m:=|\mathcal{C}^{\star}|italic_m := | caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT | edges, starting from a random tree (to guarantee connectedness) and with the remaining edges randomly added while avoiding multi-edges. The iteration parameter t𝑡titalic_t is set to zero.
\blacktriangleright Mutation: Select a random edge {u,v}E(Gt)𝑢𝑣𝐸subscript𝐺𝑡\{u,v\}\in E(G_{t}){ italic_u , italic_v } ∈ italic_E ( italic_G start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) in the graph and construct a graph Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT in which the edge {u,v}𝑢𝑣\{u,v\}{ italic_u , italic_v } is rewired to {u,v}𝑢superscript𝑣\{u,v^{\prime}\}{ italic_u , italic_v start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT }, where vsuperscript𝑣v^{\prime}italic_v start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is chosen uniformly at random while avoiding isolated nodes and multi-edges.
\blacktriangleright Selection: Compute the discrete curvature sequences 𝒞(Gt)𝒞subscript𝐺𝑡\mathcal{C}(G_{t})caligraphic_C ( italic_G start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) and 𝒞(G)𝒞superscript𝐺\mathcal{C}(G^{\prime})caligraphic_C ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) and compare with the target curvature sequence by computing the mean squared error222We found that choosing a different measure to compare the curvature sequences does not have much influence on the convergence speed of the algorithm.:

MSEt=1mi=1m(𝒞(Gt)i𝒞i)2 and MSE=1mi=1m(𝒞(G)i𝒞i)2,formulae-sequencesubscriptMSE𝑡1𝑚superscriptsubscript𝑖1𝑚superscript𝒞subscriptsubscript𝐺𝑡𝑖subscriptsuperscript𝒞𝑖2 and superscriptMSE1𝑚superscriptsubscript𝑖1𝑚superscript𝒞subscriptsuperscript𝐺𝑖subscriptsuperscript𝒞𝑖2\text{MSE}_{t}=\frac{1}{m}\sum_{i=1}^{m}(\mathcal{C}(G_{t})_{i}-\mathcal{C}^{% \star}_{i})^{2}\quad\text{~{}and~{}}\quad\text{MSE}^{\prime}=\frac{1}{m}\sum_{% i=1}^{m}(\mathcal{C}(G^{\prime})_{i}-\mathcal{C}^{\star}_{i})^{2},MSE start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( caligraphic_C ( italic_G start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and MSE start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( caligraphic_C ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,

where the curvature sequences are sorted in ascending order. The mutation Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is accepted if it results in a decrease in the mean squared error ΔMSE=MSEMSEt<0ΔMSEsuperscriptMSEsubscriptMSE𝑡0\Delta\text{MSE}=\text{MSE}^{\prime}-\text{MSE}_{t}<0roman_Δ MSE = MSE start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - MSE start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT < 0, or otherwise it is accepted with probability p=exp(ΔMSE/(MSEtθ))𝑝ΔMSEsubscriptMSE𝑡𝜃p=\exp(-\Delta\text{MSE}/(\text{MSE}_{t}\cdot\theta))italic_p = roman_exp ( - roman_Δ MSE / ( MSE start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⋅ italic_θ ) ). Finally, set tt+1𝑡𝑡1t\leftarrow t+1italic_t ← italic_t + 1 and GtGsubscript𝐺𝑡superscript𝐺G_{t}\leftarrow G^{\prime}italic_G start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ← italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT if the mutation is accepted or GtGt1subscript𝐺𝑡subscript𝐺𝑡1G_{t}\leftarrow G_{t-1}italic_G start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ← italic_G start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT otherwise, and continue the process until t=T𝑡𝑇t=Titalic_t = italic_T.

Figure 2 illustrates the evolution of the curvature sequences throughout the algorithm. It shows the histogram of the target sequence 𝒞superscript𝒞\mathcal{C}^{\star}caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT and some curvature sequences 𝒞(Gt)𝒞subscript𝐺𝑡\mathcal{C}(G_{t})caligraphic_C ( italic_G start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) during the algorithm; note the good convergence of the final curvature sequence 𝒞(GT)𝒞subscript𝐺𝑇\mathcal{C}(G_{T})caligraphic_C ( italic_G start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ). While this methodology is applicable to any choice of discrete curvature, we focus on Forman–Ricci and augmented Forman–Ricci curvature for computational reasons in our experiments. We found that the algorithm still converges for Ollivier–Ricci curvature and resistance curvature target sequences, but with prohibitively long computation times.

Refer to caption
Figure 2: Evolution of curvature sequences throughout the sampling algorithm. Top left: Histogram of the target sequence 𝒞superscript𝒞\mathcal{C}^{\star}caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT. This is the Forman–Ricci curvature sequence of the C. elegans metabolic network (see Section 3.2.1). Bottom left: Evolution of the mean squared error MSEt𝑡{}_{t}start_FLOATSUBSCRIPT italic_t end_FLOATSUBSCRIPT between the target sequence and the Forman–Ricci curvature sequences. Highlight box: Histogram of Forman–Ricci curvatures 𝒞(Gt)𝒞subscript𝐺𝑡\mathcal{C}(G_{t})caligraphic_C ( italic_G start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) at four points during the algorithm: (A) the initial graph (B) after 5×1045E45\text{\times}{10}^{4}start_ARG 5 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG 4 end_ARG end_ARG accepted mutations, (C) after 11×10411E411\text{\times}{10}^{4}start_ARG 11 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG 4 end_ARG end_ARG accepted mutations and (D) at the end of the algorithm, after 16×10416E416\text{\times}{10}^{4}start_ARG 16 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG 4 end_ARG end_ARG accepted mutations and T=2×106𝑇2E6T=$2\text{\times}{10}^{6}$italic_T = start_ARG 2 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG 6 end_ARG end_ARG total iterations.

3.2 Approximate sampling experiments

We now use the algorithm described in Section 3.1 to explore ensembles of graphs with a given discrete curvature sequence. For both Forman–Ricci and augmented Forman–Ricci curvature, we consider four target curvature sequences and sample 1000100010001000 graphs. We then compute a number of network summary statistics and explore how they vary over these graphs.

3.2.1 Target sequences

To guarantee target sequences which are realizable, we use curvature sequences of existing graphs. We consider graphs obtained from three widely-studied random graph models [42] and one real-world network. The considered graphs are as follows:

  • Erdős–Rényi (ER) random graph. The graph has n=500𝑛500n=500italic_n = 500 nodes and every pair of nodes is connected independently with probability 0.010.010.010.01. These graphs result in a unimodal target sequence.

  • Stochastic block model (SBM) random graph. The graph has n=500𝑛500n=500italic_n = 500 nodes divided in two equal sized groups. Pairs of nodes in group 1111 are connected with probability p=0.05𝑝0.05p=0.05italic_p = 0.05, in group 2222 with probability 0.010.010.010.01 and pairs of nodes from different groups with probability p=0.003𝑝0.003p=0.003italic_p = 0.003. These graphs result in a bimodal target sequence.

  • Barabási–Albert (BA) random graph. The graph has n=500𝑛500n=500italic_n = 500 nodes and is constructed by starting from a star graph with four nodes and then adding new nodes. Each new node is connected to three of the existing nodes with probability proportional to their degrees. These graphs result in a heavy-tailed target sequence.

  • Real-world network. We consider the metabolic network of the Caenorhabditis elegans worm (C. elegans) [34, 18, 30, 46], this is a well-studied graph in biology. This graph has n=453𝑛453n=453italic_n = 453 nodes and m=2025𝑚2025m=2025italic_m = 2025 edges and results in a heterogeneous/complex target sequence.

Refer to caption
Figure 3: Histograms of the Forman–Ricci curvature of an Erdős–Rényi, stochastic block model and Barabási–Albert random graph and the C. elegans metabolic network.

3.2.2 Summary statistics

To summarize the structure of the graphs obtained from our algorithm, we compute four global network statistics:

  • Number of triangles: the number of triangles in a graph is a simple but important summary statistic. It contains information about the density of the graph and it is closely related to transitivity [59] and discrete curvature [31].

  • Average clustering coefficient: the clustering coefficient Cvsubscript𝐶𝑣C_{v}italic_C start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT of a node v𝑣vitalic_v is the number of triangles that contain v𝑣vitalic_v divided by the number of possible triangles (deg(v)2)binomialdegree𝑣2\binom{\deg(v)}{2}( FRACOP start_ARG roman_deg ( italic_v ) end_ARG start_ARG 2 end_ARG ). The average clustering coefficient 1nCv1𝑛subscript𝐶𝑣\frac{1}{n}\sum C_{v}divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ italic_C start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT is an important graph property that reflects its geometry [31, 33].

  • Average shortest path length: this is the average, over all pairs of nodes u𝑢uitalic_u and v𝑣vitalic_v, of the smallest number of steps between u𝑢uitalic_u and v𝑣vitalic_v. It is an intuitive summary of the metric structure of a graph.

  • First positive eigenvalue: the normalized graph Laplacian is the n×n𝑛𝑛n\times nitalic_n × italic_n matrix L=ID1A𝐿𝐼superscript𝐷1𝐴L=I-D^{-1}Aitalic_L = italic_I - italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_A where I𝐼Iitalic_I is the identity matrix, D𝐷Ditalic_D is the diagonal degree matrix and A𝐴Aitalic_A is the adjacency matrix. The eigenvalues of the Laplacian contain a lot of information about the graph structure and dynamical processes such as diffusion and random walks associated with the graph [13, 41]. In particular, L𝐿Litalic_L is a positive semidefinite matrix and its first positive eigenvalue λ2subscript𝜆2\lambda_{2}italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT quantifies how well-connected the graph is.

3.2.3 Experiments

Before sampling the ensemble of graphs, we determine the optimal parameters θ𝜃\thetaitalic_θ and T𝑇Titalic_T for each of the target sequences. Table 1 shows the parameter values obtained via grid search.

ER SBM BA C. elegans
Forman–Ricci curvature sequence θ𝜃\thetaitalic_θ -.3×1033E-33\text{\times}{10}^{-3}start_ARG 3 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG - 3 end_ARG end_ARG -.3×1033E-33\text{\times}{10}^{-3}start_ARG 3 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG - 3 end_ARG end_ARG -.3×1033E-33\text{\times}{10}^{-3}start_ARG 3 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG - 3 end_ARG end_ARG -.8×1048E-48\text{\times}{10}^{-4}start_ARG 8 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG - 4 end_ARG end_ARG
T𝑇Titalic_T 1×1061E61\text{\times}{10}^{6}start_ARG 1 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG 6 end_ARG end_ARG 1×1061E61\text{\times}{10}^{6}start_ARG 1 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG 6 end_ARG end_ARG 1×1061E61\text{\times}{10}^{6}start_ARG 1 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG 6 end_ARG end_ARG 2×1062E62\text{\times}{10}^{6}start_ARG 2 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG 6 end_ARG end_ARG
augmented Forman–Ricci curvature sequence θ𝜃\thetaitalic_θ -.3×1033E-33\text{\times}{10}^{-3}start_ARG 3 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG - 3 end_ARG end_ARG -.3×1033E-33\text{\times}{10}^{-3}start_ARG 3 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG - 3 end_ARG end_ARG -.4×1034E-34\text{\times}{10}^{-3}start_ARG 4 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG - 3 end_ARG end_ARG -.6×1046E-46\text{\times}{10}^{-4}start_ARG 6 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG - 4 end_ARG end_ARG
T𝑇Titalic_T 2×1062E62\text{\times}{10}^{6}start_ARG 2 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG 6 end_ARG end_ARG 2×1062E62\text{\times}{10}^{6}start_ARG 2 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG 6 end_ARG end_ARG 2×1062E62\text{\times}{10}^{6}start_ARG 2 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG 6 end_ARG end_ARG 2.5×1062.5E62.5\text{\times}{10}^{6}start_ARG 2.5 end_ARG start_ARG times end_ARG start_ARG power start_ARG 10 end_ARG start_ARG 6 end_ARG end_ARG
Table 1: Optimal θ𝜃\thetaitalic_θ and T𝑇Titalic_T parameters determined via grid search.

We now consider the four graphs (ER, SBM, BA, C. elegans), compute their Forman–Ricci curvature sequence as a target sequence 𝒞superscript𝒞\mathcal{C}^{\star}caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT and then compute 1000100010001000 graphs with 𝒞(G^)𝒞𝒞^𝐺superscript𝒞\mathcal{C}(\hat{G})\approx\mathcal{C}^{\star}caligraphic_C ( over^ start_ARG italic_G end_ARG ) ≈ caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT using our algorithm. As an indication of the quality of the approximation, Table 2 shows the mean squared errors for the sampled graphs. Generally, we observe that the MSE increases as the graphs become less uniform, as is the case for the Barabási–Albert and the C. elegans network.
 
Figure 4 shows the histograms of the network statistics (in blue) and the value of this statistic for the starting graph (red line). For each statistic, we also plot the standard score |ss|σsuperscript𝑠delimited-⟨⟩𝑠𝜎\tfrac{|s^{\star}-\langle s\rangle|}{\sigma}divide start_ARG | italic_s start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT - ⟨ italic_s ⟩ | end_ARG start_ARG italic_σ end_ARG, where ssuperscript𝑠s^{\star}italic_s start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is the value of the statistic for the target graph, sdelimited-⟨⟩𝑠\langle s\rangle⟨ italic_s ⟩ is the ensemble average of the statistic and σ𝜎\sigmaitalic_σ is the standard deviation of the statistic over the ensemble. This number is a proxy for how typical or atypical the statistics of the target graph are with respect to the ensemble.

Refer to caption
Figure 4: Network statistics for graphs sampled using the algorithm from Section 3.1, based on four target Forman–Ricci curvature sequences (ER, SBM, BA, C. elegans). The histograms of the observed statistics in the sampled graph ensemble are shown in blue and the red line indicates the statistic of the target graph together with its standard score.

We note a few observations: the standard scores for the ER-based target sequence are all below 1111. This suggests that the ensemble of graphs obtained from the algorithm is not very different from an ensemble of ER random graphs. Second, for the SBM-based target sequence, we see that the average shortest path length is much smaller for the sampled graphs than for the target graph, and that the first positive eigenvalue is much larger. Both observations suggest that the block structure of the SBM graph, i. e., two groups of nodes which are poorly connected, is not well-reproduced by the sampled graphs. Finally, for the C. elegans-based target sequence, we see that all standard scores are very high. This implies that the structure of this graph is very different from the sampled graphs. This is not surprising since only fixing the degree sums of edges is a very local and coarse structural constraint.
 
We repeat the same experiment with the augmented Forman–Ricci curvature. The mean squared error between the sampled and target curvature sequences are shown in Table 2 and the MSE is again higher for the Barabási–Albert graph and the C. elegans network. The results of the experiment are shown in Figure 5.

Refer to caption
Figure 5: Network statistics for graphs sampled using the algorithm from Section 3.1, based on four target augmented Forman–Ricci curvature sequences (ER, SBM, BA, C. elegans). The histograms of the observed statistics in the sampled graph ensemble are shown in blue and the red line indicates the statistic of the target graph together with its standard score.

We note a few observations: compared to Figure 4, the standard scores for the ER-based target sequence have all increased. This is not unexpected, since the graphs are sampled based on the augmented Forman–Ricci curvature which takes into account the presence of triangles in the graph, whereas ER graphs have no correlations between different edges. Secondly, the same observation as before holds for the SBM-based target sequence: the smaller shortest path lengths and larger first positive eigenvalue suggest that the two-block structure of the SBM is not reproduced in the sampled graphs.

MSE Forman Augmented Forman
ER SBM BA C. elegans ER SBM BA C. elegans
min 0.0 0.016 0.038 0.111 0.018 0.153 0.195 15.539
max 0.005 0.039 0.193 0.416 0.191 0.285 3.757 21.469
mean 0.001 0.027 0.088 0.033 0.094 0.212 1.115 18.188
Table 2: Minimal, maximal and mean of the MSE between 𝒞superscript𝒞\mathcal{C}^{\star}caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT and 𝒞(GT)𝒞subscript𝐺𝑇\mathcal{C}(G_{T})caligraphic_C ( italic_G start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) for the 1000100010001000 sampled graphs in the experiments of Figures 4 and 5.

Finally, we note that one should take care in interpreting the histograms of network statistics, since the specific design of our algorithm does not give much control or insight into the distribution from which we sample graphs. Furthermore, we do not have any mixing time results on which we can base our choice of stopping parameter T𝑇Titalic_T. This makes it hard to control the influence of the initialization G0subscript𝐺0G_{0}italic_G start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT on the ensemble of graphs which we sample from. In future work, these issues could be addressed by designing a more tailored algorithm or by trying to find exact expressions for the probability distribution over graphs, for instance using a soft maximum entropy model [52].

4 Exact graph reconstruction from curvatures and degrees

In this section, we take a second approach to the main question (Q1) and consider the exact reconstruction problem of finding graphs whose discrete curvatures precisely match a given multiset of numbers. This problem clearly depends on the notion of discrete curvature, and we will focus on Forman–Ricci curvature F({u,v})=4deg(u)deg(v)𝐹𝑢𝑣4degree𝑢degree𝑣F(\{u,v\})=4-\deg(u)-\deg(v)italic_F ( { italic_u , italic_v } ) = 4 - roman_deg ( italic_u ) - roman_deg ( italic_v ). For reasons which will be explained later, we not only consider fixed curvatures but also fixed degrees. In the main result, Theorem 4.12, we find that there exists a finite set of graph rewiring moves which can transform any two multigraphs with the same Forman–Ricci curvatures and degree sequences into each other. This result follows from encoding the degree and curvature constraints algebraically using a joint degree matrix and using the theory of Markov bases. Our approach then decomposes into two parts: first, finding all joint degree matrices that are compatible with the curvature and degree constraints and second, finding all graphs with the same joint degree matrix.

The rest of this section will use the following conventions: a multigraph can have multi-edges and self-loops while a simple graph cannot, and we write “graph” if the distinction is not relevant. The maximum degree is denoted by ΔΔ\Deltaroman_Δ, the degree sequence is the ΔΔ\Deltaroman_Δ-tuple (# nodes of degree a)a=1Δsuperscriptsubscript# nodes of degree a𝑎1Δ(\text{\# nodes of degree $a$})_{a=1}^{\Delta}( # nodes of degree italic_a ) start_POSTSUBSCRIPT italic_a = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_Δ end_POSTSUPERSCRIPT and the curvature sequence is the (2Δ1)2Δ1(2\Delta-1)( 2 roman_Δ - 1 )-tuple (# edges with Forman–Ricci curvature 4κ4𝜅4-\kappa4 - italic_κ)2Δκ=2superscriptsubscriptabsent𝜅22Δ{}_{\kappa=2}^{2\Delta}start_FLOATSUBSCRIPT italic_κ = 2 end_FLOATSUBSCRIPT start_POSTSUPERSCRIPT 2 roman_Δ end_POSTSUPERSCRIPT; see Figure 6 for an example. Note that this is different from how we defined the curvature sequence 𝒞𝒞\mathcal{C}caligraphic_C before; it records the frequency of all possible curvatures rather than the multiset of curvatures itself. Since both definitions contain the same information, we will use the same name.

4.1 Joint degree matrix and transpositions

The Forman–Ricci curvature of an edge depends on the sum of the degrees of its end points. This information can be represented by the following matrix: the joint degree matrix (JDM) of a graph G𝐺Gitalic_G is a symmetric Δ×ΔΔΔ\Delta\times\Deltaroman_Δ × roman_Δ matrix J𝐽Jitalic_J with entries

Jab=Jbasubscript𝐽𝑎𝑏subscript𝐽𝑏𝑎\displaystyle J_{ab}=J_{ba}italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT = italic_J start_POSTSUBSCRIPT italic_b italic_a end_POSTSUBSCRIPT =|{{u,v}E(G):{deg(u),deg(v)}={a,b}}|absentconditional-set𝑢𝑣𝐸𝐺degree𝑢degree𝑣𝑎𝑏\displaystyle=\left|\big{\{}\{u,v\}\in E(G)\,:\,\{\deg(u),\deg(v)\}=\{a,b\}% \big{\}}\right|= | { { italic_u , italic_v } ∈ italic_E ( italic_G ) : { roman_deg ( italic_u ) , roman_deg ( italic_v ) } = { italic_a , italic_b } } |
= # edges whose end points have degrees a and b.absent # edges whose end points have degrees a and b.\displaystyle=\text{~{}\# edges whose end points have degrees $a$ and $b$.}= # edges whose end points have degrees italic_a and italic_b .

We note that the degree and curvature sequences of a graph can be computed from its JDM: the number of nodes with degree a𝑎aitalic_a is equal to the row sum (bJab+Jaa)/asubscript𝑏subscript𝐽𝑎𝑏subscript𝐽𝑎𝑎𝑎(\sum_{b}J_{ab}+J_{aa})/a( ∑ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT ) / italic_a and the number of edges with Forman–Ricci curvature κ𝜅\kappaitalic_κ is equal to the anti-diagonal sum ab,a+b=κJabsubscriptformulae-sequence𝑎𝑏𝑎𝑏𝜅subscript𝐽𝑎𝑏\sum_{a\leq b,a+b=\kappa}J_{ab}∑ start_POSTSUBSCRIPT italic_a ≤ italic_b , italic_a + italic_b = italic_κ end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT. Figure 6 shows the JDM and associated sequences for a small graph that will be our running example.

Refer to caption
Figure 6: A graph G𝐺Gitalic_G and its joint degree matrix (JDM). The figures on the right show the degree and curvature sequence of G𝐺Gitalic_G and highlight how these can be obtained from the JDM.

The following result from [54] gives a complete characterization of JDMs.

Theorem 4.1.

A symmetric matrix JΔ×Δ𝐽superscriptnormal-Δnormal-ΔJ\in\mathbb{N}^{\Delta\times\Delta}italic_J ∈ blackboard_N start_POSTSUPERSCRIPT roman_Δ × roman_Δ end_POSTSUPERSCRIPT is the JDM of a simple graph if and only if it satisfies, for all distinct a,b{1,,Δ}𝑎𝑏1normal-…normal-Δa,b\in\{1,\dots,\Delta\}italic_a , italic_b ∈ { 1 , … , roman_Δ }, the conditions

  • (i)

    |Va|=(c=1ΔJac+Jaa)/asubscript𝑉𝑎superscriptsubscript𝑐1Δsubscript𝐽𝑎𝑐subscript𝐽𝑎𝑎𝑎|V_{a}|=(\sum_{c=1}^{\Delta}J_{ac}+J_{aa})/a| italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | = ( ∑ start_POSTSUBSCRIPT italic_c = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_Δ end_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT italic_a italic_c end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT ) / italic_a   is an integer,

  • (ii)

    Jab|Va||Vb|subscript𝐽𝑎𝑏subscript𝑉𝑎subscript𝑉𝑏J_{ab}\leq|V_{a}|\cdot|V_{b}|italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT ≤ | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ⋅ | italic_V start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT |, and

  • (iii)

    Jaa(|Va|2)subscript𝐽𝑎𝑎binomialsubscript𝑉𝑎2J_{aa}\leq\binom{|V_{a}|}{2}italic_J start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT ≤ ( FRACOP start_ARG | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | end_ARG start_ARG 2 end_ARG ).

The authors of [54] gave an algorithm to construct a simple graph from a given JDM. More importantly, they also showed that the set of all graphs with the same JDM is connected through the graph operation shown in Figure 7: let ={{u,v},{x,y}}E(G)𝑢𝑣𝑥𝑦𝐸𝐺\mathcal{E}=\big{\{}\{u,v\},\{x,y\}\big{\}}\subseteq E(G)caligraphic_E = { { italic_u , italic_v } , { italic_x , italic_y } } ⊆ italic_E ( italic_G ) be two edges such that deg(u)=deg(x)degree𝑢degree𝑥\deg(u)=\deg(x)roman_deg ( italic_u ) = roman_deg ( italic_x ) and let ={{u,y},{x,v}}superscript𝑢𝑦𝑥𝑣\mathcal{E}^{\prime}=\big{\{}\{u,y\},\{x,v\}\big{\}}caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = { { italic_u , italic_y } , { italic_x , italic_v } }. The transposition of the edges \mathcal{E}caligraphic_E in G𝐺Gitalic_G is a new graph Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT obtained by rewiring these two edges. Its nodes and edges are

V(G)=V(G) and E(G)=(E(G)\).formulae-sequence𝑉superscript𝐺𝑉𝐺 and 𝐸superscript𝐺\𝐸𝐺superscriptV(G^{\prime})=V(G)\quad\text{~{}and~{}}\quad E(G^{\prime})=\big{(}E(G)% \backslash\mathcal{E}\big{)}\cup\mathcal{E}^{\prime}.italic_V ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = italic_V ( italic_G ) and italic_E ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = ( italic_E ( italic_G ) \ caligraphic_E ) ∪ caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT .
Refer to caption
Figure 7: A transposition rewires two edges in a graph without changing the joint degrees. In this example, edges {u,v},{x,y}𝑢𝑣𝑥𝑦\{u,v\},\{x,y\}{ italic_u , italic_v } , { italic_x , italic_y } are rewired to {u,y},{x,v}𝑢𝑦𝑥𝑣\{u,y\},\{x,v\}{ italic_u , italic_y } , { italic_x , italic_v }.

Transpositions do not change the JDM of a graph and the theory of matchings in convex bipartite graphs guarantees that the set of multigraphs with the same JDM is connected through transpositions [16, 54]. In general, transpositions in simple graphs may create multi-edges or self-loops depending on the choice of \mathcal{E}caligraphic_E, but the following result shows that there always exists a choice for which this is not the case.

Theorem 4.2 ([54]).

The set of all simple graphs with a given JDM is connected through transpositions.

In other words, all simple graphs with the same JDM can be found by performing transpositions, in such a way that every intermediate step is also a simple graph. Since transpositions do not change the JDM, these graphs will also have the same curvature and degree sequences. This is the first step towards our main result.
 
We note the following count for the number of multigraphs with a given JDM:

Proposition 4.3.

The number of node-labeled multigraphs with a given JDM is equal to

1aΔ(a|Va|)!(a!)|Va|2Jaa1abΔ1Jab!.subscriptproduct1𝑎Δ𝑎subscript𝑉𝑎superscript𝑎subscript𝑉𝑎superscript2subscript𝐽𝑎𝑎subscriptproduct1𝑎𝑏Δ1subscript𝐽𝑎𝑏\prod_{1\leq a\leq\Delta}\frac{(a|V_{a}|)!}{(a!)^{|V_{a}|}\cdot 2^{J_{aa}}}% \cdot\prod_{1\leq a\leq b\leq\Delta}\frac{1}{J_{ab}!}.∏ start_POSTSUBSCRIPT 1 ≤ italic_a ≤ roman_Δ end_POSTSUBSCRIPT divide start_ARG ( italic_a | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ) ! end_ARG start_ARG ( italic_a ! ) start_POSTSUPERSCRIPT | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | end_POSTSUPERSCRIPT ⋅ 2 start_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ⋅ ∏ start_POSTSUBSCRIPT 1 ≤ italic_a ≤ italic_b ≤ roman_Δ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT ! end_ARG . (3)

Proof. We have |Va|subscript𝑉𝑎|V_{a}|| italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | nodes of degree a𝑎aitalic_a. Each such node has a𝑎aitalic_a “stubs” that need to be matched to a𝑎aitalic_a of the a|Va|𝑎subscript𝑉𝑎a|V_{a}|italic_a | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | “half-edges”. For the first node, there are (a|Va|a)binomial𝑎subscript𝑉𝑎𝑎\binom{a|V_{a}|}{a}( FRACOP start_ARG italic_a | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | end_ARG start_ARG italic_a end_ARG ) ways to match its stubs, for the second node there are (a(|Va|1)a)binomial𝑎subscript𝑉𝑎1𝑎\binom{a(|V_{a}|-1)}{a}( FRACOP start_ARG italic_a ( | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | - 1 ) end_ARG start_ARG italic_a end_ARG ) ways, and so on. This gives a total of

(a|Va|a)(a(|Va|1)a)(aa)=(a|Va|)!a!(a(|Va|1))!(a(|Va|1)!a!(a(|Va|2))!a!a!=(a|Va|)!(a!)|Va|\displaystyle\binom{a|V_{a}|}{a}\binom{a(|V_{a}|-1)}{a}\dots\binom{a}{a}=\frac% {(a|V_{a}|)!}{a!(a(|V_{a}|-1))!}\frac{(a(|V_{a}|-1)!}{a!(a(|V_{a}|-2))!}\dots% \frac{a!}{a!}=\frac{(a|V_{a}|)!}{(a!)^{|V_{a}|}}( FRACOP start_ARG italic_a | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | end_ARG start_ARG italic_a end_ARG ) ( FRACOP start_ARG italic_a ( | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | - 1 ) end_ARG start_ARG italic_a end_ARG ) … ( FRACOP start_ARG italic_a end_ARG start_ARG italic_a end_ARG ) = divide start_ARG ( italic_a | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ) ! end_ARG start_ARG italic_a ! ( italic_a ( | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | - 1 ) ) ! end_ARG divide start_ARG ( italic_a ( | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | - 1 ) ! end_ARG start_ARG italic_a ! ( italic_a ( | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | - 2 ) ) ! end_ARG … divide start_ARG italic_a ! end_ARG start_ARG italic_a ! end_ARG = divide start_ARG ( italic_a | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ) ! end_ARG start_ARG ( italic_a ! ) start_POSTSUPERSCRIPT | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | end_POSTSUPERSCRIPT end_ARG

ways to connect nodes of degree a𝑎aitalic_a to half-edges. This counting procedure distinguishes the half-edges, so to obtain edge-unlabeled multigraphs we have to forget the half-edge labels. This makes any permutation of the Jabsubscript𝐽𝑎𝑏J_{ab}italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT edges which are connected to nodes with degree a𝑎aitalic_a and b𝑏bitalic_b indistinguishable, so we divide the count by Jab!subscript𝐽𝑎𝑏J_{ab}!italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT !. Second, forgetting the half-edge labels makes any flip of the Jaasubscript𝐽𝑎𝑎J_{aa}italic_J start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT edges connected to two nodes of equal degree a𝑎aitalic_a indistinguishable, so we divide the count by 2Jaasuperscript2subscript𝐽𝑎𝑎2^{J_{aa}}2 start_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. This results in (3) and completes the proof. ∎

For the JDM in Figure 6 with degree sequence (2,2,2,1)2221(2,2,2,1)( 2 , 2 , 2 , 1 ), the number (3) equals 2!4!6!4!25(3!)24!=302464superscript25superscript32430\frac{2!4!6!4!}{2^{5}(3!)^{2}4!}=30divide start_ARG 2 ! 4 ! 6 ! 4 ! end_ARG start_ARG 2 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT ( 3 ! ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT 4 ! end_ARG = 30, so there are 30303030 node-labeled multigraphs with this JDM. In practice, we may be interested in simple and non-isomorphic graphs for which this number serves as an upper bound.

4.2 Markov bases

In the previous section, we found that all graphs with a given JDM are connected by transpositions. As a second step, we now want to find all JDMs which are compatible with a given curvature and degree sequence. More precisely, these are all nonnegative integer matrices J𝐽Jitalic_J that satisfy conditions (i)—(iii) in Theorem 4.1 (valid JDMs) and have prescribed row sums (degrees) and anti-diagonal sums (curvatures). We will show how this set of matrices can be obtained using Markov bases.

The theory of Markov bases was developed by Diaconis and Sturmfels in [17] to solve the problem of sampling contingency tables, i. e., nonnegative integer matrices with fixed row and column sums, in the context of statistics. The main result in this theory is that this sampling problem can be solved using concepts and tools from commutative algebra. We first give the main definition and theorem and then apply it to our problem.

Definition 4.4.

Let Ad×n𝐴superscript𝑑𝑛A\in\mathbb{Z}^{d\times n}italic_A ∈ blackboard_Z start_POSTSUPERSCRIPT italic_d × italic_n end_POSTSUPERSCRIPT be an integer matrix. A Markov basis for A𝐴Aitalic_A is a finite set ={β1,,βM}ker(A)subscript𝛽1subscript𝛽𝑀subscriptkernel𝐴\mathcal{M}=\{\beta_{1},\dots,\beta_{M}\}\subseteq\ker_{\mathbb{Z}}(A)caligraphic_M = { italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_β start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } ⊆ roman_ker start_POSTSUBSCRIPT blackboard_Z end_POSTSUBSCRIPT ( italic_A ) of integer vectors in the kernel of A𝐴Aitalic_A, such that for every u,vn𝑢𝑣superscript𝑛u,v\in\mathbb{N}^{n}italic_u , italic_v ∈ blackboard_N start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT with Au=Av𝐴𝑢𝐴𝑣Au=Avitalic_A italic_u = italic_A italic_v, there exists a sequence (i1,s1),(i2,s2),,(iL,sL){1,,M}×{±1}subscript𝑖1subscript𝑠1subscript𝑖2subscript𝑠2subscript𝑖𝐿subscript𝑠𝐿1𝑀plus-or-minus1(i_{1},s_{1}),(i_{2},s_{2}),\dots,(i_{L},s_{L})\in\{1,\dots,M\}\times\{\pm 1\}( italic_i start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , ( italic_i start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , … , ( italic_i start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ) ∈ { 1 , … , italic_M } × { ± 1 } that satisfies

  • (i)

    u+j=1Lsjβij=v𝑢superscriptsubscript𝑗1𝐿subscript𝑠𝑗subscript𝛽subscript𝑖𝑗𝑣u+\sum_{j=1}^{L}s_{j}\beta_{i_{j}}=vitalic_u + ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_v, and

  • (ii)

    u+j=1sjβij0 for all 1<<L𝑢superscriptsubscript𝑗1subscript𝑠𝑗subscript𝛽subscript𝑖𝑗0 for all 1𝐿u+\sum_{j=1}^{\ell}s_{j}\beta_{i_{j}}\geq 0\text{~{}for all~{}}1<\ell<Litalic_u + ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ 0 for all 1 < roman_ℓ < italic_L.

For un𝑢superscript𝑛u\in\mathbb{N}^{n}italic_u ∈ blackboard_N start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT we call A(u)={vn:Au=Av}subscript𝐴𝑢conditional-set𝑣superscript𝑛𝐴𝑢𝐴𝑣\mathcal{F}_{A}(u)=\{v\in\mathbb{N}^{n}:Au=Av\}caligraphic_F start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_u ) = { italic_v ∈ blackboard_N start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT : italic_A italic_u = italic_A italic_v } the fiber of u𝑢uitalic_u. The adjective ‘Markov’ reflects that a Markov basis can be used to design a Markov chain on a fiber: points in the fiber are states of the Markov chain and the transitions out of any state x𝑥xitalic_x are those vectors β𝛽\betaitalic_β in the Markov basis for which x+β0𝑥𝛽0x+\beta\geq 0italic_x + italic_β ≥ 0. The Markov basis properties guarantee that this chain is ergodic.

The question of how to find a Markov basis for a given integer matrix is addressed by the fundamental theorem of Markov bases (see also Appendix A).

Theorem 4.5 (Fundamental theorem of Markov bases [17]).

Let Ad×n𝐴superscript𝑑𝑛A\in\mathbb{Z}^{d\times n}italic_A ∈ blackboard_Z start_POSTSUPERSCRIPT italic_d × italic_n end_POSTSUPERSCRIPT, then ker(A)subscriptkernel𝐴\mathcal{M}\subseteq\ker_{\mathbb{Z}}(A)caligraphic_M ⊆ roman_ker start_POSTSUBSCRIPT blackboard_Z end_POSTSUBSCRIPT ( italic_A ) is a Markov basis for A𝐴Aitalic_A if and only if the binomials {𝐱β+𝐱β:β}conditional-setsuperscript𝐱superscript𝛽superscript𝐱superscript𝛽𝛽\{\mathbf{x}^{\beta^{+}}-\mathbf{x}^{\beta^{-}}:\beta\in\mathcal{M}\}{ bold_x start_POSTSUPERSCRIPT italic_β start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT - bold_x start_POSTSUPERSCRIPT italic_β start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT : italic_β ∈ caligraphic_M } generate the ideal 𝐱β+𝐱β:βker(A)delimited-⟨⟩normal-:superscript𝐱superscript𝛽superscript𝐱superscript𝛽𝛽subscriptkernel𝐴\langle\mathbf{x}^{\beta^{+}}-\mathbf{x}^{\beta^{-}}:\beta\in\ker_{\mathbb{Z}}% (A)\rangle⟨ bold_x start_POSTSUPERSCRIPT italic_β start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT - bold_x start_POSTSUPERSCRIPT italic_β start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT : italic_β ∈ roman_ker start_POSTSUBSCRIPT blackboard_Z end_POSTSUBSCRIPT ( italic_A ) ⟩.

Appendix A provides further background on the relevant algebraic concepts, but the details of the theorem are not relevant for the purpose of this article. The most important fact is that a finite Markov basis always exists and that it can be computed using computer algebra systems for sufficiently small matrices A𝐴Aitalic_A. For instance, using the software package 4ti2 [1] we compute that

A=(11110123) has Markov basis ={(0121),(1210),(1111)}.𝐴matrix11110123 has Markov basis matrix0121matrix1210matrix1111A=\begin{pmatrix}1&1&1&1\\ 0&1&2&3\end{pmatrix}\text{~{}has Markov basis~{}}\mathcal{M}=\left\{\begin{% pmatrix}[r]0\\ 1\\ -2\\ 1\end{pmatrix},\begin{pmatrix}[r]1\\ -2\\ 1\\ 0\end{pmatrix},\begin{pmatrix}[r]1\\ -1\\ -1\\ 1\end{pmatrix}\right\}.italic_A = ( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 2 end_CELL start_CELL 3 end_CELL end_ROW end_ARG ) has Markov basis caligraphic_M = { ( start_ARG start_ROW start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL end_ROW start_ROW start_CELL - 2 end_CELL end_ROW start_ROW start_CELL 1 end_CELL end_ROW end_ARG ) , ( start_ARG start_ROW start_CELL 1 end_CELL end_ROW start_ROW start_CELL - 2 end_CELL end_ROW start_ROW start_CELL 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL end_ROW end_ARG ) , ( start_ARG start_ROW start_CELL 1 end_CELL end_ROW start_ROW start_CELL - 1 end_CELL end_ROW start_ROW start_CELL - 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL end_ROW end_ARG ) } .

For larger matrices, these computations can become infeasible in practice and other approaches such as relaxed notions of Markov bases or restrictions to particular fibers may be necessary [25].
 
We now show that the set of all JDMs with a given degree and curvature sequence is the fiber for some matrix AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT. This means that the associated Markov basis ΔsubscriptΔ\mathcal{M}_{\Delta}caligraphic_M start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT can be used to explore this fiber. We stress that this matrix and its Markov basis only depend on the maximum degree ΔΔ\Deltaroman_Δ, and not on the specific degree and curvature sequences.

We start by encoding a JDM and its constraints as a vector γn𝛾superscript𝑛\gamma\in\mathbb{N}^{n}italic_γ ∈ blackboard_N start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, where n=2(Δ+12)𝑛2binomialΔ12n=2\binom{\Delta+1}{2}italic_n = 2 ( FRACOP start_ARG roman_Δ + 1 end_ARG start_ARG 2 end_ARG ). This vector contains the upper-triangular JDM entries (Jab)absubscriptsubscript𝐽𝑎𝑏𝑎𝑏(J_{ab})_{a\leq b}( italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ≤ italic_b end_POSTSUBSCRIPT and the slack variables (Sab)absubscriptsubscript𝑆𝑎𝑏𝑎𝑏(S_{ab})_{a\leq b}( italic_S start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ≤ italic_b end_POSTSUBSCRIPT which we define as

Saa=(|Va|2)Jaa and Sab=|Va||Vb|Jab for 1a<bΔ,formulae-sequencesubscript𝑆𝑎𝑎binomialsubscript𝑉𝑎2subscript𝐽𝑎𝑎 and subscript𝑆𝑎𝑏subscript𝑉𝑎subscript𝑉𝑏subscript𝐽𝑎𝑏 for 1a<bΔ,S_{aa}=\binom{|V_{a}|}{2}-J_{aa}\quad\text{~{}and~{}}\quad S_{ab}=|V_{a}|\cdot% |V_{b}|-J_{ab}\quad\text{~{}for $1\leq a<b\leq\Delta$,}italic_S start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT = ( FRACOP start_ARG | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | end_ARG start_ARG 2 end_ARG ) - italic_J start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT and italic_S start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT = | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ⋅ | italic_V start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT | - italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT for 1 ≤ italic_a < italic_b ≤ roman_Δ ,

with |Va|subscript𝑉𝑎|V_{a}|| italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | defined as before from J𝐽Jitalic_J. The slack variables are used to encode the inequalities on the JDM entries given in Theorem 4.1: a matrix J𝐽Jitalic_J is a valid JDM if and only if the associated slack variables are nonnegative integers, and thus γn𝛾superscript𝑛\gamma\in\mathbb{N}^{n}italic_γ ∈ blackboard_N start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. The matrix J𝐽Jitalic_J can be reconstructed from γ𝛾\gammaitalic_γ by forgetting the slack variables, so we can write J(γ)𝐽𝛾J(\gamma)italic_J ( italic_γ ).

Next, to construct the matrix AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT, we consider the equations in the J𝐽Jitalic_J and S𝑆Sitalic_S variables that are invariant when fixing the degree and curvature sequences:

  • bJab+Jaasubscript𝑏subscript𝐽𝑎𝑏subscript𝐽𝑎𝑎\sum_{b}J_{ab}+J_{aa}∑ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT for all a{1,,Δ}𝑎1Δa\in\{1,\dots,\Delta\}italic_a ∈ { 1 , … , roman_Δ } (fixed degree sequence333To be precise, this equation fixes a|Va|𝑎subscript𝑉𝑎a|V_{a}|italic_a | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | from which |Va|subscript𝑉𝑎|V_{a}|| italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | can be retrieved.),

  • ab,a+b=κJabsubscriptformulae-sequence𝑎𝑏𝑎𝑏𝜅subscript𝐽𝑎𝑏\sum_{a\leq b,a+b=\kappa}J_{ab}∑ start_POSTSUBSCRIPT italic_a ≤ italic_b , italic_a + italic_b = italic_κ end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT for all κ={2,,2Δ}𝜅22Δ\kappa=\{2,\dots,2\Delta\}italic_κ = { 2 , … , 2 roman_Δ } (fixed curvature sequence),

  • Sab+Jabsubscript𝑆𝑎𝑏subscript𝐽𝑎𝑏S_{ab}+J_{ab}italic_S start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT for all 1abΔ1𝑎𝑏Δ1\leq a\leq b\leq\Delta1 ≤ italic_a ≤ italic_b ≤ roman_Δ (definition of slack variables).

These d=(Δ+12)+3Δ1𝑑binomialΔ123Δ1d=\binom{\Delta+1}{2}+3\Delta-1italic_d = ( FRACOP start_ARG roman_Δ + 1 end_ARG start_ARG 2 end_ARG ) + 3 roman_Δ - 1 equations are linear functions of γ𝛾\gammaitalic_γ with integer coefficients and they determine the rows of the d×n𝑑𝑛d\times nitalic_d × italic_n matrix AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT.

To illustrate, we give AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT explicitly for Δ=4,5Δ45\Delta=4,5roman_Δ = 4 , 5 in Examples 4.8 and 4.9 below. The relevance of this matrix is shown by the following proposition.

Proposition 4.6.

Let J𝐽Jitalic_J be a JDM with corresponding vector γ𝛾\gammaitalic_γ. The fiber AΔ(γ)subscriptsubscript𝐴normal-Δ𝛾\mathcal{F}_{A_{\Delta}}(\gamma)caligraphic_F start_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_γ ) contains all JDMs with the same degree and curvature sequences as J𝐽Jitalic_J.

Proof. Let J𝐽Jitalic_J be a JDM with corresponding vector γ𝛾\gammaitalic_γ. We consider the fiber

AΔ(γ)={γn:AΔγ=AΔγ}.subscriptsubscript𝐴Δ𝛾conditional-setsuperscript𝛾superscript𝑛subscript𝐴Δsuperscript𝛾subscript𝐴Δ𝛾\mathcal{F}_{A_{\Delta}}(\gamma)=\{\gamma^{\prime}\in\mathbb{N}^{n}\,:\,A_{% \Delta}\gamma^{\prime}=A_{\Delta}\gamma\}.caligraphic_F start_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_γ ) = { italic_γ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ blackboard_N start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT : italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT italic_γ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT italic_γ } .

Let γAΔsuperscript𝛾subscriptsubscript𝐴Δ\gamma^{\prime}\in\mathcal{F}_{A_{\Delta}}italic_γ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_F start_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT end_POSTSUBSCRIPT and write Jijsubscriptsuperscript𝐽𝑖𝑗J^{\prime}_{ij}italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT and Sijsubscriptsuperscript𝑆𝑖𝑗S^{\prime}_{ij}italic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT for the corresponding entries in this vector. From the rows of AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT that encode the fixed degree sequence, we obtain

a|Va|:=bJab+Jaa=bJab+Jaa=a|Va| for all a{1,,Δ}.assign𝑎subscriptsuperscript𝑉𝑎subscript𝑏subscriptsuperscript𝐽𝑎𝑏subscriptsuperscript𝐽𝑎𝑎subscript𝑏subscript𝐽𝑎𝑏subscript𝐽𝑎𝑎𝑎subscript𝑉𝑎 for all a{1,,Δ}a|V^{\prime}_{a}|:=\sum_{b}J^{\prime}_{ab}+J^{\prime}_{aa}=\sum_{b}J_{ab}+J_{% aa}=a|V_{a}|\text{~{}for all $a\in\{1,\dots,\Delta\}$}.italic_a | italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | := ∑ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT + italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT = italic_a | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | for all italic_a ∈ { 1 , … , roman_Δ } . (4)

From the rows of AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT corresponding to the off-diagonal slack equations (1a<bΔ)1\leq a<b\leq\Delta)1 ≤ italic_a < italic_b ≤ roman_Δ ) and the fact that γn0superscript𝛾superscript𝑛0\gamma^{\prime}\in\mathbb{N}^{n}\geq 0italic_γ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ blackboard_N start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ≥ 0 and thus Sab0subscriptsuperscript𝑆𝑎𝑏0S^{\prime}_{ab}\geq 0italic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT ≥ 0, we know that

Sab+Jab=Sab+Jab=|Va||Vb|=|Va||Vb|Jab|Va||Vb|.subscriptsuperscript𝑆𝑎𝑏subscriptsuperscript𝐽𝑎𝑏subscript𝑆𝑎𝑏subscript𝐽𝑎𝑏subscript𝑉𝑎subscript𝑉𝑏subscriptsuperscript𝑉𝑎subscriptsuperscript𝑉𝑏subscriptsuperscript𝐽𝑎𝑏subscriptsuperscript𝑉𝑎subscriptsuperscript𝑉𝑏S^{\prime}_{ab}+J^{\prime}_{ab}=S_{ab}+J_{ab}=|V_{a}|\cdot|V_{b}|=|V^{\prime}_% {a}|\cdot|V^{\prime}_{b}|\Longrightarrow J^{\prime}_{ab}\leq|V^{\prime}_{a}|% \cdot|V^{\prime}_{b}|.italic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT + italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT = | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ⋅ | italic_V start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT | = | italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ⋅ | italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT | ⟹ italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT ≤ | italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ⋅ | italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT | .

Similarly, from the diagonal slack equations (1aΔ1𝑎Δ1\leq a\leq\Delta1 ≤ italic_a ≤ roman_Δ) and Saa0subscriptsuperscript𝑆𝑎𝑎0S^{\prime}_{aa}\geq 0italic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT ≥ 0, we obtain

Saa+Jaa=Saa+Jaa=(|Va|2)=(|Va|2)Jaa(|Va|2).subscriptsuperscript𝑆𝑎𝑎subscriptsuperscript𝐽𝑎𝑎subscript𝑆𝑎𝑎subscript𝐽𝑎𝑎binomialsubscript𝑉𝑎2binomialsubscriptsuperscript𝑉𝑎2subscriptsuperscript𝐽𝑎𝑎binomialsubscriptsuperscript𝑉𝑎2S^{\prime}_{aa}+J^{\prime}_{aa}=S_{aa}+J_{aa}=\binom{|V_{a}|}{2}=\binom{|V^{% \prime}_{a}|}{2}\Longrightarrow J^{\prime}_{aa}\leq\binom{|V^{\prime}_{a}|}{2}.italic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT + italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT = ( FRACOP start_ARG | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | end_ARG start_ARG 2 end_ARG ) = ( FRACOP start_ARG | italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | end_ARG start_ARG 2 end_ARG ) ⟹ italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT ≤ ( FRACOP start_ARG | italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | end_ARG start_ARG 2 end_ARG ) .

By Theorem 4.1, this proves that the matrix J=J(γ)superscript𝐽𝐽superscript𝛾J^{\prime}=J(\gamma^{\prime})italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_J ( italic_γ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) is a JDM. The degree sequences (|Va|)1aΔsubscriptsubscriptsuperscript𝑉𝑎1𝑎Δ(|V^{\prime}_{a}|)_{1\leq a\leq\Delta}( | italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ) start_POSTSUBSCRIPT 1 ≤ italic_a ≤ roman_Δ end_POSTSUBSCRIPT and (|Va|)1aΔsubscriptsubscript𝑉𝑎1𝑎Δ(|V_{a}|)_{1\leq a\leq\Delta}( | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ) start_POSTSUBSCRIPT 1 ≤ italic_a ≤ roman_Δ end_POSTSUBSCRIPT are equal by (4), and the rows of AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT which encode the fixed curvature sequence guarantee that

a+b=κabJab=a+b=κabJabsubscript𝑎𝑏𝜅𝑎𝑏subscriptsuperscript𝐽𝑎𝑏subscript𝑎𝑏𝜅𝑎𝑏subscript𝐽𝑎𝑏\sum_{\begin{subarray}{c}{a+b}=\kappa\\ a\leq b\end{subarray}}J^{\prime}_{ab}=\sum_{\begin{subarray}{c}{a+b}=\kappa\\ a\leq b\end{subarray}}J_{ab}∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_a + italic_b = italic_κ end_CELL end_ROW start_ROW start_CELL italic_a ≤ italic_b end_CELL end_ROW end_ARG end_POSTSUBSCRIPT italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_a + italic_b = italic_κ end_CELL end_ROW start_ROW start_CELL italic_a ≤ italic_b end_CELL end_ROW end_ARG end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT

and thus that the curvature sequences of J𝐽Jitalic_J and Jsuperscript𝐽J^{\prime}italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT are equal. This holds for every γAΔ(γ)superscript𝛾subscriptsubscript𝐴Δ𝛾\gamma^{\prime}\in\mathcal{F}_{A_{\Delta}}(\gamma)italic_γ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_F start_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_γ ) which proves that every vector in the fiber indeed corresponds to a JDM with degree and curvature sequences equal to J𝐽Jitalic_J. The same approach shows that every such JDM is also an element of the fiber, which completes the proof. ∎

Following Proposition 4.6, we can thus use a Markov basis ΔsubscriptΔ\mathcal{M}_{\Delta}caligraphic_M start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT corresponding to the matrix AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT to obtain all JDMs with a fixed curvature and degree sequence. At this point, it has become clear why we need to consider fixed degrees: to encode the inequalities in the characterization of JDM matrices using slack variables as Sab+Jabsubscript𝑆𝑎𝑏subscript𝐽𝑎𝑏S_{ab}+J_{ab}italic_S start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT, we need (|Va|2)binomialsubscript𝑉𝑎2\binom{|V_{a}|}{2}( FRACOP start_ARG | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | end_ARG start_ARG 2 end_ARG ) and |Va||Vb|subscript𝑉𝑎subscript𝑉𝑏|V_{a}|\cdot|V_{b}|| italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ⋅ | italic_V start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT | to be constant.

A first result on the Markov basis ΔsubscriptΔ\mathcal{M}_{\Delta}caligraphic_M start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT follows from the specific structure of AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT.

Proposition 4.7.

The entries of a vector βΔ𝛽subscriptnormal-Δ\beta\in\mathcal{M}_{\Delta}italic_β ∈ caligraphic_M start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT corresponding to the slack variables are the negative of the entries corresponding to the J𝐽Jitalic_J variables.

Proof. By construction, the equation Jab+Sabsubscript𝐽𝑎𝑏subscript𝑆𝑎𝑏J_{ab}+S_{ab}italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT is invariant so any change Jab+xsubscript𝐽𝑎𝑏𝑥J_{ab}+xitalic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT + italic_x when applying β𝛽\betaitalic_β must be compensated by the opposite change Sabxsubscript𝑆𝑎𝑏𝑥S_{ab}-xitalic_S start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT - italic_x. ∎

Following Proposition 4.7, we can just consider the J𝐽Jitalic_J-entries of vectors in the Markov basis and we will furthermore rearrange these as a symmetric Δ×ΔΔΔ\Delta\times\Deltaroman_Δ × roman_Δ matrix and write δJΔ𝛿𝐽subscriptΔ\delta J\in\mathcal{M}_{\Delta}italic_δ italic_J ∈ caligraphic_M start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT. We now consider the cases Δ=4,5Δ45\Delta=4,5roman_Δ = 4 , 5 in more detail to illustrate the definitions and results.

Example 4.8 (Δ=4Δ4\Delta=4roman_Δ = 4).

The matrix A4subscript𝐴4A_{4}italic_A start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT is a 21×20212021\times 2021 × 20 matrix. It can be written as the block-matrix

A4=(B0I10I10),subscript𝐴4matrix𝐵0subscript𝐼10subscript𝐼10A_{4}=\begin{pmatrix}B&0\\ I_{10}&I_{10}\end{pmatrix},italic_A start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT = ( start_ARG start_ROW start_CELL italic_B end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL italic_I start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT end_CELL start_CELL italic_I start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) ,

where I10subscript𝐼10I_{10}italic_I start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT is the 10×10101010\times 1010 × 10 identity matrix, 00 is the zero matrix and B𝐵Bitalic_B is the 11×10111011\times 1011 × 10 matrix whose rows encode the fixed degree sequences (first 4444 rows) and fixed curvature sequences (last 7777 rows) in terms of the J𝐽Jitalic_J variables:

\setstacktabbedgap.4em\setstackgapL.9em\TABstackTextstyle\fixTABwidthT\savestack\collabels\tabbedCenterstack11&121314222324333444\savestack\rowlabels\tabbedCenterstack[c]|V1||V2||V3||V4|κ=2κ=3κ=4κ=5κ=6κ=7κ=8\ensurestackMathB=\stackon\parenMatrixstack2&1111211112111121111111111\collabels\rowlabels\setstacktabbedgap.4𝑒𝑚\setstackgap𝐿.9𝑒𝑚\TABstackTextstyle\fixTABwidth𝑇\savestack\collabels\tabbedCenterstack11&121314222324333444\savestack\rowlabels\tabbedCenterstackdelimited-[]𝑐subscript𝑉1subscript𝑉2subscript𝑉3subscript𝑉4𝜅2𝜅3𝜅4𝜅5𝜅6𝜅7𝜅8\ensurestackMath𝐵\stackon\parenMatrixstack2&1111211112111121111111111\collabels\rowlabels\setstacktabbedgap{.4em}\setstackgap{L}{.9em}\TABstackTextstyle{\scriptsize% \color[rgb]{1,0,0}}\fixTABwidth{T}\savestack\collabels{\tabbedCenterstack{11&1% 21314222324333444}}\savestack\rowlabels{\tabbedCenterstack[c]{|V_{1}|\\ |V_{2}|\\ |V_{3}|\\ |V_{4}|\\ \kappa=2\\ \kappa=3\\ \kappa=4\\ \kappa=5\\ \kappa=6\\ \kappa=7\\ \kappa=8}}\ensurestackMath{B=\stackon{\parenMatrixstack{\makebox[0.0pt]{$2$}&1% 11\\ 1211\\ 1121\\ 1112\\ 1\\ 1\\ 11\\ 11\\ 11\\ 1\\ 1}}{\collabels}\rowlabels}.4 italic_e italic_m italic_L .9 italic_e italic_m italic_T 11 & 121314222324333444 [ italic_c ] | italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | | italic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | | italic_V start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | | italic_V start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | italic_κ = 2 italic_κ = 3 italic_κ = 4 italic_κ = 5 italic_κ = 6 italic_κ = 7 italic_κ = 8 italic_B = 2 & 111 1211 1121 1112 1 1 11 11 11 1 1

The top indices show the correspondence between the columns of A4subscript𝐴4A_{4}italic_A start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT and the entries of the J𝐽Jitalic_J matrix, while the indices on the right indicate the equation encoded by the row. For instance, the second row reflects the sum 2J22+J12+J23+J242subscript𝐽22subscript𝐽12subscript𝐽23subscript𝐽242J_{22}+J_{12}+J_{23}+J_{24}2 italic_J start_POSTSUBSCRIPT 22 end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT; this equation fixes 2|V1|2subscript𝑉12\cdot|V_{1}|2 ⋅ | italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | and thus the number of nodes with degree a=2𝑎2a=2italic_a = 2. The 7777th row reflects the sum J13+J22subscript𝐽13subscript𝐽22J_{13}+J_{22}italic_J start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT 22 end_POSTSUBSCRIPT; this equation fixes the number of edges with Forman–Ricci curvature 4κ=44=04𝜅4404-\kappa=4-4=04 - italic_κ = 4 - 4 = 0.

The matrix A4subscript𝐴4A_{4}italic_A start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT has rank 19191919 (see also Proposition 4.10) and a one-dimensional kernel. The corresponding Markov basis 4subscript4\mathcal{M}_{4}caligraphic_M start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT thus consists of a single vector

4=(0,0,1,1,1,1,1,1,0,0)T,subscript4superscript0011111100𝑇\mathcal{M}_{4}=(0,0,1,-1,-1,1,1,-1,0,0)^{T},caligraphic_M start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT = ( 0 , 0 , 1 , - 1 , - 1 , 1 , 1 , - 1 , 0 , 0 ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ,

where we only show the entries corresponding to the J𝐽Jitalic_J variables, following Proposition 4.7. We can also represent this vector as the symmetric matrix

δJ=(0011011111101100).𝛿𝐽matrix0011011111101100\delta J=\begin{pmatrix}[r]0&0&1&-1\\ 0&-1&1&1\\ 1&1&-1&0\\ -1&1&0&0\end{pmatrix}.italic_δ italic_J = ( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) .
Example 4.9 (Δ=5Δ5\Delta=5roman_Δ = 5).

The matrix A5subscript𝐴5A_{5}italic_A start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT is a 29×30293029\times 3029 × 30 matrix. It can be written in block-matrix form as

A5=(B0I15I15),subscript𝐴5matrix𝐵0subscript𝐼15subscript𝐼15A_{5}=\begin{pmatrix}B&0\\ I_{15}&I_{15}\end{pmatrix},italic_A start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT = ( start_ARG start_ROW start_CELL italic_B end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL italic_I start_POSTSUBSCRIPT 15 end_POSTSUBSCRIPT end_CELL start_CELL italic_I start_POSTSUBSCRIPT 15 end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) ,

where I15subscript𝐼15I_{15}italic_I start_POSTSUBSCRIPT 15 end_POSTSUBSCRIPT is the 15×15151515\times 1515 × 15 identity matrix, 00 is the zero matrix of appropriate dimensions and B𝐵Bitalic_B is the 14×15141514\times 1514 × 15 matrix whose rows encode the fixed degree sequences (first 5555 rows) and fixed curvature sequences (last 9999 rows) in terms of the J𝐽Jitalic_J variables:

\setstacktabbedgap.4em\setstackgapL.9em\TABstackTextstyle\fixTABwidthT\savestack\collabels\tabbedCenterstack11&1213141522232425333435444555\savestack\rowlabels\tabbedCenterstack[c]|V1||V2||V3||V4||V5|κ=2κ=3κ=4κ=5κ=6κ=7κ=8κ=9κ=10\ensurestackMathB=\stackon\parenMatrixstack2&11111211111211112111112111111111111111\collabels\rowlabels\setstacktabbedgap.4𝑒𝑚\setstackgap𝐿.9𝑒𝑚\TABstackTextstyle\fixTABwidth𝑇\savestack\collabels\tabbedCenterstack11&1213141522232425333435444555\savestack\rowlabels\tabbedCenterstackdelimited-[]𝑐subscript𝑉1subscript𝑉2subscript𝑉3subscript𝑉4subscript𝑉5𝜅2𝜅3𝜅4𝜅5𝜅6𝜅7𝜅8𝜅9𝜅10\ensurestackMath𝐵\stackon\parenMatrixstack2&11111211111211112111112111111111111111\collabels\rowlabels\setstacktabbedgap{.4em}\setstackgap{L}{.9em}\TABstackTextstyle{\scriptsize% \color[rgb]{1,0,0}}\fixTABwidth{T}\savestack\collabels{\tabbedCenterstack{11&1% 213141522232425333435444555}}\savestack\rowlabels{\tabbedCenterstack[c]{|V_{1}% |\\ |V_{2}|\\ |V_{3}|\\ |V_{4}|\\ |V_{5}|\\ \kappa=2\\ \kappa=3\\ \kappa=4\\ \kappa=5\\ \kappa=6\\ \kappa=7\\ \kappa=8\\ \kappa=9\\ \kappa=10}}\ensurestackMath{B=\stackon{\parenMatrixstack{\makebox[0.0pt]{$2$}&% 1111\\ 12111\\ 11211\\ 1121\\ 11112\\ 1\\ 1\\ 11\\ 11\\ 111\\ 11\\ 11\\ 1\\ 1}}{\collabels}\rowlabels}.4 italic_e italic_m italic_L .9 italic_e italic_m italic_T 11 & 1213141522232425333435444555 [ italic_c ] | italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | | italic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | | italic_V start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | | italic_V start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | | italic_V start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT | italic_κ = 2 italic_κ = 3 italic_κ = 4 italic_κ = 5 italic_κ = 6 italic_κ = 7 italic_κ = 8 italic_κ = 9 italic_κ = 10 italic_B = 2 & 1111 12111 11211 1121 11112 1 1 11 11 111 11 11 1 1

For instance, the first row encodes the equation 2J11+J12+J13+J14+J152subscript𝐽11subscript𝐽12subscript𝐽13subscript𝐽14subscript𝐽152J_{11}+J_{12}+J_{13}+J_{14}+J_{15}2 italic_J start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT 14 end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT 15 end_POSTSUBSCRIPT, which keeps the number |V1|subscript𝑉1|V_{1}|| italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | of nodes with degree a=1𝑎1a=1italic_a = 1 fixed. The 8888th row encodes the equation J13+J22subscript𝐽13subscript𝐽22J_{13}+J_{22}italic_J start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT 22 end_POSTSUBSCRIPT, which keeps the number of edges with Forman–Ricci curvature 4κ=44=04𝜅4404-\kappa=4-4=04 - italic_κ = 4 - 4 = 0 fixed.

The matrix A5subscript𝐴5A_{5}italic_A start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT has rank 27 and using 4ti2 we compute the following Markov basis 5subscript5\mathcal{M}_{5}caligraphic_M start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT consisting of 9999 elements:

δJ1𝛿subscript𝐽1\displaystyle\delta J_{1}italic_δ italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT =\displaystyle== (0000000011001110111001100),matrix0000000011001110111001100\displaystyle\begin{pmatrix}[r]0&0&0&0&0\\ \phantom{-}0&0&0&1&-1\\ 0&0&-1&1&1\\ 0&1&1&-1&0\\ 0&-1&1&0&0\end{pmatrix},\,\qquad( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) , δJ2𝛿subscript𝐽2\displaystyle\delta J_{2}italic_δ italic_J start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT =\displaystyle== (0001100101011101010011000),matrix0001100101011101010011000\displaystyle\begin{pmatrix}[r]0&0&0&1&-1\\ 0&0&-1&0&1\\ 0&-1&1&-1&0\\ 1&0&-1&0&0\\ -1&1&0&0&0\end{pmatrix},\,\qquad( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) , δJ3𝛿subscript𝐽3\displaystyle\delta J_{3}italic_δ italic_J start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT =\displaystyle== (0001100110010011101010100),matrix0001100110010011101010100\displaystyle\begin{pmatrix}[r]0&0&0&1&-1\\ 0&0&-1&1&0\\ 0&-1&0&0&1\\ 1&1&0&-1&0\\ -1&0&1&0&0\end{pmatrix},\,\qquad( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) ,
δJ4𝛿subscript𝐽4\displaystyle\delta J_{4}italic_δ italic_J start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT =\displaystyle== (0012101200121012001010100),matrix0012101200121012001010100\displaystyle\begin{pmatrix}[r]0&0&1&-2&1\\ 0&-1&2&0&0\\ 1&2&-1&0&-1\\ -2&0&0&1&0\\ 1&0&-1&0&0\end{pmatrix},\,\qquad( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL - 2 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 2 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 2 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL - 2 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) , δJ5𝛿subscript𝐽5\displaystyle\delta J_{5}italic_δ italic_J start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT =\displaystyle== (0011001101110111011001100),matrix0011001101110111011001100\displaystyle\begin{pmatrix}[r]0&0&1&-1&0\\ 0&-1&1&0&1\\ 1&1&0&-1&-1\\ -1&0&-1&1&0\\ 0&1&-1&0&0\end{pmatrix},\,\qquad( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) , δJ6𝛿subscript𝐽6\displaystyle\delta J_{6}italic_δ italic_J start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT =\displaystyle== (0011001110111001100000000),matrix0011001110111001100000000\displaystyle\begin{pmatrix}[r]0&0&1&-1&\phantom{-}0\\ 0&-1&1&1&0\\ 1&1&-1&0&0\\ -1&1&0&0&0\\ 0&0&0&0&0\end{pmatrix},\,\qquad( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) ,
δJ7𝛿subscript𝐽7\displaystyle\delta J_{7}italic_δ italic_J start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT =\displaystyle== (0010101002101210021012100),matrix0010101002101210021012100\displaystyle\begin{pmatrix}[r]0&0&1&0&-1\\ 0&-1&0&0&2\\ 1&0&1&-2&-1\\ 0&0&-2&1&0\\ -1&2&-1&0&0\end{pmatrix},\,\qquad( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 2 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL - 2 end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL - 2 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL - 1 end_CELL start_CELL 2 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) , δJ8𝛿subscript𝐽8\displaystyle\delta J_{8}italic_δ italic_J start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT =\displaystyle== (0010101011100100110011000),matrix0010101011100100110011000\displaystyle\begin{pmatrix}[r]0&0&1&0&-1\\ 0&-1&0&1&1\\ 1&0&0&-1&0\\ 0&1&-1&0&0\\ -1&1&0&0&0\end{pmatrix},\,\qquad( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) , δJ9𝛿subscript𝐽9\displaystyle\delta J_{9}italic_δ italic_J start_POSTSUBSCRIPT 9 end_POSTSUBSCRIPT =\displaystyle== (0010101020101010201010100).matrix0010101020101010201010100\displaystyle\begin{pmatrix}[r]0&0&1&0&-1\\ 0&-1&0&2&0\\ 1&0&-1&0&1\\ 0&2&0&-1&0\\ -1&0&1&0&0\end{pmatrix}.\,\qquad( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 2 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 2 end_CELL start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL - 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) .

If we define444This notation refers to the fact that deg(δJ)degree𝛿𝐽\deg(\delta J)roman_deg ( italic_δ italic_J ) is the degree of the homogeneous binomial 𝐱δJ+𝐱δJsuperscript𝐱𝛿superscript𝐽superscript𝐱𝛿superscript𝐽\mathbf{x}^{\delta J^{+}}-\mathbf{x}^{\delta J^{-}}bold_x start_POSTSUPERSCRIPT italic_δ italic_J start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT - bold_x start_POSTSUPERSCRIPT italic_δ italic_J start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT (see Appendix A). This should not be confused with the degree of a node. deg(δJ):=12ab|Jab|assigndegree𝛿𝐽12subscript𝑎𝑏subscript𝐽𝑎𝑏\deg(\delta J):=\frac{1}{2}\sum_{a\leq b}|J_{ab}|roman_deg ( italic_δ italic_J ) := divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_a ≤ italic_b end_POSTSUBSCRIPT | italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT |, we see that there are five bases with deg(δJ)=3degree𝛿𝐽3\deg(\delta J)=3roman_deg ( italic_δ italic_J ) = 3, two bases with deg(δJ)=4degree𝛿𝐽4\deg(\delta J)=4roman_deg ( italic_δ italic_J ) = 4 and two bases with deg(δJ)=5degree𝛿𝐽5\deg(\delta J)=5roman_deg ( italic_δ italic_J ) = 5. We will see later that this corresponds to the number of edges involved in the graph operation associated with δJ𝛿𝐽\delta Jitalic_δ italic_J.

We now give some further results related to the matrix AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT.

Proposition 4.10.

rank(AΔ)=(Δ+12)+3Δ3=d2ranksubscript𝐴ΔbinomialΔ123Δ3𝑑2\operatorname{rank}(A_{\Delta})=\binom{\Delta+1}{2}+3\Delta-3=d-2roman_rank ( italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT ) = ( FRACOP start_ARG roman_Δ + 1 end_ARG start_ARG 2 end_ARG ) + 3 roman_Δ - 3 = italic_d - 2 for all Δ3normal-Δ3\Delta\geq 3roman_Δ ≥ 3.

Proof. We first give two independent linear dependencies between the rows of AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT to show that rank(AΔ)d2ranksubscript𝐴Δ𝑑2\operatorname{rank}(A_{\Delta})\leq d-2roman_rank ( italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT ) ≤ italic_d - 2. The first linear relation between the rows is

a=1Δ(row |Va|)2κ=22Δ(row κ)=a=1Δ(cJac+Jaa)2κ=22Δ(a+b=κabJab)=0.superscriptsubscript𝑎1Δrow |Va|2superscriptsubscript𝜅22Δrow κsuperscriptsubscript𝑎1Δsubscript𝑐subscript𝐽𝑎𝑐subscript𝐽𝑎𝑎2superscriptsubscript𝜅22Δsubscript𝑎𝑏𝜅𝑎𝑏subscript𝐽𝑎𝑏0\sum_{a=1}^{\Delta}(\text{row $|V_{a}|$})-2\sum_{\kappa=2}^{2\Delta}(\text{row% $\kappa$})=\sum_{a=1}^{\Delta}\Biggl{(}\sum_{c}J_{ac}+J_{aa}\Biggr{)}-2\sum_{% \kappa=2}^{2\Delta}\Biggl{(}\sum_{\begin{subarray}{c}a+b=\kappa\\ a\leq b\end{subarray}}J_{ab}\Biggr{)}=0.∑ start_POSTSUBSCRIPT italic_a = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_Δ end_POSTSUPERSCRIPT ( row | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ) - 2 ∑ start_POSTSUBSCRIPT italic_κ = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 roman_Δ end_POSTSUPERSCRIPT ( row italic_κ ) = ∑ start_POSTSUBSCRIPT italic_a = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_Δ end_POSTSUPERSCRIPT ( ∑ start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_a italic_c end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT ) - 2 ∑ start_POSTSUBSCRIPT italic_κ = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 roman_Δ end_POSTSUPERSCRIPT ( ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_a + italic_b = italic_κ end_CELL end_ROW start_ROW start_CELL italic_a ≤ italic_b end_CELL end_ROW end_ARG end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT ) = 0 .

The second linear relation between the rows is

a=1Δa×(row |Va|)κ=22Δκ×(row κ)=a=1Δa(cJac+Jaa)κ=22Δκ(a+b=κabJab)=0.superscriptsubscript𝑎1Δ𝑎row |Va|superscriptsubscript𝜅22Δ𝜅row κsuperscriptsubscript𝑎1Δ𝑎subscript𝑐subscript𝐽𝑎𝑐subscript𝐽𝑎𝑎superscriptsubscript𝜅22Δ𝜅subscript𝑎𝑏𝜅𝑎𝑏subscript𝐽𝑎𝑏0\displaystyle\sum_{a=1}^{\Delta}a\times(\text{row $|V_{a}|$})-\sum_{\kappa=2}^% {2\Delta}\kappa\times(\text{row $\kappa$})=\sum_{a=1}^{\Delta}a\left(\sum_{c}J% _{ac}+J_{aa}\right)-\sum_{\kappa=2}^{2\Delta}\kappa\Biggl{(}\sum_{\begin{% subarray}{c}a+b=\kappa\\ a\leq b\end{subarray}}J_{ab}\Biggr{)}=0.∑ start_POSTSUBSCRIPT italic_a = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_Δ end_POSTSUPERSCRIPT italic_a × ( row | italic_V start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ) - ∑ start_POSTSUBSCRIPT italic_κ = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 roman_Δ end_POSTSUPERSCRIPT italic_κ × ( row italic_κ ) = ∑ start_POSTSUBSCRIPT italic_a = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_Δ end_POSTSUPERSCRIPT italic_a ( ∑ start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_a italic_c end_POSTSUBSCRIPT + italic_J start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT ) - ∑ start_POSTSUBSCRIPT italic_κ = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 roman_Δ end_POSTSUPERSCRIPT italic_κ ( ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_a + italic_b = italic_κ end_CELL end_ROW start_ROW start_CELL italic_a ≤ italic_b end_CELL end_ROW end_ARG end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT ) = 0 .

To prove that the latter relation holds, we simplify the first sum to vVdeg(v)2\sum_{v\in V}\deg(v)^{2}∑ start_POSTSUBSCRIPT italic_v ∈ italic_V end_POSTSUBSCRIPT roman_deg ( italic_v ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and the second sum to eEκ(e)subscript𝑒𝐸𝜅𝑒\sum_{e\in E}\kappa(e)∑ start_POSTSUBSCRIPT italic_e ∈ italic_E end_POSTSUBSCRIPT italic_κ ( italic_e ), where κ({u,v})=deg(u)+deg(v)𝜅𝑢𝑣degree𝑢degree𝑣\kappa(\{u,v\})=\deg(u)+\deg(v)italic_κ ( { italic_u , italic_v } ) = roman_deg ( italic_u ) + roman_deg ( italic_v ). We further simplify

eEκ(e)={u,v}E(deg(u)+deg(v))=vVuvdeg(v)=vVdeg(v)2,\displaystyle\sum_{e\in E}\kappa(e)=\sum_{\{u,v\}\in E}(\deg(u)+\deg(v))=\sum_% {v\in V}\sum_{u\sim v}\deg(v)=\sum_{v\in V}\deg(v)^{2},∑ start_POSTSUBSCRIPT italic_e ∈ italic_E end_POSTSUBSCRIPT italic_κ ( italic_e ) = ∑ start_POSTSUBSCRIPT { italic_u , italic_v } ∈ italic_E end_POSTSUBSCRIPT ( roman_deg ( italic_u ) + roman_deg ( italic_v ) ) = ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_u ∼ italic_v end_POSTSUBSCRIPT roman_deg ( italic_v ) = ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V end_POSTSUBSCRIPT roman_deg ( italic_v ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,

which confirms that the second linear relation equals zero. Next, we show that the matrix AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT has a nonsingular (d2)𝑑2(d-2)( italic_d - 2 )-minor A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG. This implies that rank(AΔ)d2ranksubscript𝐴Δ𝑑2\operatorname{rank}(A_{\Delta})\geq d-2roman_rank ( italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT ) ≥ italic_d - 2 which then completes the proof.

First, note that AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT can be written as AΔ=(B0II)subscript𝐴Δ𝐵0𝐼𝐼A_{\Delta}=\left(\begin{smallmatrix}B&0\\ I&I\end{smallmatrix}\right)italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT = ( start_ROW start_CELL italic_B end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL italic_I end_CELL start_CELL italic_I end_CELL end_ROW ), where 00 is the zero matrix, I𝐼Iitalic_I is the identity matrix of size (Δ+12)binomialΔ12\binom{\Delta+1}{2}( FRACOP start_ARG roman_Δ + 1 end_ARG start_ARG 2 end_ARG ) and B𝐵Bitalic_B a (Δ1)×(Δ+12)Δ1binomialΔ12(\Delta-1)\times\binom{\Delta+1}{2}( roman_Δ - 1 ) × ( FRACOP start_ARG roman_Δ + 1 end_ARG start_ARG 2 end_ARG ) matrix. The rows corresponding to the identity matrices encode the fixed slack variables and the rows corresponding to B𝐵Bitalic_B encode the fixed degree and curvature sequences; as a result, the columns of B𝐵Bitalic_B correspond to J𝐽Jitalic_J variables. See Examples 4.8 and 4.9 for a concrete example. We show that B𝐵Bitalic_B has a nonsingular (3Δ3)3Δ3(3\Delta-3)( 3 roman_Δ - 3 )-minor B~~𝐵\tilde{B}over~ start_ARG italic_B end_ARG which implies that A𝐴Aitalic_A has the required nonsingular (d2)𝑑2(d-2)( italic_d - 2 )-minor A~=(B~0II)~𝐴~𝐵0𝐼𝐼\tilde{A}=\left(\begin{smallmatrix}\tilde{B}&0\\ I&I\end{smallmatrix}\right)over~ start_ARG italic_A end_ARG = ( start_ROW start_CELL over~ start_ARG italic_B end_ARG end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL italic_I end_CELL start_CELL italic_I end_CELL end_ROW ).

Consider the following submatrices of B𝐵Bitalic_B:

\setstacktabbedgap0.2em\setstackgapL.9em\TABstackTextstyle\fixTABwidthT\savestack\collabels\tabbedCenterstackΔΔ&(Δ1)Δ(Δ1)(Δ1)\savestack\rowlabels\tabbedCenterstack[l]κ=2Δκ=2Δ1κ=2Δ2\ensurestackMathB1=\stackon\parenMatrixstack1&00010001\collabels\rowlabels\setstacktabbedgap0.2𝑒𝑚\setstackgap𝐿.9𝑒𝑚\TABstackTextstyle\fixTABwidth𝑇\savestack\collabels\tabbedCenterstackΔΔ&Δ1ΔΔ1Δ1\savestack\rowlabels\tabbedCenterstackdelimited-[]𝑙𝜅2Δ𝜅2Δ1𝜅2Δ2\ensurestackMathsubscript𝐵1\stackon\parenMatrixstack1&00010001\collabels\rowlabels\setstacktabbedgap{-0.2em}\setstackgap{L}{.9em}\TABstackTextstyle{\scriptsize% \color[rgb]{1,0,0}}\fixTABwidth{T}\savestack\collabels{\tabbedCenterstack{% \Delta\Delta&(\Delta-1)\Delta(\Delta-1)(\Delta-1)}}\savestack\rowlabels{% \tabbedCenterstack[l]{\kappa=2\Delta\\ \kappa=2\Delta-1\\ \kappa=2\Delta-2}}\ensurestackMath{B_{1}=\stackon{\parenMatrixstack{\makebox[0% .0pt]{1}&00\\ 010\\ 001}}{\collabels}\rowlabels}- 0.2 italic_e italic_m italic_L .9 italic_e italic_m italic_T roman_Δ roman_Δ & ( roman_Δ - 1 ) roman_Δ ( roman_Δ - 1 ) ( roman_Δ - 1 ) [ italic_l ] italic_κ = 2 roman_Δ italic_κ = 2 roman_Δ - 1 italic_κ = 2 roman_Δ - 2 italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 & 00 010 001
\setstacktabbedgap0.4em\setstackgapL.9em\TABstackTextstyle\fixTABwidthT\savestack\collabels\tabbedCenterstack(Δi)(Δ+2i)&(Δi)(Δ+1i)(Δi)(Δi)\savestack\rowlabels\tabbedCenterstack[l]|VΔi|κ=2Δ2i+1κ=2Δ2i\ensurestackMathBi=\stackon\parenMatrixstack1&12010001\collabels\rowlabels\setstacktabbedgap0.4𝑒𝑚\setstackgap𝐿.9𝑒𝑚\TABstackTextstyle\fixTABwidth𝑇\savestack\collabels\tabbedCenterstackΔ𝑖Δ2𝑖&Δ𝑖Δ1𝑖Δ𝑖Δ𝑖\savestack\rowlabels\tabbedCenterstackdelimited-[]𝑙subscript𝑉Δ𝑖𝜅2Δ2𝑖1𝜅2Δ2𝑖\ensurestackMathsuperscript𝐵𝑖\stackon\parenMatrixstack1&12010001\collabels\rowlabels\setstacktabbedgap{0.4em}\setstackgap{L}{.9em}\TABstackTextstyle{\scriptsize% \color[rgb]{1,0,0}}\fixTABwidth{T}\savestack\collabels{\tabbedCenterstack{(% \Delta-i)(\Delta+2-i)&(\Delta-i)(\Delta+1-i)(\Delta-i)(\Delta-i)}}\savestack% \rowlabels{\tabbedCenterstack[l]{|V_{\Delta-i}|\\ \kappa=2\Delta-2i+1\\ \kappa=2\Delta-2i}}\ensurestackMath{B^{i}=\stackon{\parenMatrixstack{\makebox[% 0.0pt]{1}&12\\ 010\\ 001}}{\collabels}\rowlabels}0.4 italic_e italic_m italic_L .9 italic_e italic_m italic_T ( roman_Δ - italic_i ) ( roman_Δ + 2 - italic_i ) & ( roman_Δ - italic_i ) ( roman_Δ + 1 - italic_i ) ( roman_Δ - italic_i ) ( roman_Δ - italic_i ) [ italic_l ] | italic_V start_POSTSUBSCRIPT roman_Δ - italic_i end_POSTSUBSCRIPT | italic_κ = 2 roman_Δ - 2 italic_i + 1 italic_κ = 2 roman_Δ - 2 italic_i italic_B start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT = 1 & 12 010 001

for i{2,,Δ1}𝑖2Δ1i\in\{2,\dots,\Delta-1\}italic_i ∈ { 2 , … , roman_Δ - 1 }. After reordering rows and columns, we can write matrix B~~𝐵\tilde{B}over~ start_ARG italic_B end_ARG as

B~=(B1B20B(Δ1)),~𝐵subscript𝐵1missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionsuperscript𝐵2missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression0missing-subexpressionsuperscript𝐵Δ1missing-subexpression\tilde{B}=\left(\begin{array}[]{ccccc}B_{1}\\ &B^{2}&&\text{\huge$\star$}\\ &&\ddots\\ &\text{\huge 0}&&B^{(\Delta-1)}\end{array}\right),over~ start_ARG italic_B end_ARG = ( start_ARRAY start_ROW start_CELL italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_B start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL end_CELL start_CELL ⋆ end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL ⋱ end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL 0 end_CELL start_CELL end_CELL start_CELL italic_B start_POSTSUPERSCRIPT ( roman_Δ - 1 ) end_POSTSUPERSCRIPT end_CELL start_CELL end_CELL end_ROW end_ARRAY ) ,

where \star indicates entries that are not relevant. This is by construction a (3Δ33Δ33\Delta-33 roman_Δ - 3)-minor of B𝐵Bitalic_B. It is an upper-triangular matrix with ones along the diagonal, which means that it is nonsingular. As noted, this implies that the (d2)𝑑2(d-2)( italic_d - 2 )-minor A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG is nonsingular, which completes the proof. ∎

Understanding the size of the fiber |AΔ(γ)|subscriptsubscript𝐴Δ𝛾|\mathcal{F}_{A_{\Delta}}(\gamma)|| caligraphic_F start_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_γ ) | is a hard problem in general. However, making use of the special property that ker(A4)kernelsubscript𝐴4\ker(A_{4})roman_ker ( italic_A start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ) is one-dimensional, we obtain the following bound on the size of the fiber in this case.

Proposition 4.11.

Let J𝐽Jitalic_J be a JDM with Δ=4normal-Δ4\Delta=4roman_Δ = 4. Then there are at most

|A4(γ(J))|1+min{J22,J14,J33}+min{J13,J23,J24}subscriptsubscript𝐴4𝛾𝐽1subscript𝐽22subscript𝐽14subscript𝐽33subscript𝐽13subscript𝐽23subscript𝐽24|\mathcal{F}_{A_{4}}(\gamma(J))|\leq 1+\min\{J_{22},J_{14},J_{33}\}+\min\{J_{1% 3},J_{23},J_{24}\}| caligraphic_F start_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_γ ( italic_J ) ) | ≤ 1 + roman_min { italic_J start_POSTSUBSCRIPT 22 end_POSTSUBSCRIPT , italic_J start_POSTSUBSCRIPT 14 end_POSTSUBSCRIPT , italic_J start_POSTSUBSCRIPT 33 end_POSTSUBSCRIPT } + roman_min { italic_J start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT , italic_J start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT , italic_J start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT }

JDMs, including J𝐽Jitalic_J, with the same degree and curvature sequences as J𝐽Jitalic_J.

Proof. We recall from Example 4.8 above that the Markov basis for A4subscript𝐴4A_{4}italic_A start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT contains a single element

δJ=(0011011111101100).𝛿𝐽matrix0011011111101100\delta J=\begin{pmatrix}[r]0&0&1&-1\\ 0&-1&1&1\\ 1&1&-1&0\\ -1&1&0&0\end{pmatrix}.italic_δ italic_J = ( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL - 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) .

This means that every JDM with the same degree and curvature sequences as J𝐽Jitalic_J is of the form Jk=J+k×δJsubscript𝐽𝑘𝐽𝑘𝛿𝐽J_{k}=J+k\times\delta Jitalic_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_J + italic_k × italic_δ italic_J with k𝑘k\in\mathbb{Z}italic_k ∈ blackboard_Z. For Jksubscript𝐽𝑘J_{k}italic_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT to be a JDM, it is necessary that Jk0subscript𝐽𝑘0J_{k}\geq 0italic_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ 0 and for this reason we know that kmin{J22,J14,J33}𝑘subscript𝐽22subscript𝐽14subscript𝐽33k\leq\min\{J_{22},J_{14},J_{33}\}italic_k ≤ roman_min { italic_J start_POSTSUBSCRIPT 22 end_POSTSUBSCRIPT , italic_J start_POSTSUBSCRIPT 14 end_POSTSUBSCRIPT , italic_J start_POSTSUBSCRIPT 33 end_POSTSUBSCRIPT } and kmin{J13,J23,J24}𝑘subscript𝐽13subscript𝐽23subscript𝐽24k\geq\min\{J_{13},J_{23},J_{24}\}italic_k ≥ roman_min { italic_J start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT , italic_J start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT , italic_J start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT }. These conditions are necessary but in general not sufficient, since Jk0subscript𝐽𝑘0J_{k}\geq 0italic_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ 0 alone does not guarantee that the slack variables are also nonnegative. The possible JDMs from the proposition are thus a subset of Jksubscript𝐽𝑘J_{k}italic_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT for k𝑘kitalic_k within the given bounds, which completes the proof. ∎

We note that the bound is the same for all elements of the fiber. To illustrate that the bound can be strict, consider the JDM in the running example in Figure 6. Here, J+δJ𝐽𝛿𝐽J+\delta Jitalic_J + italic_δ italic_J is a valid JDM but JδJ𝐽𝛿𝐽J-\delta Jitalic_J - italic_δ italic_J is not and we thus have just two JDMs in the fiber, whereas 1+min{J22,J14,J33}+min{J13,J23,J24}=31subscript𝐽22subscript𝐽14subscript𝐽33subscript𝐽13subscript𝐽23subscript𝐽2431+\min\{J_{22},J_{14},J_{33}\}+\min\{J_{13},J_{23},J_{24}\}=31 + roman_min { italic_J start_POSTSUBSCRIPT 22 end_POSTSUBSCRIPT , italic_J start_POSTSUBSCRIPT 14 end_POSTSUBSCRIPT , italic_J start_POSTSUBSCRIPT 33 end_POSTSUBSCRIPT } + roman_min { italic_J start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT , italic_J start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT , italic_J start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT } = 3.

4.3 Markov moves and main result

To combine Theorem 4.2 on transpositions with Proposition 4.6 on JDMs and Markov bases, we make the observation that applying an element of the Markov basis JJ+δJ𝐽𝐽𝛿𝐽J\rightarrow J+\delta Jitalic_J → italic_J + italic_δ italic_J to a JDM can be interpreted as a graph operation; this is illustrated in Figure 8.

Refer to caption
Figure 8: The figure left illustrates how a Markov basis element δJΔ𝛿𝐽subscriptΔ\delta J\in\mathcal{M}_{\Delta}italic_δ italic_J ∈ caligraphic_M start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT can be interpreted as a graph operation that rewires certain edges; we call this a Markov move. The figure right shows a Markov move on the running example.

We define a Markov move as follows: Let G𝐺Gitalic_G be a graph and δJ𝛿𝐽\delta Jitalic_δ italic_J an element of the Markov basis such that J(G)+δJ𝐽𝐺𝛿𝐽J(G)+\delta Jitalic_J ( italic_G ) + italic_δ italic_J is a JDM. Consider a pair (,π)𝜋(\mathcal{E},\pi)( caligraphic_E , italic_π ) where =({i,j})=1deg(δJ)E(G)superscriptsubscriptsubscript𝑖subscript𝑗1degree𝛿𝐽𝐸𝐺\mathcal{E}=(\{i_{\ell},j_{\ell}\})_{\ell=1}^{\deg(\delta J)}\subseteq E(G)caligraphic_E = ( { italic_i start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , italic_j start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT } ) start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_deg ( italic_δ italic_J ) end_POSTSUPERSCRIPT ⊆ italic_E ( italic_G ) is a subset of edges that has |δJab|𝛿subscript𝐽𝑎𝑏|\delta J_{ab}|| italic_δ italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT | edges with degrees a,b𝑎𝑏a,bitalic_a , italic_b whenever δJab<0𝛿subscript𝐽𝑎𝑏0\delta J_{ab}<0italic_δ italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT < 0, and where π𝜋\piitalic_π is a permutation such that the set =({i,π(j)})=1deg(δJ)superscriptsuperscriptsubscriptsubscript𝑖𝜋subscript𝑗1degree𝛿𝐽\mathcal{E}^{\prime}=(\{i_{\ell},\pi(j_{\ell})\})_{\ell=1}^{\deg(\delta J)}caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = ( { italic_i start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , italic_π ( italic_j start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) } ) start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_deg ( italic_δ italic_J ) end_POSTSUPERSCRIPT has |δJab|𝛿subscript𝐽𝑎𝑏|\delta J_{ab}|| italic_δ italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT | edges with degrees a,b𝑎𝑏a,bitalic_a , italic_b whenever δJab>0𝛿subscript𝐽𝑎𝑏0\delta J_{ab}>0italic_δ italic_J start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT > 0. Existence of such a pair follows from the definition of the Markov basis δJ𝛿𝐽\delta Jitalic_δ italic_J. A Markov move of G𝐺Gitalic_G is a new graph Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT with nodes and edges

V(G)=V(G) and E(G)=(E(G)\).formulae-sequence𝑉superscript𝐺𝑉𝐺 and 𝐸superscript𝐺\𝐸𝐺superscriptV(G^{\prime})=V(G)\quad\text{~{}and~{}}\quad E(G^{\prime})=(E(G)\backslash% \mathcal{E})\cup\mathcal{E}^{\prime}.italic_V ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = italic_V ( italic_G ) and italic_E ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = ( italic_E ( italic_G ) \ caligraphic_E ) ∪ caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT .

See Figure 8 for an example. We now know that all multigraphs with a given degree and curvature sequence are connected by a finite set of graph operations: transpositions, which leave the JDM unchanged, and Markov moves corresponding to the elements of ΔsubscriptΔ\mathcal{M}_{\Delta}caligraphic_M start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT, which change the JDM. This is summarized in the following theorem.

Theorem 4.12.

The set of multigraphs with a given Forman–Ricci curvature and degree sequence is connected through transpositions and Markov bases. For small Δnormal-Δ\Deltaroman_Δ, there exist Markov bases of size

ΔΔ\Deltaroman_Δ 1 2 3 4 5 6 7 8 9absent9\geq 9≥ 9
|AΔ|subscriptsubscript𝐴normal-Δ|\mathcal{M}_{A_{\Delta}}|| caligraphic_M start_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT end_POSTSUBSCRIPT | 0 0 0 1 9 111 2662 171964 ?

Proof. Let G𝐺Gitalic_G and Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT be two multigraphs with the same curvature and degree sequences. If G𝐺Gitalic_G and Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT have the same JDM then Theorem 4.12 in the case of multigraphs (see also [54, 16]) says that G𝐺Gitalic_G can be transformed into Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT by transpositions. If G𝐺Gitalic_G and Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT do not have the same JDM, i. e., JJ𝐽superscript𝐽J\neq J^{\prime}italic_J ≠ italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, then by Proposition 4.6 and the fundamental theorem of Markov bases, matrix J𝐽Jitalic_J can be transformed into Jsuperscript𝐽J^{\prime}italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT by adding or subtracting a sequence of elements from the Markov basis ΔsubscriptΔ\mathcal{M}_{\Delta}caligraphic_M start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT. This implies that G𝐺Gitalic_G can be transformed using Markov moves into a graph G′′superscript𝐺′′G^{\prime\prime}italic_G start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT such that J′′=Jsuperscript𝐽′′𝐽J^{\prime\prime}=Jitalic_J start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT = italic_J. Using Theorem 4.12 again, G′′superscript𝐺′′G^{\prime\prime}italic_G start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT can then be transformed into Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT using transpositions. All intermediate steps in this procedure can be guaranteed to be multigraphs. This concludes the proof of the first part. The results on the size of the Markov bases follows from computations: for Δ{1,2,3}Δ123\Delta\in\{1,2,3\}roman_Δ ∈ { 1 , 2 , 3 } the matrix AΔsubscript𝐴ΔA_{\Delta}italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT has full rank and for Δ{4,5,6,7,8}Δ45678\Delta\in\{4,5,6,7,8\}roman_Δ ∈ { 4 , 5 , 6 , 7 , 8 } we computed the Markov bases using the software 4ti2. ∎

We note that the set of simple graphs with a given curvature and degree sequence sits inside this bigger set of multigraphs. So in particular, the set of simple graphs is connected by transpositions and Markov moves, with intermediate steps potentially being multigraphs. It is an interesting open question whether all intermediate steps can be simple graphs as well, which would imply a version of Theorem 4.12 for simple graphs.

Finally, we note that Theorem 4.12 may be used to sample graphs with a given degree and curvature sequence. Algorithm 1 shows one possible approach.

Algorithm 1 Sampling graphs with given curvature and degree sequence
graph G𝐺Gitalic_G, Markov basis ΔsubscriptΔ\mathcal{M}_{\Delta}caligraphic_M start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT, number of steps T𝑇Titalic_T, ratio of Markov moves to transpositions Θ[0,1]Θ01\Theta\in[0,1]roman_Θ ∈ [ 0 , 1 ]
t0𝑡0t\leftarrow 0italic_t ← 0
J𝐽absentJ\leftarrowitalic_J ← JDM of G𝐺Gitalic_G
S𝑆absentS\leftarrowitalic_S ← slack variables
while tT𝑡𝑇t\leq Titalic_t ≤ italic_T do
     p𝑝absentp\leftarrowitalic_p ← sample random variable in [0,1]01[0,1][ 0 , 1 ]
     if pΘ𝑝Θp\leq\Thetaitalic_p ≤ roman_Θ then \triangleright do a Markov move
         δJ𝛿𝐽absent\delta J\leftarrowitalic_δ italic_J ← random element in ±Δplus-or-minussubscriptΔ\pm\mathcal{M}_{\Delta}± caligraphic_M start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT
         while J+δJ0not-greater-than-or-equals𝐽𝛿𝐽0J+\delta J\not\geq 0italic_J + italic_δ italic_J ≱ 0 or SδJ0not-less-than-or-equals𝑆𝛿𝐽0S-\delta J\not\leq 0italic_S - italic_δ italic_J ≰ 0 do
              δJ𝛿𝐽absent\delta J\leftarrowitalic_δ italic_J ← random element in ±Δplus-or-minussubscriptΔ\pm\mathcal{M}_{\Delta}± caligraphic_M start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT
         end while
         Gsuperscript𝐺absentG^{\prime}\leftarrowitalic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ← Markov move of G𝐺Gitalic_G associated with δJ𝛿𝐽\delta Jitalic_δ italic_J
         (G,J,S)(G,J+δJ,SδJ)𝐺𝐽𝑆superscript𝐺𝐽𝛿𝐽𝑆𝛿𝐽(G,J,S)\leftarrow(G^{\prime},J+\delta J,S-\delta J)( italic_G , italic_J , italic_S ) ← ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_J + italic_δ italic_J , italic_S - italic_δ italic_J )
     else if p>Θ𝑝Θp>\Thetaitalic_p > roman_Θ then \triangleright do a transposition
         G𝐺absentG\leftarrowitalic_G ← transposition of G𝐺Gitalic_G
     end if
     tt+1𝑡𝑡1t\leftarrow t+1italic_t ← italic_t + 1
end while
Return: graph Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT

The performance of any sampling algorithm based on transpositions and Markov moves will depend on how fast the associated Markov chain mixes. The mixing behavior of transpositions is partially understood via its relation to perfect matching sampling [54], but understanding and controlling the mixing behavior of the Markov chain associated with general Markov bases is much more challenging [25, 61].

5 Conclusion and discussion

In this work, we consider the problem of constructing graphs with prescribed discrete curvatures. We take a first step in addressing this methodological gap in network geometry and discrete curvature by approaching the problem in two ways. First, in Section 3 we develop an evolutionary MCMC-type algorithm to construct graphs whose curvatures approximate a given set. Using this algorithm, we then explore the ensemble of graphs with a fixed discrete curvature sequence. Second, in Section 4 we solve the exact reconstruction problem for Forman–Ricci curvature and degree sequences. We show that there exists a finite set of graph rewiring moves — Markov moves and transpositions — which connect the set of all graphs with a given Forman–Ricci curvature and degree sequence.

Our work opens several new directions for future research. Most results obtained in this article are limited to specific choices of discrete curvatures, but the same questions are equally relevant for all other notions of discrete curvature. More specifically, our algorithmic approach could be extended by optimizing the expensive computations in the mutation/selection phases for specific curvatures. This would make the sampling algorithm practical for a wider range of curvatures and provide a tool to compare among them. Furthermore, a more careful choice of algorithm design might lead to an understanding of and control over the mixing times and convergence rates of the algorithm.

Our results on exact reconstruction for Forman–Ricci target sequences are not as directly extendable to other discrete curvatures. The analysis and solution heavily depends on the simple combinatorial nature of Forman–Ricci curvatures and its connection to joint degrees in a graph. However, an important follow-up question would be to find ways to decrease the prohibitively large size of the Markov bases ΔsubscriptΔ\mathcal{M}_{\Delta}caligraphic_M start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT. Adding constraints to the curvature or degree sequences and thus restricting to subspaces of ker(AΔ)subscriptkernelsubscript𝐴Δ\ker_{\mathbb{Z}}(A_{\Delta})roman_ker start_POSTSUBSCRIPT blackboard_Z end_POSTSUBSCRIPT ( italic_A start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT ) is one possible approach. If the Markov basis size can be controlled, this would enable a practical implementation of the proposed Algorithm 1.
 
Acknowledgements: The authors thank Florentin Münch for a discussion that lead to the proof of Proposition 4.10.

Appendix A Algebraic background for Markov bases

This brief appendix defines the algebraic concepts that appear in the fundamental theorem of Markov bases, Theorem 4.5, and that may not be commonly known. We refer to [17, 25] for further background.

Let K=[x1,,xn]𝐾subscript𝑥1subscript𝑥𝑛K=\mathbb{C}[x_{1},\dots,x_{n}]italic_K = blackboard_C [ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] be the ring of polynomials in variables x1,,xnsubscript𝑥1subscript𝑥𝑛x_{1},\dots,x_{n}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and complex coefficients. For a vector u=(u1,,un)n𝑢subscript𝑢1subscript𝑢𝑛superscript𝑛u=(u_{1},\dots,u_{n})\in\mathbb{N}^{n}italic_u = ( italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ∈ blackboard_N start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT we write 𝐱u=x1u1xnunKsuperscript𝐱𝑢superscriptsubscript𝑥1subscript𝑢1superscriptsubscript𝑥𝑛subscript𝑢𝑛𝐾\mathbf{x}^{u}=x_{1}^{u_{1}}\cdot\dots\cdot x_{n}^{u_{n}}\in Kbold_x start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT = italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ⋅ ⋯ ⋅ italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∈ italic_K for the monomial with exponent vector u𝑢uitalic_u, and for a vector βn𝛽superscript𝑛\beta\in\mathbb{Z}^{n}italic_β ∈ blackboard_Z start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT we write β+superscript𝛽\beta^{+}italic_β start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT for the vector with entries (β+)i=βisubscriptsuperscript𝛽𝑖subscript𝛽𝑖(\beta^{+})_{i}=\beta_{i}( italic_β start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT if βi0subscript𝛽𝑖0\beta_{i}\geq 0italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ 0 and zero otherwise, and β=β+βsuperscript𝛽superscript𝛽𝛽\beta^{-}=\beta^{+}-\betaitalic_β start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT = italic_β start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT - italic_β.

An ideal in K𝐾Kitalic_K is a subset IK𝐼𝐾I\subseteq Kitalic_I ⊆ italic_K such that fI,gKfgIformulae-sequence𝑓𝐼𝑔𝐾𝑓𝑔𝐼f\in I,g\in K\Rightarrow fg\in Iitalic_f ∈ italic_I , italic_g ∈ italic_K ⇒ italic_f italic_g ∈ italic_I and f,gIf+gI𝑓𝑔𝐼𝑓𝑔𝐼f,g\in I\Rightarrow f+g\in Iitalic_f , italic_g ∈ italic_I ⇒ italic_f + italic_g ∈ italic_I. An ideal I𝐼Iitalic_I is generated by f1,,fkIsubscript𝑓1subscript𝑓𝑘𝐼f_{1},\dots,f_{k}\subseteq Iitalic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ⊆ italic_I if every gI𝑔𝐼g\in Iitalic_g ∈ italic_I can be written as a polynomial combination g=i=1khifi𝑔superscriptsubscript𝑖1𝑘subscript𝑖subscript𝑓𝑖g=\sum_{i=1}^{k}h_{i}f_{i}italic_g = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for some h1,,hkKsubscript1subscript𝑘𝐾h_{1},\dots,h_{k}\in Kitalic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ italic_K.

The fundamental theorem of Markov bases is thus an algebraic statement which says that Markov bases are precisely the generators of some ideal associated with the kernel ker(A)subscriptkernel𝐴\ker_{\mathbb{Z}}(A)roman_ker start_POSTSUBSCRIPT blackboard_Z end_POSTSUBSCRIPT ( italic_A ) of an integer matrix Ad×n𝐴superscript𝑑𝑛A\in\mathbb{Z}^{d\times n}italic_A ∈ blackboard_Z start_POSTSUPERSCRIPT italic_d × italic_n end_POSTSUPERSCRIPT; these are called toric ideals.

References

  • [1] 4ti2 team. 4ti2—a software package for algebraic, geometric and combinatorial problems on linear spaces. available at: https://4ti2.github.io.
  • [2] D. Bakry, I. Gentil, and M. Ledoux. Analysis and Geometry of Markov Diffusion Operators. Grundlehren der mathematischen Wissenschaften. Springer, Cham, 2013.
  • [3] A.-L. Barabási and R. Albert. Emergence of scaling in random networks. Science, 286(5439):509–512, 1999.
  • [4] M. Barthelemy. The structure and dynamics of cities: Urban data analysis and theoretical modeling. Cambridge University Press, Cambridge, UK, 2016.
  • [5] F. Battiston, G. Cencetti, I. Iacopini, V. Latora, M. Lucas, A. Patania, J.-G. Young, and G. Petri. Networks beyond pairwise interactions: Structure and dynamics. Physics Reports, 874:1–92, 2020.
  • [6] S. Battiston, M. Puliga, R. Kaushik, P. Tasca, and G. Caldarelli. DebtRank: Too central to fail? Financial networks, the FED and systemic risk. Scientific Reports, 2(1):541–541, 2012.
  • [7] F. Bauer, J. Jost, and S. Liu. Ollivier-Ricci curvature and the spectrum of the normalized graph Laplace operator. Math. Res. Lett., 19(6):1185–1205, 2012.
  • [8] C. Bick, E. Gross, H. A. Harrington, and M. T. Schaub. What are higher-order networks? SIAM Review, 65(3):686–731, 2023.
  • [9] J. Bober, A. Monod, E. Saucan, and K. N. Webster. Rewiring networks for graph neural network training using discrete geometry, 2022. arXiv:2207.08026 [stat.ML].
  • [10] M. Boguñá, I. Bonamassa, M. De Domenico, S. Havlin, D. Krioukov, and M. A. Serrano. Network geometry. Nature Reviews Physics, 3(2):114–135, 2021.
  • [11] A. D. Broido and A. Clauset. Scale-free networks are rare. Nature Communications, 10(1):1017–1017, 2019.
  • [12] G. T. Cantwell, Y. Liu, B. F. Maier, A. C. Schwarze, C. A. Serván, J. Snyder, and G. St-Onge. Thresholding normally distributed data creates complex networks. Phys. Rev. E, 101:062302, 2020.
  • [13] F. R. K. Chung. Spectral Graph Theory. American Mathematical Society, Providence, RI, US, 1997.
  • [14] K. Devriendt and R. Lambiotte. Discrete curvature on graphs from the effective resistance. Journal of Physics: Complexity, 3(2):025008, 2022.
  • [15] K. Devriendt, A. Ottolini, and S. Steinerberger. Graph curvature via resistance distance. Discrete Applied Mathematics, 348:68–78, 2024.
  • [16] P. Diaconis, R. Graham, and S. P. Holmes. Statistical problems involving permutations with restricted positions. Lecture notes-monograph series, 36:195–222, 2001.
  • [17] P. Diaconis and B. Sturmfels. Algebraic algorithms for sampling from conditional distributions. The Annals of Statistics, 26(1):363–397, 1998.
  • [18] J. Duch and A. Arenas. Community detection in complex networks using extremal optimization. Phys. Rev. E, 72(2):027104, 2005.
  • [19] M. Eidi, A. Farzam, W. Leal, A. Samal, and J. Jost. Edge-based analysis of networks: curvatures of graphs and hypergraphs. Theory in Biosciences, 139:337 – 348, 2020.
  • [20] P. Elumalai, Y. Yadav, N. Williams, E. Saucan, J. Jost, and A. Samal. Graph Ricci curvatures reveal atypical functional connectivity in autism spectrum disorder. Scientific Reports, 12(1), 2022.
  • [21] M. Erbar and J. Maas. Ricci curvature of finite Markov chains via convexity of the entropy. Archive for rational mechanics and analysis, 206(3):997–1038, 2012.
  • [22] L. Fesser, S. S. de Haro Iváñez, K. Devriendt, M. Weber, and R. Lambiotte. Augmentations of Forman’s Ricci curvature and their applications in community detection, 2023. arXiv:2306.06474 [math.CO].
  • [23] R. Forman. Bochner’s method for cell complexes and combinatorial Ricci curvature. Discrete & Computational Geometry, 29:323–374, 2003.
  • [24] A. Fornito, A. Zalesky, and E. T. Bullmore. Fundamentals of brain network analysis. Academic Press, London, UK, first edition, 2016.
  • [25] J. A. D. Félix Almendra-Hernández and S. Petrović. Markov bases: A 25 year update. Journal of the American Statistical Association, pages 1–32, 2024.
  • [26] M. Girvan and M. E. J. Newman. Community structure in social and biological networks. Proceedings of the National Academy of Sciences, 99(12):7821–7826, 2002.
  • [27] A. Gosztolai and A. Arnaudon. Unfolding the multiscale structure of networks with dynamical Ollivier-Ricci curvature. Nature Communications, 12:4561, 2021.
  • [28] R. Hausmann, C. Hidalgo, S. Bustos, M. Coscia, S. Chung, J. Jimenez, A. Simoes, and M. Yildirim. The Atlas of Economic Complexity. Puritan Press, 2011.
  • [29] M. Ipsen and A. S. Mikhailov. Evolutionary reconstruction of networks. Phys. Rev. E, 66:046109, 2002.
  • [30] H. Jeong, B. Tombor, R. Albert, Z. N. Oltvai, and A.-L. Barabási. The large-scale organization of metabolic networks. Nature, 407(6804):651–654, 2000.
  • [31] J. Jost and S. Liu. Ollivier’s Ricci curvature, local clustering and curvature-dimension inequalities on graphs. Discrete & Computational Geometry, 51(2):300–322, 2014.
  • [32] J. Jost and F. Münch. Characterizations of Forman curvature, 2021. arXiv:2110.04554 [math.DG].
  • [33] D. Krioukov. Clustering implies geometry in networks. Phys. Rev. Lett., 116:208302, 2016.
  • [34] J. Kunegis. Konect: the Koblenz network collection. In Proceedings of the 22nd international conference on world wide web, pages 1343–1350, 2013.
  • [35] R. Lambiotte, M. Rosvall, and I. Scholtes. From networks to optimal higher-order models of complex systems. Nature Physics, 15(4):313–320, 2019.
  • [36] Y. Lin, L. Lu, and S.-T. Yau. Ricci curvature of graphs. Tohoku Mathematical Journal, 63(4):605 – 627, 2011.
  • [37] Y. Lin and S.-T. Yau. Ricci curvature and eigenvalue estimate on locally finite graphs. Mathematical research letters, 17(2):343–356, 2010.
  • [38] S. Liu, F. Münch, and N. Peyerimhoff. Bakry–Émery curvature and diameter bounds on graphs. Calculus of variations and partial differential equations, 57(2):1–9, 2018.
  • [39] B. Loisel and P. Romon. Ricci curvature on polyhedral surfaces via optimal transportation. Axioms, 3(1):119–139, 2014.
  • [40] M. Mondal, A. Samal, F. Münch, and J. Jost. Bakry-Émery-Ricci curvature: An alternative network geometry measure in the expanding toolbox of graph Ricci curvatures, 2024. arXiv:2402.06616 [physics.comp-ph].
  • [41] R. Mulas, D. Horak, and J. Jost. Graphs, simplicial complexes and hypergraphs: Spectral theory and topology. In Higher-Order Systems, Understanding Complex Systems, pages 1–58. Springer International Publishing AG, Switzerland, 2022.
  • [42] M. E. J. Newman. Networks an introduction. Oxford University Press, Oxford, UK, 2010.
  • [43] C. Ni, Y. Lin, F. Luo, and J. Gao. Community detection on networks with Ricci flow. Scientific Reports, 9:9984, 2019.
  • [44] Y. Ollivier. Ricci curvature of metric spaces. Comptes Rendus Mathematique, 345(11):643–646, 2007.
  • [45] Y. Ollivier. A survey of Ricci curvature for metric spaces and Markov chains. In Probabilistic approach to geometry, volume 57, pages 343–382. Mathematical Society of Japan, 2010.
  • [46] R. Overbeek, N. Larsen, G. D. Pusch, M. D’Souza, E. S. Jr, N. Kyrpides, M. Fonstein, N. Maltsev, and E. Selkov. WIT: Integrated system for high-throughput genome sequence analysis and metabolic reconstruction. Nucleic Acids Research, 28(1):123–125, 2000.
  • [47] G. A. Pagani and M. Aiello. The power grid as a complex network: A survey. Physica A: Statistical Mechanics and its Applications, 392(11):2688–2700, 2013.
  • [48] R. Pastor-Satorras, C. Castellano, P. Van Mieghem, and A. Vespignani. Epidemic processes in complex networks. Rev. Mod. Phys., 87:925–979, 2015.
  • [49] A. Samal, H. K. Pharasi, S. J. Ramaia, H. Kannan, E. Saucan, J. Jost, and A. Chakraborti. Network geometry and market instability. Royal Society Open Science, 8(2):201734, 2021.
  • [50] A. Samal, R. P. Sreejith, J. Gu, S. Liu, E. Saucan, and J. Jost. Comparative analysis of two discretizations of Ricci curvature for complex networks. Scientific Reports, 8(1):1–16, 2018.
  • [51] R. Sandhu, T. Georgiou, E. Reznik, L. Zhu, I. Kolesov, Y. Şenbabaoğlu, and A. Tannenbaum. Graph curvature for differentiating cancer networks. Scientific Reports, 5:12323, 2015.
  • [52] T. Squartini and D. Garlaschelli. Maximum-entropy networks: Pattern detection, network reconstruction and graph combinatorics. SpringerBriefs in complexity. Springer, Cham, Switzerland, 2017.
  • [53] R.-P. Sreejith, K. Mohanraj, J. Jost, E. Saucan, and A. Samal. Forman curvature for complex networks. Journal of Statistical Mechanics: Theory and Experiment, 2016(6):063206, 2016.
  • [54] I. Stanton and A. Pinar. Constructing and sampling graphs with a prescribed joint degree distribution. ACM J. Exp. Algorithmics, 17, 2012.
  • [55] S. Steinerberger. Curvature on graphs via equilibrium measures. Journal of Graph Theory, 103(3):415–436, 2023.
  • [56] P. Tee and C. A. Trugenberger. Enhanced Forman curvature and its relation to Ollivier curvature. Europhysics Letters, 133(6):60006, 2021.
  • [57] J. Topping, F. D. Giovanni, B. P. Chamberlain, X. Dong, and M. M. Bronstein. Understanding over-squashing and bottlenecks on graphs via curvature, 2022. arXiv:2111.14522 [stat.ML].
  • [58] I. Voitalov, P. van der Hoorn, R. van der Hofstad, and D. Krioukov. Scale-free networks well done. Phys. Rev. Res., 1:033034, 2019.
  • [59] S. Wasserman and K. Faust. Social Network Analysis: Methods and Applications. Cambridge University Press, Cambridge, UK, 1994.
  • [60] D. J. Watts and S. H. Strogatz. Collective dynamics of ‘small-world’ networks. Nature, 393(6684):440–442, 1998.
  • [61] T. Windisch. Rapid mixing and markov bases. SIAM Journal on Discrete Mathematics, 30(4):2130–2145, 2016.
  • [62] Z. Wu, G. Menichetti, C. Rahmede, and G. Bianconi. Emergent complex network geometry. Scientific Reports, 5, 2014.