Abstract
One of the most basic facts related to the famous Ulam reconstruction conjecture
is that the connectedness of a graph can be determined by the deck of its vertex-deleted
subgraphs, which are considered up to isomorphism. We strengthen this result
by proving that connectedness can still be determined when the subgraphs in the deck are given up to
equivalence under the color refinement isomorphism test.
Consequently, this implies that connectedness is recognizable by Reconstruction Graph Neural Networks,
a recently introduced GNN architecture inspired by the reconstruction conjecture
(Cotta, Morris, Ribeiro 2021).
1 Introduction
For a vertex of a graph , the vertex-deleted subgraph is obtained by
removing along with all incident edges from . The isomorphism type of
is sometimes referred to as a card, and then the multiset of all cards is called
the deck of . The famous Ulam reconstruction conjecture [19] says that every graph
with more than 2 vertices is, up to isomorphism, reconstructible from its deck.
Though published in 1960, the problem collection [19] is a descendant
of the much earlier famous Scottish Book [12]. According to Harary [9],
the conjecture has already been discussed since 1929 and, about a hundred years later,
it remains a notoriously hard open problem in graph theory.
The subject has a rich literature, and
one of the most frequently taken approaches is examination of how much information about a graph
can be retrieved from its deck. One of the earliest results in this direction
is referred to in [9] as a “little theorem”: is connected if and only if
at least two subgraphs in its deck are connected. This implies the following fact
(see also [15, Theorem 2.2])
Theorem 1 (Harary [8]).
The connectedness of a graph is determined by the multiset of all
vertex-deleted subgraphs of .
Recall that the subgraphs are considered up to isomorphism,
which means that the full structural information about them is available.
It is natural to ask which partial information suffices to decide the connectedness of .
For example, the aforementioned “little theorem” of Harary shows that it would be just enough
to know the connectedness of each card in the deck. The question addressed in this note
is whether connectedness can still be decided when the subgraphs are considered up to
an important relaxation of the isomorphism relation, namely up to equivalence
under the color refinement test (also known as one-dimensional Weisfeiler-Leman equivalence).
Color refinement iteratively computes an isomorphism-invariant coloring of
the vertex set of an input graph .
Let denote the mutisets of colors assigned by the algorithm to the vertices of
(see Section 2 for definitions).
Color refinement distinguishes two graphs and if .
If , the graphs are called CR-equivalent.
Since is a graph invariant, CR-equivalence is a coarser equivalence relation
than isomorphism of graphs.
CR-equivalence has natural characterizations discovered in different research contexts.
Two graphs are CR-equivalent exactly when they are equivalent in the
two-variable first-order logic with counting quantifiers
[10], when they have equally many vertices and share a
common covering [1], when they are fractionally
isomorphic [17], and when the number of homomorphisms
from every tree to these graphs is the same [5].
Color refinement is a versatile tool with applications in diverse fields [1, 7, 16].
Most prominently, it is a practical method of isomorphism testing.
The graph invariant can be computed efficiently [10]
and identifies the isomorphism type of for almost all graphs [2].
In machine learning, color refinement is used for analysis and comparison of
graph-structured data [18] and for understanding the expressiveness of graph
neural networks (GNNs) [13].
Our main result is the following strengthening of Theorem 1.
Theorem 2.
The connectedness of a graph is determined
by the multiset of the invariants for all
vertex-deleted subgraphs of .
Note that the invariant is not powerful enough to decide whether
or not is connected. Indeed, , where denotes
the cycle graph of length , and stands for the vertex-disjoint union of
two copies of . This cuts off the easiest route to Theorem 2
through the potential use of the “little theorem” of Harary and shows that
running color refinement on the deck of a graph can be more beneficial than
running it on the graph alone. Theorem 2 is interesting in several respects,
which we discuss now.
Consequences for GNNs.
The expressive power of a representative class of GNNs is characterizable
in terms of color refinement [14]. In the search for a more expressive GNN architecture,
Cotta, Morris, Ribeiro [4] introduce the model of -Reconstruction GNNs,
whose conceptual novelty consists in enhancing the standard GNNs by running them
on the deck of -vertex subgraphs of a graph rather than on itself. The simplest instructive example,
highlighted in [4], is given by considering the 6-cycle graph .
As we just mentioned, is not identified by . However,
is reconstructible from its deck, which consists of six copies of the 5-path graph .
Since is identified by , the cycle graph is identified by
color refinement applied to its deck. We state an immediate consequence of Theorem 2,
providing further evidence that -Reconstruction GNNs can often be more powerful.
Corollary 3.
The connecteness of an -vertex graph is recognizable by -Reconstruction GNNs.
Graph reconstruction.
Theorem 2 shows that a prototypical reconstructible
graph property can be recognized even from an incomplete information about the deck.
The research on the reconstruction conjecture provides many results of this kind,
and we here mention two of them which concern connectedness.
One can determine the connectedness of an -vertex graph from any
of its cards [3].
This can also be done using the -deck of a graph whenever [6].
Note in this respect that the -deck of a graph determines its -deck whenever
and, therefore, the -deck with a smaller potentially provides less information.
Turning back to Theorem 2, it is natural to combine it with the logical
characterization of color refinement [10]. As a consequence, the connectedness
of a graph can be determined based on the information about its deck expressible in
the two-variable logic with counting quantifiers.
Isomorphism invariants of graphs.
Let denote the multiset of over all vertices of .
This graph invariant is interesting to compare not only with with but
as well with other related invariants.
Color refinement is also commonly known as the one-dimensional Weisfeiler-Leman algorithm.
Weisfeiler and Leman [20] actually introduced the two-dimensional version, exploiting
the same idea as color refinement but coloring pairs of vertices instead of single vertices.
Let denote the graph invariant computed by this algorithm on input .
A higher dimension results in a more powerful isomorphism test and, in particular,
is determined by . It is not hard to show that
is also determined by . Whether , , and
form a hierarchy of graph invariants linearly ordered by their strength is an open question.
More precisely, it is unknown whether or not is determined by .
This question is discussed in [17, Section 6.8] and [4, Section 4.1].
As we mentioned above, does not allow us to see whether is connected.
On the other hand, it is well known that the connectedness of a
graph can be determined from . Theorem 2 provides
a formal strengthening of the last fact by showing that the connectedness of
is determined even by .
2 Color refinenemt
The vertex set of a graph is denoted by .
The neighborhood
of a vertex consists of all vertices adjacent to .
We write to denote the degree of .
On an input graph , the color-refinement algorithm (to be
abbreviated as CR) iteratively computes a sequence of colorings
of . The initial coloring is monochromatic,
i.e., for all , where the same initial color is used for all .
Each subsequent color of a vertex is obtained from the preceding coloring by the rule
|
|
|
where denotes a multiset.
Note that exactly when , that is,
the color can just be seen as the degree of the vertex .
We define as the sequence of for all
and set . We say that CR
distinguishes two graphs and if .
If and are indistinguishable by CR, i.e.,
, they are referred to as CR-equivalent.
Defining as an infinite sequence facilitates proving Theorem 2
in the subsequent sections. For example, this definition is needed for the equivalences
in Lemma 8 below. Color refinement is, nevertheless, an absolutely
practical procedure. Indeed, if -vertex graphs and are distinguishable by CR,
then they are distinguishable after the -th refinement round at latest in the sense that
.
Note in this respect that the color names of increase
exponentially with if they are encoded in a straightforward way. This can
be avoided by renaming the colors synchronously on and after
each round.
3 Useful lemmas
As , we have , which
can be referred to as the iterated degree of .
We begin with a particularly useful fact about the reconstructibility of iterated degrees.
Lemma 4 (Nash-Williams [15, Lemma 3.3]).
Suppose that graphs and have the same deck up to CR-equivalence, that is,
|
|
|
If
for a vertex and a vertex , then .
If two graphs have different number of vertices, then they are
straightforwardly distinguished by CR. Nevertheless, such a pair of graphs
might still be similar in the sense that the CR colors occurring in
them are the same, and the difference is only in the color
multiplicities. In the proof of Theorem 2, this
similarity concept will play an important role.
Definition 5.
We say that graphs and are CR-similar and write
if . That is, we require that the sets of the colors
produced by CR on and be equal (although the
multisets of these colors might be unequal).
Note that two graphs with different numbers of vertices can be
CR-similar, for example, .
Lemma 6.
-
1.
If is a proper subgraph of a connected graph , then .
-
2.
Let be a proper subgraph of and be a proper subgraph of .
Suppose that both and are connected. Then it is impossible that simultaneously
and .
The proof is based on a characterization of the CR-color in terms of
the universal cover of the graph rooted at the vertex . We first give
the relevant definitions.
Let and be connected graphs, where is finite graph, while can be
either finite or infinite. Let be a surjective homomorphism from onto
. We call a covering map if its restriction to the neighborhood of
each vertex in is bijective. If such an exists, we say that
covers . A graph is called a universal cover of
if covers every graph covering . A universal cover
of is unique up to isomorphism. It can be seen as an unfolding of
into a (possibly infinite) tree starting from an arbitrarily chosen vertex of .
Denote this unfolding by and note that whatever .
If is a tree, then . If contains a cycle, then is an infinite tree.
The unfolding can formally be described as follows. A sequence of vertices
in is called a walk starting at if every two successive vertices and
are adjacent. If for all , then the walk is non-backtracking.
Now, is the graph whose vertices are all non-backtracking walks in starting at ,
with any two walks of the form and
being adjacent.
A straightforward inductive argument shows that a covering map
preserves the coloring produced by CR, that is,
for all . This has the following consequence. Let be a vertex of a connected graph
and be a vertex of a connected graph . If and are isomorphic as rooted trees,
then . The converse implication is also true; see, e.g., Lemmas 2.3 and 2.4 in [11].
We state this equivalence in a more precise form. Let denote the rooted tree
truncated at level .
Lemma 7.
if and only if , where denotes the isomorphism of rooted trees.
The characterization of CR-colors in terms of universal covers readily implies the following equivalences.
Lemma 8.
The following conditions are equivalent for connected graphs and :
-
1.
;
-
2.
;
-
3.
.
Lemma 8 has a simple consequence about the CR-similarity of not necessarily
connected graphs. For graphs , we write to denote the
vertex-disjoint union of these graphs.
Lemma 9.
-
1.
If , then for every connected component of there is
a connected component of such that (and vice versa).
-
2.
If where is connected and , then .
Proof of Lemma 6.
Let us extend the notation to not necessarily connected graphs
by setting where is the connected component of containing
the vertex .
1. Let the positive integer be one more than the
diameter of . Fix a vertex of the subgraph maximizing
the number of vertices in the truncated tree . The
description of a universal cover in terms of non-backtracking walks
readily implies that has strictly more vertices than
and, hence, than for any vertex of
. Indeed, our choice of ensures that there is a walk of
length at most starting from and passing through an edge of
absent in . Lemma 7, therefore, implies
that for any .
2. Towards a contradiction, assume that and
. Let be the positive integer that is one more than
the larger of the diameters of and . Choose a vertex of
the subgraph maximizing the number of vertices in the truncated
tree . As in the proof of the first part, note that
has strictly more vertices than and, hence,
than for any vertex of . The assumption
implies by Lemma 7 that every tree
for has an isomorphic mate
for some and vice versa. Therefore, has
strictly more vertices than for any vertex of
. The same argument with the roles of and interchanged
provides us with a vertex in which has strictly more
vertices than for any vertex of , in particular,
than . This contradiction completes the proof.
∎
4 Proof of Theorem 2
Let be a connected graph and any disconnected graph. We have
to prove that the multisets and
are unequal. We will assume that these
multisets are equal and derive a contradiction.
Let , where , be the connected components of .
Let and for .
Identify with so that
|
|
|
(1) |
for all .
By Lemma 4, this yields
|
|
|
for all . Taking into account the decomposition of into connected components, we have
|
|
|
(2) |
Denote the set of articulation (or cut) vertices of by , and set .
Let . If , then is connected and
. By Part 2 of Lemma 9,
Equality (1) implies that
|
|
|
(3) |
Applying the similarity to any two vertices in ,
we derive the similarity whenever .
Applying Lemma 4 in the particular case of , we conclude that
|
|
|
(4) |
Let and assume that and .
Let and . By (3), we obtain
and , which contradicts Part 2 of Lemma 6.
Therefore, only one of the sets is not empty, that is, for some .
Without loss of generality, we suppose that .
Claim A.
.
Proof of Claim A.
Towards a contradiction, assume that contains a cut vertex of .
By Part 1 of Lemma 9, Equality (1) implies that
the disconnected graph has a connected component
such that . Choose a vertex in a connected component of
different from . Since , the similarity (3) implies that
and, therefore, . However, is a subgraph of the connected graph ,
contrary to Part 1 of Lemma 6.
By Claim 4 and Equality (4), we have
|
|
|
Taking into account Equality (2), this implies that
|
|
|
(5) |
In particular, is a regular graph, say, of degree .
Consider the block-cut tree of . Recall that it describes the decomposition
of into biconnected components, that is, maximal biconnected subgraphs, which are also
called blocks. The vertex set of the tree consists of the blocks and the cut vertices of .
A block is adjacent to a cut vertex if .
Let be a block that is a leaf in the block-cut tree of .
Let be the single cut vertex of belonging to .
By Claim 4, belongs to ,
and the other vertices of belong to .
Without loss of generality, suppose that .
Let be a vertex in adjacent to . Since , the similarity (3) implies that
|
|
|
(6) |
Since is a -regular graph, every vertex of has degree or .
By the similarity (6), the neighbors of in are of degree or .
By Equality (2), the same holds true for in .
Therefore, every connected component of has a vertex of degree or
(namely a neighbor of in ). As a consequence, for every . On the other hand,
Equality (1) implies that
|
|
|
By Part 1 of Lemma 9, we conclude that for some component of .
This contradiction completes the proof of Theorem 2.