Relations between average clustering coefficient and another centralities in graphs
Mikhail Tuzhilin
Affiliation: Moscow State University, Electronic address: mtu93@mail.ru;
Abstract
Relations between average clustering coefficient and global clustering coefficient, local efficiency, radiality, closeness, betweenness and stress centralities were obtained for simple graphs.
keywords:
Networks, centralities, local and global properties of graphs, Watts-Strogatz clustering coefficient, global clustering coefficient.
1 Introduction.
The centrality measure was introduced by Bonacich in [1]. Centrality is a local (with relation to a vertex) or global (with relation to a whole graph) measures in networks. There are many centrality measures (or shortly centralities) such as local efficiency, radiality, closeness, betweenness, stress centralities, etc. Calculation of centralities is very useful for finding intrinsic properties of “real” networks (which can be found in applications) [2]- [4]. One of most important centrality measure is a clustering coefficient, that differentiate “real” graphs (or small-world networks) and random generated graphs [5].
There are two definitions of clustering coefficient: the average clustering coefficient introduced by Watts-Strogatz [5] and the global clustering coefficient. It was shown in [6] that for windmill graphs the average clustering coefficient and the global clustering coefficient asymptotically different. More precisely, the average clustering coefficient tends to 1 and the global clustering coefficient tends to 0 if the number of vertices increasing. In this paper, author provides two large class of graphs for which the average clustering coefficient is less or equal than the global cluster coefficient and vise versa.
Nowadays, there are also many articles where centrality measures are used for calculations and predictions of certain network characteristics, but a very few with theoretical basis. In the article [7] relations between different centralities were obtained, also an estimation of the local efficiency was obtained in terms of the average clustering coefficient. In this article relations between the average clustering coefficient and another centralities are proved for simple undirected graphs, in particular, it is proved that the estimation of local efficiency in terms of the average cluster coefficient is in fact an equality.
2 Main definitions.
All subsequent definitions are given for a simple undirected graph without pendant vertices. It also can be defined to a simple graph with pendant vertices if every function where {vertex degree } is in the denominator are defined to be equal to for all cases where vertices degrees equal to , but this will be omitted in this article for the sake of brevity.
Let’s give necessary denotations. Let’s denote by
-
•
the set of vertices, the set of edges and adjacency matrix of graph .
-
•
Neighbourhood — the set of vertices which adjacent to the vertex ,
-
•
subgraph in on these vertices,
-
•
, where is any function , the restriction of this function on (for example will be the average shortest path between and with restriction to subgraph ),
-
•
,
-
•
,
-
•
for any — set or function corresponding to vertex ,
Let’s give definitions of centralities:
-
1.
Diameter .
-
2.
Density .
-
3.
Global efficiency .
-
4.
Average shortest path length .
-
5.
Local cluster coefficient
.
-
6.
Average clustering coefficient
.
-
7.
Global clustering coefficient
.
-
8.
Betweenness centrality , where is the total number of shortest paths from to and is the total number of shortest paths which contains vertex .
-
9.
Closeness centrality .
-
10.
Local efficiency
-
11.
Radiality .
-
12.
Stress , where is the total number of shortest paths from to which contains vertex .
Note that all centralities are non-negative and are less or equal 1.
3 Main results.
All subsequent lemmas and theorems are given for a simple undirected graph without pendant vertices. It also can be defined to a simple graph with pendant vertices if every function where is in the denominator are defined to be equal to for all cases .
First let’s prove a lemma about a relation between average shortest path length between vertices in the neighbourhood of and local clustering coefficient of this vertex.
Lemma 1.
|
|
|
Proof.
|
|
|
|
|
|
|
|
|
Note that shortest paths for vertices in are defined corresponding to whole graph .
∎
Let’s prove theorem about a connection between local efficiency and average clustering coefficient of a graph.
Theorem 1.
|
|
|
Proof.
Let’s give two proofs of this fact:
-
1.
Note that by definition . In the article [7] it was proved that
|
|
|
Using lemma 1
|
|
|
Note that shortest paths for vertices in are defined corresponding to whole graph . Averaging by ends the proof.
-
2.
Let’s rewrite the local clustering coefficient formula:
|
|
|
|
|
|
|
|
|
Averaging by ends the proof.
∎
Let’s prove theorem about a connection between average clustering coefficient and stress centrality.
Theorem 2.
|
|
|
Proof.
Note that the shortest path between and is . Therefore,
|
|
|
|
|
|
Averaging by
|
|
|
Note that for holds an equality.
∎
Let’s prove theorem about a relation between average clustering coefficient and betweenness centrality.
Theorem 3.
Let’s then
|
|
|
Proof.
Let’s note that
|
|
|
|
|
|
Averaging by
|
|
|
Note that for holds an equality.
∎
By using theorems 2 and 3 an estimation of average shortest path in the neighborhood of is obtained.
Corollary 1.
|
|
|
Note that shortest paths for vertices in are defined corresponding to whole graph .
Let’s prove lemma about a relation between average closeness centrality and average shortest path length in graph.
Lemma 2.
|
|
|
Proof.
By the inequality of harmonic mean and arithmetic mean
|
|
|
Note that an equality holds when all average shortest path lengths from any vertex to all remaining vertices are equal.
∎
Now let’s prove theorem about a relation between average clustering coefficient and closeness centrality.
Theorem 4.
|
|
|
Proof.
By lemma 2
|
|
|
By the inequality of harmonic mean and arithmetic mean (since ):
|
|
|
∎
Let’s prove lemma about a relation between average shortest path length and average radiality.
Lemma 3.
|
|
|
Proof.
The proof holds from definition
|
|
|
∎
Now let’s prove theorem about a relation between average clustering coefficient and radiality.
Theorem 5.
|
|
|
Proof.
By lemma 3
|
|
|
where
. Averaging by ends the proof.
∎
Let’s prove two theorems about a relation between average clustering coefficient and global clustering coefficient.
Theorem 6.
Let’s hold , then
|
|
|
Proof.
Let’s re-numerate vertices such that .
Note that
|
|
|
Indeed,
|
|
|
.
Therefore,
|
|
|
Let’s denote by . Since and the maximum number of edges in subgraph equals to , then Hence, using Chebyshev’s sum inequality ():
|
|
|
Therefore,
|
|
|
The equality holds when , that is for a graphs in which all vertices degrees are equal.
∎
Corollary 2.
Let’s hold , then
|
|
|
The proof is the same as in theorem 6.
References
-
[1]
Bonacich P. Factoring and weighting approaches to status scores and clique identification //Journal of mathematical sociology. 1972. 2. № 1. 113–120.
-
[2]
Borgatti S. P., Everett M. G. A graph-theoretic perspective on centrality //Social networks. 2006. 28. № 4. 466–484.
-
[3]
Kiss C., Bichler M. Identification of influencers—measuring influence in customer networks //Decision Support Systems. 2008. 46. № 1. 233–253.
-
[4]
Lee S. H. M., Cotte J., Noseworthy T. J. The role of network centrality in the flow of consumer influence //Journal of Consumer Psychology. 2010. 20. № 1. 66–77.
-
[5]
Watts D. J., Strogatz S. H. Collective dynamics of ‘small-world’networks //nature. 1998. 393. № 6684. 440–442.
-
[6]
Estrada E. When local and global clustering of networks diverge //Linear Algebra and its Applications. 2016. 488. 249–263
-
[7]
Strang A. et al. Generalized relationships between characteristic path length, efficiency, clustering coefficients, and density //Social Network Analysis and Mining. 2018. 8. 1–6.