Identifying vital nodes in complex networks by adjacency information entropy

Xu, Xiang; Zhu, Cheng; Wang, Qingyong; Zhu, Xianqiang; Zhou, Yun

doi:10.1038/s41598-020-59616-w

Download PDF

Article
Open access
Published: 14 February 2020

Identifying vital nodes in complex networks by adjacency information entropy

Xiang XuÂ ORCID: orcid.org/0000-0002-3031-2561¹,
Cheng Zhu¹,
Qingyong Wang¹,
Xianqiang Zhu¹ &
â¦
Yun Zhou¹Â

Scientific Reports volumeÂ 10, ArticleÂ number:Â 2691 (2020) Cite this article

6040 Accesses
43 Citations
Metrics details

Subjects

Abstract

Identifying the vital nodes in networks is of great significance for understanding the function of nodes and the nature of networks. Many centrality indices, such as betweenness centrality (BC), eccentricity centrality (EC), closeness centricity (CC), structural holes (SH), degree centrality (DC), PageRank (PR) and eigenvector centrality (VC), have been proposed to identify the influential nodes of networks. However, some of these indices have limited application scopes. EC and CC are generally only applicable to undirected networks, while PR and VC are generally used for directed networks. To design a more applicable centrality measure, two vital node identification algorithms based on node adjacency information entropy are proposed in this paper. To validate the effectiveness and applicability of the proposed algorithms, contrast experiments are conducted with the BC, EC, CC, SH, DC, PR and VC indices in different kinds of networks. The results show that the index in this paper has a high correlation with the local metric DC, and it also has a certain correlation with the PR and VC indices for directed networks. In addition, the experimental results indicate that our algorithms can effectively identify the vital nodes in different networks.

Integrating local and global information to identify influential nodes in complex networks

Article Open access 14 July 2023

Characterizing the interactions between classical and community-aware centrality measures in complex networks

Article Open access 12 May 2021

Identifying influential nodes based on the disassortativity and community structure of complex network

Article Open access 11 April 2024

Introduction

The vital nodes in networks are the nodes that have great impacts on the network structure and function¹. Previous studies have described many centralities that can rank the nodes in networks, such as degree centrality², eccentricity³, closeness centricity⁴, betweenness centrality^5,6,7, eigenvector centrality⁸ and PageRank⁹. Identifying the influential nodes in networks is not only of theoretical significance but also of practical value. For example, identifying the important junctions in traffic networks can prevent the paralysis of traffic networks caused by traffic congestion. Locking key sources in virus transmission networks can significantly reduce the speed and scope of virus transmission. These examples and others are all related to identifying the vital nodes in networks. The paper of Gino et al. applied the optimal percolation theory to predict the influential nodes in memory networks¹⁰.

Considering that the local metrics have lower computational complexity and the global metrics have higher computational accuracy, in recent work, many vital node identification methods that consider both local and global metrics have been proposed. A semi-local metric that balances the accuracy and efficiency was proposed by Chen et al.¹¹. Another neighbourhood centrality that takes into account the importance of a node and its neighboursâ was proposed¹². In the paper by Yu et al.¹³, an improved method called improved structural holes (ISH) that identifies the key nodes in complex networks was proposed; unlike the eccentricity and betweenness centrality, this method can be applied to large-scale and disconnected networks. Zhang et al.¹⁴ presented an effective method named VoteRank to identify a set of dispersive spreaders with the best spreading ability. By considering the propagation probability, Ma et al.¹⁵ proposed a new algorithm named hybrid degree centrality (HC) to improve the local metrics and combined it with degree centrality. LÃ¼ et al.¹⁶ gave a complete overview of the vital node identification methods in recent years.

In addition to the above perspectives, many other related references use network dynamics to study the importance of nodes in networks. LÃ¼ et al.¹⁷ devised an adaptive and parameter-free algorithm, the LeaderRank, to measure the influence of users in social networks, and the experimental results show that the algorithm is more efficient than PageRank and more robust to noisy data. Min¹⁸ proposed a method using a message-passing approach for identifying the most influential spreaders in networks and found that the method can be easily applied to unweighted and weighted networks. Liu et al.¹⁹ presented dynamics-sensitive (DS) centrality for locating influential nodes by combining the topological and dynamic characteristics of the networks. Zhang et al.²⁰ designed a multiscale node-importance method to measure the importance of nodes in the process of network dynamics according to different network scales. Many related studies only identify the vital nodes for a certain type of network, such as refs. ^21,22,23 that only study node identification for weighted networks, Chen et al.²⁴ who proposed an identification method for directed networks, reference²⁵ that mined the vital nodes in directed weighted complex networks, and Edgar et al.²⁶ who used Kuramoto and Ising dynamics to study the central role of peripheral nodes in directed networks. The last paper argued that a large key component does not uniquely ensure the emergence of collective phenomena, as it does for undirected networks.

To identify the vital nodes for different types (unweighted-undirected, unweighted-directed, weighted-undirected and weighted-directed) of networks, we propose an adjacency information entropy method to identify the vital nodes in different networks by considering the weights and directions of the edges in networks. For weighted networks, the node strength is used instead of the node degree. For directed networks, in order to refine the influence of the out-degree and in-degree on the node importance, we set the influence coefficient Î¸ of the in-degree value. By adjusting the size of Î¸, we can control the different influences of the out-degree and in-degree on nodes.

The rest of this paper is organised as follows. In section 2, we provide detailed representations of the different types of networks. In section 3, two vital node identification algorithms and three related definitions are proposed. In section 4, four empirical experiments on the independent parts, largest components, network efficiency and correlations analysis are carried out, and the experimental results are compared and explained. Finally, the conclusion and future works are presented in section 5.

Representations

In this paper, we study vital node identification in four different types of networks, namely, unweighted-undirected networks, unweighted-directed networks, weighted-undirected networks and weighted-directed networks. Obviously, the representations of the different networks and the calculations of the related metrics in the networks are different.

Four different types of networks

Usually, an unweighted network is represented by G = (V, E), where V = {v₁, v₂, â¯, v_n} and â£Vâ£ is the number of nodes in the network. E = {e₁, e₂, â¯ , e_m} and â£Eâ£ is the number of edges in the network. An adjacency matrix is used to represent the connections between the nodes in the network, and the topology of the network can be obtained by using the adjacency matrix. In Fig. 1(a), the left figure is an unweighted-undirected network and the right figure is its corresponding adjacency matrix. It is obvious that the adjacency matrix of undirected network is a symmetric matrix.

The degree of nodes in unweighted-undirected networks can be calculated by ${k}_{i}={\sum }_{j=1}^{m}{a}_{ij}$, where j is the neighbour of node i and m is the number of neighbours of node i. a_ij = 1 if there is an edge between node i and node j, and otherwise it is 0.

Unlike unweighted-undirected networks, the edges between the nodes in unweighted-directed networks have directions. The asymmetry of the adjacency matrix can reflect the directions of the edges in networks. We can see that the matrix in Fig. 1(b) is different from the matrix in Fig. 1(a). There are two kinds of degrees of nodes in directed networks, namely, in-degree and out-degree. In directed networks, the in-degree of a node is the number of edges from its neighbours that point to it, and the out-degree of a node is the number of edges of the node that point to its neighbours. These two kinds of degrees can be calculated by Eqs. 1 and 2, respectively.

$${k}_{i}^{in}=\{\begin{array}{cc}{\sum }_{j\in {\Gamma }_{i}}{a}_{ji} & \,{\rm{i}}{\rm{f}}\,{\rm{t}}{\rm{h}}{\rm{e}}\,{\rm{n}}{\rm{e}}{\rm{t}}{\rm{w}}{\rm{o}}{\rm{r}}{\rm{k}}\,{\rm{i}}{\rm{s}}\,{\rm{u}}{\rm{n}}{\rm{w}}{\rm{e}}{\rm{i}}{\rm{g}}{\rm{h}}{\rm{t}}{\rm{e}}{\rm{d}}\,\\ {\sum }_{j\in {\Gamma }_{i}}{w}_{ji} & \,{\rm{i}}{\rm{f}}\,{\rm{t}}{\rm{h}}{\rm{e}}\,{\rm{n}}{\rm{e}}{\rm{t}}{\rm{w}}{\rm{o}}{\rm{r}}{\rm{k}}\,{\rm{i}}{\rm{s}}\,{\rm{w}}{\rm{e}}{\rm{i}}{\rm{g}}{\rm{h}}{\rm{t}}{\rm{e}}{\rm{d}}\,\end{array}$$

(1)

$${k}_{i}^{out}=\{\begin{array}{cc}{\sum }_{j\in {\Gamma }_{i}}{a}_{ij} & \,{\rm{i}}{\rm{f}}\,{\rm{t}}{\rm{h}}{\rm{e}}\,{\rm{n}}{\rm{e}}{\rm{t}}{\rm{w}}{\rm{o}}{\rm{r}}{\rm{k}}\,{\rm{i}}{\rm{s}}\,{\rm{u}}{\rm{n}}{\rm{w}}{\rm{e}}{\rm{i}}{\rm{g}}{\rm{h}}{\rm{t}}{\rm{e}}{\rm{d}}\,\\ {\sum }_{j\in {\Gamma }_{i}}{w}_{ij} & \,{\rm{i}}{\rm{f}}\,{\rm{t}}{\rm{h}}{\rm{e}}\,{\rm{n}}{\rm{e}}{\rm{t}}{\rm{w}}{\rm{o}}{\rm{r}}{\rm{k}}\,{\rm{i}}{\rm{s}}\,{\rm{w}}{\rm{e}}{\rm{i}}{\rm{g}}{\rm{h}}{\rm{t}}{\rm{e}}{\rm{d}}\,\end{array}$$

(2)

Generally, the calculation of the degree in directed networks adds the in-degree to the out-degree. Here, we consider that the in-degree and out-degree of nodes have different effects on nodes²⁷. Then, the degree of nodes in directed networks can be calculated by Eq. 3, where Î¸ is the influence coefficient of the nodesâ in-degree, and in this paper, we set Î¸ = 0.75.

$${k}_{i}^{unweighted}=\theta {k}_{i}^{in}+(1-\theta ){k}_{i}^{out}=\theta \mathop{\sum }\limits_{j=1}^{m}{a}_{ji}+(1-\theta )\mathop{\sum }\limits_{j=1}^{m}{a}_{ij}$$

(3)

A weighted network can be represented by G = (V, E, W), where W is the adjacency weighted matrix of the network. The weights of the connected edges in weighted networks are not only 0 or 1, and edgesâ weights can reflect the strength of the relationships between nodes. Figure 1(c) presents a weighted-undirected network and the corresponding adjacency matrix. The degree of nodes in weighted-undirected networks can be obtained by ${k}_{i}={\sum }_{j=1}^{m}{w}_{ij}$, where w_ij is the weight of the edge between node i and node j.

Weighted-directed networks are the most complicated of the four types of networks. Figure 1(d) shows a simple weighted-directed network and its adjacency weighted matrix. According to the above degree calculation method for weighted networks and directed networks, naturally, the degree of nodes in weighted-directed networks can be obtained by Eq. 4. Figure 2 illustrates the relationships among the four different types of networks.

$${k}_{i}^{weighted}=\theta {k}_{i}^{in}+(1-\theta ){k}_{i}^{out}=\theta \mathop{\sum }\limits_{j=1}^{m}{w}_{ji}+(1-\theta )\mathop{\sum }\limits_{j=1}^{m}{w}_{ij}$$

(4)

Methods

Related definitions

To identify the vital nodes in different types of networks, we propose three definitions as follows.

Definition 1. Adjacency degree A_i. We define the adjacency degree of nodes in undirected networks by considering its nearest neighbours as ${A}_{i}={\sum }_{j\in {\Gamma }_{i}}{k}_{j}$, where j is the neighbour of node i, Î_i is the set of neighbours of node i, and k_j is the degree of node j. For example, in Fig. 3(a), A₁ = k₂ + k₇ = 3 + 6 = 9. In directed networks, the adjacency degree of nodes is defined as follows (Eq. 5), where ${k}_{{j}_{in}}$ is the number of edges that point to node j from node i, and ${k}_{{j}_{out}}$ is the number of edges from node j that point to node i. For example, in Fig. 3(b), A_b = Î¸(k_a + k_c) + (1 â Î¸)k_g = Î¸ * (1 + 1.75) + (1 â Î¸) * 2 = 2.5625.

$${A}_{i}=\theta \sum _{j\in {\Gamma }_{i}}{k}_{{j}_{in}}+(1-\theta )\sum _{j\in {\Gamma }_{i}}{k}_{{j}_{out}}$$

(5)

Definition 2. Selection probability ${P}_{{i}_{j}}$. We define the selection probability of node i in the network by considering the probability that it will be selected by its neighbour j, and the calculation formula is Eq. 6.

Taking from the idea from information theory, a certain node in the network is taken as the information source point, and its neighbouring nodes are taken as the target points. In the process of information transmission or disease transmission, the information source point and infected person will select the target point among its neighbouring nodes for information transmission or disease infection. The probability that the target nodes are selected is called the selection probability. This definition considers the importance of the selected nodes, that is, the influence of the degrees of the selected node in the selection process.

$${P}_{{i}_{j}}={k}_{i}/{A}_{j},(j\in {\Gamma }_{i})$$

(6)

For example, in Fig. 3(a)${P}_{{1}_{2}}={k}_{1}/{A}_{2}={k}_{1}/({k}_{1}+{k}_{3}+{k}_{7})=2/(2+3+6)\approx 0.23$. Similarly, ${P}_{{1}_{7}}={k}_{1}/{A}_{7}=$ ${k}_{1}/({k}_{1}+{k}_{2}+{k}_{3}+{k}_{4}+{k}_{5}+{k}_{6})=2/(2+3+3+3+3+2)=0.125$.

Definition 3. Adjacency information entropy E_i. We define the adjacency information entropy of nodes in undirected networks as Eq. 7 and that in directed networks as Eq. 8.

$${E}_{i}=-{\sum }_{j\in {\Gamma }_{i}}({P}_{{i}_{j}}lo{g}_{2}{P}_{{i}_{j}})$$

(7)

$${E}_{i}={\sum }_{j\in {\Gamma }_{i}}| (-{P}_{{i}_{j}}lo{g}_{2}{P}_{{i}_{j}})| $$

(8)

Vital node identification algorithms

According to the characteristics of the four different types of networks, the proposed algorithms in this paper can be applied to different networks. Before the algorithms can be applied, we need to obtain the adjacency matrix A or the adjacency weighted matrix Wof the network. From the above definitions, we can rank the nodes in the network by the value of the nodeâs adjacency information entropy (E_i), and the specific algorithms step are as follows.

Results and Discussion

To verify the accuracy and applicability of our proposed algorithms, four different kinds of networks are employed, which include (1) unweighted-undirected networks (UUNs), (2) unweighted-directed networks (UDNs), (3) weighted-undirected networks (WUNs), and (4) weighted-directed networks (WDNs). The statistical properties of the studied networks are listed in Table 1. With respect to the unweighted-undirected networks, the Astro network is a collaboration network of astrophysics scientists²⁸; the CA network is a large connected component of the arXiv collaboration network in high-energy physics theory²⁹; the Facebook network is an anonymised social networks with 4039 users, where the data can be downloaded in http://snap.stanford.edu/data/; and the Hamster network is a friendship and family connections network among website users³⁰. With respect to the unweighted-directed networks, the Email network includes 1133 email users of the University at Rovira i Virgili, URV³¹; the PGP network is a communication network³²; Router is a topological network of the Internet³³; and the Wiki-Vote network is a who-votes-on-whom network from Wikipedia, where the data can be downloades from http://snap.stanford.edu/data/. With respect to the weighted-directed networks, the data of the P2P and PHD networks can be obtained at http://vlado.fmf.uni-lj.si/pub/networks/data/.

Table 1 The statistical properties of the four kinds of complex networks, where n and m are the total numbers of nodes and edges, respectively. < k > and < d > denote the average degree and the average distance respectively, and C denotes the clustering coefficient.

Full size table

We will verify the accuracy of our algorithms by computing the proportion of independent parts of networks by removing the different proportions of nodes. Obviously, the larger the proportion of independent parts is, the more seriously the network is destroyed, and the higher the identification accuracy of vital nodes is. For undirected networks, we selected five other centralities as benchmark indices, namely, betweenness centrality (BC), eccentricity centrality (EC), closeness centrality (CC), structural holes centrality³⁴ (SH) and degree centrality (DC). For simplicity, we call our algorithm for unweighted-undirected networks URank and that for weighted-undirected networks WRank. The X-axis is the different proportions of removed nodes. The Y-axis is the proportions of independent parts of corresponding networks. The results of the different centralities after removing different proportions of ranked nodes in the four different unweighted-undirected networks are shown in Fig. 4. From Figure 4, it is clear that our algorithm is more significant. Similarly, Supplementary Fig. S1 shows the experimental results for weighted-undirected networks. In addition, according to the importance of the nodes, we also remove the different proportions of nodes from high to low to test the efficiency of the undirected networks. As is well known, the higher the network efficiency is, the smaller the average distance between the nodes in the network is. If the removed nodes cause the network efficiency to decline more, the impact of the removed nodes on the network is greater, the removed nodes are more important. Figure 5 shows the experimental results of the network efficiency curves of undirected networks after removing different proportions of nodes.

For directed networks, we also selected five other centralities as benchmark indices, namely, PageRank centrality (PR), eigenvector centrality (VC), eccentricity centrality (EC), closeness centrality (CC) and degree centrality (DC). Similarly, for simplicity, we call our algorithm for unweighted-directed networks DRank and that for weighted-directed networks WDRank. Supplementary Figs. S2 and S3 show the independent parts experimental results in unweighted-directed networks and weighted-directed networks, respectively.

To further prove the effectiveness and applicability of the proposed algorithms, we implemented the largest component experiments using the four different types of networks. When some nodes in the network are deleted according to the importance of the nodes, different sized components will be formed. If the size of the component is smaller, the removed nodes are more destructive to the original network. The largest component experiments can illustrate the accuracy of the vital node identification algorithms from another perspective. The X-axis is the different proportions of removed nodes. The Y-axis is the largest component sizes of the different networks when the corresponding proportion nodes were removed. Figure 6 shows the experimental results for unweighted-undirected networks. Figure 6, shows that the URank algorithm performs well for most networks. The results of the same largest component experiments for unweighted-directed networks, weighted-undirected networks and weighted-directed networks are shown in Supplementary Figs. S5âS6, respectively.

To verify the applicability of our algorithms to other kinds of networks, we further carry out the three verification experiments described above using spatial networks and classical networks, such as a small world network and scale-free network, respectively. Figures 7, 8 and 9 present the results of the independent parts experiments, the network efficiency experiments and the largest component experiments, respectively. The corresponding statistical properties of the spatial networks and classical networks are listed in Table 2. Euroroad and Minnesota are road networks, and the data can be downloaded from http://networkrepository.com/road.php. Power Grid³⁵ contains an undirected unweighted representation of the topology of the Western States Power Grid of the United States, which was compiled by Duncan Watts and Steven Strogatz. The data are downloaded from the web site of Prof. Duncan Watts at Columbia University, http://cdg.columbia.edu/cdg/datasets. The Scale-Free and Small World networks are generated by the Pajek software. World Cites is a network of 415 cities, and the data can be obtained from http://www-personal.umich.edu/mejn/netdata/.

Table 2 The statistical properties of spatial networks and classical networks, where n and m are the total numbers of nodes and edges, respectively. < k > and < d > denote the average degree and the average distance, respectively, and C denotes the clustering coefficient.

Full size table

To investigate the relations between our algorithms and other centralities in different networks, we conducted correlation analysis experiments. We use the Kendallâs Tau to describe the relationship between different centralities. The relevant definitions are as follows³⁶.

Assuming that two random variables are X and Y (they can also be regarded as two sets), their number of elements is N, where X_i and Y_i represent the i-th element of each random variable, respectively. The corresponding elements in X and Y form an element pair set XY, which contains the elements (X_i, Y_i)(1â¤iâ¤N). When X_i > X_j and Y_i > Y_j or X_i < X_j and Y_i < Y_j, these two elements are considered to be concordant. When X_i > X_j and Y_i < Y_j or X_i < X_j and Y_i > Y_j, these two elements are considered to be discordant. When X_i = X_j or Y_i = Y_j, the two elements are neither concordant nor discordant. Kendallâs Tau is defined as.

$$\tau =\frac{{N}_{c}-{N}_{d}}{N(N-1)/2}$$

(9)

where N_c and N_d are the number of concordant and discordant pairs, respectively. N is the number of nodes in the network.

In undirected networks, from Figs. 10 and 11, we can see that our centrality index is negatively correlated with EC and SH, because EC considers the node with the largest distance from the node, while SH considers the constraint coefficient of the node. The smaller the constraint coefficient is, the more important the node is, contrary to our centrality index in this paper. In unweighted-undirected networks, we can see from Fig. 10 that there is no obvious correlation between our centrality index and other centralities, but in the Facebook network, our centrality index has a high positive correlation with BC, CC and DC. The reason may be that Facebook is a social network, and the propagation between nodes is similar to that of the adjacency entropy algorithms in this paper. In weighted-undirected networks (Fig. 11) and directed networks (Supplementary Figs. S7 and S8), we can clearly observe a high correlation between our centrality index and DC. The reason may be that our centrality index and DC are designed based on the local properties of nodes. Similarly, we can find that our centrality index has low correlation with BC, CC and EC in the four different types of networks because BC, CC and EC are global metrics. By comparing the correlations between our centrality index and PR, VC and other centralities in directed networks (Supplementary Figs. S7 and S8), we can find that the correlations between our centrality index and PR and VC are greater than those of other centralities (except DC) because PR and VC are both centralities designed for directed networks, while other centralities are applicable to both directed networks and undirected networks.

Computational efficiency

The adjacency information entropy algorithm has two steps: the calculation of the adjacency degree and the adjacency information entropy. Since every nodeâs adjacency degree and adjacency information entropy in the network needs to be calculated, the computational complexity of the first cycle is O(N), where N is the number of the network nodes. In the calculation of the node adjacency degree, it is also necessary to traverse the neighbouring nodes of the network nodes; thus, the total computational complexity of our algorithm is O(N²). Since the metric used in this paper involves the first-order neighbour of the node, the algorithmic complexity is lower than those of the global metrics, such as betweenness centrality(BC) with complexity O(MN³) and closeness centrality(CC) with complexity O(MN²), where M is the number of edges in the network. The number of network nodes applied by our algorithm could be further scaled up under the High Performance Computing (HPC) environment.

Conclusion

In this paper, we design two vital node identification algorithms for four different types of networks. By calculating and comparing the adjacency information entropy of nodes, the importance of nodes is ranked. The larger the entropy value is, the more vital the nodes are. The algorithms highlight the different characteristics of the different types of networks. For weighted networks, the strength of the nodes is used to calculate the adjacency information entropy instead of the degree of the nodes. For directed networks, the influence coefficient of a nodeâs in-degree and out-degree value is used, which further refines the influence of a nodeâs in-degree and out-degree on the nodeâs importance. The experimental results show that our proposed algorithms outperform several benchmark methods. In the future, we will consider identifying vital nodes for more realistic network types, including temporal networks, etc.

References

Kitsak, M. et al. Identification of influential spreaders in complex networks. Nature Physics 6, 888â893 (2010).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Freeman, L. C. Centrality in social networks conceptual clarification. Social Networks. 1, 215â239 (1978).
ArticleÂ Google ScholarÂ
Hage, P. & Harary, F. Eccentricity and centrality in networks. Social Networks. 17, 57â63 (1995).
ArticleÂ Google ScholarÂ
Sabidussi, G. The Centrality Index of a Graph. Psychometrika 31, 581â603 (1996).
ArticleÂ MathSciNetÂ Google ScholarÂ
Freeman, L. C. A set of measures of centrality based on betweenness. Sociometry 40, 35â41 (1997).
ArticleÂ Google ScholarÂ
Shimbel, A. Structural parameters of communication networks. Bulletin of Mathematical Biophysics 15, 501â507 (1953).
ArticleÂ MathSciNetÂ Google ScholarÂ
Shaw, M. E. Group Structure and the Behavior of Individuals in Small Groups. Journal of Psychology Interdisciplinary & Applied 38, 139â149 (1954).
ArticleÂ Google ScholarÂ
Bonacich, P. Factoring and weighting approaches to status scores and clique identification. Journal of Mathematical Sociology 2, 113â120 (1972).
ArticleÂ Google ScholarÂ
Brin, S. & Page, L. The anatomy of a large-scale hypertextual Web search engine. International Conference on World Wide Web 1, 107â117 (1998).
Google ScholarÂ
Gino, D. F. et al. Finding influential nodes for integration in brain networks using optimal percolation theory. Nature Communications 9, 2274â2286 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Chen, D., LÃ¼, L., Shang, M. S., Zhang, Y. C. & Zhou, T. Identifying influential nodes in complex networks. Physica A: Statistical Mechanics & its Applications 391, 1777â1787 (2012).
ArticleÂ ADSÂ Google ScholarÂ
Liu, Y., Tang, M., Zhou, T. & Do, Y. Identify influential spreaders in complex networks, the role of neighborhood. Physica A: Statistical Mechanics & its Applications 452, 289â298 (2015).
ArticleÂ ADSÂ Google ScholarÂ
Yu, H., Cao, X., Liu, Z. & Li, Y. Identifying key nodes based on improved structural holes in complex networks. Physica A: Statistical Mechanics & its Applications 486, 318â327 (2017).
ArticleÂ ADSÂ Google ScholarÂ
Zhang, J. X., Chen, D. B., Dong, Q. & Zhao, Z. D. Identifying a set of influential spreaders in complex networks. Scientific Reports 6, 27823â27833 (2016).
ArticleÂ ADSÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Ma, Q. & Ma, J. Identifying and ranking influential spreaders in complex networks with consideration of spreading probability. Physica A: Statistical Mechanics & its Applications 465, 312â330 (2017).
ArticleÂ ADSÂ Google ScholarÂ
LÃ¼, L., Chen, D., Ren, X. L., Zhang, Q. M., Zhang, Y. C. & Zhou, T. Vital nodes identification in complex networks. Phys. Rep. 650, 1â63 (2016).
ArticleÂ ADSÂ MathSciNetÂ Google ScholarÂ
LÃ¼, L., Zhang, Y. C., Yeung, C. H. & Zhou, T. Leaders in Social Networks, the Delicious Case. PLOS One 6, e21202 (2011).
ArticleÂ ADSÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Min, B. Identifying an influential spreader from a single seed in complex networks via a message-passing approach. The European Physical Journal B 91, 18â24 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Liu, J. G., Lin, J. H., Guo, Q. & Zhou, T. Locating influential nodes via dynamics-sensitive centrality. Scientific Reports 6, 21380â21388 (2016).
ArticleÂ ADSÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Zhang, J., Xu, X. K., Li, P., Zhang, K. & Small, M. Node importance for dynamical process on networks: A multiscale characterization. Chaos: An Interdisciplinary. Journal of Nonlinear Science 21, 47â4 (2011).
Google ScholarÂ
Gao, C., Wei, D., Hu, Y., Mahadevan, S. & Deng, Y. A modified evidential methodology of identifying influential nodes in weighted networks. Physica A: Statistical Mechanics & its Applications 392, 5490â5500 (2013).
ArticleÂ ADSÂ MathSciNetÂ Google ScholarÂ
Wei, D., Deng, X., Zhang, X., Deng, Y. & Mahadevan, S. Identifying influential nodes in weighted networks based on evidence theory. Physica A: Statistical Mechanics & its Applications 392, 2564â2575 (2013).
ArticleÂ ADSÂ Google ScholarÂ
Eidsaa, M. & Almaas, E. S-core network decomposition: A generation of k-core analisis to weighted networks. Phys. Rev. E. 88, 062819 (2013).
ArticleÂ ADSÂ Google ScholarÂ
Chen, D. B., Gao, H., LÃ¼, L. & Zhou, T. Identifying Influential Nodes in Large-Scale Directed Networks: The Role of Clustering. Plos One. 8, e77455 (2013).
ArticleÂ ADSÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Yang, Y., Xie, G. & Xie, J. Mining Important Nodes in Directed Weighted Complex Networks. Discrete Dynamics in Nature and Society. 1â7(2017).
Edgar, A. P. W., Sooyeon, Y., Antonio, L. F., Jose, F. F. M. & Alexander, V. G. The central role of peripheral nodes in directed network dynamics. Scientific Reports 9, 2045â2322 (2019).
ArticleÂ Google ScholarÂ
Wang, Y. & Liu, J. G. Evaluation method of node importance in directed-weighted complex network based on multiple influence matrix. Acta Physica Sinica 66, 13â24 (2017).
Google ScholarÂ
Newman, M. E. The structure of scientific collaboration networks. Proceedings of the National Academy of Sciences of the United States of America 98, 404â409 (2001).
ArticleÂ ADSÂ MathSciNetÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Leskovec, J., Kleinberg, J. & Faloutsos, C. Graph evolution: Densification and shrinking diameters. Acm Transactions on Knowledge Discovery from Data 1, 2 (2007).
ArticleÂ Google ScholarÂ
Kunegis, J. Hamsterster full network dataset-KONECT.Available at, http://konect.uni-koblenz.de/networks/petster-hamster (Accessed:01/03/2014).
GuimerÃ , R., Danon, L., DÃaz-Guilera, A., Giralt, F. & Arenas, A. Self-similar community structure in a network of human interactions. Physical Review E 68, 065103 (2004).
ArticleÂ ADSÂ Google ScholarÂ
BoguÃ±Ã¡, M., Pastor-Satorras, R., DÃaz-Guilera, A. & Arenas, A. Models of social networks based on social distance attachment. Physical Review E Statistical Nonlinear & Soft Matter Physics 70, 056122 (2004).
ArticleÂ ADSÂ Google ScholarÂ
Spring, N., Mahajan, R. & Wetherall, D. Measuring ISP topologies with Rocketfuel. IEEE/ACM Trans. Netw. 12, 2 (2004).
ArticleÂ Google ScholarÂ
Burt, R. S. Structural Holes: The Social Structure of Competition, Harvard University Press (2009).
Watts, D. J. & Strogatz, S. H. Collective dynamics of âsmall-world". Nature 393, 440â442 (1998).
ADSÂ CASÂ PubMedÂ MATHÂ Google ScholarÂ
Kendall, M. G. The treatment of ties in ranking problems. Biometrika 33, 239â251 (1945).
ArticleÂ MathSciNetÂ CASÂ PubMedÂ Google ScholarÂ

Download references

Acknowledgements

The work is partially supported by National Natural Science Foundation of China (71571186, 61703416), Natural Science Foundation of Hunan Province, China (2018JJ3614), Postgraduate Research Innovation Project of Hunan Province, China (CX20190041)

Author information

Authors and Affiliations

Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha, 410072, China
Xiang Xu,Â Cheng Zhu,Â Qingyong Wang,Â Xianqiang ZhuÂ &Â Yun Zhou

Authors

Xiang Xu
View author publications
You can also search for this author in PubMedÂ Google Scholar
Cheng Zhu
View author publications
You can also search for this author in PubMedÂ Google Scholar
Qingyong Wang
View author publications
You can also search for this author in PubMedÂ Google Scholar
Xianqiang Zhu
View author publications
You can also search for this author in PubMedÂ Google Scholar
Yun Zhou
View author publications
You can also search for this author in PubMedÂ Google Scholar

Contributions

X.X., C.Z. and Q.Y.W. devised the research project. X.X. and Q.Y.W. performed the research and analyzed the data. X.X., C.Z., Q.Y.W., Y.Z. and X.Q.Z. wrote the paper.

Corresponding authors

Correspondence to Xiang Xu or Cheng Zhu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xu, X., Zhu, C., Wang, Q. et al. Identifying vital nodes in complex networks by adjacency information entropy. Sci Rep 10, 2691 (2020). https://doi.org/10.1038/s41598-020-59616-w

Download citation

Received: 30 September 2019
Accepted: 02 February 2020
Published: 14 February 2020
DOI: https://doi.org/10.1038/s41598-020-59616-w

This article is cited by

Risk analysis of rainstorm-urban lifeline system disaster chain based on the PageRank-risk matrix and complex network
- Hai-xiang Guo
- Xin-yu He
- Yang Wu
Natural Hazards (2024)
Estimation and update of betweenness centrality with progressive algorithm and shortest paths approximation
- Nan Xiang
- Qilin Wang
- Mingwei You
Scientific Reports (2023)
Influential nodes identification using network local structural properties
- Bin Wang
- Junkai Zhang
- Jinfang Sheng
Scientific Reports (2022)
Weighted Association Rule Mining Over Unweighted Databases Using Inter-Item Link Based Automated Weighting Scheme
- Subrata Datta
- Kalyani Mali
- Sourav Ghosh
Arabian Journal for Science and Engineering (2021)