Abstract
Identifying top-ranked nodes can be performed using different centrality measures, based on their characteristics and influential power. The most basic of all the ranking techniques is based on nodes degree. While finding the degree of a node requires local information, ranking the node based on its degree requires global information, namely the degrees of all the nodes of the network. It is infeasible to collect the global information for some graphs such as (i) the ones emerging from big data, (ii) dynamic networks, and (iii) distributed networks in which the whole graph is not known. In this work, we propose methods to estimate the degree rank of a node, that are faster than the classical method of computing the centrality value of all nodes and then rank a node. The proposed methods are modeled based on the network characteristics and sampling techniques, thus not requiring the entire network. We show that approximately \(1\%\) node samples are adequate to find the rank of a node with high accuracy.










Similar content being viewed by others
Notes
Here, discrete probability values are considered as continuous probability density function, as this introduces a very small error.
References
Backstrom L, Leskovec J (2011) Supervised random walks: predicting and recommending links in social networks. In: Proceedings of the fourth ACM international conference on Web search and data mining, ACM, pp 635–644
Barabási AL, Albert R (1999) Emergence of scaling in random networks. Science 286(5439):509–512
Boldi P, Vigna S (2004) The WebGraph framework I: Compression techniques. In: Proc. of the Thirteenth International World Wide Web Conference (WWW 2004), ACM Press, Manhattan, USA, pp 595–601
Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. In: Seventh international world-wide web conference (www 1998), april 14-18, 1998, brisbane, australia. Brisbane, Australia
Cem E, Sarac K (2015) Estimating the size and average degree of online social networks at the extreme. In: Communications (ICC), 2015 IEEE International Conference on, IEEE, pp 1268–1273
Cem E, Sarac K (2016a) Average degree estimation under ego-centric sampling design. In: Computer Communications Workshops (INFOCOM WKSHPS), 2016 IEEE Conference on, IEEE, pp 152–157
Cem E, Sarac K (2016b) Estimation of structural properties of online social networks at the extreme. Comput Netw 108:323–344
Chen D, Lü L, Shang MS, Zhang YC, Zhou T (2012) Identifying influential nodes in complex networks. Physica Stat Mech Appl 391(4):1777–1787
Chen L, Karbasi A, Crawford FW (2016) Estimating the size of a large network and its communities from a random sample. In: Advances in Neural Information Processing Systems, pp 3072–3080
Cho E, Myers SA, Leskovec J (2011) Friendship and mobility: user movement in location-based social networks. In: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, pp 1082–1090
Cooper C, Radzik T, Siantos Y (2012) A fast algorithm to find all high degree vertices in power law graphs. In: Proceedings of the 21st International Conference on World Wide Web, ACM, pp 1007–1016
Dasgupta A, Kumar R, Sarlos T (2014) On estimating the average degree. In: Proceedings of the 23rd international conference on World wide web, ACM, pp 795–806
Davis B, Gera R, Lazzaro G, Lim BY, Rye EC (2016) The marginal benefit of monitor placement on networks. In: Cherifi H, Gonçalves B, Menezes R, Sinatra R (eds) Complex networks VII, Springer, Cham, pp 93–104
De Choudhury M, Sundaram H, John A, Seligmann DD (2009) Social synchrony: Predicting mimicry of user actions in online social media. In: Computational Science and Engineering, 2009. CSE’09. International Conference on, IEEE, vol 4, pp 151–158
Eden T, Ron D, Seshadhri C (2016) Sublinear time estimation of degree distribution moments: The arboricity connection. arXiv preprint arXiv:160403661
Erdős P, Rényi A (1960) On the evolution of random graphs. Publ Math Inst Hungar Acad Sci 5:17–61
Even S (2011) Graph algorithms. Cambridge University Press, Cambridge
Fire M, Tenenboim L, Lesser O, Puzis R, Rokach L, Elovici Y (2011) Link prediction in social networks using computationally efficient topological features. In: Privacy, security, risk and trust (PASSAT) and IEEE third international confernece on social computing (SocialCom), IEEE, pp 73–80
Fortunato S, Boguñá M, Flammini A, Menczer F (2006) Approximating pagerank from in-degree. In: Aiello W, Broder A, Janssen J, Milios E (eds) International workshop on algorithms and models for the web-graph. Springer, Berlin, Heidelberg, pp 59–71
Freeman LC (1977) A set of measures of centrality based on betweenness. Sociometry 40(1):35–41
Ghoshal G, Barabási AL (2011) Ranking stability and super-stable nodes in complex networks. Nat Commun 2:394
Gjoka M, Kurant M, Butts CT, Markopoulou A (2010) Walking in Facebook: A case study of unbiased sampling of OSNs. In: INFOCOM, 2010 Proceedings IEEE, IEEE, pp 1–9
Goodman LA (1961) Snowball sampling. Ann Math Stat 32(1):148–170
Hansen MH, Hurwitz WN (1943) On the theory of sampling from finite populations. Ann Math Stat 14(4):333–362
Haralabopoulos G, Anagnostopoulos I (2014) Real time enhanced random sampling of online social networks. J Netw Comput Appl 41:126–134
Hardiman SJ, Katzir L (2013) Estimating clustering coefficients and size of social networks via random walk. In: Proceedings of the 22nd international conference on World Wide Web, International World Wide Web Conferences Steering Committee, pp 539–550
Hogg T, Lerman K (2012) Social dynamics of digg. EPJ Data Sci 1(1):1–26
Hou B, Yao Y, Liao D (2012) Identifying all-around nodes for spreading dynamics in complex networks. Phys A Stat Mech Appl 391(15):4012–4017
Katz L (1953) A new status index derived from sociometric analysis. Psychometrika 18(1):39–43
Konstas I, Stathopoulos V, Jose JM (2009) On social networks and collaborative recommendation. In: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, ACM, pp 195–202
Kurant M, Butts CT, Markopoulou A (2012) Graph size estimation. arXiv preprint arXiv:12100460
Leskovec J, Faloutsos C (2006) Sampling from large graphs. In: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, pp 631–636
Leskovec J, Kleinberg J, Faloutsos C (2007) Graph evolution: densification and shrinking diameters. ACM Trans Knowl Discov Data (TKDD) 1(1):2
Lovász L (1993) Random walks on graphs: A survey. Comb Paul erdos is eighty 2(1):1–46
Lu J, Li D (2012) Sampling online social networks by random walk. In: Proceedings of the First ACM International Workshop on Hot Topics on Interdisciplinary Social Networks Research, ACM, pp 33–40
Lucchese R, Varagnolo D (2015) Networks cardinality estimation using order statistics. In: American Control Conference (ACC), 2015, IEEE, pp 3810–3817
Marchetti-Spaccamela A (1988) On the estimate of the size of a directed graph. In: International Workshop on Graph-Theoretic Concepts in Computer Science, Springer, pp 317–326
McAuley JJ, Leskovec J (2012) Learning to discover social circles in ego networks. NIPS 2012:548–56
Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E (1953) Equation of state calculations by fast computing machines. J Chem Phys 21(6):1087–1092
Moré JJ (1978) The levenberg-marquardt algorithm: implementation and theory. In: Watson GA (ed) Numerical analysis. Springer, Berlin, Heidelberg, pp 105–116
Musco C, Su HH, Lynch N (2016) Ant-inspired density estimation via random walks. In: Proceedings of the 2016 ACM Symposium on Principles of Distributed Computing, ACM, pp 469–478
Nazi A, Zhou Z, Thirumuruganathan S, Zhang N, Das G (2015) Walk, not wait: faster sampling over online social networks. Proc VLDB Endow 8(6):678–689
Ribeiro B, Towsley D (2010) Estimating and sampling graphs with multidimensional random walks. In: Proceedings of the 10th ACM SIGCOMM conference on Internet measurement, ACM, pp 390–403
Ribeiro B, Towsley D (2012) On the estimation accuracy of degree distributions from graph sampling. In: Decision and Control (CDC), 2012 IEEE 51st Annual Conference on, IEEE, pp 5240–5247
Ribeiro B, Wang P, Murai F, Towsley D (2012) Sampling directed graphs with random walks. In: INFOCOM, 2012 Proceedings IEEE, IEEE, pp 1692–1700
Rossi RA, Ahmed NK (2015) The network data repository with interactive graph analytics and visualization. In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, http://networkrepository.com
Sabidussi G (1966) The centrality index of a graph. Psychometrika 31(4):581–603
Salganik MJ, Heckathorn DD (2004) Sampling and estimation in hidden populations using respondent-driven sampling. Sociol Methodol 34(1):193–240
Saxena A, Gera R, Iyengar S (2017) Observe locally rank globally. In: Proceedings of the 2017 IEEE/ACM international conference on advances in social networks analysis and mining. ACM, pp 139–144
Shaw ME (1954) Some effects of unequal distribution of information upon group performance in various communication nets. J Abnorm Soc Psychol 49(4):547–553
Stephenson K, Zelen M (1989) Rethinking centrality: methods and examples. Soc Net 11(1):1–37
Traud AL, Mucha PJ, Porter MA (2012) Social structure of Facebook networks. Phys A 391(16):4165–4180
Voudigari E, Salamanos N, Papageorgiou T, Yannakoudakis EJ (2016) Rank degree: An efficient algorithm for graph sampling. In: Advances in Social Networks Analysis and Mining (ASONAM), 2016 IEEE/ACM International Conference on, IEEE, pp 120–129
Yang J, Leskovec J (2015) Defining and evaluating network communities based on ground-truth. Knowl Inf Syst 42(1):181–213
Ye S, Wu SF (2011) Estimating the size of online social networks. Int J Soc Comput Cyber Phys Syst 1(2):160–179
Yu Y, Fan S (2015) Node importance measurement based on the degree and closeness centrality. J Inf Commput Sci 12(3):1281–1291
Zafarani R, Liu H (2009) Social computing data repository at ASU. http://socialcomputing.asu.edu. Accessed Jan 2017
Zhou Z, Zhang N, Gong Z, Das G (2016) Faster random walks by rewiring online social networks on-the-fly. ACM Trans Database Syst (TODS) 40(4):26
Acknowledgements
Gera would like to thank the DoD for sponsoring this work. Saxena and Iyengar would like to thank IIT Ropar HPC committee for providing the resources to perform the experiments.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix
Results on real-world scale-free networks
See Table 6.
Rights and permissions
About this article
Cite this article
Saxena, A., Gera, R. & Iyengar, S.R.S. Estimating degree rank in complex networks. Soc. Netw. Anal. Min. 8, 42 (2018). https://doi.org/10.1007/s13278-018-0520-3
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-018-0520-3