Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Graph evolution: Densification and shrinking diameters

Published: 01 March 2007 Publication History

Abstract

How do real graphs evolve over time? What are normal growth patterns in social, technological, and information networks? Many studies have discovered patterns in static graphs, identifying properties in a single snapshot of a large network or in a very small number of snapshots; these include heavy tails for in- and out-degree distributions, communities, small-world phenomena, and others. However, given the lack of information about network evolution over long periods, it has been hard to convert these findings into statements about trends over time.
Here we study a wide range of real graphs, and we observe some surprising phenomena. First, most of these graphs densify over time with the number of edges growing superlinearly in the number of nodes. Second, the average distance between nodes often shrinks over time in contrast to the conventional wisdom that such distance parameters should increase slowly as a function of the number of nodes (like O(log n) or O(log(log n)).
Existing graph generation models do not exhibit these types of behavior even at a qualitative level. We provide a new graph generator, based on a forest fire spreading process that has a simple, intuitive justification, requires very few parameters (like the flammability of nodes), and produces graphs exhibiting the full range of properties observed both in prior work and in the present study.
We also notice that the forest fire model exhibits a sharp transition between sparse graphs and graphs that are densifying. Graphs with decreasing distance between the nodes are generated around this transition point.
Last, we analyze the connection between the temporal evolution of the degree distribution and densification of a graph. We find that the two are fundamentally related. We also observe that real networks exhibit this type of relation between densification and the degree distribution.

References

[1]
Abello, J. 2004. Hierarchical graph maps. Comput. Graph. 28, 3, 345--359.]]
[2]
Abello, J., Buchsbaum, A. L., and Westbrook, J. 1998. A functional approach to external graph algorithms. In Proceedings of the 6th Annual European Symposium on Algorithms. Springer-Verlag, 332--343.]]
[3]
Abello, J., Pardalos, P. M., and Resende, M. G. C. 2002. Handbook of Massive Data Sets. Kluwer Academic Publishing.]]
[4]
Adamic, L. A. 2000. Zipf, power-law, pareto---a ranking tutorial. http://www.hpl.hp.com/research/idl/papers/ranking.]]
[5]
Albert, R. and Barabasi, A.-L. 1999. Emergence of scaling in random networks. Science. 509--512.]]
[6]
Albert, R., Jeong, H., and Barabasi, A.-L. 1999. Diameter of the World-Wide Web. Nature 401, 130--131.]]
[7]
Alderson, D., Doyle, J. C., Li, L., and Willinger, W. 2005. Towards a theory of scale-free graphs: Definition, properties, and implications. Internet Math. 2, 4.]]
[8]
Bi, Z., Faloutsos, C., and Korn, F. 2001. The dgx distribution for mining massive, skewed data. In Proceedings of Knowledge Discovery and Data Mining (KDD). 17--26.]]
[9]
Bollobas, B. and Riordan, O. 2004. The diameter of a scale-free random graph. Combinatorica 24, 1, 5--34.]]
[10]
Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., and Wiener, J. 2000a. Graph structure in the Web. In Proceedings of the 9th International World Wide Web Conference on Computer Networks: The International Journal of Computer and Telecommunications Netowrking. North-Holland Publishing Co., Amsterdam, The Netherlands, 309--320.]]
[11]
Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., and Wiener, J. 2000b. Graph structure in the web: experiments and models. In Proceedings of World Wide Web Conference.]]
[12]
Chakrabarti, D. and Faloutsos, C. 2006. Graph mining: Laws, generators, and algorithms. ACM Comput. Sur. 38, 1.]]
[13]
Chakrabarti, D., Zhan, Y., and Faloutsos, C. 2004. R-mat: A recursive model for graph mining. In Proceedings of the SIAM Conference on Data Mining (SDM).]]
[14]
Chung, F. and Lu, L. 2002. The average distances in random graphs with given expected degrees. Proceedings of the National Academy of Sciences 99, 25, 15879--15882.]]
[15]
Cooper, C. and Frieze, A. 2003. A general model of web graphs. Random Struct. Algo. 22, 3, 311--335.]]
[16]
Dorogovtsev, S. and Mendes, J. 2001a. Effect of the accelerated growth of communications networks on their structure. Phys. Rev. E 63, 025101.]]
[17]
Dorogovtsev, S. and Mendes, J. 2001b. Language as an evolving word web. Proceedings of the Royal Society of London B 268, 2603.]]
[18]
Dorogovtsev, S. and Mendes, J. 2002. Accelerated growth of networks. In Handbook of Graphs and Networks: From the Genome to the Internet, S. Bornholdt and H.G. Schuster. Eds. Wiley-VCH, Berlin, Germany.]]
[19]
Dorogovtsev, S. and Mendes, J. 2003. Evolution of Networks: From Biological Nets to the Internet and WWW. Oxford University Press, Oxford, UK.]]
[20]
Faloutsos, M., Faloutsos, P., and Faloutsos, C. 1999. On power-law relationships of the Internet topology. In SIGCOMM. 251--262.]]
[21]
Gehrke, J., Ginsparg, P., and Kleinberg, J. M. 2003. Overview of the 2003 kdd cup. SIGKDD Explora. 5, 2, 149--151.]]
[22]
Hall, B. H., Jaffe, A. B., and Trajtenberg, M. 2001. The nber patent citation data file: Lessons, insights and methodological tools. NBER Working Papers 8498, National Bureau of Economic Research, Inc. (Oct.)]]
[23]
Huberman, B. A. and Adamic, L. A. 1999. Growth dynamics of the world-wide web. Nature 399, 131.]]
[24]
Katz, J. S. 1999. The self-similar science system. Resear. Policy 28, 501--517.]]
[25]
Katz, J. S. 2005. Scale independent bibliometric indicators. Measure.: Interdisciplin. Resea. Perspect. 3, 24--28.]]
[26]
Kleinberg, J. M. 2002. Small-world phenomena and the dynamics of information. In Advances in Neural Information Processing Systems 14.]]
[27]
Kleinberg, J. M., Kumar, R., Raghavan, P., Rajagopalan, S., and Tomkins, A. 1999. The Web as a graph: Measurements, models, and methods. In Proceedings of the International Conference on Combinatorics and Computing. 1--17.]]
[28]
Kossinets, G. and Watts, D. J. 2006. Empirical analysis of an evolving social network. Science 311, 88--90.]]
[29]
Krapivsky, P. L. and Redner, S. 2001. Organization of growing random networks. Phys. Rev. E 63, 066123.]]
[30]
Krapivsky, P. L. and Redner, S. 2005. Network growth by copying. Phys. Rev. E 71, 036118.]]
[31]
Kumar, R., Raghavan, P., Rajagopalan, S., Sivakumar, D., Tomkins, A., and Upfal, E. 2000. Stochastic models for the web graph. In Proceedings of the 41st IEEE Symposium on Foundations of Computer Science.]]
[32]
Kumar, R., Raghavan, P., Rajagopalan, S., and Tomkins, A. 1999. Trawling the Web for emerging cyber-communities. In Proceedings of the 8th International World Wide Web Conference.]]
[33]
Leskovec, J., Adamic, L., and Huberman, B. 2006. The dynamics of viral marketing. ACM Conference on Electronic Commerce.]]
[34]
Leskovec, J., Chakrabarti, D., Kleinberg, J. M., and Faloutsos, C. 2005. Realistic, mathematically tractable graph generation and evolution, using kronecker multiplication. In Proceedings of the European International Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD'05). 133--145.]]
[35]
Leskovec, J. and Faloutsos, C. 2006. Sampling from large graphs. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06). ACM Press, New York, NY, USA, 631--636.]]
[36]
Leskovec, J., Kleinberg, J., and Faloutsos, C. 2005. Graphs over time: densification laws, shrinking diameters and possible explanations. In Proceedings of the International Conference on Knowledge Discovery and Data Mining (KDD'05). Chicago, IL.]]
[37]
Menczer, F. 2002. Growing and navigating the small world web by local content. Proceedings of the National Academy of Sciences 99, 22, 14014--14019.]]
[38]
Milgram, S. 1967. The small-world problem. Psycholo. Today 2, 60--67.]]
[39]
Mitzenmacher, M. 2004. A brief history of generative models for power law and lognormal distributions. Internet Math. 1, 2, 226--251.]]
[40]
Newman, M. E. J. 2003. The structure and function of complex networks. SIAM Review 45, 167--256.]]
[41]
Newman, M. E. J. 2005. Power laws, pareto distributions and zipf's law. Contemp. Phys. 46, 323--351.]]
[42]
Ntoulas, A., Cho, J., and Olston, C. 2004. What's new on the web? the evolution of the web from a search engine perspective. In World Wide Web Conference. New York, 1--12.]]
[43]
Oregon. 1997. University of Oregon route views project. online data and reports. http://www.routeviews.org.]]
[44]
Palmer, C. R., Gibbons, P. B., and Faloutsos, C. 2002. Anf: A fast and scalable tool for data mining in massive graphs. In SIGKDD. Edmonton, AB, Canada.]]
[45]
Redner, S. 2004. Citation statistics from more than a century of physical review. Tech. rep. physics/0407137, arXiv.]]
[46]
Schroeder, M. 1991. Fractals, Chaos, Power Laws: Minutes from an Infinite Paradise. W.H. Freeman and Company, New York, NY.]]
[47]
Tauro, S. L., Palmer, C., Siganos, G., and Faloutsos, M. 2001. A simple conceptual model for the internet topology. In Global Internet. San Antonio, TX.]]
[48]
Vazquez, A. 2001. Disordered networks generated by recursive searches. Europhy. Lett. 54, 4, 430--435.]]
[49]
Vazquez, A. 2003. Growing networks with local rules: Preferential attachment, clustering hierarchy and degree correlations. Physical Review E 67, 056104.]]
[50]
Watts, D. J., Dodds, P. S., and Newman, M. E. J. 1998. Collective dynamics of ‘small-world’ networks. Nature 393, 440--442.]]
[51]
Watts, D. J., Dodds, P. S., and Newman, M. E. J. 2002. Identity and search in social networks. Science 296, 1302--1305.]]

Cited By

View all

Index Terms

  1. Graph evolution: Densification and shrinking diameters

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Knowledge Discovery from Data
    ACM Transactions on Knowledge Discovery from Data  Volume 1, Issue 1
    March 2007
    161 pages
    ISSN:1556-4681
    EISSN:1556-472X
    DOI:10.1145/1217299
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 March 2007
    Published in TKDD Volume 1, Issue 1

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Densification power laws
    2. graph generators
    3. graph mining
    4. heavy-tailed distributions
    5. small-world phenomena

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)390
    • Downloads (Last 6 weeks)28
    Reflects downloads up to 12 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Evolving meta-correlation classes for binary similarityPattern Recognition10.1016/j.patcog.2024.110871157:COnline publication date: 1-Jan-2025
    • (2025)A learning-based influence maximization framework for complex networks via K-core hierarchies and reinforcement learningExpert Systems with Applications: An International Journal10.1016/j.eswa.2024.125393259:COnline publication date: 1-Jan-2025
    • (2025)DCDIM: Diversified influence maximization on dynamic social networksComputer Communications10.1016/j.comcom.2025.108045(108045)Online publication date: Jan-2025
    • (2025)TD-GCN: A novel fusion method for network topological and dynamical featuresChaos, Solitons & Fractals10.1016/j.chaos.2024.115731191(115731)Online publication date: Feb-2025
    • (2025)Triangle-induced and degree-wise sampling over large graphs in social networksThe Journal of Supercomputing10.1007/s11227-024-06613-981:1Online publication date: 1-Jan-2025
    • (2024)Using Complex Network Analysis Techniques to Uncover Fraudulent Activity in Connected Healthcare SystemsPractical Applications of Data Processing, Algorithms, and Modeling10.4018/979-8-3693-2909-2.ch019(244-268)Online publication date: 14-Jun-2024
    • (2024)Synergistic Integration of Local and Global Information for Critical Edge IdentificationEntropy10.3390/e2611093326:11(933)Online publication date: 31-Oct-2024
    • (2024)Directed Network Comparison Using MotifsEntropy10.3390/e2602012826:2(128)Online publication date: 31-Jan-2024
    • (2024)Gain and Pain in Graph Partitioning: Finding Accurate Communities in Complex NetworksAlgorithms10.3390/a1706022617:6(226)Online publication date: 23-May-2024
    • (2024)Toward a manifold-preserving temporal graph network in hyperbolic spaceProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/484(4380-4388)Online publication date: 3-Aug-2024
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media