Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2623330.2623701acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Fast influence-based coarsening for large networks

Published: 24 August 2014 Publication History

Abstract

Given a social network, can we quickly 'zoom-out' of the graph? Is there a smaller equivalent representation of the graph that preserves its propagation characteristics? Can we group nodes together based on their influence properties? These are important problems with applications to influence analysis, epidemiology and viral marketing applications.
In this paper, we first formulate a novel Graph Coarsening Problem to find a succinct representation of any graph while preserving key characteristics for diffusion processes on that graph. We then provide a fast and effective near-linear-time (in nodes and edges) algorithm COARSENET for the same. Using extensive experiments on multiple real datasets, we demonstrate the quality and scalability of COARSENET, enabling us to reduce the graph by 90% in some cases without much loss of information. Finally we also show how our method can help in diverse applications like influence maximization and detecting patterns of propagation at the level of automatically created groups on real cascade data.

Supplementary Material

MP4 File (p1296-sidebyside.mp4)

References

[1]
R. M. Anderson and R. M. May. Infectious Diseases of Humans. Oxford University Press, 1991.
[2]
N. Bailey. The Mathematical Theory of Infectious Diseases and its Applications. Griffin, London, 1975.
[3]
S. Bikhchandani, D. Hirshleifer, and I. Welch. A theory of fads, fashion, custom, and cultural change in informational cascades. Journal of Political Economy, 100(5):992--1026, October 1992.
[4]
L. Briesemeister, P. Lincoln, and P. Porras. Epidemic profiles and defense of scale-free networks. WORM 2003, Oct. 27 2003.
[5]
M. Cha, A. Mislove, and K. P. Gummadi. A Measurement-driven Analysis of Information Propagation in the Flickr Social Network. In Proceedings of the 18th International World Wide Web Conference (WWW'09), Madrid, Spain, April 2009.
[6]
D. Chakrabarti, Y. Wang, C. Wang, J. Leskovec, and C. Faloutsos. Epidemic thresholds in real networks. ACM TISSEC, 10(4), 2008.
[7]
W. Chen, C. Wang, and Y. Wang. Scalable influence maximization for prevalent viral marketing in large-scale social networks. KDD, 2010.
[8]
W. Chen, Y. Wang, and S. Yang. Efficient influence maximization in social networks. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 199--208. ACM, 2009.
[9]
I. S. Dhillon, Y. Guan, and B. Kulis. Weighted graph cuts without eigenvectors a multilevel approach. IEEE Trans. Pattern Anal. Mach. Intell., 29(11):1944--1957, 2007.
[10]
M. Elkin and D. Peleg. Approximating k-spanner problems for k> 2. Theoretical Computer Science, 337(1):249--277, 2005.
[11]
S. Eubank, H. Guclu, V. S. Anil Kumar, M. V. Marathe, A. Srinivasan, Z. Toroczkai, and N. Wang. Modelling disease outbreaks in realistic urban social networks. Nature, 429(6988):180--184, May 2004.
[12]
W. S. Fung, R. Hariharan, N. J. Harvey, and D. Panigrahi. A general framework for graph sparsification. In Proceedings of the 43rd annual ACM symposium on Theory of computing, pages 71--80. ACM, 2011.
[13]
A. Ganesh, L. Massoulie, and D. Towsley. The effect of network topology on the spread of epidemics. In IEEE INFOCOM, Los Alamitos, CA, 2005. IEEE Computer Society Press.
[14]
J. Goldenberg, B. Libai, and E. Muller. Talk of the network: A complex systems look at the underlying process of word-of-mouth. Marketing Letters, 2001.
[15]
A. Goyal, F. Bonchi, and L. V. Lakshmanan. Learning influence probabilities in social networks. WSDM '10, 2010.
[16]
A. Goyal, W. Lu, and L. V. S. Lakshmanan. Simpath: An efficient algorithm for influence maximization under the linear threshold model. ICDM, 2011.
[17]
M. Granovetter. Threshold models of collective behavior. Am. Journal of Sociology, 83(6):1420--1443, 1978.
[18]
D. Gruhl, R. Guha, D. Liben-Nowell, and A. Tomkins. Information diffusion through blogspace. In WWW '04, 2004.
[19]
Y. Hayashi, M. Minoura, and J. Matsukubo. Recoverable prevalence in growing scale-free networks and the effective immunization. arXiv:cond-mat/0305549 v2, Aug. 6 2003.
[20]
H. W. Hethcote. The mathematics of infectious diseases. SIAM Review, 42, 2000.
[21]
G. Karypis and V. Kumar. METIS: Unstructured graph partitioning and sparse matrix ordering system. The University of Minnesota, 2, 1995.
[22]
D. Kempe, J. Kleinberg, and E. Tardos. Maximizing the spread of influence through a social network. In KDD '03: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 137--146, New York, NY, USA, 2003. ACM.
[23]
J. O. Kephart and S. R. White. Measuring and modeling computer virus prevalence. IEEE Computer Society Symposium on Research in Security and Privacy, 1993.
[24]
M. Kimura and K. Saito. Tractable models for information diffusion in social networks. Knowledge Discovery in Databases: PKDD 2006, pages 259--271, 2006.
[25]
R. Kumar, J. Novak, P. Raghavan, and A. Tomkins. On the bursty evolution of blogspace. In WWW '03: Proceedings of the 12th international conference on World Wide Web, pages 568--576, New York, NY, USA, 2003. ACM Press.
[26]
T. Lappas, E. Terzi, D. Gunopulos, and H. Mannila. Finding effectors in social networks. In Proceedings of the 16th ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), Washington, DC, pages 1059--1068, 2010.
[27]
J. Leskovec, A. Krause, C. Guestrin, C. Faloutsos, J. VanBriesen, and N. S. Glance. Cost-effective outbreak detection in networks. In KDD, pages 420--429, 2007.
[28]
M. Mathioudakis, F. Bonchi, C. Castillo, A. Gionis, and A. Ukkonen. Sparsification of influence networks. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 529--537. ACM, 2011.
[29]
A. G. McKendrick. Applications of mathematics to medical problems. In Proceedings of Edin. Math. Society, volume 44, pages 98--130, 1925.
[30]
Y. Mehmood, N. Barbieri, F. Bonchi, and A. Ukkonen. Csi: Community-level social influence analysis. In Machine Learning and Knowledge Discovery in Databases, volume 8189 of Lecture Notes in Computer Science. 2013.
[31]
R. Pastor-Santorras and A. Vespignani. Epidemic spreading in scale-free networks. Physical Review Letters 86, 14, 2001.
[32]
B. A. Prakash, D. Chakrabarti, M. Faloutsos, N. Valler, and C. Faloutsos. Threshold conditions for arbitrary cascade models on arbitrary networks. In ICDM, 2011.
[33]
B. A. Prakash, J. Vreeken, and C. Faloutsos. Spotting culprits in epidemics: How many and which ones? In ICDM, 2012.
[34]
M. Richardson and P. Domingos. Mining knowledge-sharing sites for viral marketing. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 61--70. ACM, 2002.
[35]
E. M. Rogers. Diffusion of Innovations, 5th Edition. Free Press, August 2003.
[36]
P. Shakarian, M. Broecheler, V. Subrahmanian, and C. Molinaro. Using generalized annotated programs to solve social network optimization problems. ACM Transactions on Computational Logic, 2012.
[37]
H. Tong, B. A. Prakash, T. Eliassi-Rad, M. Faloutsos, and C. Faloutsos. Gelling, and melting, large graphs by edge manipulation. In ACM CIKM, 2012.
[38]
H. Tong, B. A. Prakash, C. E. Tsourakakis, T. Eliassi-Rad, C. Faloutsos, and D. H. Chau. On the vulnerability of large graphs. In ICDM, 2010.

Cited By

View all
  • (2024)AdaMotif: Graph Simplification via Adaptive Motif DesignIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.345632131:1(688-698)Online publication date: 10-Sep-2024
  • (2024)A Graph Sampling-Based Model for Influence Maximization in Large-Scale Social NetworksIEEE Transactions on Computational Social Systems10.1109/TCSS.2022.321658711:1(144-160)Online publication date: Feb-2024
  • (2024)A survey on influence maximization modelsExpert Systems with Applications10.1016/j.eswa.2024.123429248(123429)Online publication date: Aug-2024
  • Show More Cited By

Index Terms

  1. Fast influence-based coarsening for large networks

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '14: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2014
    2028 pages
    ISBN:9781450329569
    DOI:10.1145/2623330
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 August 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. coarsening
    2. diffusion
    3. graph mining
    4. propagation

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    KDD '14
    Sponsor:

    Acceptance Rates

    KDD '14 Paper Acceptance Rate 151 of 1,036 submissions, 15%;
    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)57
    • Downloads (Last 6 weeks)7
    Reflects downloads up to 25 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)AdaMotif: Graph Simplification via Adaptive Motif DesignIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.345632131:1(688-698)Online publication date: 10-Sep-2024
    • (2024)A Graph Sampling-Based Model for Influence Maximization in Large-Scale Social NetworksIEEE Transactions on Computational Social Systems10.1109/TCSS.2022.321658711:1(144-160)Online publication date: Feb-2024
    • (2024)A survey on influence maximization modelsExpert Systems with Applications10.1016/j.eswa.2024.123429248(123429)Online publication date: Aug-2024
    • (2023)Featured graph coarsening with similarity guaranteesProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619148(17953-17975)Online publication date: 23-Jul-2023
    • (2023)DGC: Training Dynamic Graphs with Spatio-Temporal Non-Uniformity using Graph Partitioning by ChunksProceedings of the ACM on Management of Data10.1145/36267241:4(1-25)Online publication date: 12-Dec-2023
    • (2023)Topological Simplifications of HypergraphsIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2022.315389529:7(3209-3225)Online publication date: 1-Jul-2023
    • (2023)Large graph layout optimization based on vision and computational efficiency: a surveyVisual Intelligence10.1007/s44267-023-00007-w1:1Online publication date: 17-Jul-2023
    • (2023)Semantically Constitutive Entities in Knowledge GraphsDatabase and Expert Systems Applications10.1007/978-3-031-39847-6_36(445-461)Online publication date: 18-Aug-2023
    • (2022)A Multilevel Spectral Framework for Scalable Vectorless Power/Thermal Integrity VerificationACM Transactions on Design Automation of Electronic Systems10.1145/352953428:1(1-25)Online publication date: 10-Dec-2022
    • (2022)Multi-relation Graph SummarizationACM Transactions on Knowledge Discovery from Data10.1145/349456116:5(1-30)Online publication date: 9-Mar-2022
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media