Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Scalable and exact sampling method for probabilistic generative graph models

Published: 01 November 2018 Publication History

Abstract

Interest in modeling complex networks has fueled the development of multiple probabilistic generative graph models (PGGMs). PGGMs are statistical methods that model the network distribution and match common characteristics of real world networks. Recently, scalable sampling algorithms for well known PGGMs, made the analysis of large-scale, sparse networks feasible for the first time. However, it has been demonstrated that these scalable sampling algorithms do not sample from the original underlying distribution, and sometimes produce very unlikely graphs. To address this, we extend the algorithm proposed in Moreno et al. (in: IEEE 14th international conference on data mining, pp 440---449, 2014) for a single model and develop a general solution for a broad class of PGGMs. Our approach exploits the fact that PGGMs are typically parameterized by a small set of unique probability values--this enables fast generation via independent sampling of groups of edges with the same probability value. By sampling within groups, we remove bias due to conditional sampling and probability reallocation. We show that our grouped sampling methods are both provably correct and efficient. Our new algorithm reduces time complexity by avoiding the expensive rejection sampling step previously necessary, and we demonstrate its generality, by outlining implementations for six different PGGMs. We conduct theoretical analysis and empirical evaluation to demonstrate the strengths of our algorithms. We conclude by sampling a network with over a billion edges in 95 s on a single processor.

Cited By

View all
  • (2022)A Survey of Sampling Method for Social Media Embeddedness RelationshipACM Computing Surveys10.1145/352410555:4(1-39)Online publication date: 30-Mar-2022

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Data Mining and Knowledge Discovery
Data Mining and Knowledge Discovery  Volume 32, Issue 6
November 2018
336 pages

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 01 November 2018

Author Tags

  1. Graph generation
  2. Network analysis
  3. Network models
  4. Scalable sampling
  5. Social networks

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2022)A Survey of Sampling Method for Social Media Embeddedness RelationshipACM Computing Surveys10.1145/352410555:4(1-39)Online publication date: 30-Mar-2022

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media