Nonparametric Bayesian inference of the microcanonical stochastic block model

Peixoto, Tiago P.

doi:10.1103/PhysRevE.95.012317

Physics > Data Analysis, Statistics and Probability

arXiv:1610.02703 (physics)

[Submitted on 9 Oct 2016 (v1), last revised 22 Aug 2018 (this version, v4)]

Title:Nonparametric Bayesian inference of the microcanonical stochastic block model

Authors:Tiago P. Peixoto

View PDF

Abstract:A principled approach to characterize the hidden structure of networks is to formulate generative models, and then infer their parameters from data. When the desired structure is composed of modules or "communities", a suitable choice for this task is the stochastic block model (SBM), where nodes are divided into groups, and the placement of edges is conditioned on the group memberships. Here, we present a nonparametric Bayesian method to infer the modular structure of empirical networks, including the number of modules and their hierarchical organization. We focus on a microcanonical variant of the SBM, where the structure is imposed via hard constraints, i.e. the generated networks are not allowed to violate the patterns imposed by the model. We show how this simple model variation allows simultaneously for two important improvements over more traditional inference approaches: 1. Deeper Bayesian hierarchies, with noninformative priors replaced by sequences of priors and hyperpriors, that not only remove limitations that seriously degrade the inference on large networks, but also reveal structures at multiple scales; 2. A very efficient inference algorithm that scales well not only for networks with a large number of nodes and edges, but also with an unlimited number of modules. We show also how this approach can be used to sample modular hierarchies from the posterior distribution, as well as to perform model selection. We discuss and analyze the differences between sampling from the posterior and simply finding the single parameter estimate that maximizes it. Furthermore, we expose a direct equivalence between our microcanonical approach and alternative derivations based on the canonical SBM.

Comments:	24 pages, 9 figures, 1 table. Code is freely available as part of graph-tool at this https URL . See also the HOWTO at this https URL . Minor typos fixed in most recent version
Subjects:	Data Analysis, Statistics and Probability (physics.data-an); Physics and Society (physics.soc-ph); Machine Learning (stat.ML)
Cite as:	arXiv:1610.02703 [physics.data-an]
	(or arXiv:1610.02703v4 [physics.data-an] for this version)
	https://doi.org/10.48550/arXiv.1610.02703
Journal reference:	Phys. Rev. E 95, 012317 (2017)
Related DOI:	https://doi.org/10.1103/PhysRevE.95.012317

Submission history

From: Tiago Peixoto [view email]
[v1] Sun, 9 Oct 2016 18:07:07 UTC (5,632 KB)
[v2] Mon, 23 Jan 2017 09:44:18 UTC (5,638 KB)
[v3] Fri, 19 Jan 2018 04:23:54 UTC (5,639 KB)
[v4] Wed, 22 Aug 2018 16:53:03 UTC (5,638 KB)

Physics > Data Analysis, Statistics and Probability

Title:Nonparametric Bayesian inference of the microcanonical stochastic block model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Physics > Data Analysis, Statistics and Probability

Title:Nonparametric Bayesian inference of the microcanonical stochastic block model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators