-
ALAAMEE: Open-source software for fitting autologistic actor attribute models
Authors:
Alex Stivala,
Peng Wang,
Alessandro Lomi
Abstract:
The autologistic actor attribute model (ALAAM) is a model for social influence, derived from the more widely known exponential-family random graph model (ERGM). ALAAMs can be used to estimate parameters corresponding to multiple forms of social contagion associated with network structure and actor covariates. This work introduces ALAAMEE, open-source Python software for estimation, simulation, and…
▽ More
The autologistic actor attribute model (ALAAM) is a model for social influence, derived from the more widely known exponential-family random graph model (ERGM). ALAAMs can be used to estimate parameters corresponding to multiple forms of social contagion associated with network structure and actor covariates. This work introduces ALAAMEE, open-source Python software for estimation, simulation, and goodness-of-fit testing for ALAAM models. ALAAMEE implements both the stochastic approximation and equilibrium expectation (EE) algorithms for ALAAM parameter estimation, including estimation from snowball sampled network data. It implements data structures and statistics for undirected, directed, and bipartite networks. We use a simulation study to assess the accuracy of the EE algorithm for ALAAM parameter estimation and statistical inference, and demonstrate the use of ALAAMEE with empirical examples using both small (fewer than 100 nodes) and large (more than 10 000 nodes) networks.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Relational hyperevent models for the coevolution of coauthoring and citation networks
Authors:
Jürgen Lerner,
Marian-Gabriel Hâncean,
Alessandro Lomi
Abstract:
The development of suitable statistical models for the analysis of bibliographic networks has trailed behind the empirical ambitions expressed by recent studies of science of science. Extant research typically restricts the analytical focus to either paper citation networks, or author collaboration networks. These networks involve not only direct relationships between papers or authors, but also a…
▽ More
The development of suitable statistical models for the analysis of bibliographic networks has trailed behind the empirical ambitions expressed by recent studies of science of science. Extant research typically restricts the analytical focus to either paper citation networks, or author collaboration networks. These networks involve not only direct relationships between papers or authors, but also a broader system of dependencies between the references of papers connected through multiple simultaneous citation links. In this work, we extend recently developed relational hyperevent models (RHEM) to analyze scientific networks - systems of scientific publications connected by citations and authorship. We introduce new covariates that represent theoretically relevant and empirically meaningful sub-network configurations. The new model specification supports testing of hypotheses that align with the polyadic nature of scientific publication events and the multiple interdependencies between authors and references of current and prior papers. We implement the model using open-source software to analyze a large, publicly available scientific network dataset. A significant finding of the study is the tendency for subsets of papers to be repeatedly cited together across publications. This result is crucial as it suggests that the papers' impact may be partly due to endogenous network processes. More broadly, the study shows that models accounting for both the hyperedge structure of publication events and the interconnections between authors and references significantly enhance our understanding of the network mechanisms that drive scientific production, productivity, and impact.
△ Less
Submitted 5 June, 2024; v1 submitted 3 August, 2023;
originally announced August 2023.
-
Relational hyperevent models for polyadic interaction networks
Authors:
Jürgen Lerner,
Alessandro Lomi
Abstract:
Polyadic, or "multicast" social interaction networks arise when one sender addresses multiple receivers simultaneously. Currently available relational event models (REM) are not well suited to the analysis of polyadic interaction networks because they specify event rates for sets of receivers as functions of dyadic covariates associated with the sender and one receiver at a time. Relational hypere…
▽ More
Polyadic, or "multicast" social interaction networks arise when one sender addresses multiple receivers simultaneously. Currently available relational event models (REM) are not well suited to the analysis of polyadic interaction networks because they specify event rates for sets of receivers as functions of dyadic covariates associated with the sender and one receiver at a time. Relational hyperevent models (RHEM) address this problem by specifying event rates as functions of hyperedge covariates associated with the sender and the entire set of receivers. For instance, hyperedge covariates can express the tendency of senders to repeatedly address the same pairs (or larger sets) of receivers - a simple and frequent pattern in polyadic interaction data which, however, cannot be expressed with dyadic covariates. In this article we demonstrate the potential benefits of RHEMs for the analysis of polyadic social interaction. We define and discuss practically relevant effects that are not available for REMs but may be incorporated in empirical specifications of RHEM. We illustrate the empirical value of RHEM, and compare them with related REM, in a reanalysis of the canonical Enron email data.
△ Less
Submitted 2 November, 2022; v1 submitted 20 December, 2021;
originally announced December 2021.
-
Reliability of relational event model estimates under sampling: how to fit a relational event model to 360 million dyadic events
Authors:
Jürgen Lerner,
Alessandro Lomi
Abstract:
We assess the reliability of relational event model parameters estimated under two sampling schemes: (1) uniform sampling from the observed events and (2) case-control sampling which samples non-events, or null dyads ("controls"), from a suitably defined risk set. We experimentally determine the variability of estimated parameters as a function of the number of sampled events and controls per even…
▽ More
We assess the reliability of relational event model parameters estimated under two sampling schemes: (1) uniform sampling from the observed events and (2) case-control sampling which samples non-events, or null dyads ("controls"), from a suitably defined risk set. We experimentally determine the variability of estimated parameters as a function of the number of sampled events and controls per event, respectively. Results suggest that relational event models can be reliably fitted to networks with more than 12 million nodes connected by more than 360 million dyadic events by analyzing a sample of some tens of thousands of events and a small number of controls per event. Using data that we collected on the Wikipedia editing network, we illustrate how network effects commonly included in empirical studies based on relational event models need widely different sample sizes to be estimated reliably. For our analysis we use an open-source software which implements the two sampling schemes, allowing analysts to fit and analyze relational event models to the same or other data that may be collected in different empirical settings, varying sample parameters or model specification.
△ Less
Submitted 13 November, 2019; v1 submitted 2 May, 2019;
originally announced May 2019.