-
Optimal Matrix-Mimetic Tensor Algebras via Variable Projection
Authors:
Elizabeth Newman,
Katherine Keegan
Abstract:
Recent advances in {matrix-mimetic} tensor frameworks have made it possible to preserve linear algebraic properties for multilinear data analysis and, as a result, to obtain optimal representations of multiway data. Matrix mimeticity arises from interpreting tensors as operators that can be multiplied, factorized, and analyzed analogous to matrices. Underlying the tensor operation is an algebraic…
▽ More
Recent advances in {matrix-mimetic} tensor frameworks have made it possible to preserve linear algebraic properties for multilinear data analysis and, as a result, to obtain optimal representations of multiway data. Matrix mimeticity arises from interpreting tensors as operators that can be multiplied, factorized, and analyzed analogous to matrices. Underlying the tensor operation is an algebraic framework parameterized by an invertible linear transformation. The choice of linear mapping is crucial to representation quality and, in practice, is made heuristically based on expected correlations in the data. However, in many cases, these correlations are unknown and common heuristics lead to suboptimal performance. In this work, we simultaneously learn optimal linear mappings and corresponding tensor representations without relying on prior knowledge of the data. Our new framework explicitly captures the coupling between the transformation and representation using variable projection. We preserve the invertibility of the linear mapping by learning orthogonal transformations with Riemannian optimization. We provide original theory of uniqueness of the transformation and convergence analysis of our variable-projection-based algorithm. We demonstrate the generality of our framework through numerical experiments on a wide range of applications, including financial index tracking, image compression, and reduced order modeling. We have published all the code related to this work at https://github.com/elizabethnewman/star-M-opt.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Mutual information and the encoding of contingency tables
Authors:
Maximilian Jerdee,
Alec Kirkley,
M. E. J. Newman
Abstract:
Mutual information is commonly used as a measure of similarity between competing labelings of a given set of objects, for example to quantify performance in classification and community detection tasks. As argued recently, however, the mutual information as conventionally defined can return biased results because it neglects the information cost of the so-called contingency table, a crucial compon…
▽ More
Mutual information is commonly used as a measure of similarity between competing labelings of a given set of objects, for example to quantify performance in classification and community detection tasks. As argued recently, however, the mutual information as conventionally defined can return biased results because it neglects the information cost of the so-called contingency table, a crucial component of the similarity calculation. In principle the bias can be rectified by subtracting the appropriate information cost, leading to the modified measure known as the reduced mutual information, but in practice one can only ever compute an upper bound on this information cost, and the value of the reduced mutual information depends crucially on how good a bound is established. In this paper we describe an improved method for encoding contingency tables that gives a substantially better bound in typical use cases, and approaches the ideal value in the common case where the labelings are closely similar, as we demonstrate with extensive numerical results.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Variability in Titan's Mesospheric HCN and Temperature Structure as Observed by ALMA
Authors:
A. E. Thelen,
C. A. Nixon,
R. Cosentino,
M. A. Cordiner,
N. A. Teanby,
C. E. Newman,
P. G. J. Irwin,
S. B. Charnley
Abstract:
The temperature structure of Titan's upper atmosphere exhibits large variability resulting from numerous spatially and temporally irregular external energy sources, seasonal changes, and the influence of molecular species produced via photochemistry. In particular, Titan's relatively abundant HCN is thought to provide substantial cooling to the upper atmosphere through rotational emission, balanci…
▽ More
The temperature structure of Titan's upper atmosphere exhibits large variability resulting from numerous spatially and temporally irregular external energy sources, seasonal changes, and the influence of molecular species produced via photochemistry. In particular, Titan's relatively abundant HCN is thought to provide substantial cooling to the upper atmosphere through rotational emission, balancing UV/EUV heating and thermal conduction. Here, we present the analysis of ALMA observations of Titan from 2012, 2014, 2015, and 2017, corresponding to planetocentric solar longitudes of ~34-89$^{\circ}$, including vertical HCN and temperature profiles retrieved from the lower mesosphere through the thermosphere (~350-1200 km; $3\times10^{-2}$-$2\times10^{-8}$ mbar). Throughout the atmosphere, temperature profiles differ by 10 to 30 K between observations approximately one Earth year apart, particularly from 600-900 km. We find evidence for a large imbalance in Titan's upper atmospheric energy budget between 2014 and 2015, where the mesospheric thermal structure changes significantly and marks the transition between a mesopause located at ~600 km ($2\times10^{-4}$ mbar) and at ~800 km ($3\times10^{-6}$ mbar). The retrieved HCN abundances vary dramatically during the 2012 to 2017 time period as well, showing close to 2 orders of magnitude difference in abundance at 1000 km. However, the change in HCN abundance does not appear to fully account for the variation in mesospheric temperatures over the $L_S\sim$34-89$^{\circ}$ period. These measurements provide additional insight into the variability of Titan's mesospheric composition and thermal structure following its 2009 vernal equinox, and motivate continued investigation of the origins of such rapid changes in Titan's atmosphere throughout its seasonal cycle.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Luck, skill, and depth of competition in games and social hierarchies
Authors:
Maximilian Jerdee,
M. E. J. Newman
Abstract:
Patterns of wins and losses in pairwise contests, such as occur in sports and games, consumer research and paired comparison studies, and human and animal social hierarchies, are commonly analyzed using probabilistic models that allow one to quantify the strength of competitors or predict the outcome of future contests. Here we generalize this approach to incorporate two additional features: an el…
▽ More
Patterns of wins and losses in pairwise contests, such as occur in sports and games, consumer research and paired comparison studies, and human and animal social hierarchies, are commonly analyzed using probabilistic models that allow one to quantify the strength of competitors or predict the outcome of future contests. Here we generalize this approach to incorporate two additional features: an element of randomness or luck that leads to upset wins, and a "depth of competition" variable that measures the complexity of a game or hierarchy. Fitting the resulting model to a large collection of data sets we estimate depth and luck in a range of games, sports, and social situations. In general, we find that social competition tends to be "deep," meaning it has a pronounced hierarchy with many distinct levels, but also that there is often a nonzero chance of an upset victory, meaning that dominance challenges can be won even by significant underdogs. Competition in sports and games, by contrast, tends to be shallow and in most cases there is little evidence of upset wins, beyond those already implied by the shallowness of the hierarchy.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Image reconstructions using sparse dictionary representations and implicit, non-negative mappings
Authors:
Elizabeth Newman,
Jack Michael Solomon,
Matthias Chung
Abstract:
Many imaging science tasks can be modeled as a discrete linear inverse problem. Solving linear inverse problems is often challenging, with ill-conditioned operators and potentially non-unique solutions. Embedding prior knowledge, such as smoothness, into the solution can overcome these challenges. In this work, we encode prior knowledge using a non-negative patch dictionary, which effectively lear…
▽ More
Many imaging science tasks can be modeled as a discrete linear inverse problem. Solving linear inverse problems is often challenging, with ill-conditioned operators and potentially non-unique solutions. Embedding prior knowledge, such as smoothness, into the solution can overcome these challenges. In this work, we encode prior knowledge using a non-negative patch dictionary, which effectively learns a basis from a training set of natural images. In this dictionary basis, we desire solutions that are non-negative and sparse (i.e., contain many zero entries). With these constraints, standard methods for solving discrete linear inverse problems are not directly applicable. One such approach is the modified residual norm steepest descent (MRNSD), which produces non-negative solutions but does not induce sparsity. In this paper, we provide two methods based on MRNSD that promote sparsity. In our first method, we add an $\ell_1$-regularization term with a new, optimal step size. In our second method, we propose a new non-negative, sparsity-promoting mapping of the solution. We compare the performance of our proposed methods on a number of numerical experiments, including deblurring, image completion, computer tomography, and superresolution. Our results show that these methods effectively solve discrete linear inverse problems with non-negativity and sparsity constraints.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Normalized mutual information is a biased measure for classification and community detection
Authors:
Maximilian Jerdee,
Alec Kirkley,
M. E. J. Newman
Abstract:
Normalized mutual information is widely used as a similarity measure for evaluating the performance of clustering and classification algorithms. In this paper, we show that results returned by the normalized mutual information are biased for two reasons: first, because they ignore the information content of the contingency table and, second, because their symmetric normalization introduces spuriou…
▽ More
Normalized mutual information is widely used as a similarity measure for evaluating the performance of clustering and classification algorithms. In this paper, we show that results returned by the normalized mutual information are biased for two reasons: first, because they ignore the information content of the contingency table and, second, because their symmetric normalization introduces spurious dependence on algorithm output. We introduce a modified version of the mutual information that remedies both of these shortcomings. As a practical demonstration of the importance of using an unbiased measure, we perform extensive numerical tests on a basket of popular algorithms for network community detection and show that one's conclusions about which algorithm is best are significantly affected by the biases in the traditional mutual information.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
Hierarchical core-periphery structure in networks
Authors:
Austin Polanco,
M. E. J. Newman
Abstract:
We study core-periphery structure in networks using inference methods based on a flexible network model that allows for traditional onion-like cores within cores, but also for hierarchical tree-like structures and more general non-nested types of structure. We propose an efficient Monte Carlo scheme for fitting the model to observed networks and report results for a selection of real-world data se…
▽ More
We study core-periphery structure in networks using inference methods based on a flexible network model that allows for traditional onion-like cores within cores, but also for hierarchical tree-like structures and more general non-nested types of structure. We propose an efficient Monte Carlo scheme for fitting the model to observed networks and report results for a selection of real-world data sets. Among other things, we observe an empirical distinction between networks showing traditional core-periphery structure with a dense core weakly connected to a sparse periphery, and an alternative structure in which the core is strongly connected both within itself and to the periphery. Networks vary in whether they are better represented by one type of structure or the other. We also observe structures that are a hybrid between core-periphery structure and community structure, in which networks have a set of non-overlapping cores that correspond roughly to communities, surrounded by a single undifferentiated periphery. Computer code implementing our methods is available.
△ Less
Submitted 9 January, 2023;
originally announced January 2023.
-
Message passing methods on complex networks
Authors:
M. E. J. Newman
Abstract:
Networks and network computations have become a primary mathematical tool for analyzing the structure of many kinds of complex systems, ranging from the Internet and transportation networks to biochemical interactions and social networks. A common task in network analysis is the calculation of quantities that reside on the nodes of a network, such as centrality measures, probabilities, or model st…
▽ More
Networks and network computations have become a primary mathematical tool for analyzing the structure of many kinds of complex systems, ranging from the Internet and transportation networks to biochemical interactions and social networks. A common task in network analysis is the calculation of quantities that reside on the nodes of a network, such as centrality measures, probabilities, or model states. In this review article we discuss message passing methods, a family of techniques for performing such calculations, based on the propagation of information between the nodes of a network. We introduce the message passing approach with a series of examples, give some illustrative applications and results, and discuss the deep connections between message passing and phase transitions in networks. We also point out some limitations of the message passing approach and describe some recently-introduced methods that address these limitations.
△ Less
Submitted 9 November, 2022;
originally announced November 2022.
-
Improved estimates for the number of non-negative integer matrices with given row and column sums
Authors:
Maximilian Jerdee,
Alec Kirkley,
M. E. J. Newman
Abstract:
The number of non-negative integer matrices with given row and column sums appears in a variety of problems in mathematics and statistics but no closed-form expression for it is known, so we rely on approximations of various kinds. Here we describe a new such approximation, motivated by consideration of the statistics of matrices with non-integer numbers of columns. This estimate can be evaluated…
▽ More
The number of non-negative integer matrices with given row and column sums appears in a variety of problems in mathematics and statistics but no closed-form expression for it is known, so we rely on approximations of various kinds. Here we describe a new such approximation, motivated by consideration of the statistics of matrices with non-integer numbers of columns. This estimate can be evaluated in time linear in the size of the matrix and returns results of accuracy as good as or better than existing linear-time approximations across a wide range of settings. We also use this new estimate as the starting point for an improved numerical method for either counting or sampling matrices using sequential importance sampling. Code implementing our methods is provided.
△ Less
Submitted 29 September, 2022;
originally announced September 2022.
-
20 years of network community detection
Authors:
Santo Fortunato,
M. E. J. Newman
Abstract:
A fundamental technical challenge in the analysis of network data is the automated discovery of communities - groups of nodes that are strongly connected or that share similar features or roles. In this commentary we review progress in the field over the last 20 years.
A fundamental technical challenge in the analysis of network data is the automated discovery of communities - groups of nodes that are strongly connected or that share similar features or roles. In this commentary we review progress in the field over the last 20 years.
△ Less
Submitted 2 August, 2022; v1 submitted 29 July, 2022;
originally announced August 2022.
-
Efficient computation of rankings from pairwise comparisons
Authors:
M. E. J. Newman
Abstract:
We study the ranking of individuals, teams, or objects, based on pairwise comparisons between them, using the Bradley-Terry model. Estimates of rankings within this model are commonly made using a simple iterative algorithm first introduced by Zermelo almost a century ago. Here we describe an alternative and similarly simple iteration that provably returns identical results but does so much faster…
▽ More
We study the ranking of individuals, teams, or objects, based on pairwise comparisons between them, using the Bradley-Terry model. Estimates of rankings within this model are commonly made using a simple iterative algorithm first introduced by Zermelo almost a century ago. Here we describe an alternative and similarly simple iteration that provably returns identical results but does so much faster -- over a hundred times faster in some cases. We demonstrate this algorithm with applications to a range of example data sets and derive a number of results regarding its convergence.
△ Less
Submitted 7 June, 2023; v1 submitted 30 June, 2022;
originally announced July 2022.
-
Ranking with multiple types of pairwise comparisons
Authors:
M. E. J. Newman
Abstract:
The task of ranking individuals or teams, based on a set of comparisons between pairs, arises in various contexts, including sporting competitions and the analysis of dominance hierarchies among animals and humans. Given data on which competitors beat which others, the challenge is to rank the competitors from best to worst. Here we study the problem of computing rankings when there are multiple,…
▽ More
The task of ranking individuals or teams, based on a set of comparisons between pairs, arises in various contexts, including sporting competitions and the analysis of dominance hierarchies among animals and humans. Given data on which competitors beat which others, the challenge is to rank the competitors from best to worst. Here we study the problem of computing rankings when there are multiple, potentially conflicting modes of comparison, such as multiple types of dominance behaviors among animals. We assume that we do not know a priori what information each behavior conveys about the ranking, or even whether they convey any information at all. Nonetheless we show that it is possible to compute a ranking in this situation and present a fast method for doing so, based on a combination of an expectation-maximization algorithm and a modified Bradley-Terry model. We give a selection of example applications to both animal and human competition.
△ Less
Submitted 19 October, 2022; v1 submitted 27 June, 2022;
originally announced June 2022.
-
Cutting Through the Noise to Infer Autonomous System Topology
Authors:
Kirtus G. Leyba,
Joshua J. Daymude,
Jean-Gabriel Young,
M. E. J. Newman,
Jennifer Rexford,
Stephanie Forrest
Abstract:
The Border Gateway Protocol (BGP) is a distributed protocol that manages interdomain routing without requiring a centralized record of which autonomous systems (ASes) connect to which others. Many methods have been devised to infer the AS topology from publicly available BGP data, but none provide a general way to handle the fact that the data are notoriously incomplete and subject to error. This…
▽ More
The Border Gateway Protocol (BGP) is a distributed protocol that manages interdomain routing without requiring a centralized record of which autonomous systems (ASes) connect to which others. Many methods have been devised to infer the AS topology from publicly available BGP data, but none provide a general way to handle the fact that the data are notoriously incomplete and subject to error. This paper describes a method for reliably inferring AS-level connectivity in the presence of measurement error using Bayesian statistical inference acting on BGP routing tables from multiple vantage points. We employ a novel approach for counting AS adjacency observations in the AS-PATH attribute data from public route collectors, along with a Bayesian algorithm to generate a statistical estimate of the AS-level network. Our approach also gives us a way to evaluate the accuracy of existing reconstruction methods and to identify advantageous locations for new route collectors or vantage points.
△ Less
Submitted 18 January, 2022;
originally announced January 2022.
-
Science goals and new mission concepts for future exploration of Titan's atmosphere geology and habitability: Titan POlar Scout/orbitEr and In situ lake lander and DrONe explorer (POSEIDON)
Authors:
Sébastien Rodriguez,
Sandrine Vinatier,
Daniel Cordier,
Gabriel Tobie,
Richard K. Achterberg,
Carrie M. Anderson,
Sarah V. Badman,
Jason W. Barnes,
Erika L. Barth,
Bruno Bézard,
Nathalie Carrasco,
Benjamin Charnay,
Roger N. Clark,
Patrice Coll,
Thomas Cornet,
Athena Coustenis,
Isabelle Couturier-Tamburelli,
Michel Dobrijevic,
F. Michael Flasar,
Remco de Kok,
Caroline Freissinet,
Marina Galand,
Thomas Gautier,
Wolf D. Geppert,
Caitlin A. Griffith
, et al. (39 additional authors not shown)
Abstract:
In response to ESA Voyage 2050 announcement of opportunity, we propose an ambitious L-class mission to explore one of the most exciting bodies in the Solar System, Saturn largest moon Titan. Titan, a "world with two oceans", is an organic-rich body with interior-surface-atmosphere interactions that are comparable in complexity to the Earth. Titan is also one of the few places in the Solar System w…
▽ More
In response to ESA Voyage 2050 announcement of opportunity, we propose an ambitious L-class mission to explore one of the most exciting bodies in the Solar System, Saturn largest moon Titan. Titan, a "world with two oceans", is an organic-rich body with interior-surface-atmosphere interactions that are comparable in complexity to the Earth. Titan is also one of the few places in the Solar System with habitability potential. Titan remarkable nature was only partly revealed by the Cassini-Huygens mission and still holds mysteries requiring a complete exploration using a variety of vehicles and instruments. The proposed mission concept POSEIDON (Titan POlar Scout/orbitEr and In situ lake lander DrONe explorer) would perform joint orbital and in situ investigations of Titan. It is designed to build on and exceed the scope and scientific/technological accomplishments of Cassini-Huygens, exploring Titan in ways that were not previously possible, in particular through full close-up and in situ coverage over long periods of time. In the proposed mission architecture, POSEIDON consists of two major elements: a spacecraft with a large set of instruments that would orbit Titan, preferably in a low-eccentricity polar orbit, and a suite of in situ investigation components, i.e. a lake lander, a "heavy" drone (possibly amphibious) and/or a fleet of mini-drones, dedicated to the exploration of the polar regions. The ideal arrival time at Titan would be slightly before the next northern Spring equinox (2039), as equinoxes are the most active periods to monitor still largely unknown atmospheric and surface seasonal changes. The exploration of Titan northern latitudes with an orbiter and in situ element(s) would be highly complementary with the upcoming NASA New Frontiers Dragonfly mission that will provide in situ exploration of Titan equatorial regions in the mid-2030s.
△ Less
Submitted 20 October, 2021;
originally announced October 2021.
-
slimTrain -- A Stochastic Approximation Method for Training Separable Deep Neural Networks
Authors:
Elizabeth Newman,
Julianne Chung,
Matthias Chung,
Lars Ruthotto
Abstract:
Deep neural networks (DNNs) have shown their success as high-dimensional function approximators in many applications; however, training DNNs can be challenging in general. DNN training is commonly phrased as a stochastic optimization problem whose challenges include non-convexity, non-smoothness, insufficient regularization, and complicated data distributions. Hence, the performance of DNNs on a g…
▽ More
Deep neural networks (DNNs) have shown their success as high-dimensional function approximators in many applications; however, training DNNs can be challenging in general. DNN training is commonly phrased as a stochastic optimization problem whose challenges include non-convexity, non-smoothness, insufficient regularization, and complicated data distributions. Hence, the performance of DNNs on a given task depends crucially on tuning hyperparameters, especially learning rates and regularization parameters. In the absence of theoretical guidelines or prior experience on similar tasks, this requires solving many training problems, which can be time-consuming and demanding on computational resources. This can limit the applicability of DNNs to problems with non-standard, complex, and scarce datasets, e.g., those arising in many scientific applications. To remedy the challenges of DNN training, we propose slimTrain, a stochastic optimization method for training DNNs with reduced sensitivity to the choice hyperparameters and fast initial convergence. The central idea of slimTrain is to exploit the separability inherent in many DNN architectures; that is, we separate the DNN into a nonlinear feature extractor followed by a linear model. This separability allows us to leverage recent advances made for solving large-scale, linear, ill-posed inverse problems. Crucially, for the linear weights, slimTrain does not require a learning rate and automatically adapts the regularization parameter. Since our method operates on mini-batches, its computational overhead per iteration is modest. In our numerical experiments, slimTrain outperforms existing DNN training methods with the recommended hyperparameter settings and reduces the sensitivity of DNN training to the remaining hyperparameters.
△ Less
Submitted 28 September, 2021;
originally announced September 2021.
-
Clustering of heterogeneous populations of networks
Authors:
Jean-Gabriel Young,
Alec Kirkley,
M. E. J. Newman
Abstract:
Statistical methods for reconstructing networks from repeated measurements typically assume that all measurements are generated from the same underlying network structure. This need not be the case, however. People's social networks might be different on weekdays and weekends, for instance. Brain networks may differ between healthy patients and those with dementia or other conditions. Here we desc…
▽ More
Statistical methods for reconstructing networks from repeated measurements typically assume that all measurements are generated from the same underlying network structure. This need not be the case, however. People's social networks might be different on weekdays and weekends, for instance. Brain networks may differ between healthy patients and those with dementia or other conditions. Here we describe a Bayesian analysis framework for such data that allows for the fact that network measurements may be reflective of multiple possible structures. We define a finite mixture model of the measurement process and derive a fast Gibbs sampling procedure that samples exactly from the full posterior distribution of model parameters. The end result is a clustering of the measured networks into groups with similar structure. We demonstrate the method on both real and synthetic network populations.
△ Less
Submitted 23 January, 2022; v1 submitted 15 July, 2021;
originally announced July 2021.
-
Detection of Dynamical Instability in Titan's Thermospheric Jet
Authors:
M. A. Cordiner,
E. Garcia-Berrios,
R. G. Cosentino,
N. A. Teanby,
C. E. Newman,
C. A. Nixon,
A. E. Thelen,
S. B. Charnley
Abstract:
Similar to Earth, Saturn's largest moon, Titan, possesses a system of high-altitude zonal winds (or jets) that encircle the globe. Using the Atacama Large Millimeter/submillimeter Array (ALMA) in August 2016, Lellouch et al. (2019) discovered an equatorial jet at much higher altitudes than previously known, with a surprisingly fast speed of up to ~340 m/s, but the origin of such high velocities is…
▽ More
Similar to Earth, Saturn's largest moon, Titan, possesses a system of high-altitude zonal winds (or jets) that encircle the globe. Using the Atacama Large Millimeter/submillimeter Array (ALMA) in August 2016, Lellouch et al. (2019) discovered an equatorial jet at much higher altitudes than previously known, with a surprisingly fast speed of up to ~340 m/s, but the origin of such high velocities is not yet understood. We obtained spectrally and spatially resolved ALMA observations in May 2017 to map Titan's 3D global wind field and compare our results with a reanalysis of the August 2016 data. Doppler wind velocity maps were derived in the altitude range ~300-1000 km (from the upper stratosphere to the thermosphere). At the highest, thermospheric altitudes, a 47% reduction in the equatorial zonal wind speed was measured over the 9-month period (corresponding to L_s = 82-90 degrees on Titan). This is interpreted as due to a dramatic slowing and loss of confinement (broadening) of the recently-discovered thermospheric equatorial jet, as a result of dynamical instability. These unexpectedly-rapid changes in the upper-atmospheric dynamics are consistent with strong variability of the jet's primary driving mechanism.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
Representative community divisions of networks
Authors:
Alec Kirkley,
M. E. J. Newman
Abstract:
Methods for detecting community structure in networks typically aim to identify a single best partition of network nodes into communities, often by optimizing some objective function, but in real-world applications there may be many competitive partitions with objective scores close to the global optimum and one can obtain a more informative picture of the community structure by examining a repres…
▽ More
Methods for detecting community structure in networks typically aim to identify a single best partition of network nodes into communities, often by optimizing some objective function, but in real-world applications there may be many competitive partitions with objective scores close to the global optimum and one can obtain a more informative picture of the community structure by examining a representative set of such high-scoring partitions than by looking at just the single optimum. However, such a set can be difficult to interpret since its size can easily run to hundreds or thousands of partitions. In this paper we present a method for analyzing large partition sets by dividing them into groups of similar partitions and then identifying an archetypal partition as a representative of each group. The resulting set of archetypal partitions provides a succinct, interpretable summary of the form and variety of community structure in any network. We demonstrate the method on a range of example networks.
△ Less
Submitted 17 February, 2022; v1 submitted 10 May, 2021;
originally announced May 2021.
-
The friendship paradox in real and model networks
Authors:
George T. Cantwell,
Alec Kirkley,
M. E. J. Newman
Abstract:
The friendship paradox is the observation that the degrees of the neighbors of a node in any network will, on average, be greater than the degree of the node itself. In common parlance, your friends have more friends than you do. In this paper we develop the mathematical theory of the friendship paradox, both in general as well as for specific model networks, focusing not only on average behavior…
▽ More
The friendship paradox is the observation that the degrees of the neighbors of a node in any network will, on average, be greater than the degree of the node itself. In common parlance, your friends have more friends than you do. In this paper we develop the mathematical theory of the friendship paradox, both in general as well as for specific model networks, focusing not only on average behavior but also on variation about the average and using generating function methods to calculate full distributions of quantities of interest. We compare the predictions of our theory with measurements on a large number of real-world network data sets and find remarkably good agreement. We also develop equivalent theory for the generalized friendship paradox, which compares characteristics of nodes other than degree to those of their neighbors.
△ Less
Submitted 7 December, 2020;
originally announced December 2020.
-
FPRAS Approximation of the Matrix Permanent in Practice
Authors:
James E. Newman,
Moshe Y. Vardi
Abstract:
The matrix permanent belongs to the complexity class #P-Complete. It is generally believed to be computationally infeasible for large problem sizes, and significant research has been done on approximation algorithms for the matrix permanent. We present an implementation and detailed runtime analysis of one such Markov Chain Monte Carlo (MCMC) based Fully Polynomial Randomized Approximation Scheme…
▽ More
The matrix permanent belongs to the complexity class #P-Complete. It is generally believed to be computationally infeasible for large problem sizes, and significant research has been done on approximation algorithms for the matrix permanent. We present an implementation and detailed runtime analysis of one such Markov Chain Monte Carlo (MCMC) based Fully Polynomial Randomized Approximation Scheme (FPRAS) for the matrix permanent, which has previously only been described theoretically and with big-Oh runtime analysis. We demonstrate by analysis and experiment that the constant factors hidden by previous big-Oh analyses result in computational infeasibility.
△ Less
Submitted 6 December, 2020;
originally announced December 2020.
-
Belief propagation for networks with loops
Authors:
Alec Kirkley,
George T. Cantwell,
M. E. J. Newman
Abstract:
Belief propagation is a widely used message passing method for the solution of probabilistic models on networks such as epidemic models, spin models, and Bayesian graphical models, but it suffers from the serious shortcoming that it works poorly in the common case of networks that contain short loops. Here we provide a solution to this long-standing problem, deriving a belief propagation method th…
▽ More
Belief propagation is a widely used message passing method for the solution of probabilistic models on networks such as epidemic models, spin models, and Bayesian graphical models, but it suffers from the serious shortcoming that it works poorly in the common case of networks that contain short loops. Here we provide a solution to this long-standing problem, deriving a belief propagation method that allows for fast calculation of probability distributions in systems with short loops, potentially with high density, as well as giving expressions for the entropy and partition function, which are notoriously difficult quantities to compute. Using the Ising model as an example, we show that our approach gives excellent results on both real and synthetic networks, improving significantly on standard message passing methods. We also discuss potential applications of our method to a variety of other problems.
△ Less
Submitted 24 April, 2021; v1 submitted 23 September, 2020;
originally announced September 2020.
-
Bayesian inference of network structure from unreliable data
Authors:
Jean-Gabriel Young,
George T. Cantwell,
M. E. J. Newman
Abstract:
Most empirical studies of complex networks do not return direct, error-free measurements of network structure. Instead, they typically rely on indirect measurements that are often error-prone and unreliable. A fundamental problem in empirical network science is how to make the best possible estimates of network structure given such unreliable data. In this paper we describe a fully Bayesian method…
▽ More
Most empirical studies of complex networks do not return direct, error-free measurements of network structure. Instead, they typically rely on indirect measurements that are often error-prone and unreliable. A fundamental problem in empirical network science is how to make the best possible estimates of network structure given such unreliable data. In this paper we describe a fully Bayesian method for reconstructing networks from observational data in any format, even when the data contain substantial measurement error and when the nature and magnitude of that error is unknown. The method is introduced through pedagogical case studies using real-world example networks, and specifically tailored to allow straightforward, computationally efficient implementation with a minimum of technical input. Computer code implementing the method is publicly available.
△ Less
Submitted 9 March, 2021; v1 submitted 7 August, 2020;
originally announced August 2020.
-
Train Like a (Var)Pro: Efficient Training of Neural Networks with Variable Projection
Authors:
Elizabeth Newman,
Lars Ruthotto,
Joseph Hart,
Bart van Bloemen Waanders
Abstract:
Deep neural networks (DNNs) have achieved state-of-the-art performance across a variety of traditional machine learning tasks, e.g., speech recognition, image classification, and segmentation. The ability of DNNs to efficiently approximate high-dimensional functions has also motivated their use in scientific applications, e.g., to solve partial differential equations (PDE) and to generate surrogat…
▽ More
Deep neural networks (DNNs) have achieved state-of-the-art performance across a variety of traditional machine learning tasks, e.g., speech recognition, image classification, and segmentation. The ability of DNNs to efficiently approximate high-dimensional functions has also motivated their use in scientific applications, e.g., to solve partial differential equations (PDE) and to generate surrogate models. In this paper, we consider the supervised training of DNNs, which arises in many of the above applications. We focus on the central problem of optimizing the weights of the given DNN such that it accurately approximates the relation between observed input and target data. Devising effective solvers for this optimization problem is notoriously challenging due to the large number of weights, non-convexity, data-sparsity, and non-trivial choice of hyperparameters. To solve the optimization problem more efficiently, we propose the use of variable projection (VarPro), a method originally designed for separable nonlinear least-squares problems. Our main contribution is the Gauss-Newton VarPro method (GNvpro) that extends the reach of the VarPro idea to non-quadratic objective functions, most notably, cross-entropy loss functions arising in classification. These extensions make GNvpro applicable to all training problems that involve a DNN whose last layer is an affine mapping, which is common in many state-of-the-art architectures. In our four numerical experiments from surrogate modeling, segmentation, and classification GNvpro solves the optimization problem more efficiently than commonly-used stochastic gradient descent (SGD) schemes. Also, GNvpro finds solutions that generalize well, and in all but one example better than well-tuned SGD methods, to unseen data points.
△ Less
Submitted 19 April, 2021; v1 submitted 26 July, 2020;
originally announced July 2020.
-
Measuring Mars Atmospheric Winds From Orbit
Authors:
Scott Guzewich J. B. Abshire. M. M. Baker,
J. M. Battalio,
T. Bertrand,
A. J. Brown,
A. Colaprete,
A. M. Cook,
D. R. Cremons,
M. M. Crismani,
A. I. Dave,
M. Day,
M. -C. Desjean,
M. Elrod,
L. K. Fenton,
J. Fisher,
L. L. Gordley,
P. O. Hayne,
N. G. Heavens,
J. L. Hollingsworth,
D. Jha,
V. Jha,
M. A. Kahre,
A. SJ. Khayat,
A. M. Kling,
S. R. Lewis,
B. T. Marshall
, et al. (16 additional authors not shown)
Abstract:
Wind is the process that connects Mars' climate system. Measurements of Mars atmospheric winds from orbit would dramatically advance our understanding of Mars and help prepare for human exploration of the Red Planet. Multiple instrument candidates are in development and will be ready for flight in the next decade. We urge the Decadal Survey to make these measurements a priority for 2023-2032.
Wind is the process that connects Mars' climate system. Measurements of Mars atmospheric winds from orbit would dramatically advance our understanding of Mars and help prepare for human exploration of the Red Planet. Multiple instrument candidates are in development and will be ready for flight in the next decade. We urge the Decadal Survey to make these measurements a priority for 2023-2032.
△ Less
Submitted 10 July, 2020;
originally announced July 2020.
-
Tensor-Tensor Products for Optimal Representation and Compression
Authors:
Misha Kilmer,
Lior Horesh,
Haim Avron,
Elizabeth Newman
Abstract:
In this era of big data, data analytics and machine learning, it is imperative to find ways to compress large data sets such that intrinsic features necessary for subsequent analysis are not lost. The traditional workhorse for data dimensionality reduction and feature extraction has been the matrix SVD, which presupposes that the data has been arranged in matrix format. Our main goal in this study…
▽ More
In this era of big data, data analytics and machine learning, it is imperative to find ways to compress large data sets such that intrinsic features necessary for subsequent analysis are not lost. The traditional workhorse for data dimensionality reduction and feature extraction has been the matrix SVD, which presupposes that the data has been arranged in matrix format. Our main goal in this study is to show that high-dimensional data sets are more compressible when treated as tensors (aka multiway arrays) and compressed via tensor-SVDs under the tensor-tensor product structures in (Kilmer and Martin, 2011; Kernfeld et al., 2015). We begin by proving Eckart Young optimality results for families of tensor-SVDs under two different truncation strategies. As such optimality properties can be proven in both matrix and tensor-based algebras, a fundamental question arises: does the tensor construct subsume the matrix construct in terms of representation efficiency? The answer is yes, as shown when we prove that a tensor-tensor representation of an equal dimensional spanning space can be superior to its matrix counterpart. We then investigate how the compressed representation provided by the truncated tensor-SVD is related both theoretically and in compression performance to its closest tensor-based analogue, truncated HOSVD (De Lathauwer et al., 2000; De Lathauwer and Vandewalle, 2004), thereby showing the potential advantages of our tensor-based algorithms. Finally, we propose new tensor truncated SVD variants, namely multi-way tensor SVDs, provide further approximated representation efficiency and discuss under which conditions they are considered optimal. We conclude with a numerical study demonstrating the utility of the theory.
△ Less
Submitted 31 December, 2019;
originally announced January 2020.
-
Non-negative Tensor Patch Dictionary Approaches for Image Compression and Deblurring Applications
Authors:
Elizabeth Newman,
Misha E. Kilmer
Abstract:
In recent work (Soltani, Kilmer, Hansen, BIT 2016), an algorithm for non-negative tensor patch dictionary learning in the context of X-ray CT imaging and based on a tensor-tensor product called the $t$-product (Kilmer and Martin, 2011) was presented. Building on that work, in this paper, we use of non-negative tensor patch-based dictionaries trained on other data, such as facial image data, for th…
▽ More
In recent work (Soltani, Kilmer, Hansen, BIT 2016), an algorithm for non-negative tensor patch dictionary learning in the context of X-ray CT imaging and based on a tensor-tensor product called the $t$-product (Kilmer and Martin, 2011) was presented. Building on that work, in this paper, we use of non-negative tensor patch-based dictionaries trained on other data, such as facial image data, for the purposes of either compression or image deblurring. We begin with an analysis in which we address issues such as suitability of the tensor-based approach relative to a matrix-based approach, dictionary size and patch size to balance computational efficiency and qualitative representations. Next, we develop an algorithm that is capable of recovering non-negative tensor coefficients given a non-negative tensor dictionary. The algorithm is based on a variant of the Modified Residual Norm Steepest Descent method. We show how to augment the algorithm to enforce sparsity in the tensor coefficients, and note that the approach has broader applicability since it can be applied to the matrix case as well. We illustrate the surprising result that dictionaries trained on image data from one class can be successfully used to represent and compress image data from different classes and across different resolutions. Finally, we address the use of non-negative tensor dictionaries in image deblurring. We show that tensor treatment of the deblurring problem coupled with non-negative tensor patch dictionaries can give superior restorations as compared to standard treatment of the non-negativity constrained deblurring problem.
△ Less
Submitted 25 September, 2019;
originally announced October 2019.
-
Consistency of community structure in complex networks
Authors:
Maria A. Riolo,
M. E. J. Newman
Abstract:
The most widely used techniques for community detection in networks, including methods based on modularity, statistical inference, and information theoretic arguments, all work by optimizing objective functions that measure the quality of network partitions. There is a good case to be made, however, that one should not look solely at the single optimal community structure under such an objective f…
▽ More
The most widely used techniques for community detection in networks, including methods based on modularity, statistical inference, and information theoretic arguments, all work by optimizing objective functions that measure the quality of network partitions. There is a good case to be made, however, that one should not look solely at the single optimal community structure under such an objective function, but rather at a selection of high-scoring structures. If one does this one typically finds that the resulting structures show considerable variation, and this has been taken as evidence that these community detection methods are unreliable, since they do not appear to give consistent answers. Here we argue that, upon closer inspection, the structures found are in fact consistent in a certain way. Specifically, we show that they can all be assembled from a set of underlying "building blocks", groups of network nodes that are usually found together in the same community. Different community structures correspond to different arrangements of blocks, but the blocks themselves are largely invariant. We propose an information theoretic method for discovering the building blocks in specific networks and demonstrate it with several example applications. We conclude that traditional community detection is not the failure some have suggested it is, and that in fact it gives a significant amount of insight into network structure, although perhaps not in exactly the way previously imagined.
△ Less
Submitted 26 August, 2019;
originally announced August 2019.
-
Improved mutual information measure for classification and community detection
Authors:
M. E. J. Newman,
George T. Cantwell,
Jean-Gabriel Young
Abstract:
The information theoretic quantity known as mutual information finds wide use in classification and community detection analyses to compare two classifications of the same set of objects into groups. In the context of classification algorithms, for instance, it is often used to compare discovered classes to known ground truth and hence to quantify algorithm performance. Here we argue that the stan…
▽ More
The information theoretic quantity known as mutual information finds wide use in classification and community detection analyses to compare two classifications of the same set of objects into groups. In the context of classification algorithms, for instance, it is often used to compare discovered classes to known ground truth and hence to quantify algorithm performance. Here we argue that the standard mutual information, as commonly defined, omits a crucial term which can become large under real-world conditions, producing results that can be substantially in error. We demonstrate how to correct this error and define a mutual information that works in all cases. We discuss practical implementation of the new measure and give some example applications.
△ Less
Submitted 29 July, 2019;
originally announced July 2019.
-
Message passing on networks with loops
Authors:
George T. Cantwell,
M. E. J. Newman
Abstract:
In this paper we offer a solution to a long-standing problem in the study of networks. Message passing is a fundamental technique for calculations on networks and graphs. The first versions of the method appeared in the 1930s and over the decades it has been applied to a wide range of foundational problems in mathematics, physics, computer science, statistics, and machine learning, including Bayes…
▽ More
In this paper we offer a solution to a long-standing problem in the study of networks. Message passing is a fundamental technique for calculations on networks and graphs. The first versions of the method appeared in the 1930s and over the decades it has been applied to a wide range of foundational problems in mathematics, physics, computer science, statistics, and machine learning, including Bayesian inference, spin models, coloring, satisfiability, graph partitioning, network epidemiology, and the calculation of matrix eigenvalues. Despite its wide use, however, it has long been recognized that the method has a fundamental flaw: it only works on networks that are free of short loops. Loops introduce correlations that cause the method to give inaccurate answers at best, and to fail completely in the worst cases. Unfortunately, almost all real-world networks contain many short loops, which limits the usefulness of the message passing approach. In this paper we demonstrate how to rectify this shortcoming and create message passing methods that work on any network. We give two example applications, one to the percolation properties of networks and the other to the calculation of the spectra of sparse matrices.
△ Less
Submitted 18 July, 2019;
originally announced July 2019.
-
Structure of online dating markets in US cities
Authors:
Elizabeth E. Bruch,
M. E. J. Newman
Abstract:
We study the structure of heterosexual dating markets in the United States through an analysis of the interactions of several million users of a large online dating web site, applying recently developed network analysis methods to the pattern of messages exchanged among users. Our analysis shows that the strongest driver of romantic interaction at the national level is simple geographic proximity,…
▽ More
We study the structure of heterosexual dating markets in the United States through an analysis of the interactions of several million users of a large online dating web site, applying recently developed network analysis methods to the pattern of messages exchanged among users. Our analysis shows that the strongest driver of romantic interaction at the national level is simple geographic proximity, but at the local level other demographic factors come into play. We find that dating markets in each city are partitioned into submarkets along lines of age and ethnicity. Sex ratio varies widely between submarkets, with younger submarkets having more men and fewer women than older ones. There is also a noticeable tendency for minorities, especially women, to be younger than the average in older submarkets, and our analysis reveals how this kind of racial stratification arises through the messaging decisions of both men and women. Our study illustrates how network techniques applied to online interactions can reveal the aggregate effects of individual behavior on social structure.
△ Less
Submitted 3 April, 2019; v1 submitted 1 April, 2019;
originally announced April 2019.
-
Polarizable Potentials For Metals: The Density Readjusting Embedded Atom Method (DR-EAM)
Authors:
Hemanta Bhattarai,
Kathie E. Newman,
J. Daniel Gezelter
Abstract:
In simulations of metallic interfaces, a critical aspect of metallic behavior is missing from the some of the most widely used classical molecular dynamics force fields. We present a modification of the embedded atom method (EAM) which allows for electronic polarization of the metal by treating the valence density around each atom as a fluctuating dynamical quantity. The densities are represented…
▽ More
In simulations of metallic interfaces, a critical aspect of metallic behavior is missing from the some of the most widely used classical molecular dynamics force fields. We present a modification of the embedded atom method (EAM) which allows for electronic polarization of the metal by treating the valence density around each atom as a fluctuating dynamical quantity. The densities are represented by a set of additional fluctuating variables (and their conjugate momenta) which are propagated along with the nuclear coordinates. This ``density readjusting EAM'' (DR-EAM) preserves nearly all of the useful qualities of traditional EAM, including bulk elastic properties and surface energies. However, it also allows valence electron density to migrate through the metal in response to external perturbations. We show that DR-EAM can successfully model polarization in response to external charges, capturing the image charge effect in atomistic simulations. DR-EAM also captures some of the behavior of metals in the presence of uniform electric fields, predicting surface charging and shielding internal to the metal. We further show that it predicts charge transfer between the constituent atoms in alloys, leading to novel predictions about unit cell geometries in layered L$1_0$ structures.
△ Less
Submitted 30 March, 2019;
originally announced April 2019.
-
Explaining Reinforcement Learning to Mere Mortals: An Empirical Study
Authors:
Andrew Anderson,
Jonathan Dodge,
Amrita Sadarangani,
Zoe Juozapaitis,
Evan Newman,
Jed Irvine,
Souti Chattopadhyay,
Alan Fern,
Margaret Burnett
Abstract:
We present a user study to investigate the impact of explanations on non-experts' understanding of reinforcement learning (RL) agents. We investigate both a common RL visualization, saliency maps (the focus of attention), and a more recent explanation type, reward-decomposition bars (predictions of future types of rewards). We designed a 124 participant, four-treatment experiment to compare partic…
▽ More
We present a user study to investigate the impact of explanations on non-experts' understanding of reinforcement learning (RL) agents. We investigate both a common RL visualization, saliency maps (the focus of attention), and a more recent explanation type, reward-decomposition bars (predictions of future types of rewards). We designed a 124 participant, four-treatment experiment to compare participants' mental models of an RL agent in a simple Real-Time Strategy (RTS) game. Our results show that the combination of both saliency and reward bars were needed to achieve a statistically significant improvement in mental model score over the control. In addition, our qualitative analysis of the data reveals a number of effects for further study.
△ Less
Submitted 18 June, 2019; v1 submitted 22 March, 2019;
originally announced March 2019.
-
Spectra of networks containing short loops
Authors:
M. E. J. Newman
Abstract:
The spectrum of the adjacency matrix plays several important roles in the mathematical theory of networks and in network data analysis, for example in percolation theory, community detection, centrality measures, and the theory of dynamical systems on networks. A number of methods have been developed for the analytic computation of network spectra, but they typically assume that networks are local…
▽ More
The spectrum of the adjacency matrix plays several important roles in the mathematical theory of networks and in network data analysis, for example in percolation theory, community detection, centrality measures, and the theory of dynamical systems on networks. A number of methods have been developed for the analytic computation of network spectra, but they typically assume that networks are locally tree-like, meaning that the local neighborhood of any node takes the form of a tree, free of short loops. Empirically observed networks, by contrast, often have many short loops. Here we develop an approach for calculating the spectra of networks with short loops using a message passing method. We give example applications to some previously studied classes of networks.
△ Less
Submitted 12 February, 2019;
originally announced February 2019.
-
Spectra of random networks with arbitrary degrees
Authors:
M. E. J. Newman,
Xiao Zhang,
Raj Rao Nadakuditi
Abstract:
We derive a message passing method for computing the spectra of locally tree-like networks and an approximation to it that allows us to compute closed-form expressions or fast numerical approximates for the spectral density of random graphs with arbitrary node degrees -- the so-called configuration model. We find the latter approximation to work well for all but the sparsest of networks. We also d…
▽ More
We derive a message passing method for computing the spectra of locally tree-like networks and an approximation to it that allows us to compute closed-form expressions or fast numerical approximates for the spectral density of random graphs with arbitrary node degrees -- the so-called configuration model. We find the latter approximation to work well for all but the sparsest of networks. We also derive bounds on the position of the band edges of the spectrum, which are important for identifying structural phase transitions in networks.
△ Less
Submitted 7 January, 2019;
originally announced January 2019.
-
Stable Tensor Neural Networks for Rapid Deep Learning
Authors:
Elizabeth Newman,
Lior Horesh,
Haim Avron,
Misha Kilmer
Abstract:
We propose a tensor neural network ($t$-NN) framework that offers an exciting new paradigm for designing neural networks with multidimensional (tensor) data. Our network architecture is based on the $t$-product (Kilmer and Martin, 2011), an algebraic formulation to multiply tensors via circulant convolution. In this $t$-product algebra, we interpret tensors as $t$-linear operators analogous to mat…
▽ More
We propose a tensor neural network ($t$-NN) framework that offers an exciting new paradigm for designing neural networks with multidimensional (tensor) data. Our network architecture is based on the $t$-product (Kilmer and Martin, 2011), an algebraic formulation to multiply tensors via circulant convolution. In this $t$-product algebra, we interpret tensors as $t$-linear operators analogous to matrices as linear operators, and hence our framework inherits mimetic matrix properties. To exemplify the elegant, matrix-mimetic algebraic structure of our $t$-NNs, we expand on recent work (Haber and Ruthotto, 2017) which interprets deep neural networks as discretizations of non-linear differential equations and introduces stable neural networks which promote superior generalization. Motivated by this dynamic framework, we introduce a stable $t$-NN which facilitates more rapid learning because of its reduced, more powerful parameterization. Through our high-dimensional design, we create a more compact parameter space and extract multidimensional correlations otherwise latent in traditional algorithms. We further generalize our $t$-NN framework to a family of tensor-tensor products (Kernfeld, Kilmer, and Aeron, 2015) which still induce a matrix-mimetic algebraic structure. Through numerical experiments on the MNIST and CIFAR-10 datasets, we demonstrate the more powerful parameterizations and improved generalizability of stable $t$-NNs.
△ Less
Submitted 15 November, 2018;
originally announced November 2018.
-
Mixing patterns and individual differences in networks
Authors:
George T. Cantwell,
M. E. J. Newman
Abstract:
We study mixing patterns in networks, meaning the propensity for nodes of different kinds to connect to one another. The phenomenon of assortative mixing, whereby nodes prefer to connect to others that are similar to themselves, has been widely studied, but here we go further and examine how and to what extent nodes that are otherwise similar can have different preferences. Many individuals in a f…
▽ More
We study mixing patterns in networks, meaning the propensity for nodes of different kinds to connect to one another. The phenomenon of assortative mixing, whereby nodes prefer to connect to others that are similar to themselves, has been widely studied, but here we go further and examine how and to what extent nodes that are otherwise similar can have different preferences. Many individuals in a friendship network, for instance, may prefer friends who are roughly the same age as themselves, but some may display a preference for older or younger friends. We introduce a network model that captures this behavior and a method for fitting it to empirical network data. We propose metrics to characterize the mean and variation of mixing patterns and show how to infer their values from the fitted model, either using maximum-likelihood estimates of model parameters or in a Bayesian framework that does not require fixing any parameters.
△ Less
Submitted 17 April, 2019; v1 submitted 2 October, 2018;
originally announced October 2018.
-
Balance in signed networks
Authors:
Alec Kirkley,
George T. Cantwell,
M. E. J. Newman
Abstract:
We consider signed networks in which connections or edges can be either positive (friendship, trust, alliance) or negative (dislike, distrust, conflict). Early literature in graph theory theorized that such networks should display "structural balance," meaning that certain configurations of positive and negative edges are favored and others are disfavored. Here we propose two measures of balance i…
▽ More
We consider signed networks in which connections or edges can be either positive (friendship, trust, alliance) or negative (dislike, distrust, conflict). Early literature in graph theory theorized that such networks should display "structural balance," meaning that certain configurations of positive and negative edges are favored and others are disfavored. Here we propose two measures of balance in signed networks based on the established notions of weak and strong balance, and compare their performance on a range of tasks with each other and with previously proposed measures. In particular, we ask whether real-world signed networks are significantly balanced by these measures compared to an appropriate null model, finding that indeed they are, by all the measures studied. We also test our ability to predict unknown signs in otherwise known networks by maximizing balance. In a series of cross-validation tests we find that our measures are able to predict signs substantially better than chance.
△ Less
Submitted 13 September, 2018;
originally announced September 2018.
-
Aspirational pursuit of mates in online dating markets
Authors:
Elizabeth E. Bruch,
M. E. J. Newman
Abstract:
Romantic courtship is often described as taking place in a dating market where men and women compete for mates, but the detailed structure and dynamics of dating markets have historically been difficult to quantify for lack of suitable data. In recent years, however, the advent and vigorous growth of the online dating industry has provided a rich new source of information on mate pursuit. Here we…
▽ More
Romantic courtship is often described as taking place in a dating market where men and women compete for mates, but the detailed structure and dynamics of dating markets have historically been difficult to quantify for lack of suitable data. In recent years, however, the advent and vigorous growth of the online dating industry has provided a rich new source of information on mate pursuit. Here we present an empirical analysis of heterosexual dating markets in four large US cities using data from a popular, free online dating service. We show that competition for mates creates a pronounced hierarchy of desirability that correlates strongly with user demographics and is remarkably consistent across cities. We find that both men and women pursue partners who are on average about 25% more desirable than themselves by our measures and that they use different messaging strategies with partners of different desirability. We also find that the probability of receiving a response to an advance drops markedly with increasing difference in desirability between the pursuer and the pursued. Strategic behaviors can improve one's chances of attracting a more desirable mate, though the effects are modest.
△ Less
Submitted 14 August, 2018;
originally announced August 2018.
-
Estimating network structure from unreliable measurements
Authors:
M. E. J. Newman
Abstract:
Most empirical studies of networks assume that the network data we are given represent a complete and accurate picture of the nodes and edges in the system of interest, but in real-world situations this is rarely the case. More often the data only specify the network structure imperfectly -- like data in essentially every other area of empirical science, network data are prone to measurement error…
▽ More
Most empirical studies of networks assume that the network data we are given represent a complete and accurate picture of the nodes and edges in the system of interest, but in real-world situations this is rarely the case. More often the data only specify the network structure imperfectly -- like data in essentially every other area of empirical science, network data are prone to measurement error and noise. At the same time, the data may be richer than simple network measurements, incorporating multiple measurements, weights, lengths or strengths of edges, node or edge labels, or annotations of various kinds. Here we develop a general method for making estimates of network structure and properties using any form of network data, simple or complex, when the data are unreliable, and give example applications to a selection of social and biological networks.
△ Less
Submitted 18 December, 2018; v1 submitted 6 March, 2018;
originally announced March 2018.
-
Unfolding Neutron Spectrum with Markov Chain Monte Carlo at MIT Research Reactor with He-3 Neutral Current Detectors
Authors:
A. F. Leder,
A. J. Anderson,
J. Billard,
E. Figueroa-Feliciano,
J. A. Formaggio,
C. Hasselkus,
E. Newman,
K. Palladino,
M. Phuthi,
L. Winslow,
L. Zhang
Abstract:
The Ricochet experiment seeks to measure Coherent (neutral-current) Elastic Neutrino-Nucleus Scattering using dark-matter-style detectors with sub-keV thresholds placed near a neutrino source, such as the MIT (research) Reactor (MITR), which operates at 5.5 MW generating approximately 2.2e18 neutrinos/second in its core. Currently, Ricochet is characterizing the backgrounds at MITR, the main compo…
▽ More
The Ricochet experiment seeks to measure Coherent (neutral-current) Elastic Neutrino-Nucleus Scattering using dark-matter-style detectors with sub-keV thresholds placed near a neutrino source, such as the MIT (research) Reactor (MITR), which operates at 5.5 MW generating approximately 2.2e18 neutrinos/second in its core. Currently, Ricochet is characterizing the backgrounds at MITR, the main component of which comes in the form of neutrons emitted from the core simultaneous with the neutrino signal. To characterize this background, we wrapped Bonner cylinders around a He-3 thermal neutron detector, whose data was then unfolded via a Markov Chain Monte Carlo (MCMC) to produce a neutron energy spectrum across several orders of magnitude. We discuss the resulting spectrum and its implications for deploying Ricochet at the MITR site as well as the feasibility of reducing this background level via the addition of polyethylene shielding around the detector setup.
△ Less
Submitted 7 February, 2018; v1 submitted 2 October, 2017;
originally announced October 2017.
-
Revealing biases in the sampling of ecological interaction networks
Authors:
Marcus A. M. de Aguiar,
Erica A. Newman,
Mathias M. Pires,
Justin D. Yeakel,
David H. Hembry,
Laura Burkle,
Dominique Gravel,
Paulo R. Guimaraes Jr,
Jimmy O'Donnell,
Timothee Poisot,
Marie-Josee Fortin
Abstract:
The structure of ecological interactions is commonly understood through analyses of interaction networks. However, these analyses may be sensitive to sampling biases in both the interactors (the nodes of the network) and interactions (the links between nodes). These issues may affect the accuracy of empirically constructed ecological networks. We explore the properties of sampled ecological networ…
▽ More
The structure of ecological interactions is commonly understood through analyses of interaction networks. However, these analyses may be sensitive to sampling biases in both the interactors (the nodes of the network) and interactions (the links between nodes). These issues may affect the accuracy of empirically constructed ecological networks. We explore the properties of sampled ecological networks by simulating large-scale ecological networks with predetermined topologies, and sampling them with different mathematical procedures. Several types of modular networks were generated, intended to represent a wide variety of communities that vary in size and types of ecological interactions. We sampled these networks with different sampling designs that may be encountered in field experiments. The observed networks generated by each sampling process were analyzed with respect to number and size of components. We show that the sampling effort needed to estimate underlying network properties depends both on the sampling design and on network topology. Networks with random or scale-free modules require more complete sampling compared to networks whose modules are nested or bipartite. Overall, the structure of nested modules was the easiest to detect, regardless of sampling design. Sampling according to species degree was consistently found to be the most accurate strategy to estimate network structure. Conversely, sampling according to module results in an accurate view of certain modules, but fails to provide a global picture of the underlying network. We recommend that these findings are incorporated into the design of projects aiming to characterize large networks of species interactions in the field, to reduce sampling biases. The software scripts developed to construct and sample networks are provided for further explorations of network structure and comparisons to real interaction data.
△ Less
Submitted 3 August, 2017;
originally announced August 2017.
-
Image classification using local tensor singular value decompositions
Authors:
Elizabeth Newman,
Misha Kilmer,
Lior Horesh
Abstract:
From linear classifiers to neural networks, image classification has been a widely explored topic in mathematics, and many algorithms have proven to be effective classifiers. However, the most accurate classifiers typically have significantly high storage costs, or require complicated procedures that may be computationally expensive. We present a novel (nonlinear) classification approach using tru…
▽ More
From linear classifiers to neural networks, image classification has been a widely explored topic in mathematics, and many algorithms have proven to be effective classifiers. However, the most accurate classifiers typically have significantly high storage costs, or require complicated procedures that may be computationally expensive. We present a novel (nonlinear) classification approach using truncation of local tensor singular value decompositions (tSVD) that robustly offers accurate results, while maintaining manageable storage costs. Our approach takes advantage of the optimality of the representation under the tensor algebra described to determine to which class an image belongs. We extend our approach to a method that can determine specific pairwise match scores, which could be useful in, for example, object recognition problems where pose/position are different. We demonstrate the promise of our new techniques on the MNIST data set.
△ Less
Submitted 29 June, 2017;
originally announced June 2017.
-
Efficient method for estimating the number of communities in a network
Authors:
Maria A. Riolo,
George T. Cantwell,
Gesine Reinert,
M. E. J. Newman
Abstract:
While there exist a wide range of effective methods for community detection in networks, most of them require one to know in advance how many communities one is looking for. Here we present a method for estimating the number of communities in a network using a combination of Bayesian inference with a novel prior and an efficient Monte Carlo sampling scheme. We test the method extensively on both r…
▽ More
While there exist a wide range of effective methods for community detection in networks, most of them require one to know in advance how many communities one is looking for. Here we present a method for estimating the number of communities in a network using a combination of Bayesian inference with a novel prior and an efficient Monte Carlo sampling scheme. We test the method extensively on both real and computer-generated networks, showing that it performs accurately and consistently, even in cases where groups are widely varying in size or structure.
△ Less
Submitted 7 June, 2017;
originally announced June 2017.
-
Light-Cones, Almost Light-Cones and Almost-Complex Light-Cones
Authors:
Ezra T. Newman
Abstract:
We point out an unusual relationship among a variety of null geodesic congruences; (a) the generators of ordinary light-cones and (b) certain (related) shear-free but twisting congruences in Minkowski Space-time as well as (c) asymptotically shear-free null geodesic congruences that exist in the neighborhood of Penrose's Scri in Einstein or Einstein-Maxwell asymptotically flat-space-times. We refe…
▽ More
We point out an unusual relationship among a variety of null geodesic congruences; (a) the generators of ordinary light-cones and (b) certain (related) shear-free but twisting congruences in Minkowski Space-time as well as (c) asymptotically shear-free null geodesic congruences that exist in the neighborhood of Penrose's Scri in Einstein or Einstein-Maxwell asymptotically flat-space-times. We refer to these geodesic congruences respectively as: Lignt-Cones (LCs), as Almost-Complex- Light-Cones, (ACLCs), [though they are real they resemble complex light-cones in complex Minkowski space] and finally to a family of congruences in asymptotically flat-spaces as `Almost Light-Cones', (ALC). The two essential points of resemblance among the three families are: (1) they are all either shear-free or asymptotically shear-free and (2) in each family the individual members of the family can be labeled by the points in a real or complex four-dimensional manifold. As an example, the Minkowski space LCs are labeled by the (real) coordinate value of their apex. In the case of (ACLCs) (complex coordinate values), the congruences will have non-vanishing twist whose magnitude is determined by the imaginary part of the complex coordinate values.
In studies of gravitational radiation, Bondi-type of null surfaces and their associated Bondi coordinates have been almost exclusively used for calculations. Some surprising relations arise if, instead of the Bondi coordinates, one uses ALCs and their associated coordinate systems in the analysis of the Einstein-Maxwell equations near Scri. More explicitly and surprisingly, the asymptotic Bianchi Identities expressed in the coordinates of the ALCs, turn directly into many of the standard definitions and relations of classical mechanics.
△ Less
Submitted 17 April, 2017;
originally announced April 2017.
-
Network structure from rich but noisy data
Authors:
M. E. J. Newman
Abstract:
Driven by growing interest in the sciences, industry, and among the broader public, a large number of empirical studies have been conducted in recent years of the structure of networks ranging from the internet and the world wide web to biological networks and social networks. The data produced by these experiments are often rich and multimodal, yet at the same time they may contain substantial me…
▽ More
Driven by growing interest in the sciences, industry, and among the broader public, a large number of empirical studies have been conducted in recent years of the structure of networks ranging from the internet and the world wide web to biological networks and social networks. The data produced by these experiments are often rich and multimodal, yet at the same time they may contain substantial measurement error. In practice, this means that the true network structure can differ greatly from naive estimates made from the raw data, and hence that conclusions drawn from those naive estimates may be significantly in error. In this paper we describe a technique that circumvents this problem and allows us to make optimal estimates of the true structure of networks in the presence of both richly textured data and significant measurement uncertainty. We give example applications to two different social networks, one derived from face-to-face interactions and one from self-reported friendships.
△ Less
Submitted 6 February, 2018; v1 submitted 21 March, 2017;
originally announced March 2017.
-
Surprising Structures Hiding at Penrose's Future Null Infinity
Authors:
Ezra T. Newman
Abstract:
Since the late1950s, almost all discussions of Asymptotically Flat (Einstein-Maxwell) Space-Times have taken place in the context of Penrose's Null Infinity, $\mathcal{I}^{+}.$\ $\ $In addition,\ almost all calculations have used the Bondi coordinate and tetrad systems. \ We show - first, that there are other natural coordinate systems, near $\mathcal{I}^{+},$ (analogous to light-cones in flat-spa…
▽ More
Since the late1950s, almost all discussions of Asymptotically Flat (Einstein-Maxwell) Space-Times have taken place in the context of Penrose's Null Infinity, $\mathcal{I}^{+}.$\ $\ $In addition,\ almost all calculations have used the Bondi coordinate and tetrad systems. \ We show - first, that there are other natural coordinate systems, near $\mathcal{I}^{+},$ (analogous to light-cones in flat-space) that are based on (asymptotically) shear-free null geodesic congruences (analogous to the flat-space case). \ Using these new coordinates and their associated tetrad, we \textit{define the complex dipole moment, i.e., as the mass dipole plus i times angular momentum,} from the $l=1,\ $harmonic coefficient of a component of the asymptotic$\ $Weyl tensor. Second, from this definition, from the Bianchi Identities and from the Bondi mass and linear momentum, we show that \ there exists a large number of results - identifications and dynamics - identical to those of classical mechanics and electrodynamics. They include, among many others, \textbf{P}=M\textbf{v}+..., \textbf{L}=\textbf{r}x\textbf{P}, \ spin, Newtons 2nd Law with the Rocket force term (M\textbf{v}) and radiation reaction, angular momentum conservation and others. \ All these relations take place in the rather mysterious H-Space.
△ Less
Submitted 29 January, 2017;
originally announced January 2017.
-
Real space electrostatics for multipoles. III. Dielectric Properties
Authors:
Madan Lamichhane,
Thomas Parsons,
Kathie E. Newman,
J. Daniel Gezelter
Abstract:
In the first two papers in this series, we developed new shifted potential (SP), gradient shifted force (GSF), and Taylor shifted force (TSF) real-space methods for multipole interactions in condensed phase simulations. Here, we discuss the dielectric properties of fluids that emerge from simulations using these methods. Most electrostatic methods (including the Ewald sum) require correction to th…
▽ More
In the first two papers in this series, we developed new shifted potential (SP), gradient shifted force (GSF), and Taylor shifted force (TSF) real-space methods for multipole interactions in condensed phase simulations. Here, we discuss the dielectric properties of fluids that emerge from simulations using these methods. Most electrostatic methods (including the Ewald sum) require correction to the conducting boundary fluctuation formula for the static dielectric constants, and we discuss the derivation of these corrections for the new real space methods. For quadrupolar fluids, the analogous material property is the quadrupolar susceptibility. As in the dipolar case, the fluctuation formula for the quadrupolar susceptibility has corrections that depend on the electrostatic method being utilized. One of the most important effects measured by both the static dielectric and quadrupolar susceptibility is the ability to screen charges embedded in the fluid. We use potentials of mean force between solvated ions to discuss how geometric factors can lead to distance-dependent screening in both quadrupolar and dipolar fluids.
△ Less
Submitted 17 August, 2016;
originally announced August 2016.
-
Random graph models for dynamic networks
Authors:
Xiao Zhang,
Cristopher Moore,
M. E. J. Newman
Abstract:
We propose generalizations of a number of standard network models, including the classic random graph, the configuration model, and the stochastic block model, to the case of time-varying networks. We assume that the presence and absence of edges are governed by continuous-time Markov processes with rate parameters that can depend on properties of the nodes. In addition to computing equilibrium pr…
▽ More
We propose generalizations of a number of standard network models, including the classic random graph, the configuration model, and the stochastic block model, to the case of time-varying networks. We assume that the presence and absence of edges are governed by continuous-time Markov processes with rate parameters that can depend on properties of the nodes. In addition to computing equilibrium properties of these models, we demonstrate their use in data analysis and statistical inference, giving efficient algorithms for fitting them to observed network data. This allows us, for instance, to estimate the time constants of network evolution or infer community structure from temporal network data using cues embedded both in the probabilities over time that node pairs are connected by edges and in the characteristic dynamics of edge appearance and disappearance. We illustrate our methods with a selection of applications, both to computer-generated test networks and real-world examples.
△ Less
Submitted 26 July, 2016;
originally announced July 2016.
-
Community detection in networks: Modularity optimization and maximum likelihood are equivalent
Authors:
M. E. J. Newman
Abstract:
We demonstrate an exact equivalence between two widely used methods of community detection in networks, the method of modularity maximization in its generalized form which incorporates a resolution parameter controlling the size of the communities discovered, and the method of maximum likelihood applied to the special case of the stochastic block model known as the planted partition model, in whic…
▽ More
We demonstrate an exact equivalence between two widely used methods of community detection in networks, the method of modularity maximization in its generalized form which incorporates a resolution parameter controlling the size of the communities discovered, and the method of maximum likelihood applied to the special case of the stochastic block model known as the planted partition model, in which all communities in a network are assumed to have statistically similar properties. Among other things, this equivalence provides a mathematically principled derivation of the modularity function, clarifies the conditions and assumptions of its use, and gives an explicit formula for the optimal value of the resolution parameter.
△ Less
Submitted 7 June, 2016;
originally announced June 2016.
-
Estimating the number of communities in a network
Authors:
M. E. J. Newman,
Gesine Reinert
Abstract:
Community detection, the division of a network into dense subnetworks with only sparse connections between them, has been a topic of vigorous study in recent years. However, while there exist a range of powerful and flexible methods for dividing a network into a specified number of communities, it is an open question how to determine exactly how many communities one should use. Here we describe a…
▽ More
Community detection, the division of a network into dense subnetworks with only sparse connections between them, has been a topic of vigorous study in recent years. However, while there exist a range of powerful and flexible methods for dividing a network into a specified number of communities, it is an open question how to determine exactly how many communities one should use. Here we describe a mathematically principled approach for finding the number of communities in a network using a maximum-likelihood method. We demonstrate the approach on a range of real-world examples with known community structure, finding that it is able to determine the number of communities correctly in every case.
△ Less
Submitted 23 August, 2016; v1 submitted 9 May, 2016;
originally announced May 2016.