Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 55 results for author: Lyzinski, V

.
  1. arXiv:2404.07462  [pdf, other

    physics.soc-ph stat.AP

    ACRONYM: Augmented degree corrected, Community Reticulated Organized Network Yielding Model

    Authors: Benjamin Leinwand, Vince Lyzinski

    Abstract: Modeling networks can serve as a means of summarizing high-dimensional complex systems. Adapting an approach devised for dense, weighted networks, we propose a new method for generating and estimating unweighted networks. This approach can describe a broader class of potential networks than existing models, including those where nodes in different subnetworks connect to one another via various att… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 29 pages, 11 figures

  2. arXiv:2312.11054  [pdf, other

    stat.ME stat.ML

    Detection of Model-based Planted Pseudo-cliques in Random Dot Product Graphs by the Adjacency Spectral Embedding and the Graph Encoder Embedding

    Authors: Tong Qi, Vince Lyzinski

    Abstract: In this paper, we explore the capability of both the Adjacency Spectral Embedding (ASE) and the Graph Encoder Embedding (GEE) for capturing an embedded pseudo-clique structure in the random dot product graph setting. In both theory and experiments, we demonstrate that this pairing of model and methods can yield worse results than the best existing spectral clique detection methods, demonstrating a… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2310.18533  [pdf, other

    stat.ME q-bio.NC q-bio.QM stat.CO

    Evaluating the effects of high-throughput structural neuroimaging predictors on whole-brain functional connectome outcomes via network-based vector-on-matrix regression

    Authors: Tong Lu, Yuan Zhang, Vince Lyzinski, Chuan Bi, Peter Kochunov, Elliot Hong, Shuo Chen

    Abstract: The joint analysis of multimodal neuroimaging data is critical in the field of brain research because it reveals complex interactive relationships between neurobiological structures and functions. In this study, we focus on investigating the effects of structural imaging (SI) features, including white matter micro-structure integrity (WMMI) and cortical thickness, on the whole brain functional con… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 20 pages, 5 figures, 2 tables

  4. arXiv:2308.13451  [pdf, other

    stat.ML cs.LG math.CO stat.AP stat.ME

    Gotta match 'em all: Solution diversification in graph matching matched filters

    Authors: Zhirui Li, Ben Johnson, Daniel L. Sussman, Carey E. Priebe, Vince Lyzinski

    Abstract: We present a novel approach for finding multiple noisily embedded template graphs in a very large background graph. Our method builds upon the graph-matching-matched-filter technique proposed in Sussman et al., with the discovery of multiple diverse matchings being achieved by iteratively penalizing a suitable node-pair similarity matrix in the matched filter algorithm. In addition, we propose alg… ▽ More

    Submitted 4 July, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: 27 pages, 12 figures, 3 tables

  5. arXiv:2306.04016  [pdf, other

    math.OC

    On seeded subgraph-to-subgraph matching: The ssSGM Algorithm and matchability information theory

    Authors: Lingyao Meng, Mengqi Lou, Jianyu Lin, Vince Lyzinski, Donniell E. Fishkind

    Abstract: The subgraph-subgraph matching problem is, given a pair of graphs and a positive integer $K$, to find $K$ vertices in the first graph, $K$ vertices in the second graph, and a bijection between them, so as to minimize the number of adjacency disagreements across the bijection; it is ``seeded" if some of this bijection is fixed. The problem is intractable, and we present the ssSGM algorithm, which u… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: 27 pages, 16 figures

    MSC Class: 05C60; 05C80; 90C35

  6. arXiv:2208.09710  [pdf, other

    stat.ML cs.IR cs.LG

    Adversarial contamination of networks in the setting of vertex nomination: a new trimming method

    Authors: Sheyda Peyman, Minh Tang, Vince Lyzinski

    Abstract: As graph data becomes more ubiquitous, the need for robust inferential graph algorithms to operate in these complex data domains is crucial. In many cases of interest, inference is further complicated by the presence of adversarial data contamination. The effect of the adversary is frequently to change the data distribution in ways that negatively affect statistical and algorithmic performance. We… ▽ More

    Submitted 20 August, 2022; originally announced August 2022.

  7. arXiv:2208.08638  [pdf, other

    stat.ME stat.ML

    Lost in the Shuffle: Testing Power in the Presence of Errorful Network Vertex Labels

    Authors: Ayushi Saxena, Vince Lyzinski

    Abstract: Two-sample network hypothesis testing is an important inference task with applications across diverse fields such as medicine, neuroscience, and sociology. Many of these testing methodologies operate under the implicit assumption that the vertex correspondence across networks is a priori known. This assumption is often untrue, and the power of the subsequent test can degrade when there are misalig… ▽ More

    Submitted 26 May, 2024; v1 submitted 18 August, 2022; originally announced August 2022.

  8. arXiv:2205.03486  [pdf, other

    stat.ML cs.LG stat.ME

    Clustered Graph Matching for Label Recovery and Graph Classification

    Authors: Zhirui Li, Jesus Arroyo, Konstantinos Pantazis, Vince Lyzinski

    Abstract: Given a collection of vertex-aligned networks and an additional label-shuffled network, we propose procedures for leveraging the signal in the vertex-aligned collection to recover the labels of the shuffled network. We consider matching the shuffled network to averages of the networks in the vertex-aligned collection at different levels of granularity. We demonstrate both in theory and practice th… ▽ More

    Submitted 29 March, 2023; v1 submitted 6 May, 2022; originally announced May 2022.

    Comments: 22 pages, 8 figures, 5 tables

  9. arXiv:2112.12316  [pdf, ps, other

    cs.IT

    Signed and Unsigned Partial Information Decompositions of Continuous Network Interactions

    Authors: Jesse Milzman, Vince Lyzinski

    Abstract: We investigate the partial information decomposition (PID) framework as a tool for edge nomination. We consider both the $I_{\cap}^{\text{min}}$ and $I_{\cap}^{\text{PM}}$ PIDs, from arXiv:1004.2515 and arXiv:1801.09010 respectively, and we both numerically and analytically investigate the utility of these frameworks for discovering significant edge interactions. In the course of our work, we exte… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

  10. arXiv:2106.12621  [pdf, other

    cs.LG cs.IR stat.ME

    Leveraging semantically similar queries for ranking via combining representations

    Authors: Hayden S. Helm, Marah Abdin, Benjamin D. Pedigo, Shweti Mahajan, Vince Lyzinski, Youngser Park, Amitabh Basu, Piali~Choudhury, Christopher M. White, Weiwei Yang, Carey E. Priebe

    Abstract: In modern ranking problems, different and disparate representations of the items to be ranked are often available. It is sensible, then, to try to combine these representations to improve ranking. Indeed, learning to rank via combining representations is both principled and practical for learning a ranking function for a particular query. In extremely data-scarce settings, however, the amount of l… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

  11. The Phantom Alignment Strength Conjecture: Practical use of graph matching alignment strength to indicate a meaningful graph match

    Authors: Donniell E. Fishkind, Felix Parker, Hamilton Sawczuk, Lingyao Meng, Eric Bridgeford, Avanti Athreya, Carey E. Priebe, Vince Lyzinski

    Abstract: The alignment strength of a graph matching is a quantity that gives the practitioner a measure of the correlation of the two graphs, and it can also give the practitioner a sense for whether the graph matching algorithm found the true matching. Unfortunately, when a graph matching algorithm fails to find the truth because of weak signal, there may be "phantom alignment strength" from meaningless m… ▽ More

    Submitted 23 August, 2021; v1 submitted 28 February, 2021; originally announced March 2021.

  12. arXiv:2101.12430  [pdf, other

    cs.LG cs.IR cs.SI stat.ML

    Subgraph nomination: Query by Example Subgraph Retrieval in Networks

    Authors: Al-Fahad M. Al-Qadhi, Carey E. Priebe, Hayden S. Helm, Vince Lyzinski

    Abstract: This paper introduces the subgraph nomination inference task, in which example subgraphs of interest are used to query a network for similarly interesting subgraphs. This type of problem appears time and again in real world problems connected to, for example, user recommendation systems and structural retrieval tasks in social and biological/connectomic networks. We formally define the subgraph no… ▽ More

    Submitted 19 December, 2022; v1 submitted 29 January, 2021; originally announced January 2021.

    Comments: 37 pages, 11 figures

  13. arXiv:2010.14622  [pdf, other

    cs.SI stat.ME

    Vertex nomination between graphs via spectral embedding and quadratic programming

    Authors: Runbing Zheng, Vince Lyzinski, Carey E. Priebe, Minh Tang

    Abstract: Given a network and a subset of interesting vertices whose identities are only partially known, the vertex nomination problem seeks to rank the remaining vertices in such a way that the interesting vertices are ranked at the top of the list. An important variant of this problem is vertex nomination in the multi-graphs setting. Given two graphs $G_1, G_2$ with common vertices and a vertex of intere… ▽ More

    Submitted 27 March, 2022; v1 submitted 24 October, 2020; originally announced October 2020.

  14. arXiv:2008.00163  [pdf, other

    stat.ME stat.ML

    The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks

    Authors: Konstantinos Pantazis, Avanti Athreya, Jesús Arroyo, William N. Frost, Evan S. Hill, Vince Lyzinski

    Abstract: Spectral inference on multiple networks is a rapidly-developing subfield of graph statistics. Recent work has demonstrated that joint, or simultaneous, spectral embedding of multiple independent networks can deliver more accurate estimation than individual spectral decompositions of those same networks. Such inference procedures typically rely heavily on independence assumptions across the multipl… ▽ More

    Submitted 17 June, 2021; v1 submitted 31 July, 2020; originally announced August 2020.

    Comments: 44 pages, 13 figures

    MSC Class: 62H12; 62E20; 05C80

  15. arXiv:2005.02151  [pdf, other

    cs.IR cs.LG math.ST stat.ML

    Vertex Nomination in Richly Attributed Networks

    Authors: Keith Levin, Carey E. Priebe, Vince Lyzinski

    Abstract: Vertex nomination is a lightly-supervised network information retrieval task in which vertices of interest in one graph are used to query a second graph to discover vertices of interest in the second graph. Similar to other information retrieval tasks, the output of a vertex nomination scheme is a ranked list of the vertices in the second graph, with the heretofore unknown vertices of interest ide… ▽ More

    Submitted 4 May, 2023; v1 submitted 29 April, 2020; originally announced May 2020.

    Comments: 46 pages, 5 figures

  16. arXiv:2002.09976  [pdf, ps, other

    math.ST stat.ME

    On a complete and sufficient statistic for the correlated Bernoulli random graph model

    Authors: Donniell E. Fishkind, Avanti Athreya, Lingyao Meng, Vince Lyzinski, Carey E. Priebe

    Abstract: Inference on vertex-aligned graphs is of wide theoretical and practical importance.There are, however, few flexible and tractable statistical models for correlated graphs, and even fewer comprehensive approaches to parametric inference on data arising from such graphs. In this paper, we consider the correlated Bernoulli random graph model (allowing different Bernoulli coefficients and edge correla… ▽ More

    Submitted 30 March, 2021; v1 submitted 23 February, 2020; originally announced February 2020.

  17. arXiv:2002.01648  [pdf, other

    stat.ML cs.LG stat.ME

    Graph matching between bipartite and unipartite networks: to collapse, or not to collapse, that is the question

    Authors: Jesús Arroyo, Carey E. Priebe, Vince Lyzinski

    Abstract: Graph matching consists of aligning the vertices of two unlabeled graphs in order to maximize the shared structure across networks; when the graphs are unipartite, this is commonly formulated as minimizing their edge disagreements. In this paper, we address the common setting in which one of the graphs to match is a bipartite network and one is unipartite. Commonly, the bipartite networks are coll… ▽ More

    Submitted 12 April, 2021; v1 submitted 5 February, 2020; originally announced February 2020.

  18. arXiv:1908.02572  [pdf, other

    cs.SI math.CO

    Multiplex graph matching matched filters

    Authors: Konstantinos Pantazis, Daniel L. Sussman, Youngser Park, Zhirui Li, Carey E. Priebe, Vince Lyzinski

    Abstract: We consider the problem of detecting a noisy induced multiplex template network in a larger multiplex background network. Our approach, which extends the framework of Sussman et al. (2019) to the multiplex setting, leverages a multiplex analogue of the classical graph matching problem to use the template as a matched filter for efficiently searching the background for candidate template matches. T… ▽ More

    Submitted 3 December, 2021; v1 submitted 22 July, 2019; originally announced August 2019.

    Comments: 27 pages, 10 figures

  19. arXiv:1905.01776  [pdf, other

    stat.ML cs.LG cs.SI stat.CO

    Vertex Nomination, Consistent Estimation, and Adversarial Modification

    Authors: Joshua Agterberg, Youngser Park, Jonathan Larson, Christopher White, Carey E. Priebe, Vince Lyzinski

    Abstract: Given a pair of graphs $G_1$ and $G_2$ and a vertex set of interest in $G_1$, the vertex nomination (VN) problem seeks to find the corresponding vertices of interest in $G_2$ (if they exist) and produce a rank list of the vertices in $G_2$, with the corresponding vertices of interest in $G_2$ concentrating, ideally, at the top of the rank list. In this paper, we define and derive the analogue of B… ▽ More

    Submitted 14 April, 2020; v1 submitted 5 May, 2019; originally announced May 2019.

    Comments: 34 pages, 8 figures

  20. arXiv:1812.10519  [pdf, other

    stat.ML cs.LG math.ST

    Maximum Likelihood Estimation and Graph Matching in Errorfully Observed Networks

    Authors: Jesús Arroyo, Daniel L. Sussman, Carey E. Priebe, Vince Lyzinski

    Abstract: Given a pair of graphs with the same number of vertices, the inexact graph matching problem consists in finding a correspondence between the vertices of these graphs that minimizes the total number of induced edge disagreements. We study this problem from a statistical framework in which one of the graphs is an errorfully observed copy of the other. We introduce a corrupting channel model, and sho… ▽ More

    Submitted 2 July, 2020; v1 submitted 26 December, 2018; originally announced December 2018.

  21. arXiv:1808.08502  [pdf, other

    math.CO

    Alignment Strength and Correlation for Graphs

    Authors: Donniell E. Fishkind, Lingyao Meng, Ao Sun, Carey E. Priebe, Vince Lyzinski

    Abstract: When two graphs have a correlated Bernoulli distribution, we prove that the alignment strength of their natural bijection strongly converges to a novel measure of graph correlation $ρ_T$ that neatly combines intergraph with intragraph distribution parameters. Within broad families of the random graph parameter settings, we illustrate that exact graph matching runtime and also matchability are both… ▽ More

    Submitted 17 January, 2020; v1 submitted 25 August, 2018; originally announced August 2018.

    MSC Class: 05C80; 05C60; 90C35

  22. On a 'Two Truths' Phenomenon in Spectral Graph Clustering

    Authors: Carey E. Priebe, Youngser Park, Joshua T. Vogelstein, John M. Conroy, Vince Lyzinski, Minh Tang, Avanti Athreya, Joshua Cape, Eric Bridgeford

    Abstract: Clustering is concerned with coherently grouping observations without any explicit concept of true groupings. Spectral graph clustering - clustering the vertices of a graph based on their spectral embedding - is commonly approached via K-means (or, more generally, Gaussian mixture model) clustering composed with either Laplacian or Adjacency spectral embedding (LSE or ASE). Recent theoretical resu… ▽ More

    Submitted 11 February, 2019; v1 submitted 23 August, 2018; originally announced August 2018.

    Journal ref: PNAS 116 (2019) 5995-6000

  23. arXiv:1807.09299  [pdf, other

    stat.CO

    Tractable Graph Matching via Soft Seeding

    Authors: Fei Fang, Daniel L. Sussman, Vince Lyzinski

    Abstract: The graph matching problem aims to discover a latent correspondence between the vertex sets of two observed graphs. This problem has proven to be quite challenging, with few satisfying methods that are computationally tractable and widely applicable. The FAQ algorithm has proven to have good performance on benchmark problems and works with a indefinite relaxation of the problem. Due to the indefin… ▽ More

    Submitted 24 July, 2018; originally announced July 2018.

    Comments: 26 pages, 3 figures

  24. arXiv:1803.02423  [pdf, other

    stat.ML cs.DS

    Matched Filters for Noisy Induced Subgraph Detection

    Authors: Daniel L. Sussman, Youngser Park, Carey E. Priebe, Vince Lyzinski

    Abstract: The problem of finding the vertex correspondence between two noisy graphs with different number of vertices where the smaller graph is still large has many applications in social networks, neuroscience, and computer vision. We propose a solution to this problem via a graph matching matched filter: centering and padding the smaller adjacency matrix and applying graph matching methods to align it to… ▽ More

    Submitted 1 July, 2019; v1 submitted 6 March, 2018; originally announced March 2018.

    Comments: 41 pages, 7 figures

  25. arXiv:1802.04960  [pdf, other

    stat.ML

    Vertex nomination: The canonical sampling and the extended spectral nomination schemes

    Authors: Jordan Yoder, Li Chen, Henry Pao, Eric Bridgeford, Keith Levin, Donniell Fishkind, Carey Priebe, Vince Lyzinski

    Abstract: Suppose that one particular block in a stochastic block model is of interest, but block labels are only observed for a few of the vertices in the network. Utilizing a graph realized from the model and the observed block labels, the vertex nomination task is to order the vertices with unobserved block labels into a ranked nomination list with the goal of having an abundance of interesting vertices… ▽ More

    Submitted 22 January, 2020; v1 submitted 14 February, 2018; originally announced February 2018.

  26. arXiv:1711.05610  [pdf, other

    stat.ML

    On consistent vertex nomination schemes

    Authors: Vince Lyzinski, Keith Levin, Carey E. Priebe

    Abstract: Given a vertex of interest in a network $G_1$, the vertex nomination problem seeks to find the corresponding vertex of interest (if it exists) in a second network $G_2$. A vertex nomination scheme produces a list of the vertices in $G_2$, ranked according to how likely they are judged to be the corresponding vertex of interest in $G_2$. The vertex nomination problem and related information retriev… ▽ More

    Submitted 9 December, 2018; v1 submitted 15 November, 2017; originally announced November 2017.

    Comments: 32 pages, 4 figures

  27. arXiv:1709.05454  [pdf, other

    stat.ME math.ST stat.ML

    Statistical inference on random dot product graphs: a survey

    Authors: Avanti Athreya, Donniell E. Fishkind, Keith Levin, Vince Lyzinski, Youngser Park, Yichen Qin, Daniel L. Sussman, Minh Tang, Joshua T. Vogelstein, Carey E. Priebe

    Abstract: The random dot product graph (RDPG) is an independent-edge random graph that is analytically tractable and, simultaneously, either encompasses or can successfully approximate a wide range of random graphs, from relatively simple stochastic block models to complex latent position graphs. In this survey paper, we describe a comprehensive paradigm for statistical inference on random dot product graph… ▽ More

    Submitted 16 September, 2017; originally announced September 2017.

    Comments: An expository survey paper on a comprehensive paradigm for inference for random dot product graphs, centered on graph adjacency and Laplacian spectral embeddings. Paper outlines requisite background; summarizes theory, methodology, and applications from previous and ongoing work; and closes with a discussion of several open problems

    MSC Class: 62FXX; 62GXX; 62HXX; 05CXX

    Journal ref: Journal of Machine Learning Research, 2018

  28. arXiv:1705.09355  [pdf, other

    stat.ME

    A central limit theorem for an omnibus embedding of multiple random graphs and implications for multiscale network inference

    Authors: Keith Levin, Avanti Athreya, Minh Tang, Vince Lyzinski, Youngser Park, Carey E. Priebe

    Abstract: Performing statistical analyses on collections of graphs is of import to many disciplines, but principled, scalable methods for multi-sample graph inference are few. Here we describe an "omnibus" embedding in which multiple graphs on the same vertex set are jointly embedded into a single space with a distinct representation for each graph. We prove a central limit theorem for this embedding and de… ▽ More

    Submitted 25 June, 2019; v1 submitted 25 May, 2017; originally announced May 2017.

    MSC Class: 62H12; 62H15; 05C80

  29. arXiv:1705.03297  [pdf, other

    stat.ML

    Semiparametric spectral modeling of the Drosophila connectome

    Authors: Carey E. Priebe, Youngser Park, Minh Tang, Avanti Athreya, Vince Lyzinski, Joshua T. Vogelstein, Yichen Qin, Ben Cocanougher, Katharina Eichler, Marta Zlatic, Albert Cardona

    Abstract: We present semiparametric spectral modeling of the complete larval Drosophila mushroom body connectome. Motivated by a thorough exploratory data analysis of the network via Gaussian mixture modeling (GMM) in the adjacency spectral embedding (ASE) representation space, we introduce the latent structure model (LSM) for network modeling and inference. LSM is a generalization of the stochastic block m… ▽ More

    Submitted 9 May, 2017; originally announced May 2017.

  30. arXiv:1705.02294  [pdf, other

    math.ST cs.SI

    Matchability of heterogeneous networks pairs

    Authors: Vince Lyzinski, Daniel L. Sussman

    Abstract: We consider the problem of graph matchability in non-identically distributed networks. In a general class of edge-independent networks, we demonstrate that graph matchability can be lost with high probability when matching the networks directly. We further demonstrate that under mild model assumptions, matchability is almost perfectly recovered by centering the networks using Universal Singular Va… ▽ More

    Submitted 20 March, 2019; v1 submitted 5 May, 2017; originally announced May 2017.

    Comments: 44 pages, 10 figures

  31. arXiv:1705.00674  [pdf, ps, other

    stat.ML

    Vertex Nomination Via Seeded Graph Matching

    Authors: Heather G. Patsolic, Youngser Park, Vince Lyzinski, Carey E. Priebe

    Abstract: Consider two networks on overlapping, non-identical vertex sets. Given vertices of interest in the first network, we seek to identify the corresponding vertices, if any exist, in the second network. While in moderately sized networks graph matching methods can be applied directly to recover the missing correspondences, herein we present a principled methodology appropriate for situations in which… ▽ More

    Submitted 5 November, 2019; v1 submitted 1 May, 2017; originally announced May 2017.

    Comments: 19 pages, 14 (sub)figures, edits: removed investigation of the impact of seeds and moved the material to a supplement that will be available on the webpage indicated in the article, and did some word-smithing to make the article cleaner

  32. arXiv:1608.00451  [pdf, other

    stat.CO math.NA

    Numerical tolerance for spectral decompositions of random matrices

    Authors: Avanti Athreya, Michael Kane, Bryan Lewis, Zachary Lubberts, Vince Lyzinski, Youngser Park, Carey E. Priebe, Minh Tang

    Abstract: We precisely quantify the impact of statistical error in the quality of a numerical approximation to a random matrix eigendecomposition, and under mild conditions, we use this to introduce an optimal numerical tolerance for residual error in spectral decompositions of random matrices. We demonstrate that terminating an eigendecomposition algorithm when the numerical error and statistical error are… ▽ More

    Submitted 30 January, 2020; v1 submitted 1 August, 2016; originally announced August 2016.

    Comments: 20 pages, 2 figures

    MSC Class: 15; 62; 65

  33. arXiv:1607.01369  [pdf, other

    stat.ML

    On the Consistency of the Likelihood Maximization Vertex Nomination Scheme: Bridging the Gap Between Maximum Likelihood Estimation and Graph Matching

    Authors: Vince Lyzinski, Keith Levin, Donniell E. Fishkind, Carey E. Priebe

    Abstract: Given a graph in which a few vertices are deemed interesting a priori, the vertex nomination task is to order the remaining vertices into a nomination list such that there is a concentration of interesting vertices at the top of the list. Previous work has yielded several approaches to this problem, with theoretical results in the setting where the graph is drawn from a stochastic block model (SBM… ▽ More

    Submitted 27 August, 2016; v1 submitted 5 July, 2016; originally announced July 2016.

  34. arXiv:1605.02315  [pdf, other

    stat.ML cs.IT math.CO

    Information Recovery in Shuffled Graphs via Graph Matching

    Authors: Vince Lyzinski

    Abstract: While many multiple graph inference methodologies operate under the implicit assumption that an explicit vertex correspondence is known across the vertex sets of the graphs, in practice these correspondences may only be partially or errorfully known. Herein, we provide an information theoretic foundation for understanding the practical impact that errorfully observed vertex correspondences can hav… ▽ More

    Submitted 27 September, 2017; v1 submitted 8 May, 2016; originally announced May 2016.

    Comments: 55 pages, 6 figures

  35. Laplacian Eigenmaps from Sparse, Noisy Similarity Measurements

    Authors: Keith Levin, Vince Lyzinski

    Abstract: Manifold learning and dimensionality reduction techniques are ubiquitous in science and engineering, but can be computationally expensive procedures when applied to large data sets or when similarities are expensive to compute. To date, little work has been done to investigate the tradeoff between computational resources and the quality of learned representations. We present both theoretical and e… ▽ More

    Submitted 16 August, 2016; v1 submitted 12 March, 2016; originally announced March 2016.

  36. Semi-External Memory Sparse Matrix Multiplication for Billion-Node Graphs

    Authors: Da Zheng, Disa Mhembere, Vince Lyzinski, Joshua Vogelstein, Carey E. Priebe, Randal Burns

    Abstract: Sparse matrix multiplication is traditionally performed in memory and scales to large matrices using the distributed memory of multiple nodes. In contrast, we scale sparse matrix multiplication beyond memory capacity by implementing sparse matrix dense matrix multiplication (SpMM) in a semi-external memory (SEM) fashion; i.e., we keep the sparse matrix on commodity SSDs and dense matrices in memor… ▽ More

    Submitted 14 October, 2016; v1 submitted 9 February, 2016; originally announced February 2016.

    Comments: published in IEEE Transactions on Parallel and Distributed Systems

  37. arXiv:1508.04422  [pdf, other

    stat.ML cs.LG cs.NE stat.ME

    Scalable Out-of-Sample Extension of Graph Embeddings Using Deep Neural Networks

    Authors: Aren Jansen, Gregory Sell, Vince Lyzinski

    Abstract: Several popular graph embedding techniques for representation learning and dimensionality reduction rely on performing computationally expensive eigendecompositions to derive a nonlinear transformation of the input data space. The resulting eigenvectors encode the embedding coordinates for the training samples only, and so the embedding of novel data samples requires further costly computation. In… ▽ More

    Submitted 14 June, 2016; v1 submitted 18 August, 2015; originally announced August 2015.

    Comments: 10 pages, 2 figures, 1 table, this paper is under consideration for publication in Pattern Recognition Letters

  38. arXiv:1507.08376  [pdf, other

    stat.AP q-bio.NC

    A Joint Graph Inference Case Study: the C.elegans Chemical and Electrical Connectomes

    Authors: Li Chen, Joshua T. Vogelstein, Vince Lyzinski, Carey E. Priebe

    Abstract: We investigate joint graph inference for the chemical and electrical connectomes of the \textit{Caenorhabditis elegans} roundworm. The \textit{C.elegans} connectomes consist of $253$ non-isolated neurons with known functional attributes, and there are two types of synaptic connectomes, resulting in a pair of graphs. We formulate our joint graph inference from the perspectives of seeded graph match… ▽ More

    Submitted 5 August, 2015; v1 submitted 30 July, 2015; originally announced July 2015.

  39. arXiv:1503.02115  [pdf, other

    stat.ML stat.AP

    Community Detection and Classification in Hierarchical Stochastic Blockmodels

    Authors: Vince Lyzinski, Minh Tang, Avanti Athreya, Youngser Park, Carey E. Priebe

    Abstract: We propose a robust, scalable, integrated methodology for community detection and community comparison in graphs. In our procedure, we first embed a graph into an appropriate Euclidean space to obtain a low-dimensional representation, and then cluster the vertices into communities. We next employ nonparametric graph inference techniques to identify structural similarity among these communities. Th… ▽ More

    Submitted 25 August, 2016; v1 submitted 6 March, 2015; originally announced March 2015.

    Comments: 17 pages, 7 figures

  40. arXiv:1502.03391  [pdf, other

    stat.ML stat.ME

    Fast Embedding for JOFC Using the Raw Stress Criterion

    Authors: Vince Lyzinski, Youngser Park, Carey E. Priebe, Michael W. Trosset

    Abstract: The Joint Optimization of Fidelity and Commensurability (JOFC) manifold matching methodology embeds an omnibus dissimilarity matrix consisting of multiple dissimilarities on the same set of objects. One approach to this embedding optimizes the preservation of fidelity to each individual dissimilarity matrix together with commensurability of each given observation across modalities via iterative ma… ▽ More

    Submitted 31 October, 2016; v1 submitted 11 February, 2015; originally announced February 2015.

    Comments: 43 pages, 10 figures, 3 tables

  41. arXiv:1409.2344  [pdf, other

    math.ST

    A nonparametric two-sample hypothesis testing problem for random dot product graphs

    Authors: Minh Tang, Avanti Athreya, Daniel L. Sussman, Vince Lyzinski, Carey E. Priebe

    Abstract: We consider the problem of testing whether two finite-dimensional random dot product graphs have generating latent positions that are independently drawn from the same distribution, or distributions that are related via scaling or projection. We propose a test statistic that is a kernel-based function of the adjacency spectral embedding for each graph. We obtain a limiting distribution for our tes… ▽ More

    Submitted 12 November, 2015; v1 submitted 8 September, 2014; originally announced September 2014.

    Comments: 24 pages, 1 figures

    MSC Class: 62G10; 62H12; 05C80; 60B20

  42. arXiv:1405.3133  [pdf, other

    stat.ML math.OC

    Graph Matching: Relax at Your Own Risk

    Authors: Vince Lyzinski, Donniell Fishkind, Marcelo Fiori, Joshua T. Vogelstein, Carey E. Priebe, Guillermo Sapiro

    Abstract: Graph matching---aligning a pair of graphs to minimize their edge disagreements---has received wide-spread attention from both theoretical and applied communities over the past several decades, including combinatorics, computer vision, and connectomics. Its attention can be partially attributed to its computational difficulty. Although many heuristics have previously been proposed in the literatur… ▽ More

    Submitted 9 January, 2015; v1 submitted 13 May, 2014; originally announced May 2014.

    Comments: 14 pages, 11 figures, 3 tables

  43. arXiv:1403.7249  [pdf, other

    stat.ME

    A semiparametric two-sample hypothesis testing problem for random dot product graphs

    Authors: Minh Tang, Avanti Athreya, Daniel L. Sussman, Vince Lyzinski, Carey E. Priebe

    Abstract: Two-sample hypothesis testing for random graphs arises naturally in neuroscience, social networks, and machine learning. In this paper, we consider a semiparametric problem of two-sample hypothesis testing for a class of latent position random graphs. We formulate a notion of consistency in this context and propose a valid test for the hypothesis that two finite-dimensional random dot product grap… ▽ More

    Submitted 18 June, 2015; v1 submitted 27 March, 2014; originally announced March 2014.

    Comments: 44 pages

    MSC Class: 62G10; 62H12; 05C80

  44. arXiv:1402.5706  [pdf, ps, other

    math.PR

    Strong Stationary Duality for Diffusion Processes

    Authors: James Allen Fill, Vince Lyzinski

    Abstract: We develop the theory of strong stationary duality for diffusion processes on compact intervals. We analytically derive the generator and boundary behavior of the dual process and recover a central tenet of the classical Markov chain theory in the diffusion setting by linking the separation distance in the primal diffusion to the absorption time in the dual diffusion. We also exhibit our strong st… ▽ More

    Submitted 17 April, 2015; v1 submitted 23 February, 2014; originally announced February 2014.

    Comments: 34 pages

  45. arXiv:1401.3813  [pdf, other

    stat.ML stat.AP stat.ME

    Seeded Graph Matching Via Joint Optimization of Fidelity and Commensurability

    Authors: Heather Patsolic, Sancar Adali, Joshua T. Vogelstein, Youngser Park, Carey E. Friebe, Gongkai Li, Vince Lyzinski

    Abstract: We present a novel approximate graph matching algorithm that incorporates seeded data into the graph matching paradigm. Our Joint Optimization of Fidelity and Commensurability (JOFC) algorithm embeds two graphs into a common Euclidean space where the matching inference task can be performed. Through real and simulated data examples, we demonstrate the versatility of our algorithm in matching graph… ▽ More

    Submitted 8 December, 2019; v1 submitted 15 January, 2014; originally announced January 2014.

    Comments: 26 pages, 7 figures. Updated content and added application of simultaneous matching for several time-steps for zebrafish connectomes

  46. arXiv:1312.2638  [pdf, ps, other

    stat.ML math.OC stat.AP

    Vertex nomination schemes for membership prediction

    Authors: D. E. Fishkind, V. Lyzinski, H. Pao, L. Chen, C. E. Priebe

    Abstract: Suppose that a graph is realized from a stochastic block model where one of the blocks is of interest, but many or all of the vertices' block labels are unobserved. The task is to order the vertices with unobserved block labels into a ``nomination list'' such that, with high probability, vertices from the interesting block are concentrated near the list's beginning. We propose several vertex nomin… ▽ More

    Submitted 17 November, 2015; v1 submitted 9 December, 2013; originally announced December 2013.

    Comments: Published at http://dx.doi.org/10.1214/15-AOAS834 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS834

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 3, 1510-1532

  47. arXiv:1310.1297  [pdf, other

    stat.ML math.OC stat.CO

    Spectral Clustering for Divide-and-Conquer Graph Matching

    Authors: Vince Lyzinski, Daniel L. Sussman, Donniell E. Fishkind, Henry Pao, Li Chen, Joshua T. Vogelstein, Youngser Park, Carey E. Priebe

    Abstract: We present a parallelized bijective graph matching algorithm that leverages seeds and is designed to match very large graphs. Our algorithm combines spectral graph embedding with existing state-of-the-art seeded graph matching procedures. We justify our approach by proving that modestly correlated, large stochastic block model random graphs are correctly matched utilizing very few seeds through ou… ▽ More

    Submitted 12 March, 2015; v1 submitted 4 October, 2013; originally announced October 2013.

    Comments: 32 pages, 8 figures

  48. arXiv:1310.0532  [pdf, other

    stat.ML

    Perfect Clustering for Stochastic Blockmodel Graphs via Adjacency Spectral Embedding

    Authors: Vince Lyzinski, Daniel Sussman, Minh Tang, Avanti Athreya, Carey Priebe

    Abstract: Vertex clustering in a stochastic blockmodel graph has wide applicability and has been the subject of extensive research. In thispaper, we provide a short proof that the adjacency spectral embedding can be used to obtain perfect clustering for the stochastic blockmodel and the degree-corrected stochastic blockmodel. We also show an analogous result for the more general random dot product graph mod… ▽ More

    Submitted 15 January, 2015; v1 submitted 1 October, 2013; originally announced October 2013.

    Comments: 22 pages, including references; 2 figures

    Journal ref: Electronic Journal of Statistics, 8 (2014) 2905--2922

  49. arXiv:1305.7388  [pdf, other

    math.ST stat.ML

    A central limit theorem for scaled eigenvectors of random dot product graphs

    Authors: Avanti Athreya, Vince Lyzinski, David J. Marchette, Carey E. Priebe, Daniel L. Sussman, Minh Tang

    Abstract: We prove a central limit theorem for the components of the largest eigenvectors of the adjacency matrix of a finite-dimensional random dot product graph whose true latent positions are unknown. In particular, we follow the methodology outlined in \citet{sussman2012universally} to construct consistent estimates for the latent positions, and we show that the appropriately scaled differences between… ▽ More

    Submitted 23 December, 2013; v1 submitted 31 May, 2013; originally announced May 2013.

    Comments: 24 pages, 2 figures

  50. arXiv:1304.7844  [pdf, other

    math.OC math.CO math.PR

    Seeded graph matching for correlated Erdős-Rényi graphs

    Authors: Vince Lyzinski, Donniell E. Fishkind, Carey E. Priebe

    Abstract: Graph matching is an important problem in machine learning and pattern recognition. Herein, we present theoretical and practical results on the consistency of graph matching for estimating a latent alignment function between the vertex sets of two graphs, as well as subsequent algorithmic implications when the latent alignment is partially observed. In the correlated Erdős-Rényi graph setting, we… ▽ More

    Submitted 1 August, 2014; v1 submitted 29 April, 2013; originally announced April 2013.

    Comments: 28 pages, 5 figures