Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–27 of 27 results for author: Burkholz, R

.
  1. arXiv:2406.02773  [pdf, other

    cs.LG cs.CV

    Cyclic Sparse Training: Is it Enough?

    Authors: Advait Gadhikar, Sree Harsha Nelaturu, Rebekka Burkholz

    Abstract: The success of iterative pruning methods in achieving state-of-the-art sparse networks has largely been attributed to improved mask identification and an implicit regularization induced by pruning. We challenge this hypothesis and instead posit that their repeated cyclic training schedules enable improved optimization. To verify this, we show that pruning at initialization is significantly boosted… ▽ More

    Submitted 7 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2406.00418  [pdf, other

    cs.LG

    GATE: How to Keep Out Intrusive Neighbors

    Authors: Nimrah Mustafa, Rebekka Burkholz

    Abstract: Graph Attention Networks (GATs) are designed to provide flexible neighborhood aggregation that assigns weights to neighbors according to their importance. In practice, however, GATs are often unable to switch off task-irrelevant neighborhood aggregation, as we show experimentally and analytically. To address this challenge, we propose GATE, a GAT extension that holds three major advantages: i) It… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 26 pages. To be published at the International Conference on Machine Learning (ICML), 2024

  3. arXiv:2405.18655  [pdf, other

    cs.LG cs.AI q-bio.GN

    CAVACHON: a hierarchical variational autoencoder to integrate multi-modal single-cell data

    Authors: Ping-Han Hsieh, Ru-Xiu Hsiao, Katalin Ferenc, Anthony Mathelier, Rebekka Burkholz, Chien-Yu Chen, Geir Kjetil Sandve, Tatiana Belova, Marieke Lydia Kuijjer

    Abstract: Paired single-cell sequencing technologies enable the simultaneous measurement of complementary modalities of molecular data at single-cell resolution. Along with the advances in these technologies, many methods based on variational autoencoders have been developed to integrate these data. However, these methods do not explicitly incorporate prior biological relationships between the data modaliti… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  4. arXiv:2404.04612  [pdf, other

    cs.LG eess.SP stat.ML

    Spectral Graph Pruning Against Over-Squashing and Over-Smoothing

    Authors: Adarsh Jamadandi, Celia Rubio-Madrigal, Rebekka Burkholz

    Abstract: Message Passing Graph Neural Networks are known to suffer from two problems that are sometimes believed to be diametrically opposed: over-squashing and over-smoothing. The former results from topological bottlenecks that hamper the information flow from distant nodes and are mitigated by spectral gap maximization, primarily, by means of edge additions. However, such additions often promote over-sm… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  5. arXiv:2403.04805  [pdf, other

    cs.LG q-bio.QM stat.AP stat.ML

    Not all tickets are equal and we know it: Guiding pruning with domain-specific knowledge

    Authors: Intekhab Hossain, Jonas Fischer, Rebekka Burkholz, John Quackenbush

    Abstract: Neural structure learning is of paramount importance for scientific discovery and interpretability. Yet, contemporary pruning algorithms that focus on computational resource efficiency face algorithmic barriers to select a meaningful model that aligns with domain expertise. To mitigate this challenge, we propose DASH, which guides pruning by available domain-specific structural information. In the… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  6. arXiv:2402.19262  [pdf, other

    cs.LG

    Masks, Signs, And Learning Rate Rewinding

    Authors: Advait Gadhikar, Rebekka Burkholz

    Abstract: Learning Rate Rewinding (LRR) has been established as a strong variant of Iterative Magnitude Pruning (IMP) to find lottery tickets in deep overparameterized neural networks. While both iterative pruning schemes couple structure and parameter learning, understanding how LRR excels in both aspects can bring us closer to the design of more flexible deep learning algorithms that can optimize diverse… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted for publishing at ICLR 2024

  7. arXiv:2310.07235  [pdf, other

    cs.LG

    Are GATs Out of Balance?

    Authors: Nimrah Mustafa, Aleksandar Bojchevski, Rebekka Burkholz

    Abstract: While the expressive power and computational capabilities of graph neural networks (GNNs) have been theoretically studied, their optimization and learning dynamics, in general, remain largely unexplored. Our study undertakes the Graph Attention Network (GAT), a popular GNN architecture in which a node's neighborhood aggregation is weighted by parameterized attention coefficients. We derive a conse… ▽ More

    Submitted 25 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 25 pages. To be published in Advances in Neural Information Processing Systems (NeurIPS), 2023

  8. arXiv:2301.13732  [pdf, other

    cs.LG stat.ML

    Preserving local densities in low-dimensional embeddings

    Authors: Jonas Fischer, Rebekka Burkholz, Jilles Vreeken

    Abstract: Low-dimensional embeddings and visualizations are an indispensable tool for analysis of high-dimensional data. State-of-the-art methods, such as tSNE and UMAP, excel in unveiling local structures hidden in high-dimensional data and are therefore routinely applied in standard analysis pipelines in biology. We show, however, that these methods fail to reconstruct local properties, such as relative d… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  9. arXiv:2210.02412  [pdf, other

    cs.LG

    Why Random Pruning Is All We Need to Start Sparse

    Authors: Advait Gadhikar, Sohom Mukherjee, Rebekka Burkholz

    Abstract: Random masks define surprisingly effective sparse neural network models, as has been shown empirically. The resulting sparse networks can often compete with dense architectures and state-of-the-art lottery ticket pruning algorithms, even though they do not rely on computationally expensive prune-train iterations and can be drawn initially without significant computational overhead. We offer a theo… ▽ More

    Submitted 31 May, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted for publication at ICML, 2023

  10. arXiv:2210.02411  [pdf, other

    cs.LG

    Dynamical Isometry for Residual Networks

    Authors: Advait Gadhikar, Rebekka Burkholz

    Abstract: The training success, training speed and generalization ability of neural networks rely crucially on the choice of random parameter initialization. It has been shown for multiple architectures that initial dynamical isometry is particularly advantageous. Known initialization schemes for residual blocks, however, miss this property and suffer from degrading separability of different inputs for incr… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: 22 pages, 5 figures

  11. arXiv:2205.02343  [pdf, other

    cs.LG cs.AI

    Convolutional and Residual Networks Provably Contain Lottery Tickets

    Authors: Rebekka Burkholz

    Abstract: The Lottery Ticket Hypothesis continues to have a profound practical impact on the quest for small scale deep neural networks that solve modern deep learning tasks at competitive performance. These lottery tickets are identified by pruning large randomly initialized neural networks with architectures that are as diverse as their applications. Yet, theoretical insights that attest their existence h… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

  12. arXiv:2205.02321  [pdf, other

    cs.LG cs.AI

    Most Activation Functions Can Win the Lottery Without Excessive Depth

    Authors: Rebekka Burkholz

    Abstract: The strong lottery ticket hypothesis has highlighted the potential for training deep neural networks by pruning, which has inspired interesting practical and theoretical insights into how neural networks can represent functions. For networks with ReLU activation functions, it has been proven that a target network with depth $L$ can be approximated by the subnetwork of a randomly initialized neural… ▽ More

    Submitted 8 January, 2023; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at NeurIPS 2022

  13. arXiv:2111.11153  [pdf, other

    cs.LG cs.AI stat.ML

    Plant 'n' Seek: Can You Find the Winning Ticket?

    Authors: Jonas Fischer, Rebekka Burkholz

    Abstract: The lottery ticket hypothesis has sparked the rapid development of pruning algorithms that aim to reduce the computational costs associated with deep learning during training and model deployment. Currently, such algorithms are primarily evaluated on imaging data, for which we lack ground truth information and thus the understanding of how sparse lottery tickets could be. To fill this gap, we deve… ▽ More

    Submitted 7 June, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

  14. arXiv:2111.11146  [pdf, other

    cs.LG cs.AI stat.ML

    On the Existence of Universal Lottery Tickets

    Authors: Rebekka Burkholz, Nilanjana Laha, Rajarshi Mukherjee, Alkis Gotovos

    Abstract: The lottery ticket hypothesis conjectures the existence of sparse subnetworks of large randomly initialized deep neural networks that can be successfully trained in isolation. Recent work has experimentally observed that some of these tickets can be practically reused across a variety of tasks, hinting at some form of universality. We formalize this concept and theoretically prove that not only do… ▽ More

    Submitted 16 March, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: Accepted for publication at The Tenth International Conference on Learning Representations (ICLR 2022)

  15. arXiv:2110.11150  [pdf, ps, other

    cs.LG cs.AI

    Lottery Tickets with Nonzero Biases

    Authors: Jonas Fischer, Advait Gadhikar, Rebekka Burkholz

    Abstract: The strong lottery ticket hypothesis holds the promise that pruning randomly initialized deep neural networks could offer a computationally efficient alternative to deep learning with stochastic gradient descent. Common parameter initialization schemes and existence proofs, however, are focused on networks with zero biases, thus foregoing the potential universal approximation property of pruning.… ▽ More

    Submitted 7 June, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

  16. arXiv:2107.02911  [pdf, other

    cs.LG stat.ML

    Scaling up Continuous-Time Markov Chains Helps Resolve Underspecification

    Authors: Alkis Gotovos, Rebekka Burkholz, John Quackenbush, Stefanie Jegelka

    Abstract: Modeling the time evolution of discrete sets of items (e.g., genetic mutations) is a fundamental problem in many biomedical applications. We approach this problem through the lens of continuous-time Markov chains, and show that the resulting learning task is generally underspecified in the usual setting of cross-sectional data. We explore a perhaps surprising remedy: including a number of addition… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  17. arXiv:2104.01690  [pdf, other

    q-bio.MN

    DRAGON: Determining Regulatory Associations using Graphical models on multi-Omic Networks

    Authors: Katherine H. Shutta, Deborah Weighill, Rebekka Burkholz, Marouen Ben Guebila, Dawn L. DeMeo, Helena U. Zacharias, John Quackenbush, Michael Altenbuchinger

    Abstract: The increasing quantity of multi-omics data, such as methylomic and transcriptomic profiles, collected on the same specimen, or even on the same cell, provide a unique opportunity to explore the complex interactions that define cell phenotype and govern cellular responses to perturbations. We propose a network approach based on Gaussian Graphical Models (GGMs) that facilitates the joint analysis o… ▽ More

    Submitted 21 September, 2022; v1 submitted 4 April, 2021; originally announced April 2021.

    Comments: 24 pages, 8 figures

  18. arXiv:1909.05416  [pdf, other

    cs.AI cs.SI physics.soc-ph

    Cascade Size Distributions: Why They Matter and How to Compute Them Efficiently

    Authors: Rebekka Burkholz, John Quackenbush

    Abstract: Cascade models are central to understanding, predicting, and controlling epidemic spreading and information propagation. Related optimization, including influence maximization, model parameter inference, or the development of vaccination strategies, relies heavily on sampling from a model. This is either inefficient or inaccurate. As alternative, we present an efficient message passing algorithm t… ▽ More

    Submitted 16 December, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: Accepted at AAAI 2021

  19. arXiv:1901.05872  [pdf, other

    physics.soc-ph nlin.AO q-fin.TR

    International crop trade networks: The impact of shocks and cascades

    Authors: Rebekka Burkholz, Frank Schweitzer

    Abstract: Analyzing available FAO data from 176 countries over 21 years, we observe an increase of complexity in the international trade of maize, rice, soy, and wheat. A larger number of countries play a role as producers or intermediaries, either for trade or food processing. In consequence, we find that the trade networks become more prone to failure cascades caused by exogenous shocks. In our model, cou… ▽ More

    Submitted 17 January, 2019; originally announced January 2019.

  20. arXiv:1811.06872  [pdf, other

    physics.soc-ph nlin.CD

    Efficient message passing for cascade size distributions on finite trees

    Authors: Rebekka Burkholz

    Abstract: How big is the risk that a few initial failures of networked nodes amplify to large cascades that endanger the functioning of the system? Common answers refer to the average final cascade size. Two analytic approaches allow its computation: a) (heterogeneous) mean field approximation and b) belief propagation. The former applies to (infinitely) large locally tree-like networks, while the latter is… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

  21. arXiv:1806.06362  [pdf, other

    stat.ML cs.LG

    Initialization of ReLUs for Dynamical Isometry

    Authors: Rebekka Burkholz, Alina Dubatovka

    Abstract: Deep learning relies on good initialization schemes and hyperparameter choices prior to training a neural network. Random weight initializations induce random network ensembles, which give rise to the trainability, training speed, and sometimes also generalization ability of an instance. In addition, such ensembles provide theoretical insights into the space of candidate models of which one is sel… ▽ More

    Submitted 24 October, 2019; v1 submitted 17 June, 2018; originally announced June 2018.

    Comments: NeurIPS 2019

  22. arXiv:1802.03286  [pdf, other

    physics.soc-ph math.ST q-fin.RM

    Explicit size distributions of failure cascades redefine systemic risk on finite networks

    Authors: Rebekka Burkholz, Hans J. Herrmann, Frank Schweitzer

    Abstract: How big is the risk that a few initial failures of nodes in a network amplify to large cascades that span a substantial share of all nodes? Predicting the final cascade size is critical to ensure the functioning of a system as a whole. Yet, this task is hampered by uncertain or changing parameters and missing information. In infinitely large networks, the average cascade size can often be well est… ▽ More

    Submitted 8 February, 2018; originally announced February 2018.

    Comments: systemic risk, finite size effects, cascades, networks

  23. arXiv:1712.01755  [pdf, other

    physics.soc-ph cs.MA nlin.AO

    Modeling the formation of R\&D alliances: An agent-based model with empirical validation

    Authors: Mario V. Tomasello, Rebekka Burkholz, Frank Schweitzer

    Abstract: We develop an agent-based model to reproduce the size distribution of R\&D alliances of firms. Agents are uniformly selected to initiate an alliance and to invite collaboration partners. These decide about acceptance based on an individual threshold that is compared with the utility expected from joining the current alliance. The benefit of alliances results from the fitness of the agents involved… ▽ More

    Submitted 5 December, 2017; originally announced December 2017.

  24. arXiv:1706.04451  [pdf, other

    nlin.AO cond-mat.stat-mech nlin.CD nlin.SI physics.soc-ph

    Correlations between thresholds and degrees: An analytic approach to model attacks and failure cascades

    Authors: Rebekka Burkholz, Frank Schweitzer

    Abstract: Two node variables determine the evolution of cascades in random networks: a node's degree and threshold. Correlations between both fundamentally change the robustness of a network, yet, they are disregarded in standard analytic methods as local tree or heterogeneous mean field approximations because of the bad tractability of order statistics. We show how they become tractable in the thermodynami… ▽ More

    Submitted 16 June, 2017; v1 submitted 14 June, 2017; originally announced June 2017.

    MSC Class: 60K35

    Journal ref: Phys. Rev. E 98, 022306 (2018)

  25. arXiv:1701.06970  [pdf, other

    physics.soc-ph cs.SI

    A framework for cascade size calculations on random networks

    Authors: Rebekka Burkholz, Frank Schweitzer

    Abstract: We present a framework to calculate the cascade size evolution for a large class of cascade models on random network ensembles in the limit of infinite network size. Our method is exact and applies to network ensembles with almost arbitrary degree distribution, degree-degree correlations and, in case of threshold models, with arbitrary threshold distribution. With our approach, we shift the perspe… ▽ More

    Submitted 18 January, 2017; originally announced January 2017.

    Journal ref: Phys. Rev. E 97, 042312 (2018)

  26. arXiv:1506.06664  [pdf, other

    physics.soc-ph q-fin.RM

    Systemic risk in multiplex networks with asymmetric coupling and threshold feedback

    Authors: Rebekka Burkholz, Matt V. Leduc, Antonios Garas, Frank Schweitzer

    Abstract: We study cascades on a two-layer multiplex network, with asymmetric feedback that depends on the coupling strength between the layers. Based on an analytical branching process approximation, we calculate the systemic risk measured by the final fraction of failed nodes on a reference layer. The results are compared with the case of a single layer network that is an aggregated representation of the… ▽ More

    Submitted 22 June, 2015; originally announced June 2015.

    Comments: 18 pages, 5 figures

    Journal ref: Physica D: Nonlinear Phenomena, Vol. 323--324, 64--72 (2016)

  27. How Damage Diversification Can Reduce Systemic Risk

    Authors: Rebekka Burkholz, Antonios Garas, Frank Schweitzer

    Abstract: We consider the problem of risk diversification in complex networks. Nodes represent e.g. financial actors, whereas weighted links represent e.g. financial obligations (credits/debts). Each node has a risk to fail because of losses resulting from defaulting neighbors, which may lead to large failure cascades. Classical risk diversification strategies usually neglect network effects and therefore s… ▽ More

    Submitted 3 March, 2015; originally announced March 2015.

    Journal ref: Phys. Rev. E 93, 042313 (2016)