Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–32 of 32 results for author: Karrer, B

.
  1. arXiv:2402.14017  [pdf, other

    cs.LG

    D-Flow: Differentiating through Flows for Controlled Generation

    Authors: Heli Ben-Hamu, Omri Puny, Itai Gat, Brian Karrer, Uriel Singer, Yaron Lipman

    Abstract: Taming the generation outcome of state of the art Diffusion and Flow-Matching (FM) models without having to re-train a task-specific model unlocks a powerful tool for solving inverse problems, conditional generation, and controlled generation in general. In this work we introduce D-Flow, a simple framework for controlling the generation process by differentiating through the flow, optimizing for t… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  2. arXiv:2402.11070  [pdf, other

    stat.ME

    Scalable Analysis of Bipartite Experiments

    Authors: Liang Shi, Edvard Bakhitov, Kenneth Hung, Brian Karrer, Charlie Walker, Monica Bhole, Okke Schrijvers

    Abstract: Bipartite Experiments are randomized experiments where the treatment is applied to a set of units (randomization units) that is different from the units of analysis, and randomization units and analysis units are connected through a bipartite graph. The scale of experimentation at large online platforms necessitates both accurate inference in the presence of a large bipartite interference graph, a… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  3. arXiv:2310.14983  [pdf, other

    econ.EM math.ST stat.ME

    Causal clustering: design of cluster experiments under network interference

    Authors: Davide Viviano, Lihua Lei, Guido Imbens, Brian Karrer, Okke Schrijvers, Liang Shi

    Abstract: This paper studies the design of cluster experiments to estimate the global treatment effect in the presence of network spillovers. We provide a framework to choose the clustering that minimizes the worst-case mean-squared error of the estimated global effect. We show that optimal clustering solves a novel penalized min-cut optimization problem computed via off-the-shelf semi-definite programming… ▽ More

    Submitted 13 January, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  4. arXiv:2310.04432  [pdf, other

    cs.CV cs.AI cs.LG

    Training-free Linear Image Inverses via Flows

    Authors: Ashwini Pokle, Matthew J. Muckley, Ricky T. Q. Chen, Brian Karrer

    Abstract: Solving inverse problems without any training involves using a pretrained generative model and making appropriate modifications to the generation process to avoid finetuning of the generative model. While recent methods have explored the use of diffusion models, they still require the manual tuning of many hyperparameters for different inverse problems. In this work, we propose a training-free met… ▽ More

    Submitted 10 March, 2024; v1 submitted 25 September, 2023; originally announced October 2023.

    Comments: 40 pages, 30 figures. Added additional qualitative results in the appendix

  5. arXiv:2310.02233  [pdf, other

    stat.ML cs.LG math.OC

    Generalized Schrödinger Bridge Matching

    Authors: Guan-Horng Liu, Yaron Lipman, Maximilian Nickel, Brian Karrer, Evangelos A. Theodorou, Ricky T. Q. Chen

    Abstract: Modern distribution matching algorithms for training diffusion or flow models directly prescribe the time evolution of the marginal distributions between two boundary distributions. In this work, we consider a generalized distribution matching setup, where these marginals are only implicitly described as a solution to some task-specific objective function. The problem setup, known as the Generaliz… ▽ More

    Submitted 18 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Camera Ready

  6. arXiv:2309.02591  [pdf, other

    cs.LG cs.CL cs.CV

    Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning

    Authors: Lili Yu, Bowen Shi, Ramakanth Pasunuru, Benjamin Muller, Olga Golovneva, Tianlu Wang, Arun Babu, Binh Tang, Brian Karrer, Shelly Sheynin, Candace Ross, Adam Polyak, Russell Howes, Vasu Sharma, Puxin Xu, Hovhannes Tamoyan, Oron Ashual, Uriel Singer, Shang-Wen Li, Susan Zhang, Richard James, Gargi Ghosh, Yaniv Taigman, Maryam Fazel-Zarandi, Asli Celikyilmaz , et al. (2 additional authors not shown)

    Abstract: We present CM3Leon (pronounced "Chameleon"), a retrieval-augmented, token-based, decoder-only multi-modal language model capable of generating and infilling both text and images. CM3Leon uses the CM3 multi-modal architecture but additionally shows the extreme benefits of scaling up and tuning on more diverse instruction-style data. It is the first multi-modal model trained with a recipe adapted fr… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  7. arXiv:2306.15687  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

    Authors: Matthew Le, Apoorv Vyas, Bowen Shi, Brian Karrer, Leda Sari, Rashel Moritz, Mary Williamson, Vimal Manohar, Yossi Adi, Jay Mahadeokar, Wei-Ning Hsu

    Abstract: Large-scale generative models such as GPT and DALL-E have revolutionized the research community. These models not only generate high fidelity outputs, but are also generalists which can solve tasks not explicitly taught. In contrast, speech generative models are still primitive in terms of scale and task generalization. In this paper, we present Voicebox, the most versatile text-guided generative… ▽ More

    Submitted 19 October, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023

  8. arXiv:2306.04803  [pdf, other

    cs.LG cs.CL cs.CR

    Privately generating tabular data using language models

    Authors: Alexandre Sablayrolles, Yue Wang, Brian Karrer

    Abstract: Privately generating synthetic data from a table is an important brick of a privacy-first world. We propose and investigate a simple approach of treating each row in a table as a sentence and training a language model with differential privacy. We show this approach obtains competitive results in modelling tabular data across multiple datasets, even at small scales that favor alternative methods b… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 9 pages, 3 figures

  9. arXiv:2202.01100  [pdf, other

    cs.CR cs.DB

    Exact Privacy Analysis of the Gaussian Sparse Histogram Mechanism

    Authors: Brian Karrer, Daniel Kifer, Arjun Wilkins, Danfeng Zhang

    Abstract: Sparse histogram methods can be useful for returning differentially private counts of items in large or infinite histograms, large group-by queries, and more generally, releasing a set of statistics with sufficient item counts. We consider the Gaussian version of the sparse histogram mechanism and study the exact $ε,δ$ differential privacy guarantees satisfied by this mechanism. We compare these e… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

    Comments: 22 pages, 1 figure

  10. arXiv:2201.12383  [pdf, other

    cs.LG cs.CR

    Bounding Training Data Reconstruction in Private (Deep) Learning

    Authors: Chuan Guo, Brian Karrer, Kamalika Chaudhuri, Laurens van der Maaten

    Abstract: Differential privacy is widely accepted as the de facto method for preventing data leakage in ML, and conventional wisdom suggests that it offers strong protection against privacy attacks. However, existing semantic guarantees for DP focus on membership inference, which may overestimate the adversary's capabilities and is not applicable when membership status itself is non-sensitive. In this paper… ▽ More

    Submitted 23 June, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

  11. arXiv:2012.08591  [pdf, other

    cs.SI stat.AP stat.ME

    Network experimentation at scale

    Authors: Brian Karrer, Liang Shi, Monica Bhole, Matt Goldman, Tyrone Palmer, Charlie Gelman, Mikael Konutgan, Feng Sun

    Abstract: We describe our framework, deployed at Facebook, that accounts for interference between experimental units through cluster-randomized experiments. We document this system, including the design and estimation procedures, and detail insights we have gained from the many experiments that have used this system at scale. We introduce a cluster-based regression adjustment that substantially improves pre… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: 12 pages, 8 figures

  12. arXiv:2006.15779  [pdf, other

    cs.LG math.NA stat.ML

    Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step Trees

    Authors: Shali Jiang, Daniel R. Jiang, Maximilian Balandat, Brian Karrer, Jacob R. Gardner, Roman Garnett

    Abstract: Bayesian optimization is a sequential decision making framework for optimizing expensive-to-evaluate black-box functions. Computing a full lookahead policy amounts to solving a highly intractable stochastic dynamic program. Myopic approaches, such as expected improvement, are often adopted in practice, but they ignore the long-term impact of the immediate decision. Existing nonmyopic approaches ar… ▽ More

    Submitted 28 June, 2020; originally announced June 2020.

  13. arXiv:1910.06403  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    BoTorch: A Framework for Efficient Monte-Carlo Bayesian Optimization

    Authors: Maximilian Balandat, Brian Karrer, Daniel R. Jiang, Samuel Daulton, Benjamin Letham, Andrew Gordon Wilson, Eytan Bakshy

    Abstract: Bayesian optimization provides sample-efficient global optimization for a broad range of applications, including automatic machine learning, engineering, physics, and experimental design. We introduce BoTorch, a modern programming framework for Bayesian optimization that combines Monte-Carlo (MC) acquisition functions, a novel sample average approximation optimization approach, auto-differentiatio… ▽ More

    Submitted 8 December, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

    Journal ref: Advances in Neural Information Processing Systems 33, 2020

  14. arXiv:1806.09976  [pdf, other

    stat.ML cs.LG

    The decoupled extended Kalman filter for dynamic exponential-family factorization models

    Authors: Carlos Alberto Gomez-Uribe, Brian Karrer

    Abstract: Motivated by the needs of online large-scale recommender systems, we specialize the decoupled extended Kalman filter (DEKF) to factorization models, including factorization machines, matrix and tensor factorization, and illustrate the effectiveness of the approach through numerical experiments on synthetic and on real-world data. Online learning of model parameters through the DEKF makes factoriza… ▽ More

    Submitted 24 February, 2021; v1 submitted 26 June, 2018; originally announced June 2018.

    Comments: 29 pages, 4 figures

    Journal ref: Journal of Machine Learning Research (JMLR), 22(5):1-25, 2021

  15. arXiv:1707.06665  [pdf, other

    cs.DS cs.DC

    Social Hash Partitioner: A Scalable Distributed Hypergraph Partitioner

    Authors: Igor Kabiljo, Brian Karrer, Mayank Pundir, Sergey Pupyrev, Alon Shalita, Alessandro Presta, Yaroslav Akhremtsev

    Abstract: We design and implement a distributed algorithm for balanced $k$-way hypergraph partitioning that minimizes fanout, a fundamental hypergraph quantity also known as the communication volume and ($k-1$)-cut metric, by optimizing a novel objective called probabilistic fanout. This choice allows a simple local search heuristic to achieve comparable solution quality to the best existing hypergraph part… ▽ More

    Submitted 20 July, 2017; originally announced July 2017.

    Comments: Proceedings of the VLDB Endowment 2017

  16. arXiv:1706.07094  [pdf, other

    stat.ML cs.LG stat.AP

    Constrained Bayesian Optimization with Noisy Experiments

    Authors: Benjamin Letham, Brian Karrer, Guilherme Ottoni, Eytan Bakshy

    Abstract: Randomized experiments are the gold standard for evaluating the effects of changes to real-world systems. Data in these tests may be difficult to collect and outcomes may have high variance, resulting in potentially large measurement error. Bayesian optimization is a promising technique for efficiently optimizing multiple continuous parameters, but existing approaches degrade in performance when t… ▽ More

    Submitted 26 June, 2018; v1 submitted 21 June, 2017; originally announced June 2017.

  17. arXiv:1705.07249  [pdf, other

    cs.NI cs.CV math.OC

    End-to-end Planning of Fixed Millimeter-Wave Networks

    Authors: Tim Danford, Onur Filiz, Jing Huang, Brian Karrer, Manohar Paluri, Guan Pang, Vish Ponnampalam, Nicolas Stier-Moses, Birce Tezel

    Abstract: This article discusses a framework to support the design and end-to-end planning of fixed millimeter-wave networks. Compared to traditional techniques, the framework allows an organization to quickly plan a deployment in a cost-effective way. We start by using LiDAR data---basically, a 3D point cloud captured from a city---to estimate potential sites to deploy antennas and whether there is line-of… ▽ More

    Submitted 19 May, 2017; originally announced May 2017.

  18. Compressing Graphs and Indexes with Recursive Graph Bisection

    Authors: Laxman Dhulipala, Igor Kabiljo, Brian Karrer, Giuseppe Ottaviano, Sergey Pupyrev, Alon Shalita

    Abstract: Graph reordering is a powerful technique to increase the locality of the representations of graphs, which can be helpful in several applications. We study how the technique can be used to improve compression of graphs and inverted indexes. We extend the recent theoretical model of Chierichetti et al. (KDD 2009) for graph compression, and show how it can be employed for compression-friendly reord… ▽ More

    Submitted 28 February, 2016; originally announced February 2016.

  19. arXiv:1405.0483  [pdf, ps, other

    cond-mat.stat-mech cs.SI physics.soc-ph

    Percolation on sparse networks

    Authors: Brian Karrer, M. E. J. Newman, Lenka Zdeborová

    Abstract: We study percolation on networks, which is used as a model of the resilience of networked systems such as the Internet to attack or failure and as a simple model of the spread of disease over human contact networks. We reformulate percolation as a message passing process and demonstrate how the resulting equations can be used to calculate, among other things, the size of the percolating cluster an… ▽ More

    Submitted 7 October, 2014; v1 submitted 2 May, 2014; originally announced May 2014.

    Comments: 6 pages, 1 figure, 1 table. This version includes a Supplemental Information section and some changes to the proofs and results in the main paper

    Journal ref: Phys. Rev. Lett. 113, 208702 (2014)

  20. arXiv:1404.7530  [pdf, other

    stat.ME cs.SI physics.soc-ph

    Design and analysis of experiments in networks: Reducing bias from interference

    Authors: Dean Eckles, Brian Karrer, Johan Ugander

    Abstract: Estimating the effects of interventions in networks is complicated when the units are interacting, such that the outcomes for one unit may depend on the treatment assignment and behavior of many or all other units (i.e., there is interference). When most or all units are in a single connected component, it is impossible to directly experimentally compare outcomes under two or more global treatment… ▽ More

    Submitted 13 August, 2014; v1 submitted 29 April, 2014; originally announced April 2014.

    Comments: 32 pages, 7 figures

  21. arXiv:1305.6979  [pdf, other

    cs.SI physics.soc-ph stat.ME

    Graph cluster randomization: network exposure to multiple universes

    Authors: Johan Ugander, Brian Karrer, Lars Backstrom, Jon Kleinberg

    Abstract: A/B testing is a standard approach for evaluating the effect of online experiments; the goal is to estimate the `average treatment effect' of a new feature or condition by exposing a sample of the overall population to it. A drawback with A/B testing is that it is poorly suited for experiments involving social interference, when the treatment of individuals spills over to neighboring individuals a… ▽ More

    Submitted 29 May, 2013; originally announced May 2013.

    Comments: 9 pages, 2 figures

  22. arXiv:1304.0473  [pdf, ps, other

    cs.DL cs.SI physics.soc-ph

    Coauthorship and citation in scientific publishing

    Authors: Travis Martin, Brian Ball, Brian Karrer, M. E. J. Newman

    Abstract: A large number of published studies have examined the properties of either networks of citation among scientific papers or networks of coauthorship among scientists. Here, using an extensive data set covering more than a century of physics papers published in the Physical Review, we study a hybrid coauthorship/citation network that combines the two, which we analyze to gain insight into the correl… ▽ More

    Submitted 1 April, 2013; originally announced April 2013.

    Comments: 10 pages, 11 figures, 3 tables

  23. arXiv:1111.4503  [pdf, ps, other

    cs.SI physics.soc-ph

    The Anatomy of the Facebook Social Graph

    Authors: Johan Ugander, Brian Karrer, Lars Backstrom, Cameron Marlow

    Abstract: We study the structure of the social graph of active Facebook users, the largest social network ever analyzed. We compute numerous features of the graph including the number of users and friendships, the degree distribution, path lengths, clustering, and mixing patterns. Our results center around three main observations. First, we characterize the global structure of the graph, determining that th… ▽ More

    Submitted 18 November, 2011; originally announced November 2011.

    Comments: 17 pages, 9 figures, 1 table

  24. arXiv:1105.3424  [pdf, ps, other

    physics.soc-ph cond-mat.stat-mech cs.SI

    Competing epidemics on complex networks

    Authors: Brian Karrer, M. E. J. Newman

    Abstract: Human diseases spread over networks of contacts between individuals and a substantial body of recent research has focused on the dynamics of the spreading process. Here we examine a model of two competing diseases spreading over the same network at the same time, where infection with either disease gives an individual subsequent immunity to both. Using a combination of analytic and numerical metho… ▽ More

    Submitted 17 May, 2011; originally announced May 2011.

    Comments: 14 pages, 5 figures

    Journal ref: Phys. Rev. E 84, 036106 (2011)

  25. arXiv:1104.3590  [pdf, ps, other

    cs.SI cond-mat.stat-mech physics.soc-ph

    An efficient and principled method for detecting communities in networks

    Authors: Brian Ball, Brian Karrer, M. E. J. Newman

    Abstract: A fundamental problem in the analysis of network data is the detection of network communities, groups of densely interconnected nodes, which may be overlapping or disjoint. Here we describe a method for finding overlapping communities based on a principled statistical approach using generative network models. We show how the method can be implemented using a fast, closed-form expectation-maximizat… ▽ More

    Submitted 18 April, 2011; originally announced April 2011.

    Comments: 14 pages, 5 figures, 1 table

    Journal ref: Phys. Rev. E 84, 036103 (2011)

  26. arXiv:1008.3926  [pdf, ps, other

    physics.soc-ph cond-mat.stat-mech cs.SI physics.data-an

    Stochastic blockmodels and community structure in networks

    Authors: Brian Karrer, M. E. J. Newman

    Abstract: Stochastic blockmodels have been proposed as a tool for detecting community structure in networks as well as for generating synthetic networks for use as benchmarks. Most blockmodels, however, ignore variation in vertex degree, making them unsuitable for applications to real-world networks, which typically display broad degree distributions that can significantly distort the results. Here we demon… ▽ More

    Submitted 23 August, 2010; originally announced August 2010.

    Comments: 11 pages, 3 figures

    Journal ref: Phys. Rev. E 83, 016107 (2011)

  27. arXiv:1005.1659  [pdf, ps, other

    cond-mat.stat-mech cs.DM physics.soc-ph

    Random graphs containing arbitrary distributions of subgraphs

    Authors: Brian Karrer, M. E. J. Newman

    Abstract: Traditional random graph models of networks generate networks that are locally tree-like, meaning that all local neighborhoods take the form of trees. In this respect such models are highly unrealistic, most real networks having strongly non-tree-like neighborhoods that contain short loops, cliques, or other biconnected subgraphs. In this paper we propose and analyze a new class of random graph… ▽ More

    Submitted 10 May, 2010; originally announced May 2010.

    Comments: 12 pages, 6 figures, 1 table

    Journal ref: Phys. Rev. E 82, 066118 (2010)

  28. arXiv:1003.5673  [pdf, ps, other

    physics.soc-ph cond-mat.stat-mech q-bio.PE

    A message passing approach for general epidemic models

    Authors: Brian Karrer, M. E. J. Newman

    Abstract: In most models of the spread of disease over contact networks it is assumed that the probabilities per unit time of disease transmission and recovery from disease are constant, implying exponential distributions of the time intervals for transmission and recovery. Time intervals for real diseases, however, have distributions that in most cases are far from exponential, which leads to disagreements… ▽ More

    Submitted 22 July, 2010; v1 submitted 29 March, 2010; originally announced March 2010.

    Comments: 10 pages, 3 figures

    Journal ref: Phys. Rev. E 82, 016101 (2010)

  29. arXiv:0907.4346  [pdf, ps, other

    physics.soc-ph cond-mat.stat-mech physics.data-an

    Random graph models for directed acyclic networks

    Authors: Brian Karrer, M. E. J. Newman

    Abstract: We study random graph models for directed acyclic graphs, an important class of networks that includes citation networks, food webs, and feed-forward neural networks among others. We propose two specific models, roughly analogous to the fixed edge number and fixed edge probability variants of traditional undirected random graphs. We calculate a number of properties of these models, including par… ▽ More

    Submitted 24 July, 2009; originally announced July 2009.

    Comments: 14 pages, 5 figures

    Journal ref: Phys. Rev. E 80, 046110 (2009)

  30. arXiv:0902.4013  [pdf, ps, other

    physics.soc-ph cond-mat.stat-mech physics.data-an

    Random acyclic networks

    Authors: Brian Karrer, M. E. J. Newman

    Abstract: Directed acyclic graphs are a fundamental class of networks that includes citation networks, food webs, and family trees, among others. Here we define a random graph model for directed acyclic graphs and give solutions for a number of the model's properties, including connection probabilities and component sizes, as well as a fast algorithm for simulating the model on a computer. We compare the… ▽ More

    Submitted 23 February, 2009; originally announced February 2009.

    Comments: 4 pages, 2 figures

    Journal ref: Phys. Rev. Lett. 102, 128701 (2009)

  31. arXiv:0711.1602  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn

    Preservation of network Degree Distributions from non-uniform failures

    Authors: Brian Karrer, Gourab Ghoshal

    Abstract: There has been a considerable amount of interest in recent years on the robustness of networks to failures. Many previous studies have concentrated on the effects of node and edge removals on the connectivity structure of a static network; the networks are considered to be static in the sense that no compensatory measures are allowed for recovery of the original structure. Real world networks su… ▽ More

    Submitted 12 March, 2008; v1 submitted 10 November, 2007; originally announced November 2007.

    Comments: 8 pages, 4 figures Additional content. Added references. Fixed typos

    Journal ref: Eur. Phys. J. B 62, 239-245 (2008)

  32. arXiv:0709.2108  [pdf, ps, other

    physics.data-an cond-mat.stat-mech physics.soc-ph

    Robustness of community structure in networks

    Authors: Brian Karrer, Elizaveta Levina, M. E. J. Newman

    Abstract: The discovery of community structure is a common challenge in the analysis of network data. Many methods have been proposed for finding community structure, but few have been proposed for determining whether the structure found is statistically significant or whether, conversely, it could have arisen purely as a result of chance. In this paper we show that the significance of community structure… ▽ More

    Submitted 13 September, 2007; originally announced September 2007.

    Comments: 10 pages, 2 figures

    Journal ref: Phys. Rev. E 77, 046119 (2008)