Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 62 results for author: Willett, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14064  [pdf, other

    stat.ML cs.LG math.ST

    Building a stable classifier with the inflated argmax

    Authors: Jake A. Soloff, Rina Foygel Barber, Rebecca Willett

    Abstract: We propose a new framework for algorithmic stability in the context of multiclass classification. In practice, classification algorithms often operate by first assigning a continuous score (for instance, an estimated probability) to each possible label, then taking the maximizer -- i.e., selecting the class that has the highest score. A drawback of this type of approach is that it is inherently un… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2405.13180  [pdf, other

    eess.SP cs.LG nlin.CD physics.ao-ph stat.AP

    Data Assimilation with Machine Learning Surrogate Models: A Case Study with FourCastNet

    Authors: Melissa Adrian, Daniel Sanz-Alonso, Rebecca Willett

    Abstract: Modern data-driven surrogate models for weather forecasting provide accurate short-term predictions but inaccurate and nonphysical long-term forecasts. This paper investigates online weather prediction using machine learning surrogates supplemented with partial and noisy observations. We empirically demonstrate and theoretically justify that, despite the long-time instability of the surrogates and… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  3. arXiv:2404.10947  [pdf, other

    cs.CV

    Residual Connections Harm Abstract Feature Learning in Masked Autoencoders

    Authors: Xiao Zhang, Ruoxi Jiang, William Gao, Rebecca Willett, Michael Maire

    Abstract: We demonstrate that adding a weighting factor to decay the strength of identity shortcuts within residual networks substantially improves semantic feature learning in the state-of-the-art self-supervised masked autoencoding (MAE) paradigm. Our modification to the identity shortcuts within a VIT-B/16 backbone of an MAE boosts linear probing accuracy on ImageNet from 67.8% to 72.7%. This significant… ▽ More

    Submitted 20 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  4. arXiv:2403.05583  [pdf, other

    cs.HC cs.AI cs.SD eess.AS

    A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition

    Authors: Tyler Benster, Guy Wilson, Reshef Elisha, Francis R Willett, Shaul Druckmann

    Abstract: Silent Speech Interfaces (SSIs) offer a noninvasive alternative to brain-computer interfaces for soundless verbal communication. We introduce Multimodal Orofacial Neural Audio (MONA), a system that leverages cross-modal alignment through novel loss functions--cross-contrast (crossCon) and supervised temporal contrast (supTcon)--to train a multimodal model with a shared latent representation. This… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  5. arXiv:2402.08808  [pdf, other

    cs.LG stat.ML

    Depth Separation in Norm-Bounded Infinite-Width Neural Networks

    Authors: Suzanna Parkinson, Greg Ongie, Rebecca Willett, Ohad Shamir, Nathan Srebro

    Abstract: We study depth separation in infinite-width neural networks, where complexity is controlled by the overall squared $\ell_2$-norm of the weights (sum of squares of all weights in the network). Whereas previous depth separation results focused on separation in terms of width, such results do not give insight into whether depth determines if it is possible to learn a network that generalizes well eve… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  6. arXiv:2311.03611  [pdf, other

    cs.HC cs.LG q-bio.NC

    Plug-and-Play Stability for Intracortical Brain-Computer Interfaces: A One-Year Demonstration of Seamless Brain-to-Text Communication

    Authors: Chaofei Fan, Nick Hahn, Foram Kamdar, Donald Avansino, Guy H. Wilson, Leigh Hochberg, Krishna V. Shenoy, Jaimie M. Henderson, Francis R. Willett

    Abstract: Intracortical brain-computer interfaces (iBCIs) have shown promise for restoring rapid communication to people with neurological disorders such as amyotrophic lateral sclerosis (ALS). However, to maintain high performance over time, iBCIs typically need frequent recalibration to combat changes in the neural recordings that accrue over days. This requires iBCI users to stop using the iBCI and engag… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  7. arXiv:2308.06271  [pdf, other

    cs.CV cs.AI cs.LG

    Rotation-Invariant Random Features Provide a Strong Baseline for Machine Learning on 3D Point Clouds

    Authors: Owen Melia, Eric Jonas, Rebecca Willett

    Abstract: Rotational invariance is a popular inductive bias used by many fields in machine learning, such as computer vision and machine learning for quantum chemistry. Rotation-invariant machine learning methods set the state of the art for many tasks, including molecular property prediction and 3D shape classification. These methods generally either rely on task-specific rotation-invariant features, or th… ▽ More

    Submitted 27 July, 2023; originally announced August 2023.

  8. arXiv:2306.01187  [pdf, other

    cs.LG math.DS

    Training neural operators to preserve invariant measures of chaotic attractors

    Authors: Ruoxi Jiang, Peter Y. Lu, Elena Orlova, Rebecca Willett

    Abstract: Chaotic systems make long-horizon forecasts difficult because small perturbations in initial conditions cause trajectories to diverge at an exponential rate. In this setting, neural operators trained to minimize squared error losses, while capable of accurate short-term forecasts, often fail to reproduce statistical or structural properties of the dynamics over longer time horizons and can yield d… ▽ More

    Submitted 16 April, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted at NeurIPS 2023

  9. arXiv:2305.19685  [pdf, other

    cs.LG quant-ph stat.ML

    Deep Stochastic Mechanics

    Authors: Elena Orlova, Aleksei Ustimenko, Ruoxi Jiang, Peter Y. Lu, Rebecca Willett

    Abstract: This paper introduces a novel deep-learning-based approach for numerical simulation of a time-evolving Schrödinger equation inspired by stochastic mechanics and generative diffusion models. Unlike existing approaches, which exhibit computational complexity that scales exponentially in the problem dimension, our method allows us to adapt to the latent low-dimensional structure of the wave function… ▽ More

    Submitted 7 July, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: ICML 2024

  10. arXiv:2305.15598  [pdf, other

    cs.LG stat.ML

    ReLU Neural Networks with Linear Layers are Biased Towards Single- and Multi-Index Models

    Authors: Suzanna Parkinson, Greg Ongie, Rebecca Willett

    Abstract: Neural networks often operate in the overparameterized regime, in which there are far more parameters than training samples, allowing the training data to be fit perfectly. That is, training the network effectively learns an interpolating function, and properties of the interpolant affect predictions the network will make on new samples. This manuscript explores how properties of such functions le… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

  11. arXiv:2301.12600  [pdf, other

    stat.ML cs.LG math.ST

    Bagging Provides Assumption-free Stability

    Authors: Jake A. Soloff, Rina Foygel Barber, Rebecca Willett

    Abstract: Bagging is an important technique for stabilizing machine learning models. In this paper, we derive a finite-sample guarantee on the stability of bagging for any model. Our result places no assumptions on the distribution of the data, on the properties of the base algorithm, or on the dimensionality of the covariates. Our guarantee applies to many variants of bagging and is optimal up to a constan… ▽ More

    Submitted 25 April, 2024; v1 submitted 29 January, 2023; originally announced January 2023.

  12. arXiv:2301.11961  [pdf, other

    stat.ML cs.LG math.DS stat.CO

    Reduced-Order Autodifferentiable Ensemble Kalman Filters

    Authors: Yuming Chen, Daniel Sanz-Alonso, Rebecca Willett

    Abstract: This paper introduces a computational framework to reconstruct and forecast a partially observed state that evolves according to an unknown or expensive-to-simulate dynamical system. Our reduced-order autodifferentiable ensemble Kalman filters (ROAD-EnKFs) learn a latent low-dimensional surrogate model for the dynamics and a decoder that maps from the latent space to the state space. The learned d… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  13. arXiv:2211.15856  [pdf, other

    cs.LG physics.ao-ph

    Beyond Ensemble Averages: Leveraging Climate Model Ensembles for Subseasonal Forecasting

    Authors: Elena Orlova, Haokun Liu, Raphael Rossellini, Benjamin A. Cash, Rebecca Willett

    Abstract: Producing high-quality forecasts of key climate variables, such as temperature and precipitation, on subseasonal time scales has long been a gap in operational forecasting. This study explores an application of machine learning (ML) models as post-processing tools for subseasonal forecasting. Lagged numerical ensemble forecasts (i.e., an ensemble where the members have different initialization dat… ▽ More

    Submitted 3 June, 2024; v1 submitted 28 November, 2022; originally announced November 2022.

  14. arXiv:2211.01554  [pdf, other

    cs.LG math.NA

    Embed and Emulate: Learning to estimate parameters of dynamical systems with uncertainty quantification

    Authors: Ruoxi Jiang, Rebecca Willett

    Abstract: This paper explores learning emulators for parameter estimation with uncertainty estimation of high-dimensional dynamical systems. We assume access to a computationally complex simulator that inputs a candidate parameter and outputs a corresponding multichannel time series. Our task is to accurately estimate a range of likely values of the underlying parameters. Standard iterative approaches neces… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted at NeurIPS 2022

  15. arXiv:2209.15585  [pdf, other

    physics.ao-ph cs.LG

    Cloud Classification with Unsupervised Deep Learning

    Authors: Takuya Kurihana, Ian Foster, Rebecca Willett, Sydney Jenkins, Kathryn Koenig, Ruby Werman, Ricardo Barros Lourenco, Casper Neo, Elisabeth Moyer

    Abstract: We present a framework for cloud characterization that leverages modern unsupervised deep learning technologies. While previous neural network-based cloud classification models have used supervised learning methods, unsupervised learning allows us to avoid restricting the model to artificial categories based on historical cloud classification schemes and enables the discovery of novel, more detail… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: 5 pages, 6 figures, Proceedings for Climate Informatics Workshop 2019 Paris

  16. arXiv:2203.08339  [pdf, other

    cs.LG cs.DC cs.PF

    NURD: Negative-Unlabeled Learning for Online Datacenter Straggler Prediction

    Authors: Yi Ding, Avinash Rao, Hyebin Song, Rebecca Willett, Henry Hoffmann

    Abstract: Datacenters execute large computational jobs, which are composed of smaller tasks. A job completes when all its tasks finish, so stragglers -- rare, yet extremely slow tasks -- are a major impediment to datacenter performance. Accurately predicting stragglers would enable proactive intervention, allowing datacenter operators to mitigate stragglers before they delay a job. While much prior work app… ▽ More

    Submitted 13 August, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

  17. arXiv:2202.00856  [pdf, other

    cs.LG

    The Role of Linear Layers in Nonlinear Interpolating Networks

    Authors: Greg Ongie, Rebecca Willett

    Abstract: This paper explores the implicit bias of overparameterized neural networks of depth greater than two layers. Our framework considers a family of networks of varying depth that all have the same capacity but different implicitly defined representation costs. The representation cost of a function induced by a neural network architecture is the minimum sum of squared weights needed for the network to… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

  18. arXiv:2110.07435  [pdf, other

    cs.LG eess.IV math.OC stat.ML

    Adaptive Differentially Private Empirical Risk Minimization

    Authors: Xiaoxia Wu, Lingxiao Wang, Irina Cristali, Quanquan Gu, Rebecca Willett

    Abstract: We propose an adaptive (stochastic) gradient perturbation method for differentially private empirical risk minimization. At each iteration, the random noise added to the gradient is optimally adapted to the stepsize; we name this process adaptive differentially private (ADP) learning. Given the same privacy budget, we prove that the ADP method considerably improves the utility guarantee compared t… ▽ More

    Submitted 24 October, 2021; v1 submitted 14 October, 2021; originally announced October 2021.

  19. arXiv:2107.07687  [pdf, other

    stat.ML cs.LG stat.CO

    Auto-differentiable Ensemble Kalman Filters

    Authors: Yuming Chen, Daniel Sanz-Alonso, Rebecca Willett

    Abstract: Data assimilation is concerned with sequentially estimating a temporally-evolving state. This task, which arises in a wide range of scientific and engineering applications, is particularly challenging when the state is high-dimensional and the state-space dynamics are unknown. This paper introduces a machine learning framework for learning dynamical systems in data assimilation. Our auto-different… ▽ More

    Submitted 19 July, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

  20. arXiv:2106.12034  [pdf, other

    stat.ML cs.LG

    Pure Exploration in Kernel and Neural Bandits

    Authors: Yinglun Zhu, Dongruo Zhou, Ruoxi Jiang, Quanquan Gu, Rebecca Willett, Robert Nowak

    Abstract: We study pure exploration in bandits, where the dimension of the feature representation can be much larger than the number of arms. To overcome the curse of dimensionality, we propose to adaptively embed the feature representation of each arm into a lower-dimensional space and carefully deal with the induced model misspecification. Our approach is conceptually very different from existing works th… ▽ More

    Submitted 17 March, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

  21. arXiv:2103.13555  [pdf, other

    stat.ML cs.LG

    Prediction in the presence of response-dependent missing labels

    Authors: Hyebin Song, Garvesh Raskutti, Rebecca Willett

    Abstract: In a variety of settings, limitations of sensing technologies or other sampling mechanisms result in missing labels, where the likelihood of a missing label in the training set is an unknown function of the data. For example, satellites used to detect forest fires cannot sense fires below a certain size threshold. In such cases, training datasets consist of positive and pseudo-negative observation… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

  22. arXiv:2103.04885  [pdf, other

    cs.CV physics.ao-ph

    Data-driven Cloud Clustering via a Rotationally Invariant Autoencoder

    Authors: Takuya Kurihana, Elisabeth Moyer, Rebecca Willett, Davis Gilton, Ian Foster

    Abstract: Advanced satellite-born remote sensing instruments produce high-resolution multi-spectral data for much of the globe at a daily cadence. These datasets open up the possibility of improved understanding of cloud dynamics and feedback, which remain the biggest source of uncertainty in global climate model projections. As a step towards answering these questions, we describe an automated rotation-inv… ▽ More

    Submitted 28 October, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: 25 pages. Accepted by IEEE Transactions on Geoscience and Remote Sensing (TGRS)

    Journal ref: IEEE Transactions on Geoscience and Remote Sensing, 2021

  23. arXiv:2102.07944  [pdf, other

    eess.IV cs.CV

    Deep Equilibrium Architectures for Inverse Problems in Imaging

    Authors: Davis Gilton, Gregory Ongie, Rebecca Willett

    Abstract: Recent efforts on solving inverse problems in imaging via deep neural networks use architectures inspired by a fixed number of iterations of an optimization method. The number of iterations is typically quite small due to difficulties in training networks corresponding to more iterations; the resulting solvers cannot be run for more iterations at test time without incurring significant errors. Thi… ▽ More

    Submitted 2 June, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  24. arXiv:2012.00139  [pdf, other

    eess.IV cs.CV

    Model Adaptation for Inverse Problems in Imaging

    Authors: Davis Gilton, Gregory Ongie, Rebecca Willett

    Abstract: Deep neural networks have been applied successfully to a wide variety of inverse problems arising in computational imaging. These networks are typically trained using a forward model that describes the measurement process to be inverted, which is often incorporated directly into the network itself. However, these approaches are sensitive to changes in the forward model: if at test time the forward… ▽ More

    Submitted 12 April, 2021; v1 submitted 30 November, 2020; originally announced December 2020.

  25. arXiv:2005.06001  [pdf, other

    eess.IV cs.LG stat.ML

    Deep Learning Techniques for Inverse Problems in Imaging

    Authors: Gregory Ongie, Ajil Jalal, Christopher A. Metzler, Richard G. Baraniuk, Alexandros G. Dimakis, Rebecca Willett

    Abstract: Recent work in machine learning shows that deep neural networks can be used to solve a wide variety of inverse problems arising in computational imaging. We explore the central prevailing themes of this emerging area and present a taxonomy that can be used to categorize different problems and reconstruction methods. Our taxonomy is organized along two central axes: (1) whether or not a forward mod… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

  26. arXiv:2003.12633  [pdf, other

    cs.CV

    Detection and Description of Change in Visual Streams

    Authors: Davis Gilton, Ruotian Luo, Rebecca Willett, Greg Shakhnarovich

    Abstract: This paper presents a framework for the analysis of changes in visual streams: ordered sequences of images, possibly separated by significant time gaps. We propose a new approach to incorporating unlabeled data into training to generate natural language descriptions of change. We also develop a framework for estimating the time of change in visual stream. We use learned representations for change… ▽ More

    Submitted 9 April, 2020; v1 submitted 27 March, 2020; originally announced March 2020.

  27. arXiv:2003.07429  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Context-dependent self-exciting point processes: models, methods, and risk bounds in high dimensions

    Authors: Lili Zheng, Garvesh Raskutti, Rebecca Willett, Benjamin Mark

    Abstract: High-dimensional autoregressive point processes model how current events trigger or inhibit future events, such as activity by one member of a social network can affect the future activity of his or her neighbors. While past work has focused on estimating the underlying network structure based solely on the times at which events occur on each node of the network, this paper examines the more nuanc… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

  28. arXiv:2002.11255  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    An Optimal Statistical and Computational Framework for Generalized Tensor Estimation

    Authors: Rungang Han, Rebecca Willett, Anru R. Zhang

    Abstract: This paper describes a flexible framework for generalized low-rank tensor estimation problems that includes many important instances arising from applications in computational imaging, genomics, and network analysis. The proposed estimator consists of finding a low-rank tensor fit to the data under generalized parametric models. To overcome the difficulty of non-convexity in these problems, we int… ▽ More

    Submitted 4 February, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

  29. arXiv:1910.01635  [pdf, other

    cs.LG stat.ML

    A Function Space View of Bounded Norm Infinite Width ReLU Nets: The Multivariate Case

    Authors: Greg Ongie, Rebecca Willett, Daniel Soudry, Nathan Srebro

    Abstract: A key element of understanding the efficacy of overparameterized neural networks is characterizing how they represent functions as the number of weights in the network approaches infinity. In this paper, we characterize the norm required to realize a function $f:\mathbb{R}^d\rightarrow\mathbb{R}$ as a single hidden-layer ReLU network with an unbounded number of units (infinite width), but where th… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

  30. arXiv:1901.03707  [pdf, other

    cs.CV cs.LG stat.ML

    Neumann Networks for Inverse Problems in Imaging

    Authors: Davis Gilton, Greg Ongie, Rebecca Willett

    Abstract: Many challenging image processing tasks can be described by an ill-posed linear inverse problem: deblurring, deconvolution, inpainting, compressed sensing, and superresolution all lie in this framework. Traditional inverse problem solvers minimize a cost function consisting of a data-fit term, which measures how well an image matches the observations, and a regularizer, which reflects prior knowle… ▽ More

    Submitted 3 June, 2019; v1 submitted 13 January, 2019; originally announced January 2019.

    Comments: Added further experiments, reorganized proof section, added further references and supporting figures

  31. arXiv:1901.02470  [pdf, other

    cs.LG stat.ML

    Bilinear Bandits with Low-rank Structure

    Authors: Kwang-Sung Jun, Rebecca Willett, Stephen Wright, Robert Nowak

    Abstract: We introduce the bilinear bandit problem with low-rank structure in which an action takes the form of a pair of arms from two different entity types, and the reward is a bilinear function of the known feature vectors of the arms. The unknown in the problem is a $d_1$ by $d_2$ matrix $\mathbfΘ^*$ that defines the reward, and has low rank $r \ll \min\{d_1,d_2\}$. Determination of $\mathbfΘ^*$ with t… ▽ More

    Submitted 9 June, 2019; v1 submitted 8 January, 2019; originally announced January 2019.

    Comments: Accepted to ICML'19

  32. arXiv:1811.02979  [pdf, other

    stat.ML cs.LG

    Estimating Network Structure from Incomplete Event Data

    Authors: Benjamin Mark, Garvesh Raskutti, Rebecca Willett

    Abstract: Multivariate Bernoulli autoregressive (BAR) processes model time series of events in which the likelihood of current events is determined by the times and locations of past events. These processes can be used to model nonlinear dynamical systems corresponding to criminal activity, responses of patients to different medical treatment plans, opinion dynamics across social networks, epidemic spread,… ▽ More

    Submitted 7 November, 2018; originally announced November 2018.

  33. arXiv:1804.10266  [pdf, other

    stat.ML cs.LG

    Tensor Methods for Nonlinear Matrix Completion

    Authors: Greg Ongie, Daniel Pimentel-Alarcón, Laura Balzano, Rebecca Willett, Robert D. Nowak

    Abstract: In the low-rank matrix completion (LRMC) problem, the low-rank assumption means that the columns (or rows) of the matrix to be completed are points on a low-dimensional linear algebraic variety. This paper extends this thinking to cases where the columns are points on a low-dimensional nonlinear algebraic variety, a problem we call Low Algebraic Dimension Matrix Completion (LADMC). Matrices whose… ▽ More

    Submitted 4 September, 2020; v1 submitted 26 April, 2018; originally announced April 2018.

  34. arXiv:1803.07658  [pdf, other

    stat.ML cs.LG

    Graph-based regularization for regression problems with alignment and highly-correlated designs

    Authors: Yuan Li, Benjamin Mark, Garvesh Raskutti, Rebecca Willett, Hyebin Song, David Neiman

    Abstract: Sparse models for high-dimensional linear regression and machine learning have received substantial attention over the past two decades. Model selection, or determining which features or covariates are the best explanatory variables, is critical to the interpretability of a learned model. Much of the current literature assumes that covariates are only mildly correlated. However, in many modern app… ▽ More

    Submitted 13 October, 2019; v1 submitted 20 March, 2018; originally announced March 2018.

  35. arXiv:1802.09511  [pdf, ps, other

    stat.ML cs.LG

    Missing Data in Sparse Transition Matrix Estimation for Sub-Gaussian Vector Autoregressive Processes

    Authors: Amin Jalali, Rebecca Willett

    Abstract: High-dimensional time series data exist in numerous areas such as finance, genomics, healthcare, and neuroscience. An unavoidable aspect of all such datasets is missing data, and dealing with this issue has been an important focus in statistics, control, and machine learning. In this work, we consider a high-dimensional estimation problem where a dynamical system, governed by a stable vector autor… ▽ More

    Submitted 26 February, 2018; originally announced February 2018.

  36. arXiv:1802.04838  [pdf, other

    stat.ML cs.IT math.ST

    Network Estimation from Point Process Data

    Authors: Benjamin Mark, Garvesh Raskutti, Rebecca Willett

    Abstract: Consider observing a collection of discrete events within a network that reflect how network nodes influence one another. Such data are common in spike trains recorded from biological neural networks, interactions within a social network, and a variety of other settings. Data of this form may be modeled as self-exciting point processes, in which the likelihood of future events depends on the past… ▽ More

    Submitted 13 February, 2018; originally announced February 2018.

    Comments: Submitted to IEEE Transactions on Information Theory

  37. arXiv:1711.02545  [pdf, other

    stat.ML cs.LG

    Online Learning for Changing Environments using Coin Betting

    Authors: Kwang-Sung Jun, Francesco Orabona, Stephen Wright, Rebecca Willett

    Abstract: A key challenge in online learning is that classical algorithms can be slow to adapt to changing environments. Recent studies have proposed "meta" algorithms that convert any online learning algorithm to one that is adaptive to changing environments, where the adaptivity is analyzed in a quantity called the strongly-adaptive regret. This paper describes a new meta algorithm that has a strongly-ada… ▽ More

    Submitted 6 November, 2017; originally announced November 2017.

    Comments: submitted to a journal. arXiv admin note: substantial text overlap with arXiv:1610.04578

  38. arXiv:1706.00136  [pdf, other

    stat.ML cs.LG

    Scalable Generalized Linear Bandits: Online Computation and Hashing

    Authors: Kwang-Sung Jun, Aniruddha Bhargava, Robert Nowak, Rebecca Willett

    Abstract: Generalized Linear Bandits (GLBs), a natural extension of the stochastic linear bandits, has been popular and successful in recent years. However, existing GLBs scale poorly with the number of rounds and the number of arms, limiting their utility in practice. This paper proposes new, scalable solutions to the GLB problem in two respects. First, unlike existing GLBs, whose per-time-step space and t… ▽ More

    Submitted 21 October, 2017; v1 submitted 31 May, 2017; originally announced June 2017.

    Comments: accepted to NIPS'17 (typos fixed)

  39. arXiv:1610.04578  [pdf, other

    stat.ML cs.LG

    Improved Strongly Adaptive Online Learning using Coin Betting

    Authors: Kwang-Sung Jun, Francesco Orabona, Rebecca Willett, Stephen Wright

    Abstract: This paper describes a new parameter-free online learning algorithm for changing environments. In comparing against algorithms with the same time complexity as ours, we obtain a strongly adaptive regret bound that is a factor of at least $\sqrt{\log(T)}$ better, where $T$ is the time horizon. Empirical results show that our algorithm outperforms state-of-the-art methods in learning with expert adv… ▽ More

    Submitted 7 August, 2017; v1 submitted 14 October, 2016; originally announced October 2016.

    Comments: fixed a few typos

  40. arXiv:1609.03544  [pdf, other

    stat.ML cs.LG

    Online Data Thinning via Multi-Subspace Tracking

    Authors: Xin Jiang, Rebecca Willett

    Abstract: In an era of ubiquitous large-scale streaming data, the availability of data far exceeds the capacity of expert human analysts. In many settings, such data is either discarded or stored unprocessed in datacenters. This paper proposes a method of online data thinning, in which large-scale streaming datasets are winnowed to preserve unique, anomalous, or salient elements for timely expert analysis.… ▽ More

    Submitted 12 September, 2016; originally announced September 2016.

    Comments: 32 pages, 10 figures

  41. arXiv:1605.02693  [pdf, other

    stat.ML cs.IT math.ST

    Inference of High-dimensional Autoregressive Generalized Linear Models

    Authors: Eric C. Hall, Garvesh Raskutti, Rebecca Willett

    Abstract: Vector autoregressive models characterize a variety of time series in which linear combinations of current and past observations can be used to accurately predict future observations. For instance, each element of an observation vector could correspond to a different node in a network, and the parameters of an autoregressive model would correspond to the impact of the network structure on the time… ▽ More

    Submitted 24 June, 2017; v1 submitted 9 May, 2016; originally announced May 2016.

    Comments: Submitted to IEEE Transactions on Information Theory

  42. arXiv:1603.03980  [pdf, ps, other

    stat.ML cs.AI cs.LG

    On Learning High Dimensional Structured Single Index Models

    Authors: Nikhil Rao, Ravi Ganti, Laura Balzano, Rebecca Willett, Robert Nowak

    Abstract: Single Index Models (SIMs) are simple yet flexible semi-parametric models for machine learning, where the response variable is modeled as a monotonic function of a linear combination of features. Estimation in this context requires learning both the feature weights and the nonlinear function that relates features to observations. While methods have been described to learn SIMs in the low dimension… ▽ More

    Submitted 29 November, 2016; v1 submitted 12 March, 2016; originally announced March 2016.

    Comments: 7 pages, 3 tables, 1 Figure, substantial text overlap with arXiv:1506.08910; Accepted for publication at AAAI 2017; added new experimental results comparing our method to a single layer neural network

  43. arXiv:1512.08787  [pdf, other

    stat.ML cs.LG

    Matrix Completion Under Monotonic Single Index Models

    Authors: Ravi Ganti, Laura Balzano, Rebecca Willett

    Abstract: Most recent results in matrix completion assume that the matrix under consideration is low-rank or that the columns are in a union of low-rank subspaces. In real-world settings, however, the linear structure underlying these models is distorted by a (typically unknown) nonlinear transformation. This paper addresses the challenge of matrix completion in the face of such nonlinearities. Given a few… ▽ More

    Submitted 29 December, 2015; originally announced December 2015.

    Comments: 21 pages, 5 figures, 1 table. Accepted for publication at NIPS 2015

  44. arXiv:1509.08892  [pdf, other

    math.ST cs.IT

    A data-dependent weighted LASSO under Poisson noise

    Authors: Xin Jiang, Patricia Reynaud-Bouret, Vincent Rivoirard, Laure Sansonnet, Rebecca Willett

    Abstract: Sparse linear inverse problems appear in a variety of settings, but often the noise contaminating observations cannot accurately be described as bounded by or arising from a Gaussian distribution. Poisson observations in particular are a feature of several real-world applications. Previous work on sparse Poisson inverse problems encountered several limiting technical hurdles. This paper describes… ▽ More

    Submitted 13 February, 2018; v1 submitted 29 September, 2015; originally announced September 2015.

    Comments: 25 pages (48 pages with appendix), 3 figures

  45. arXiv:1506.08910  [pdf, other

    stat.ML cs.LG stat.ME

    Learning Single Index Models in High Dimensions

    Authors: Ravi Ganti, Nikhil Rao, Rebecca M. Willett, Robert Nowak

    Abstract: Single Index Models (SIMs) are simple yet flexible semi-parametric models for classification and regression. Response variables are modeled as a nonlinear, monotonic function of a linear combination of features. Estimation in this context requires learning both the feature weights, and the nonlinear function. While methods have been described to learn SIMs in the low dimensional regime, a method t… ▽ More

    Submitted 29 June, 2015; originally announced June 2015.

    Comments: 16 pages, 2 figures, 1 table

  46. arXiv:1503.08348  [pdf, other

    stat.ML cs.LG stat.ME

    Sparse Linear Regression With Missing Data

    Authors: Ravi Ganti, Rebecca M. Willett

    Abstract: This paper proposes a fast and accurate method for sparse regression in the presence of missing data. The underlying statistical model encapsulates the low-dimensional structure of the incomplete data matrix and the sparsity of the regression coefficients, and the proposed algorithm jointly learns the low-dimensional structure of the data and a linear regressor with sparse coefficients. The propos… ▽ More

    Submitted 28 March, 2015; originally announced March 2015.

    Comments: 14 pages, 7 figures

  47. arXiv:1409.0031  [pdf, other

    stat.ML cs.IT cs.SI

    Tracking Dynamic Point Processes on Networks

    Authors: Eric C. Hall, Rebecca M. Willett

    Abstract: Cascading chains of events are a salient feature of many real-world social, biological, and financial networks. In social networks, social reciprocity accounts for retaliations in gang interactions, proxy wars in nation-state conflicts, or Internet memes shared via social media. Neuron spikes stimulate or inhibit spike activity in other neurons. Stock market shocks can trigger a contagion of volat… ▽ More

    Submitted 1 July, 2016; v1 submitted 29 August, 2014; originally announced September 2014.

    Journal ref: IEEE Transaction on Information Theory, Vol. 62, No. 7, 2016

  48. arXiv:1401.3198  [pdf, other

    math.OC cs.LG eess.SY

    Online Markov decision processes with Kullback-Leibler control cost

    Authors: Peng Guan, Maxim Raginsky, Rebecca Willett

    Abstract: This paper considers an online (real-time) control problem that involves an agent performing a discrete-time random walk over a finite state space. The agent's action at each time step is to specify the probability distribution for the next state given the current state. Following the set-up of Todorov, the state-action cost at each time step is a sum of a state cost and a control cost given by th… ▽ More

    Submitted 14 January, 2014; originally announced January 2014.

    Comments: to appear in IEEE Transactions on Automatic Control

  49. arXiv:1310.7300  [pdf, ps, other

    cs.LG math.OC stat.ML

    Relax but stay in control: from value to algorithms for online Markov decision processes

    Authors: Peng Guan, Maxim Raginsky, Rebecca Willett

    Abstract: Online learning algorithms are designed to perform in non-stationary environments, but generally there is no notion of a dynamic state to model constraints on current and future actions as a function of past actions. State-based models are common in stochastic control settings, but commonly used frameworks such as Markov Decision Processes (MDPs) assume a known stationary environment. In recent ye… ▽ More

    Submitted 31 August, 2015; v1 submitted 27 October, 2013; originally announced October 2013.

    Comments: 40 pages; additional results in the convex-analytic framework

  50. arXiv:1307.5944  [pdf, other

    stat.ML cs.LG math.OC

    Online Optimization in Dynamic Environments

    Authors: Eric C. Hall, Rebecca M. Willett

    Abstract: High-velocity streams of high-dimensional data pose significant "big data" analysis challenges across a range of applications and settings. Online learning and online convex programming play a significant role in the rapid recovery of important or anomalous information from these large datastreams. While recent advances in online learning have led to novel and rapidly converging algorithms, these… ▽ More

    Submitted 19 January, 2016; v1 submitted 23 July, 2013; originally announced July 2013.

    Comments: arXiv admin note: text overlap with arXiv:1301.1254

    Journal ref: IEEE Journal of Selected Topics in Signal Processing - Signal Processing for Big Data, vol. 9, no 4. 2015