Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 101 results for author: Tewari, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2408.09004  [pdf, ps, other

    stat.ML cs.LG math.NA

    Error Bounds for Learning Fourier Linear Operators

    Authors: Unique Subedi, Ambuj Tewari

    Abstract: We investigate the problem of learning operators between function spaces, focusing on the linear layer of the Fourier Neural Operator. First, we identify three main errors that occur during the learning process: statistical error due to finite sample size, truncation error from finite rank approximation of the operator, and discretization error from handling functional data on a finite grid of dom… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: 30 pages

  2. arXiv:2405.17324  [pdf, other

    cs.LG cs.AI stat.ML

    Leveraging Offline Data in Linear Latent Bandits

    Authors: Chinmaya Kausik, Kevin Tan, Ambuj Tewari

    Abstract: Sequential decision-making domains such as recommender systems, healthcare and education often have unobserved heterogeneity in the population that can be modeled using latent bandits $-$ a framework where an unobserved latent state determines the model for a trajectory. While the latent bandit framework is compelling, the extent of its generality is unclear. We first address this by establishing… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 40 pages. 14 pages for main paper, 26 pages for references + appendix

  3. arXiv:2405.16250  [pdf, other

    eess.SY stat.ME

    Conformal Robust Control of Linear Systems

    Authors: Yash Patel, Sahana Rayan, Ambuj Tewari

    Abstract: End-to-end engineering design pipelines, in which designs are evaluated using concurrently defined optimal controllers, are becoming increasingly common in practice. To discover designs that perform well even under the misspecification of system dynamics, such end-to-end pipelines have now begun evaluating designs with a robust control objective in place of the nominal optimal control setup. Curre… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  4. arXiv:2405.16246  [pdf, other

    stat.ME stat.ML

    Conformalized Late Fusion Multi-View Learning

    Authors: Eduardo Ochoa Rivera, Yash Patel, Ambuj Tewari

    Abstract: Uncertainty quantification for multi-view learning is motivated by the increasing use of multi-view data in scientific problems. A common variant of multi-view learning is late fusion: train separate predictors on individual views and combine them after single-view predictions are available. Existing methods for uncertainty quantification for late fusion often rely on undesirable distributional as… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  5. arXiv:2405.15050  [pdf, ps, other

    stat.ML cs.LG

    Provably Efficient Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs

    Authors: Kihyuk Hong, Yufan Zhang, Ambuj Tewari

    Abstract: We resolve the open problem of designing a computationally efficient algorithm for infinite-horizon average-reward linear Markov Decision Processes (MDPs) with $\widetilde{O}(\sqrt{T})$ regret. Previous approaches with $\widetilde{O}(\sqrt{T})$ regret either suffer from computational inefficiency or require strong assumptions on dynamics, such as ergodicity. In this paper, we approximate the avera… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  6. arXiv:2405.14066  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Online Classification with Predictions

    Authors: Vinod Raman, Ambuj Tewari

    Abstract: We study online classification when the learner has access to predictions about future examples. We design an online learner whose expected regret is never worse than the worst-case regret, gracefully improves with the quality of the predictions, and can be significantly better than the worst-case regret when the predictions of future examples are accurate. As a corollary, we show that if the lear… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 24 pages

  7. arXiv:2403.01636  [pdf, other

    stat.ML cs.LG

    Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks

    Authors: Ziping Xu, Zifan Xu, Runxuan Jiang, Peter Stone, Ambuj Tewari

    Abstract: Multitask Reinforcement Learning (MTRL) approaches have gained increasing attention for its wide applications in many important Reinforcement Learning (RL) tasks. However, while recent advancements in MTRL theory have focused on the improved statistical efficiency by assuming a shared structure across tasks, exploration--a crucial aspect of RL--has been largely overlooked. This paper addresses thi… ▽ More

    Submitted 5 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  8. arXiv:2402.09467  [pdf, other

    stat.ML cs.LG

    Optimal Thresholding Linear Bandit

    Authors: Eduardo Ochoa Rivera, Ambuj Tewari

    Abstract: We study a novel pure exploration problem: the $ε$-Thresholding Bandit Problem (TBP) with fixed confidence in stochastic linear bandits. We prove a lower bound for the sample complexity and extend an algorithm designed for Best Arm Identification in the linear case to TBP that is asymptotically optimal.

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2006.16073 by other authors

  9. arXiv:2402.06614  [pdf, other

    cs.LG stat.ML

    The Complexity of Sequential Prediction in Dynamical Systems

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: We study the problem of learning to predict the next state of a dynamical system when the underlying evolution function is unknown. Unlike previous work, we place no parametric assumptions on the dynamical system, and study the problem from a learning theory perspective. We define new combinatorial measures and dimensions and show that they quantify the optimal mistake and regret bounds in the rea… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 35 pages

  10. arXiv:2402.04493  [pdf, ps, other

    stat.ML cs.LG

    A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPs

    Authors: Kihyuk Hong, Ambuj Tewari

    Abstract: We study offline reinforcement learning (RL) with linear MDPs under the infinite-horizon discounted setting which aims to learn a policy that maximizes the expected discounted cumulative reward using a pre-collected dataset. Existing algorithms for this setting either require a uniform data coverage assumptions or are computationally inefficient for finding an $ε$-optimal policy with $O(ε^{-2})$ s… ▽ More

    Submitted 2 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  11. arXiv:2402.03282  [pdf, ps, other

    cs.LG cs.AI stat.ML

    A Theoretical Framework for Partially Observed Reward-States in RLHF

    Authors: Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano, Ambuj Tewari

    Abstract: The growing deployment of reinforcement learning from human feedback (RLHF) calls for a deeper theoretical investigation of its underlying models. The prevalent models of RLHF do not account for neuroscience-backed, partially-observed "internal states" that can affect human feedback, nor do they accommodate intermediate feedback during an interaction. Both of these can be instrumental in speeding… ▽ More

    Submitted 27 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 58 pages. 12 pages formain paper, 46 pages for references + appendix

  12. arXiv:2310.19064  [pdf, other

    cs.LG stat.ML

    Apple Tasting: Combinatorial Dimensions and Minimax Rates

    Authors: Vinod Raman, Unique Subedi, Ananth Raman, Ambuj Tewari

    Abstract: In online binary classification under \emph{apple tasting} feedback, the learner only observes the true label if it predicts ``1". First studied by \cite{helmbold2000apple}, we revisit this classical partial-feedback setting and study online learnability from a combinatorial perspective. We show that the Littlestone dimension continues to provide a tight quantitative characterization of apple tast… ▽ More

    Submitted 18 June, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: 21 pages, COLT 2024 Camera Ready

  13. arXiv:2310.13088  [pdf, other

    stat.ML cs.LG

    Sequence Length Independent Norm-Based Generalization Bounds for Transformers

    Authors: Jacob Trauger, Ambuj Tewari

    Abstract: This paper provides norm-based generalization bounds for the Transformer architecture that do not depend on the input sequence length. We employ a covering number based approach to prove our bounds. We use three novel covering number bounds for the function class of bounded linear transformations to upper bound the Rademacher complexity of the Transformer. Furthermore, we show this generalization… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 18 pages

  14. arXiv:2310.10003  [pdf, other

    stat.ME cs.LG stat.ML

    Conformal Contextual Robust Optimization

    Authors: Yash Patel, Sahana Rayan, Ambuj Tewari

    Abstract: Data-driven approaches to predict-then-optimize decision-making problems seek to mitigate the risk of uncertainty region misspecification in safety-critical settings. Current approaches, however, suffer from considering overly conservative uncertainty regions, often resulting in suboptimal decisionmaking. To this end, we propose Conformal-Predict-Then-Optimize (CPO), a framework for leveraging hig… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

  15. arXiv:2310.07852  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    On the Computational Complexity of Private High-dimensional Model Selection

    Authors: Saptarshi Roy, Zehua Wang, Ambuj Tewari

    Abstract: We consider the problem of model selection in a high-dimensional sparse linear regression model under privacy constraints. We propose a differentially private best subset selection method with strong utility properties by adopting the well-known exponential mechanism for selecting the best model. We propose an efficient Metropolis-Hastings algorithm and establish that it enjoys polynomial mixing t… ▽ More

    Submitted 23 May, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 27 pages, 2 figures

  16. arXiv:2309.06548  [pdf, ps, other

    stat.ML cs.LG

    Online Infinite-Dimensional Regression: Learning Linear Operators

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: We consider the problem of learning linear operators under squared loss between two infinite-dimensional Hilbert spaces in the online setting. We show that the class of linear operators with uniformly bounded $p$-Schatten norm is online learnable for any $p \in [1, \infty)$. On the other hand, we prove an impossibility result by showing that the class of uniformly bounded linear operators with res… ▽ More

    Submitted 24 January, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: 21 pages, ALT 2024 Camera Ready

  17. arXiv:2309.02425  [pdf, ps, other

    cs.LG stat.ML

    On the Minimax Regret in Online Ranking with Top-k Feedback

    Authors: Mingyuan Zhang, Ambuj Tewari

    Abstract: In online ranking, a learning algorithm sequentially ranks a set of items and receives feedback on its ranking in the form of relevance scores. Since obtaining relevance scores typically involves human annotation, it is of great interest to consider a partial feedback setting where feedback is restricted to the top-$k$ items in the rankings. Chaudhuri and Tewari [2017] developed a framework to ana… ▽ More

    Submitted 12 April, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

  18. arXiv:2308.04620  [pdf, other

    cs.LG stat.ML

    Multiclass Online Learnability under Bandit Feedback

    Authors: Ananth Raman, Vinod Raman, Unique Subedi, Idan Mehalel, Ambuj Tewari

    Abstract: We study online multiclass classification under bandit feedback. We extend the results of Daniely and Helbertal [2013] by showing that the finiteness of the Bandit Littlestone dimension is necessary and sufficient for bandit online learnability even when the label space is unbounded. Moreover, we show that, unlike the full-information setting, sequential uniform convergence is necessary but not su… ▽ More

    Submitted 20 January, 2024; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: 16 pages, ALT 2024 Camera Ready

  19. arXiv:2306.07818  [pdf, other

    cs.LG stat.ML

    A Primal-Dual-Critic Algorithm for Offline Constrained Reinforcement Learning

    Authors: Kihyuk Hong, Yuhang Li, Ambuj Tewari

    Abstract: Offline constrained reinforcement learning (RL) aims to learn a policy that maximizes the expected cumulative reward subject to constraints on expected cumulative cost using an existing dataset. In this paper, we propose Primal-Dual-Critic Algorithm (PDCA), a novel algorithm for offline constrained RL with general function approximation. PDCA runs a primal-dual algorithm on the Lagrangian function… ▽ More

    Submitted 19 October, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

  20. arXiv:2306.06247  [pdf, ps, other

    cs.LG stat.ML

    Online Learning with Set-Valued Feedback

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: We study a variant of online multiclass classification where the learner predicts a single label but receives a \textit{set of labels} as feedback. In this model, the learner is penalized for not outputting a label contained in the revealed set. We show that unlike online multiclass learning with single-label feedback, deterministic and randomized online learnability are \textit{not equivalent} ev… ▽ More

    Submitted 18 June, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted to COLT 2024

  21. arXiv:2305.14275  [pdf, other

    stat.ME cs.LG

    Variational Inference with Coverage Guarantees in Simulation-Based Inference

    Authors: Yash Patel, Declan McNamara, Jackson Loper, Jeffrey Regier, Ambuj Tewari

    Abstract: Amortized variational inference is an often employed framework in simulation-based inference that produces a posterior approximation that can be rapidly computed given any new observation. Unfortunately, there are few guarantees about the quality of these approximate posteriors. We propose Conformalized Amortized Neural Variational Inference (CANVI), a procedure that is scalable, easily implemente… ▽ More

    Submitted 25 July, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  22. arXiv:2304.03337  [pdf, ps, other

    cs.LG stat.ML

    On the Learnability of Multilabel Ranking

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: Multilabel ranking is a central task in machine learning. However, the most fundamental question of learnability in a multilabel ranking setting with relevance-score feedback remains unanswered. In this work, we characterize the learnability of multilabel ranking problems in both batch and online settings for a large family of ranking losses. Along the way, we give two equivalence classes of ranki… ▽ More

    Submitted 25 May, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: 28 pages

  23. arXiv:2303.17716  [pdf, ps, other

    cs.LG stat.ML

    Multiclass Online Learning and Uniform Convergence

    Authors: Steve Hanneke, Shay Moran, Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: We study multiclass classification in the agnostic adversarial online learning setting. As our main result, we prove that any multiclass concept class is agnostically learnable if and only if its Littlestone dimension is finite. This solves an open problem studied by Daniely, Sabato, Ben-David, and Shalev-Shwartz (2011,2015) who handled the case when the number of classes (or labels) is bounded. W… ▽ More

    Submitted 7 July, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: COLT Camera-Ready, 15 pages

  24. arXiv:2302.07409  [pdf, other

    cs.LG cs.CC quant-ph stat.ML

    Quantum Learning Theory Beyond Batch Binary Classification

    Authors: Preetham Mohan, Ambuj Tewari

    Abstract: Arunachalam and de Wolf (2018) showed that the sample complexity of quantum batch learning of boolean functions, in the realizable and agnostic settings, has the same form and order as the corresponding classical sample complexities. In this paper, we extend this, ostensibly surprising, message to batch multiclass learning, online boolean learning, and online multiclass learning. For our online le… ▽ More

    Submitted 26 December, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 30 pages, 2 figures, 2 tables; v4: entirely reorganized paper with more detailed proofs; handles the adversary-provides-a-distribution model independently;

  25. arXiv:2302.02033  [pdf, other

    stat.ML cs.LG

    An Asymptotically Optimal Algorithm for the Convex Hull Membership Problem

    Authors: Gang Qiao, Ambuj Tewari

    Abstract: We study the convex hull membership (CHM) problem in the pure exploration setting where one aims to efficiently and accurately determine if a given point lies in the convex hull of means of a finite set of distributions. We give a complete characterization of the sample complexity of the CHM problem in the one-dimensional case. We present the first asymptotically optimal algorithm called Thompson-… ▽ More

    Submitted 23 May, 2024; v1 submitted 3 February, 2023; originally announced February 2023.

  26. arXiv:2301.06259  [pdf, other

    math.ST stat.ML

    Understanding Best Subset Selection: A Tale of Two C(omplex)ities

    Authors: Saptarshi Roy, Ambuj Tewari, Ziwei Zhu

    Abstract: For decades, best subset selection (BSS) has eluded statisticians mainly due to its computational bottleneck. However, until recently, modern computational breakthroughs have rekindled theoretical interest in BSS and have led to new findings. Recently, \cite{guo2020best} showed that the model selection performance of BSS is governed by a margin quantity that is robust to the design dependence, unl… ▽ More

    Submitted 17 July, 2023; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: 46 pages, 2 Figures

  27. arXiv:2301.02729  [pdf, ps, other

    cs.LG stat.ML

    A Characterization of Multioutput Learnability

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: We consider the problem of learning multioutput function classes in batch and online settings. In both settings, we show that a multioutput function class is learnable if and only if each single-output restriction of the function class is learnable. This provides a complete characterization of the learnability of multilabel classification and multioutput regression in both batch and online setting… ▽ More

    Submitted 22 October, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: 37 pages

  28. arXiv:2211.16583  [pdf, other

    stat.ML cs.LG

    Offline Policy Evaluation and Optimization under Confounding

    Authors: Chinmaya Kausik, Yangyi Lu, Kevin Tan, Maggie Makar, Yixin Wang, Ambuj Tewari

    Abstract: Evaluating and optimizing policies in the presence of unobserved confounders is a problem of growing interest in offline reinforcement learning. Using conventional methods for offline RL in the presence of confounding can not only lead to poor decisions and poor policies, but also have disastrous effects in critical applications such as healthcare and education. We map out the landscape of offline… ▽ More

    Submitted 6 November, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Overhauled terminology and presentation, strengthened presentation of results

  29. arXiv:2211.10771  [pdf, other

    q-bio.QM stat.AP

    RL Boltzmann Generators for Conformer Generation in Data-Sparse Environments

    Authors: Yash Patel, Ambuj Tewari

    Abstract: The generation of conformers has been a long-standing interest to structural chemists and biologists alike. A subset of proteins known as intrinsically disordered proteins (IDPs) fail to exhibit a fixed structure and, therefore, must also be studied in this light of conformer generation. Unlike in the small molecule setting, ground truth data are sparse in the IDP setting, undermining many existin… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: Accepted to the NeurIPS 2022 Workshop on Machine Learning in Structural Biology

  30. arXiv:2211.09403  [pdf, other

    stat.ML cs.LG

    Learning Mixtures of Markov Chains and MDPs

    Authors: Chinmaya Kausik, Kevin Tan, Ambuj Tewari

    Abstract: We present an algorithm for learning mixtures of Markov chains and Markov decision processes (MDPs) from short unlabeled trajectories. Specifically, our method handles mixtures of Markov chains with optional control input by going through a multi-step process, involving (1) a subspace estimation step, (2) spectral clustering of trajectories using "pairwise distance estimators," along with refineme… ▽ More

    Submitted 6 February, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: 51 pages (13 page paper, 38 page appendix). Paper restructured and refined, corrections made to proofs, experiments added

  31. arXiv:2211.05964  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits

    Authors: Sunrit Chakraborty, Saptarshi Roy, Ambuj Tewari

    Abstract: We consider the stochastic linear contextual bandit problem with high-dimensional features. We analyze the Thompson sampling algorithm using special classes of sparsity-inducing priors (e.g., spike-and-slab) to model the unknown parameter and provide a nearly optimal upper bound on the expected cumulative regret. To the best of our knowledge, this is the first work that provides theoretical guaran… ▽ More

    Submitted 28 January, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: 38 pages, 4 figures

  32. arXiv:2211.05656  [pdf, other

    cs.LG stat.ML

    On Proper Learnability between Average- and Worst-case Robustness

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: Recently, Montasser et al. [2019] showed that finite VC dimension is not sufficient for proper adversarially robust PAC learning. In light of this hardness, there is a growing effort to study what type of relaxations to the adversarially robust PAC learning setup can enable proper learnability. In this work, we initiate the study of proper learning under relaxations of the worst-case robust loss.… ▽ More

    Submitted 25 May, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: 19 pages

  33. arXiv:2205.15113  [pdf, other

    cs.LG stat.ML

    Online Agnostic Multiclass Boosting

    Authors: Vinod Raman, Ambuj Tewari

    Abstract: Boosting is a fundamental approach in machine learning that enjoys both strong theoretical and practical guarantees. At a high-level, boosting algorithms cleverly aggregate weak learners to generate predictions with arbitrarily high accuracy. In this way, boosting algorithms convert weak learners into strong ones. Recently, Brukhim et al. extended boosting to the online agnostic binary classificat… ▽ More

    Submitted 17 October, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: Camera-Ready Version

  34. arXiv:2205.14829  [pdf, other

    stat.ML cs.LG

    Adaptive Sampling for Discovery

    Authors: Ziping Xu, Eunjae Shim, Ambuj Tewari, Paul Zimmerman

    Abstract: In this paper, we study a sequential decision-making problem, called Adaptive Sampling for Discovery (ASD). Starting with a large unlabeled dataset, algorithms for ASD adaptively label the points with the goal to maximize the sum of responses. This problem has wide applications to real-world discovery problems, for example drug discovery with the help of machine learning models. ASD algorithms f… ▽ More

    Submitted 2 January, 2023; v1 submitted 29 May, 2022; originally announced May 2022.

  35. arXiv:2205.14775  [pdf, other

    stat.ML cs.LG

    An Optimization-based Algorithm for Non-stationary Kernel Bandits without Prior Knowledge

    Authors: Kihyuk Hong, Yuhang Li, Ambuj Tewari

    Abstract: We propose an algorithm for non-stationary kernel bandits that does not require prior knowledge of the degree of non-stationarity. The algorithm follows randomized strategies obtained by solving optimization problems that balance exploration and exploitation. It adapts to non-stationarity by restarting when a change in the reward function is detected. Our algorithm enjoys a tighter dynamic regret… ▽ More

    Submitted 19 February, 2023; v1 submitted 29 May, 2022; originally announced May 2022.

  36. arXiv:2205.00894  [pdf, other

    stat.AP cs.LG

    Modeling and mitigation of occupational safety risks in dynamic industrial environments

    Authors: Ashutosh Tewari, Antonio R. Paiva

    Abstract: Identifying and mitigating safety risks is paramount in a number of industries. In addition to guidelines and best practices, many industries already have safety management systems (SMSs) designed to monitor and reinforce good safety behaviors. The analytic capabilities to analyze the data acquired through such systems, however, are still lacking in terms of their ability to robustly quantify risk… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  37. arXiv:2204.06664  [pdf, other

    stat.ML cs.LG

    Achieving Representative Data via Convex Hull Feasibility Sampling Algorithms

    Authors: Laura Niss, Yuekai Sun, Ambuj Tewari

    Abstract: Sampling biases in training data are a major source of algorithmic biases in machine learning systems. Although there are many methods that attempt to mitigate such algorithmic biases during training, the most direct and obvious way is simply collecting more representative training data. In this paper, we consider the task of assembling a training dataset in which minority groups are adequately re… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

  38. Methodology for Testing and Evaluation of Safety Analytics Approaches

    Authors: Antonio R. Paiva, Ashutosh Tewari

    Abstract: There has been a significant increase in the development of data-driven safety analytics approaches in recent years. In light of these advances it has become imperative to evaluate such approaches in a principled way to determine their merits and limitations. To that end, we propose an evaluation methodology underpinned by a simulated environment that allows for a comprehensive assessment of safet… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Comments: Accepted to Safety Science

  39. arXiv:2112.10955  [pdf, other

    stat.ML cs.LG eess.SY math.DS

    Joint Learning of Linear Time-Invariant Dynamical Systems

    Authors: Aditya Modi, Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

    Abstract: Linear time-invariant systems are very popular models in system theory and applications. A fundamental problem in system identification that remains rather unaddressed in extant literature is to leverage commonalities amongst related linear systems to estimate their transition matrices more accurately. To address this problem, the current paper investigates methods for jointly estimating the trans… ▽ More

    Submitted 2 January, 2024; v1 submitted 20 December, 2021; originally announced December 2021.

  40. arXiv:2112.10314  [pdf, other

    cs.GT stat.ML

    Balancing Adaptability and Non-exploitability in Repeated Games

    Authors: Anthony DiGiovanni, Ambuj Tewari

    Abstract: We study the problem of guaranteeing low regret in repeated games against an opponent with unknown membership in one of several classes. We add the constraint that our algorithm is non-exploitable, in that the opponent lacks an incentive to use an algorithm against which we cannot achieve rewards exceeding some "fair" value. Our solution is an expert algorithm (LAFF) that searches within a set of… ▽ More

    Submitted 2 July, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: Accepted at Uncertainty in Artificial Intelligence 2022

  41. arXiv:2111.07126  [pdf, other

    stat.ML cs.LG

    On the Statistical Benefits of Curriculum Learning

    Authors: Ziping Xu, Ambuj Tewari

    Abstract: Curriculum learning (CL) is a commonly used machine learning training strategy. However, we still lack a clear theoretical understanding of CL's benefits. In this paper, we study the benefits of CL in the multitask linear regression problem under both structured and unstructured settings. For both settings, we derive the minimax rates for CL with the oracle that provides the optimal curriculum and… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

  42. arXiv:2108.04782  [pdf, ps, other

    stat.ML cs.LG

    Bandit Algorithms for Precision Medicine

    Authors: Yangyi Lu, Ziping Xu, Ambuj Tewari

    Abstract: The Oxford English Dictionary defines precision medicine as "medical care designed to optimize efficiency or therapeutic benefit for particular groups of patients, especially by using genetic or molecular profiling." It is not an entirely new idea: physicians from ancient times have recognized that medical treatment needs to consider individual variations in patient characteristics. However, the m… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

    Comments: To appear as a chapter in the Handbook of Statistical Methods for Precision Medicine edited by Tianxi Cai, Bibhas Chakraborty, Eric Laber, Erica Moodie, and Mark van der Laan

  43. arXiv:2106.02988  [pdf, other

    stat.ML cs.LG

    Causal Bandits with Unknown Graph Structure

    Authors: Yangyi Lu, Amirhossein Meisami, Ambuj Tewari

    Abstract: In causal bandit problems, the action set consists of interventions on variables of a causal graph. Several researchers have recently studied such bandit problems and pointed out their practical applications. However, all existing works rely on a restrictive and impractical assumption that the learner is given full knowledge of the causal graph structure upfront. In this paper, we develop novel ca… ▽ More

    Submitted 9 November, 2021; v1 submitted 5 June, 2021; originally announced June 2021.

    Comments: Accepted to NeurIPS 2021

  44. arXiv:2105.14989  [pdf, other

    stat.ML cs.LG

    Representation Learning Beyond Linear Prediction Functions

    Authors: Ziping Xu, Ambuj Tewari

    Abstract: Recent papers on the theory of representation learning has shown the importance of a quantity called diversity when generalizing from a set of source tasks to a target task. Most of these papers assume that the function mapping shared representations to predictions is linear, for both source and target tasks. In practice, researchers in deep learning use different numbers of extra layers following… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: 1 Figure

  45. arXiv:2102.07663  [pdf, other

    stat.ML cs.LG

    Causal Markov Decision Processes: Learning Good Interventions Efficiently

    Authors: Yangyi Lu, Amirhossein Meisami, Ambuj Tewari

    Abstract: We introduce causal Markov Decision Processes (C-MDPs), a new formalism for sequential decision making which combines the standard MDP formulation with causal structures over state transition and reward functions. Many contemporary and emerging application areas such as digital healthcare and digital marketing can benefit from modeling with C-MDPs due to the causal mechanisms underlying the relati… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

  46. arXiv:2010.08048  [pdf, other

    stat.ML cs.LG

    Decision Making Problems with Funnel Structure: A Multi-Task Learning Approach with Application to Email Marketing Campaigns

    Authors: Ziping Xu, Amirhossein Meisami, Ambuj Tewari

    Abstract: This paper studies the decision making problem with Funnel Structure. Funnel structure, a well-known concept in the marketing field, occurs in those systems where the decision maker interacts with the environment in a layered manner receiving far fewer observations from deep layers than shallow ones. For example, in the email marketing campaign application, the layers correspond to Open, Click and… ▽ More

    Submitted 31 January, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

  47. arXiv:2008.04489  [pdf, other

    cs.LG stat.ML

    Federated Learning via Synthetic Data

    Authors: Jack Goetz, Ambuj Tewari

    Abstract: Federated learning allows for the training of a model using data on multiple clients without the clients transmitting that raw data. However the standard method is to transmit model parameters (or updates), which for modern neural networks can be on the scale of millions of parameters, inflicting significant computational costs on the clients. We propose a method for federated learning where inste… ▽ More

    Submitted 26 September, 2020; v1 submitted 10 August, 2020; originally announced August 2020.

  48. arXiv:2006.07078  [pdf, other

    cs.LG stat.ML

    TorsionNet: A Reinforcement Learning Approach to Sequential Conformer Search

    Authors: Tarun Gogineni, Ziping Xu, Exequiel Punzalan, Runxuan Jiang, Joshua Kammeraad, Ambuj Tewari, Paul Zimmerman

    Abstract: Molecular geometry prediction of flexible molecules, or conformer search, is a long-standing challenge in computational chemistry. This task is of great importance for predicting structure-activity relationships for a wide variety of substances ranging from biomolecules to ubiquitous materials. Substantial computational resources are invested in Monte Carlo and Molecular Dynamics methods to genera… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  49. arXiv:2006.02948  [pdf, other

    stat.ML cs.LG

    Low-Rank Generalized Linear Bandit Problems

    Authors: Yangyi Lu, Amirhossein Meisami, Ambuj Tewari

    Abstract: In a low-rank linear bandit problem, the reward of an action (represented by a matrix of size $d_1 \times d_2$) is the inner product between the action and an unknown low-rank matrix $Θ^*$. We propose an algorithm based on a novel combination of online-to-confidence-set conversion~\citep{abbasi2012online} and the exponentially weighted average forecaster constructed by a covering of low-rank matri… ▽ More

    Submitted 19 October, 2020; v1 submitted 4 June, 2020; originally announced June 2020.

  50. arXiv:2006.01980  [pdf, other

    stat.ML cs.CR cs.LG

    On the Equivalence between Online and Private Learnability beyond Binary Classification

    Authors: Young Hun Jung, Baekjin Kim, Ambuj Tewari

    Abstract: Alon et al. [2019] and Bun et al. [2020] recently showed that online learnability and private PAC learnability are equivalent in binary classification. We investigate whether this equivalence extends to multi-class classification and regression. First, we show that private learnability implies online learnability in both settings. Our extension involves studying a novel variant of the Littlestone… ▽ More

    Submitted 8 October, 2021; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: An earlier version of this manuscript claimed an upper bound over the sample complexity that is exponential in the Littlestone dimension. The argument contained a technical mistake, and the current version presents a correction that deteriorates the dependence on the Littlestone dimension from exponential to doubly exponential. arXiv admin note: text overlap with arXiv:2003.00563 by other authors