Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–24 of 24 results for author: Kontonis, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00966  [pdf, ps, other

    cs.LG cs.CC

    Smoothed Analysis for Learning Concepts with Low Intrinsic Dimension

    Authors: Gautam Chandrasekaran, Adam Klivans, Vasilis Kontonis, Raghu Meka, Konstantinos Stavropoulos

    Abstract: In traditional models of supervised learning, the goal of a learner -- given examples from an arbitrary joint distribution on $\mathbb{R}^d \times \{\pm 1\}$ -- is to output a hypothesis that is competitive (to within $ε$) of the best fitting concept from some class. In order to escape strong hardness results for learning even simple concept classes, we introduce a smoothed-analysis framework that… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: To appear in COLT 2024

  2. arXiv:2406.09373  [pdf, other

    cs.DS cs.LG

    Efficient Discrepancy Testing for Learning with Distribution Shift

    Authors: Gautam Chandrasekaran, Adam R. Klivans, Vasilis Kontonis, Konstantinos Stavropoulos, Arsen Vasilyan

    Abstract: A fundamental notion of distance between train and test distributions from the field of domain adaptation is discrepancy distance. While in general hard to compute, here we provide the first set of provably efficient algorithms for testing localized discrepancy distance, where discrepancy is computed with respect to a fixed output classifier. These results imply a broad set of new, efficient learn… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 45 pages, 3 figures

  3. arXiv:2405.12958  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Online Learning of Halfspaces with Massart Noise

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the task of online learning in the presence of Massart noise. Instead of assuming that the online adversary chooses an arbitrary sequence of labels, we assume that the context $\mathbf{x}$ is selected adversarially but the label $y$ presented to the learner disagrees with the ground-truth label of $\mathbf{x}$ with unknown probability at most $η$. We study the fundamental class of $γ$-mar… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  4. arXiv:2405.07937  [pdf, other

    cs.LG cs.DS

    Active Learning with Simple Questions

    Authors: Vasilis Kontonis, Mingchen Ma, Christos Tzamos

    Abstract: We consider an active learning setting where a learner is presented with a pool S of n unlabeled examples belonging to a domain X and asks queries to find the underlying labeling that agrees with a target concept h^* \in H. In contrast to traditional active learning that queries a single example for its label, we study more general region queries that allow the learner to pick a subset of the do… ▽ More

    Submitted 10 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: To appear at COLT 2024

  5. arXiv:2404.18893  [pdf, other

    cs.DS cs.LG stat.ML

    Learning general Gaussian mixtures with efficient score matching

    Authors: Sitan Chen, Vasilis Kontonis, Kulin Shah

    Abstract: We study the problem of learning mixtures of $k$ Gaussians in $d$ dimensions. We make no separation assumptions on the underlying mixture components: we only require that the covariance matrices have bounded condition number and that the means and covariances lie in a ball of bounded radius. We give an algorithm that draws $d^{\mathrm{poly}(k/\varepsilon)}$ samples from the target mixture, runs in… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 57 pages

  6. arXiv:2404.00529  [pdf, other

    cs.DS cs.LG

    Super Non-singular Decompositions of Polynomials and their Application to Robustly Learning Low-degree PTFs

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Sihan Liu, Nikos Zarifis

    Abstract: We study the efficient learnability of low-degree polynomial threshold functions (PTFs) in the presence of a constant fraction of adversarial corruptions. Our main algorithmic result is a polynomial-time PAC learning algorithm for this concept class in the strong contamination model under the Gaussian distribution with error guarantee $O_{d, c}(\text{opt}^{1-c})$, for any desired constant $c>0$, w… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: To appear in STOC2024

  7. arXiv:2312.16616  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Agnostically Learning Multi-index Models with Queries

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the power of query access for the task of agnostic learning under the Gaussian distribution. In the agnostic model, no assumptions are made on the labels and the goal is to compute a hypothesis that is competitive with the {\em best-fit} function in a known class, i.e., it achieves error $\mathrm{opt}+ε$, where $\mathrm{opt}$ is the error of the best function in the class. We focus on a g… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: abstract shortened due to arxiv requirements

  8. arXiv:2310.05309  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Optimizing Solution-Samplers for Combinatorial Problems: The Landscape of Policy-Gradient Methods

    Authors: Constantine Caramanis, Dimitris Fotakis, Alkis Kalavasis, Vasilis Kontonis, Christos Tzamos

    Abstract: Deep Neural Networks and Reinforcement Learning methods have empirically shown great promise in tackling challenging combinatorial problems. In those methods a deep neural network is used as a solution generator which is then trained by gradient-based methods (e.g., policy gradient) to successively obtain better solution distributions. In this work we introduce a novel theoretical framework for an… ▽ More

    Submitted 6 November, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  9. arXiv:2308.03142  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Self-Directed Linear Classification

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: In online classification, a learner is presented with a sequence of examples and aims to predict their labels in an online fashion so as to minimize the total number of mistakes. In the self-directed variant, the learner knows in advance the pool of examples and can adaptively choose the order in which predictions are made. Here we study the power of choosing the prediction order and establish the… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  10. arXiv:2303.05485  [pdf, ps, other

    cs.LG stat.ML

    Efficient Testable Learning of Halfspaces with Adversarial Label Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Sihan Liu, Nikos Zarifis

    Abstract: We give the first polynomial-time algorithm for the testable learning of halfspaces in the presence of adversarial label noise under the Gaussian distribution. In the recently introduced testable learning model, one is required to produce a tester-learner such that if the data passes the tester, then one can trust the output of the robust learner on the data. Our tester-learner runs in time… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  11. arXiv:2302.03806  [pdf, other

    cs.LG

    SLaM: Student-Label Mixing for Distillation with Unlabeled Examples

    Authors: Vasilis Kontonis, Fotis Iliopoulos, Khoa Trinh, Cenk Baykal, Gaurav Menghani, Erik Vee

    Abstract: Knowledge distillation with unlabeled examples is a powerful training paradigm for generating compact and lightweight student models in applications where the amount of labeled data is limited but one has access to a large pool of unlabeled data. In this setting, a large teacher model generates ``soft'' pseudo-labels for the unlabeled dataset which are then used for training the student model. Des… ▽ More

    Submitted 8 June, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

  12. arXiv:2210.06711  [pdf, other

    cs.LG cs.AI

    Weighted Distillation with Unlabeled Examples

    Authors: Fotis Iliopoulos, Vasilis Kontonis, Cenk Baykal, Gaurav Menghani, Khoa Trinh, Erik Vee

    Abstract: Distillation with unlabeled examples is a popular and powerful method for training deep neural networks in settings where the amount of labeled data is limited: A large ''teacher'' neural network is trained on the labeled data available, and then it is used to generate labels on an unlabeled dataset (typically much larger in size). These labels are then utilized to train the smaller ''student'' mo… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: To appear in NeurIPS 2022

  13. arXiv:2206.08918  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Learning a Single Neuron with Adversarial Label Noise via Gradient Descent

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the fundamental problem of learning a single neuron, i.e., a function of the form $\mathbf{x}\mapstoσ(\mathbf{w}\cdot\mathbf{x})$ for monotone activations $σ:\mathbb{R}\mapsto\mathbb{R}$, with respect to the $L_2^2$-loss in the presence of adversarial label noise. Specifically, we are given labeled examples from a distribution $D$ on $(\mathbf{x}, y)\in\mathbb{R}^d \times \mathbb{R}$ such… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  14. arXiv:2108.09805  [pdf, other

    cs.LG cs.DS stat.ML

    Efficient Algorithms for Learning from Coarse Labels

    Authors: Dimitris Fotakis, Alkis Kalavasis, Vasilis Kontonis, Christos Tzamos

    Abstract: For many learning problems one may not have access to fine grained label information; e.g., an image can be labeled as husky, dog, or even animal depending on the expertise of the annotator. In this work, we formalize these settings and study the problem of learning from such coarse data. Instead of observing the actual labels from a set $\mathcal{Z}$, we observe coarse labels corresponding to a p… ▽ More

    Submitted 24 March, 2023; v1 submitted 22 August, 2021; originally announced August 2021.

  15. arXiv:2108.08767  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Learning General Halfspaces with General Massart Noise under the Gaussian Distribution

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of PAC learning halfspaces on $\mathbb{R}^d$ with Massart noise under the Gaussian distribution. In the Massart model, an adversary is allowed to flip the label of each point $\mathbf{x}$ with unknown probability $η(\mathbf{x}) \leq η$, for some parameter $η\in [0,1/2]$. The goal is to find a hypothesis with misclassification error of $\mathrm{OPT} + ε$, where $\mathrm{OPT}$ i… ▽ More

    Submitted 8 November, 2021; v1 submitted 19 August, 2021; originally announced August 2021.

    Comments: Revised presentation

  16. arXiv:2102.05629  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Agnostic Proper Learning of Halfspaces under Gaussian Marginals

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of agnostically learning halfspaces under the Gaussian distribution. Our main result is the {\em first proper} learning algorithm for this problem whose sample complexity and computational complexity qualitatively match those of the best known improper agnostic learner. Building on this result, we also obtain the first proper polynomial-time approximation scheme (PTAS) for agn… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

  17. arXiv:2012.00732  [pdf, other

    cs.LG math.ST

    Convergence and Sample Complexity of SGD in GANs

    Authors: Vasilis Kontonis, Sihan Liu, Christos Tzamos

    Abstract: We provide theoretical convergence guarantees on training Generative Adversarial Networks (GANs) via SGD. We consider learning a target distribution modeled by a 1-layer Generator network with a non-linear activation function $φ(\cdot)$ parametrized by a $d \times d$ weight matrix $\mathbf W_*$, i.e., $f_*(\mathbf x) = φ(\mathbf W_* \mathbf x)$. Our main result is that by training the Generator… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  18. arXiv:2010.01705  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    A Polynomial Time Algorithm for Learning Halfspaces with Tsybakov Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of PAC learning homogeneous halfspaces in the presence of Tsybakov noise. In the Tsybakov noise model, the label of every sample is independently flipped with an adversarially controlled probability that can be arbitrarily close to $1/2$ for a fraction of the samples. {\em We give the first polynomial-time algorithm for this fundamental learning problem.} Our algorithm learns… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

  19. arXiv:2006.12476  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Algorithms and SQ Lower Bounds for PAC Learning One-Hidden-Layer ReLU Networks

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Nikos Zarifis

    Abstract: We study the problem of PAC learning one-hidden-layer ReLU networks with $k$ hidden units on $\mathbb{R}^d$ under Gaussian marginals in the presence of additive label noise. For the case of positive coefficients, we give the first polynomial-time algorithm for this learning problem for $k$ up to $\tilde{O}(\sqrt{\log d})$. Previously, no polynomial time algorithm was known, even for $k=3$. This an… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  20. arXiv:2006.06742  [pdf, ps, other

    cs.LG stat.ML

    Non-Convex SGD Learns Halfspaces with Adversarial Label Noise

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of agnostically learning homogeneous halfspaces in the distribution-specific PAC model. For a broad family of structured distributions, including log-concave distributions, we show that non-convex SGD efficiently converges to a solution with misclassification error $O(\opt)+\eps$, where $\opt$ is the misclassification error of the best-fitting halfspace. In sharp contrast, we… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  21. arXiv:2006.06467  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Learning Halfspaces with Tsybakov Noise

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the efficient PAC learnability of halfspaces in the presence of Tsybakov noise. In the Tsybakov noise model, each label is independently flipped with some probability which is controlled by an adversary. This noise model significantly generalizes the Massart noise model, by allowing the flipping probabilities to be arbitrarily close to $1/2$ for a fraction of the samples. Our main result… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  22. arXiv:2002.05632  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Learning Halfspaces with Massart Noise Under Structured Distributions

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of learning halfspaces with Massart noise in the distribution-specific PAC model. We give the first computationally efficient algorithm for this problem with respect to a broad family of distributions, including log-concave distributions. This resolves an open question posed in a number of prior works. Our approach is extremely simple: We identify a smooth {\em non-convex} sur… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

  23. arXiv:1908.01034  [pdf, other

    math.ST cs.DS cs.LG stat.CO stat.ML

    Efficient Truncated Statistics with Unknown Truncation

    Authors: Vasilis Kontonis, Christos Tzamos, Manolis Zampetakis

    Abstract: We study the problem of estimating the parameters of a Gaussian distribution when samples are only shown if they fall in some (unknown) subset $S \subseteq \R^d$. This core problem in truncated statistics has long history going back to Galton, Lee, Pearson and Fisher. Recent work by Daskalakis et al. (FOCS'18), provides the first efficient algorithm that works for arbitrary sets in high dimension… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

    Comments: to appear at 60th Annual IEEE Symposium on Foundations of Computer Science (FOCS), 2019

  24. arXiv:1707.05662  [pdf, ps, other

    cs.DS cs.LG math.ST

    Learning Powers of Poisson Binomial Distributions

    Authors: Dimitris Fotakis, Vasilis Kontonis, Piotr Krysta, Paul Spirakis

    Abstract: We introduce the problem of simultaneously learning all powers of a Poisson Binomial Distribution (PBD). A PBD of order $n$ is the distribution of a sum of $n$ mutually independent Bernoulli random variables $X_i$, where $\mathbb{E}[X_i] = p_i$. The $k$'th power of this distribution, for $k$ in a range $[m]$, is the distribution of $P_k = \sum_{i=1}^n X_i^{(k)}$, where each Bernoulli random variab… ▽ More

    Submitted 18 July, 2017; originally announced July 2017.