Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–29 of 29 results for author: Ailon, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.04807  [pdf, ps, other

    cs.LG

    Efficient NTK using Dimensionality Reduction

    Authors: Nir Ailon, Supratim Shit

    Abstract: Recently, neural tangent kernel (NTK) has been used to explain the dynamics of learning parameters of neural networks, at the large width limit. Quantitative analyses of NTK give rise to network widths that are often impractical and incur high costs in time and energy in both training and deployment. Using a matrix factorization technique, we show how to obtain similar guarantees to those obtained… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  2. arXiv:2007.08864  [pdf, other

    cs.LG stat.ML

    Sparse Linear Networks with a Fixed Butterfly Structure: Theory and Practice

    Authors: Nir Ailon, Omer Leibovich, Vineet Nair

    Abstract: A butterfly network consists of logarithmically many layers, each with a linear number of non-zero weights (pre-specified). The fast Johnson-Lindenstrauss transform (FJLT) can be represented as a butterfly network followed by a projection onto a random subset of the coordinates. Moreover, a random matrix based on FJLT with high probability approximates the action of any matrix on a vector. Motivat… ▽ More

    Submitted 4 July, 2021; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: Accepted to UAI 2021

  3. arXiv:1907.07471  [pdf, ps, other

    cs.CC math.RT

    Interesting Open Problem Related to Complexity of Computing the Fourier Transform and Group Theory

    Authors: Nir Ailon

    Abstract: The Fourier Transform is one of the most important linear transformations used in science and engineering. Cooley and Tukey's Fast Fourier Transform (FFT) from 1964 is a method for computing this transformation in time $O(n\log n)$. From a lower bound perspective, relatively little is known. Ailon shows in 2013 an $Ω(n\log n)$ bound for computing the normalized Fourier Transform assuming only unit… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

  4. arXiv:1712.06865  [pdf, ps, other

    cs.DS

    Approximate Correlation Clustering Using Same-Cluster Queries

    Authors: Nir Ailon, Anup Bhattacharya, Ragesh Jaiswal

    Abstract: Ashtiani et al. (NIPS 2016) introduced a semi-supervised framework for clustering (SSAC) where a learner is allowed to make same-cluster queries. More specifically, in their model, there is a query oracle that answers queries of the form given any two vertices, do they belong to the same optimal cluster?. Ashtiani et al. showed the usefulness of such a query framework by giving a polynomial time a… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

    Comments: To appear in LATIN 2018

  5. arXiv:1704.01862  [pdf, ps, other

    cs.DS

    Approximate Clustering with Same-Cluster Queries

    Authors: Nir Ailon, Anup Bhattacharya, Ragesh Jaiswal, Amit Kumar

    Abstract: Ashtiani et al. proposed a Semi-Supervised Active Clustering framework (SSAC), where the learner is allowed to make adaptive queries to a domain expert. The queries are of the kind "do two given points belong to the same optimal cluster?" There are many clustering contexts where such same-cluster queries are feasible. Ashtiani et al. exhibited the power of such queries by showing that any instance… ▽ More

    Submitted 4 October, 2017; v1 submitted 6 April, 2017; originally announced April 2017.

    Comments: Updated version has results for faulty queries

  6. arXiv:1611.06996  [pdf, ps, other

    stat.ML cs.LG

    Spatial contrasting for deep unsupervised learning

    Authors: Elad Hoffer, Itay Hubara, Nir Ailon

    Abstract: Convolutional networks have marked their place over the last few years as the best performing model for various visual tasks. They are, however, most suited for supervised learning from large amounts of labeled data. Previous attempts have been made to use unlabeled data to improve model performance by applying unsupervised techniques. These attempts require different architectures and training me… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

  7. arXiv:1611.01449  [pdf, other

    cs.LG

    Semi-supervised deep learning by metric embedding

    Authors: Elad Hoffer, Nir Ailon

    Abstract: Deep networks are successfully used as classification models yielding state-of-the-art results when trained on a large number of labeled samples. These models, however, are usually much less suited for semi-supervised problems because of their tendency to overfit easily when trained on small amounts of data. In this work we will explore a new training objective that is targeting a semi-supervised… ▽ More

    Submitted 4 December, 2018; v1 submitted 4 November, 2016; originally announced November 2016.

  8. arXiv:1610.00243  [pdf, other

    cs.LG cs.AI stat.ML

    Deep unsupervised learning through spatial contrasting

    Authors: Elad Hoffer, Itay Hubara, Nir Ailon

    Abstract: Convolutional networks have marked their place over the last few years as the best performing model for various visual tasks. They are, however, most suited for supervised learning from large amounts of labeled data. Previous attempts have been made to use unlabeled data to improve model performance by applying unsupervised techniques. These attempts require different architectures and training me… ▽ More

    Submitted 4 December, 2018; v1 submitted 2 October, 2016; originally announced October 2016.

  9. arXiv:1609.03278  [pdf, ps, other

    cs.CC

    Paraunitary Matrices, Entropy, Algebraic Condition Number and Fourier Computation

    Authors: Nir Ailon

    Abstract: The Fourier Transform is one of the most important linear transformations used in science and engineering. Cooley and Tukey's Fast Fourier Transform (FFT) from 1964 is a method for computing this transformation in time $O(n\log n)$. From a lower bound perspective, relatively little is known. Ailon shows in 2013 an $Ω(n\log n)$ bound for computing the normalized Fourier Transform assuming only unit… ▽ More

    Submitted 8 November, 2018; v1 submitted 12 September, 2016; originally announced September 2016.

    Comments: arXiv admin note: text overlap with arXiv:1404.1741

    ACM Class: F.2.1

  10. arXiv:1604.02557  [pdf, ps, other

    cs.CC

    The Complexity of Computing (Almost) Unitary Matrices With $\eps$-Copies of the Fourier Transform

    Authors: Nir Ailon, Gal Yehuda

    Abstract: The complexity of computing the Fourier transform is a longstanding open problem. Very recently, Ailon (2013, 2014, 2015) showed in a collection of papers that, roughly speaking, a speedup of the Fourier transform computation implies numerical ill-condition. The papers also quantify this tradeoff. The main method for proving these results is via a potential function called quasi-entropy, reminisce… ▽ More

    Submitted 17 April, 2019; v1 submitted 9 April, 2016; originally announced April 2016.

  11. arXiv:1412.6622  [pdf, other

    cs.LG cs.CV stat.ML

    Deep metric learning using Triplet network

    Authors: Elad Hoffer, Nir Ailon

    Abstract: Deep learning has proven itself as a successful set of models for learning useful semantic representations of data. These, however, are mostly implicitly learned as part of a classification task. In this paper we propose the triplet network model, which aims to learn useful representations by distance comparisons. A similar model was defined by Wang et al. (2014), tailor made for learning a rankin… ▽ More

    Submitted 4 December, 2018; v1 submitted 20 December, 2014; originally announced December 2014.

  12. arXiv:1405.3396  [pdf, other

    cs.LG

    Reducing Dueling Bandits to Cardinal Bandits

    Authors: Nir Ailon, Thorsten Joachims, Zohar Karnin

    Abstract: We present algorithms for reducing the Dueling Bandits problem to the conventional (stochastic) Multi-Armed Bandits problem. The Dueling Bandits problem is an online model of learning with ordinal feedback of the form "A is preferred to B" (as opposed to cardinal feedback like "A has value 2.5"), giving it wide applicability in learning from implicit user feedback and revealed and stated preferenc… ▽ More

    Submitted 14 May, 2014; originally announced May 2014.

  13. arXiv:1404.1741  [pdf, ps, other

    cs.CC

    Tighter Fourier Transform Complexity Tradeoffs

    Authors: Nir Ailon

    Abstract: The Fourier Transform is one of the most important linear transformations used in science and engineering. Cooley and Tukey's Fast Fourier Transform (FFT) from 1964 is a method for computing this transformation in time $O(n\log n)$. Achieving a matching lower bound in a reasonable computational model is one of the most important open problems in theoretical computer science. In 2014, improving o… ▽ More

    Submitted 15 April, 2015; v1 submitted 7 April, 2014; originally announced April 2014.

  14. arXiv:1403.1307  [pdf, ps, other

    cs.CC

    An n\log n Lower Bound for Fourier Transform Computation in the Well Conditioned Model

    Authors: Nir Ailon

    Abstract: Obtaining a non-trivial (super-linear) lower bound for computation of the Fourier transform in the linear circuit model has been a long standing open problem for over 40 years. An early result by Morgenstern from 1973, provides an $Ω(n \log n)$ lower bound for the unnormalized Fourier transform when the constants used in the computation are bounded. The proof uses a potential function related to… ▽ More

    Submitted 24 July, 2014; v1 submitted 5 March, 2014; originally announced March 2014.

  15. arXiv:1401.2912  [pdf, other

    cs.DS

    A tight lower bound instance for k-means++ in constant dimension

    Authors: Anup Bhattacharya, Ragesh Jaiswal, Nir Ailon

    Abstract: The k-means++ seeding algorithm is one of the most popular algorithms that is used for finding the initial $k$ centers when using the k-means heuristic. The algorithm is a simple sampling procedure and can be described as follows: Pick the first center randomly from the given points. For $i > 1$, pick a point to be the $i^{th}$ center with probability proportional to the square of the Euclidean di… ▽ More

    Submitted 13 January, 2014; v1 submitted 13 January, 2014; originally announced January 2014.

    Comments: To appear in TAMC 2014. arXiv admin note: text overlap with arXiv:1306.4207

  16. arXiv:1312.1530  [pdf, ps, other

    cs.LG

    Bandit Online Optimization Over the Permutahedron

    Authors: Nir Ailon, Kohei Hatano, Eiji Takimoto

    Abstract: The permutahedron is the convex polytope with vertex set consisting of the vectors $(π(1),\dots, π(n))$ for all permutations (bijections) $π$ over $\{1,\dots, n\}$. We study a bandit game in which, at each step $t$, an adversary chooses a hidden weight weight vector $s_t$, a player chooses a vertex $π_t$ of the permutahedron and suffers an observed loss of $\sum_{i=1}^n π(i) s_t(i)$. A previous… ▽ More

    Submitted 6 July, 2014; v1 submitted 5 December, 2013; originally announced December 2013.

  17. arXiv:1308.6797  [pdf, ps, other

    cs.LG cs.GT stat.ML

    Online Ranking: Discrete Choice, Spearman Correlation and Other Feedback

    Authors: Nir Ailon

    Abstract: Given a set $V$ of $n$ objects, an online ranking system outputs at each time step a full ranking of the set, observes a feedback of some form and suffers a loss. We study the setting in which the (adversarial) feedback is an element in $V$, and the loss is the position (0th, 1st, 2nd...) of the item in the outputted ranking. More generally, we study a setting in which the feedback is a subset… ▽ More

    Submitted 14 October, 2013; v1 submitted 30 August, 2013; originally announced August 2013.

  18. arXiv:1305.4745  [pdf, ps, other

    cs.CC

    A Lower Bound for Fourier Transform Computation in a Linear Model Over 2x2 Unitary Gates Using Matrix Entropy

    Authors: Nir Ailon

    Abstract: Obtaining a non-trivial (super-linear) lower bound for computation of the Fourier transform in the linear circuit model has been a long standing open problem. All lower bounds so far have made strong restrictions on the computational model. One of the most well known results, by Morgenstern from 1973, provides an $Ω(n \log n)$ lower bound for the \emph{unnormalized} FFT when the constants used in… ▽ More

    Submitted 21 May, 2013; originally announced May 2013.

  19. arXiv:1302.4549  [pdf, other

    cs.LG stat.ML

    Breaking the Small Cluster Barrier of Graph Clustering

    Authors: Nir Ailon, Yudong Chen, Xu Huan

    Abstract: This paper investigates graph clustering in the planted cluster model in the presence of {\em small clusters}. Traditional results dictate that for an algorithm to provably correctly recover the clusters, {\em all} clusters must be sufficiently large (in particular, $\tildeΩ(\sqrt{n})$ where $n$ is the number of nodes of the graph). We show that this is not really a restriction: by a more refined… ▽ More

    Submitted 20 February, 2013; v1 submitted 19 February, 2013; originally announced February 2013.

  20. arXiv:1301.0878  [pdf, ps, other

    math.NA cs.IT

    Fast and RIP-optimal transforms

    Authors: Nir Ailon, Holger Rauhut

    Abstract: We study constructions of $k \times n$ matrices $A$ that both (1) satisfy the restricted isometry property (RIP) at sparsity $s$ with optimal parameters, and (2) are efficient in the sense that only $O(n\log n)$ operations are required to compute $Ax$ given a vector $x$. Our construction is based on repeated application of independent transformations of the form $DH$, where $H$ is a Hadamard or Fo… ▽ More

    Submitted 17 February, 2013; v1 submitted 5 January, 2013; originally announced January 2013.

    ACM Class: F.2.1

  21. arXiv:1204.6588  [pdf, ps, other

    cs.CC

    A note on: No need to choose: How to get both a PTAS and Sublinear Query Complexity

    Authors: Nir Ailon, Zohar Karnin

    Abstract: We revisit various PTAS's (Polynomial Time Approximation Schemes) for minimization versions of dense problems, and show that they can be performed with sublinear query complexity. This means that not only do we obtain a (1+eps)-approximation to the NP-Hard problems in polynomial time, but also avoid reading the entire input. This setting is particularly advantageous when the price of reading parts… ▽ More

    Submitted 30 April, 2012; originally announced April 2012.

  22. arXiv:1201.6462  [pdf, ps, other

    cs.LG

    Active Learning of Custering with Side Information Using $\eps$-Smooth Relative Regret Approximations

    Authors: Nir Ailon, Ron Begleiter

    Abstract: Clustering is considered a non-supervised learning setting, in which the goal is to partition a collection of data points into disjoint clusters. Often a bound $k$ on the number of clusters is given or assumed by the practitioner. Many versions of this problem have been defined, most notably $k$-means and $k$-median. An underlying problem with the unsupervised nature of clustering it that of det… ▽ More

    Submitted 31 January, 2012; originally announced January 2012.

  23. arXiv:1110.2136  [pdf, ps, other

    cs.LG

    Active Learning Using Smooth Relative Regret Approximations with Applications

    Authors: Nir Ailon, Ron Begleiter, Esther Ezra

    Abstract: The disagreement coefficient of Hanneke has become a central data independent invariant in proving active learning rates. It has been shown in various ways that a concept class with low complexity together with a bound on the disagreement coefficient at an optimal solution allows active learning rates that are superior to passive learning ones. We present a different tool for pool based active l… ▽ More

    Submitted 20 June, 2012; v1 submitted 10 October, 2011; originally announced October 2011.

  24. arXiv:1012.3011  [pdf, other

    cs.DS

    An Improved Algorithm for Bipartite Correlation Clustering

    Authors: Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty

    Abstract: Bipartite Correlation clustering is the problem of generating a set of disjoint bi-cliques on a set of nodes while minimizing the symmetric difference to a bipartite input graph. The number or size of the output clusters is not constrained in any way. The best known approximation algorithm for this problem gives a factor of 11. This result and all previous ones involve solving large linear or semi… ▽ More

    Submitted 14 December, 2010; originally announced December 2010.

  25. arXiv:1011.0108  [pdf, ps, other

    cs.DS

    An Active Learning Algorithm for Ranking from Pairwise Preferences with an Almost Optimal Query Complexity

    Authors: Nir Ailon

    Abstract: We study the problem of learning to rank from pairwise preferences, and solve a long-standing open problem that has led to development of many heuristics but no provable results for our particular problem. Given a set $V$ of $n$ elements, we wish to linearly order them given pairwise preference labels. A pairwise preference label is obtained as a response, typically from a human, to the question "… ▽ More

    Submitted 17 May, 2011; v1 submitted 30 October, 2010; originally announced November 2010.

    Comments: Fixed a tiny error in theorem 3.1 statement

  26. arXiv:1005.5513  [pdf, ps, other

    cs.DS

    Almost Optimal Unrestricted Fast Johnson-Lindenstrauss Transform

    Authors: Nir Ailon, Edo Liberty

    Abstract: The problems of random projections and sparse reconstruction have much in common and individually received much attention. Surprisingly, until now they progressed in parallel and remained mostly separate. Here, we employ new tools from probability in Banach spaces that were successfully used in the context of sparse reconstruction to advance on an open problem in random pojection. In particular, w… ▽ More

    Submitted 30 May, 2010; originally announced May 2010.

  27. arXiv:0907.0884  [pdf, ps, other

    cs.DS cs.CG

    Self-Improving Algorithms

    Authors: Nir Ailon, Bernard Chazelle, Kenneth L. Clarkson, Ding Liu, Wolfgang Mulzer, C. Seshadhri

    Abstract: We investigate ways in which an algorithm can improve its expected performance by fine-tuning itself automatically with respect to an unknown input distribution D. We assume here that D is of product type. More precisely, suppose that we need to process a sequence I_1, I_2, ... of inputs I = (x_1, x_2, ..., x_n) of some fixed length n, where each x_i is drawn independently from some arbitrary, unk… ▽ More

    Submitted 18 October, 2010; v1 submitted 5 July, 2009; originally announced July 2009.

    Comments: 26 pages, 8 figures, preliminary versions appeared at SODA 2006 and SoCG 2008. Thorough revision to improve the presentation of the paper

    ACM Class: F.2.2; D.1; F.1.1; I.2.6

    Journal ref: SIAM Journal on Computing (SICOMP), 40(2), 2011, pp. 350-375

  28. arXiv:0810.2764  [pdf, ps, other

    cs.IR cs.LG

    A Simple Linear Ranking Algorithm Using Query Dependent Intercept Variables

    Authors: Nir Ailon

    Abstract: The LETOR website contains three information retrieval datasets used as a benchmark for testing machine learning ideas for ranking. Algorithms participating in the challenge are required to assign score values to search results for a collection of queries, and are measured using standard IR ranking measures (NDCG, precision, MAP) that depend only the relative score-induced order of the results.… ▽ More

    Submitted 15 October, 2008; originally announced October 2008.

    Comments: 5 pages

  29. arXiv:0710.2889  [pdf, ps, other

    cs.LG cs.IR

    An efficient reduction of ranking to classification

    Authors: Nir Ailon, Mehryar Mohri

    Abstract: This paper describes an efficient reduction of the learning problem of ranking to binary classification. The reduction guarantees an average pairwise misranking regret of at most that of the binary classifier regret, improving a recent result of Balcan et al which only guarantees a factor of 2. Moreover, our reduction applies to a broader class of ranking loss functions, admits a simpler proof,… ▽ More

    Submitted 6 December, 2007; v1 submitted 15 October, 2007; originally announced October 2007.

    Comments: Revised paper: Improved results: Upper bounds for regret (constant down to 1 for bipartite case) and also lower bound on deterministic algorithms for bipartite case. Total number of pages 22

    ACM Class: K.3.2