Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–21 of 21 results for author: Rakhuba, M

.
  1. arXiv:2406.10019  [pdf, other

    cs.LG cs.AI cs.CL cs.CV math.NA

    Group and Shuffle: Efficient Structured Orthogonal Parametrization

    Authors: Mikhail Gorbunov, Nikolay Yudin, Vera Soboleva, Aibek Alanov, Alexey Naumov, Maxim Rakhuba

    Abstract: The increasing size of neural networks has led to a growing demand for methods of efficient fine-tuning. Recently, an orthogonal fine-tuning paradigm was introduced that uses orthogonal matrices for adapting the weights of a pretrained model. In this paper, we introduce a new class of structured matrices, which unifies and generalizes structured classes from previous works. We examine properties o… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2402.10032  [pdf, ps, other

    math.ST eess.SP

    Dimension-free Structured Covariance Estimation

    Authors: Nikita Puchkin, Maxim Rakhuba

    Abstract: Given a sample of i.i.d. high-dimensional centered random vectors, we consider a problem of estimation of their covariance matrix $Σ$ with an additional assumption that $Σ$ can be represented as a sum of a few Kronecker products of smaller matrices. Under mild conditions, we derive the first non-asymptotic dimension-free high-probability bound on the Frobenius distance between $Σ$ and a widely use… ▽ More

    Submitted 15 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted for presentation at the 37th Annual Conference on Learning Theory (COLT 2024)

  3. arXiv:2211.13771  [pdf, other

    cs.LG cs.CV

    Towards Practical Control of Singular Values of Convolutional Layers

    Authors: Alexandra Senderovich, Ekaterina Bulatova, Anton Obukhov, Maxim Rakhuba

    Abstract: In general, convolutional neural networks (CNNs) are easy to train, but their essential properties, such as generalization error and adversarial robustness, are hard to control. Recent research demonstrated that singular values of convolutional layers significantly affect such elusive properties and offered several methods for controlling them. Nevertheless, these methods present an intractable co… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: Published as a conference paper at NeurIPS 2022

  4. arXiv:2205.04335  [pdf, other

    math.NA

    Tensor rank bounds and explicit QTT representations for the inverses of circulant matrices

    Authors: Lev Vysotsky, Maxim Rakhuba

    Abstract: In this paper, we are concerned with the inversion of circulant matrices and their quantized tensor-train (QTT) structure. In particular, we show that the inverse of a complex circulant matrix $A$, generated by the first column of the form $(a_0,\dots,a_{m-1},0,\dots,0,a_{-n},\dots, a_{-1})^\top$ admits a QTT representation with the QTT ranks bounded by $(m+n)$. Under certain assumptions on the en… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    MSC Class: 15B05; 65F55

  5. arXiv:2111.14758  [pdf, other

    math.NA math.OC

    Local convergence of alternating low-rank optimization methods with overrelaxation

    Authors: Ivan V. Oseledets, Maxim V. Rakhuba, André Uschmajew

    Abstract: The local convergence of alternating optimization methods with overrelaxation for low-rank matrix and tensor problems is established. The analysis is based on the linearization of the method which takes the form of an SOR iteration for a positive semidefinite Hessian and can be studied in the corresponding quotient geometry of equivalent low-rank representations. In the matrix case, the optimal re… ▽ More

    Submitted 28 June, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

  6. arXiv:2105.14250  [pdf, other

    cs.CV cs.LG

    Cherry-Picking Gradients: Learning Low-Rank Embeddings of Visual Data via Differentiable Cross-Approximation

    Authors: Mikhail Usvyatsov, Anastasia Makarova, Rafael Ballester-Ripoll, Maxim Rakhuba, Andreas Krause, Konrad Schindler

    Abstract: We propose an end-to-end trainable framework that processes large-scale visual data tensors by looking at a fraction of their entries only. Our method combines a neural network encoder with a tensor train decomposition to learn a low-rank latent encoding, coupled with cross-approximation (CA) to learn the representation through a subset of the original samples. CA is an adaptive sampling algorithm… ▽ More

    Submitted 12 November, 2021; v1 submitted 29 May, 2021; originally announced May 2021.

    Journal ref: Proc. International Conference on Computer Vision (ICCV) 2021

  7. arXiv:2103.14974  [pdf, other

    math.OC cs.LG cs.MS math.NA

    Automatic differentiation for Riemannian optimization on low-rank matrix and tensor-train manifolds

    Authors: Alexander Novikov, Maxim Rakhuba, Ivan Oseledets

    Abstract: In scientific computing and machine learning applications, matrices and more general multidimensional arrays (tensors) can often be approximated with the help of low-rank decompositions. Since matrices and tensors of fixed rank form smooth Riemannian manifolds, one of the popular tools for finding low-rank approximations is to use Riemannian optimization. Nevertheless, efficient implementation of… ▽ More

    Submitted 23 October, 2021; v1 submitted 27 March, 2021; originally announced March 2021.

  8. arXiv:2103.04217  [pdf, other

    cs.LG cs.CV stat.ML

    Spectral Tensor Train Parameterization of Deep Learning Layers

    Authors: Anton Obukhov, Maxim Rakhuba, Alexander Liniger, Zhiwu Huang, Stamatios Georgoulis, Dengxin Dai, Luc Van Gool

    Abstract: We study low-rank parameterizations of weight matrices with embedded spectral properties in the Deep Learning context. The low-rank property leads to parameter efficiency and permits taking computational shortcuts when computing mappings. Spectral properties are often subject to constraints in optimization problems, leading to better models and stability of optimization. We start by looking at the… ▽ More

    Submitted 13 July, 2021; v1 submitted 6 March, 2021; originally announced March 2021.

    Comments: Accepted at AISTATS 2021

  9. arXiv:2010.06919  [pdf, other

    math.NA

    Low rank tensor approximation of singularly perturbed partial differential equations in one dimension

    Authors: Carlo Marcati, Maxim Rakhuba, Johan E. M. Ulander

    Abstract: We derive rank bounds on the quantized tensor train (QTT) compressed approximation of singularly perturbed reaction diffusion partial differential equations (PDEs) in one dimension. Specifically, we show that, independently of the scale of the singular perturbation parameter, a numerical solution with accuracy $0<ε<1$ can be represented in QTT format with a number of parameters that depends only p… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    MSC Class: 15A69; 35A35; 35J25; 41A25; 65N30

  10. arXiv:2007.06631  [pdf, other

    cs.LG cs.CV stat.ML

    T-Basis: a Compact Representation for Neural Networks

    Authors: Anton Obukhov, Maxim Rakhuba, Stamatios Georgoulis, Menelaos Kanakis, Dengxin Dai, Luc Van Gool

    Abstract: We introduce T-Basis, a novel concept for a compact representation of a set of tensors, each of an arbitrary shape, which is often seen in Neural Networks. Each of the tensors in the set is modeled using Tensor Rings, though the concept applies to other Tensor Networks. Owing its name to the T-shape of nodes in diagram notation of Tensor Rings, T-Basis is simply a list of equally shaped three-dime… ▽ More

    Submitted 13 July, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: Accepted at ICML 2020

  11. arXiv:2006.01455  [pdf, other

    math.NA

    Quantized tensor FEM for multiscale problems: diffusion problems in two and three dimensions

    Authors: V. Kazeev, I. Oseledets, M. Rakhuba, Ch. Schwab

    Abstract: Homogenization in terms of multiscale limits transforms a multiscale problem with $n+1$ asymptotically separated microscales posed on a physical domain $D \subset \mathbb{R}^d$ into a one-scale problem posed on a product domain of dimension $(n+1)d$ by introducing $n$ so-called "fast variables". This procedure allows to convert $n+1$ scales in $d$ physical dimensions into a single-scale structure… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

    Comments: 31 pages, 8 figures

    MSC Class: 15A69; 35B27; 65N15; 65N30

  12. arXiv:1912.07996  [pdf, other

    math.NA

    Tensor Rank bounds for Point Singularities in $\mathbb{R}^3$

    Authors: Carlo Marcati, Maxim Rakhuba, Christoph Schwab

    Abstract: We analyze rates of approximation by quantized, tensor-structured representations of functions with isolated point singularities in ${\mathbb R}^3$. We consider functions in countably normed Sobolev spaces with radial weights and analytic- or Gevrey-type control of weighted semi-norms. Several classes of boundary value and eigenvalue problems from science and engineering are discussed whose soluti… ▽ More

    Submitted 17 December, 2019; originally announced December 2019.

    MSC Class: 35A35 (Primary); 15A69; 35J15; 41A25; 41A46; 65N30 (Secondary)

  13. Low-rank Riemannian eigensolver for high-dimensional Hamiltonians

    Authors: Maxim Rakhuba, Alexander Novikov, Ivan Oseledets

    Abstract: Such problems as computation of spectra of spin chains and vibrational spectra of molecules can be written as high-dimensional eigenvalue problems, i.e., when the eigenvector can be naturally represented as a multidimensional tensor. Tensor methods have proven to be an efficient tool for the approximation of solutions of high-dimensional eigenvalue problems, however, their performance deteriorates… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

    MSC Class: 65Z05; 15A69; 65F15

  14. arXiv:1709.07286  [pdf, other

    math.NA math.OC

    Alternating least squares as moving subspace correction

    Authors: Ivan Oseledets, Maxim Rakhuba, André Uschmajew

    Abstract: In this note we take a new look at the local convergence of alternating optimization methods for low-rank matrices and tensors. Our abstract interpretation as sequential optimization on moving subspaces yields insightful reformulations of some known convergence conditions that focus on the interplay between the contractivity of classical multiplicative Schwarz methods with overlapping subspaces an… ▽ More

    Submitted 11 January, 2019; v1 submitted 21 September, 2017; originally announced September 2017.

    Comments: 20 pages, 4 figures

    MSC Class: 15A69; 65K10; 53B21

  15. arXiv:1704.01669  [pdf, other

    math.NA

    Vico-Greengard-Ferrando quadratures in the tensor solver for integral equations

    Authors: Valentin Khrulkov, Maxim Rakhuba, Ivan Oseledets

    Abstract: Convolution with Green's function of a differential operator appears in a lot of applications e.g. Lippmann-Schwinger integral equation. Algorithms for computing such are usually non-trivial and require non-uniform mesh. However, recently Vico, Greengard and Ferrando developed method for computing convolution with smooth functions with compact support with spectral accuracy, requiring nothing more… ▽ More

    Submitted 5 April, 2017; originally announced April 2017.

  16. arXiv:1703.09096  [pdf, other

    math.NA

    Jacobi-Davidson method on low-rank matrix manifolds

    Authors: Maxim Rakhuba, Ivan Oseledets

    Abstract: In this work we generalize the Jacobi-Davidson method to the case when eigenvector can be reshaped into a low-rank matrix. In this setting the proposed method inherits advantages of the original Jacobi-Davidson method, has lower complexity and requires less storage. We also introduce low-rank version of the Rayleigh quotient iteration which naturally arises in the Jacobi-Davidson method.

    Submitted 27 March, 2017; originally announced March 2017.

    Comments: 18 pages, 7 figures

    MSC Class: 15A18; 15A69; 65F15; 53B21

  17. arXiv:1612.01166  [pdf, other

    math.NA

    Robust discretization in quantized tensor train format for elliptic problems in two dimensions

    Authors: A. V. Chertkov, I. V. Oseledets, M. V. Rakhuba

    Abstract: In this work we propose an efficient black-box solver for two-dimensional stationary diffusion equations, which is based on a new robust discretization scheme. The idea is to formulate an equation in a certain form without derivatives with a non-local stencil, which leads us to a linear system of equations with dense matrix. This matrix and a right-hand side are represented in a low-rank parametri… ▽ More

    Submitted 21 December, 2016; v1 submitted 4 December, 2016; originally announced December 2016.

  18. Calculating vibrational spectra of molecules using tensor train decomposition

    Authors: Maxim Rakhuba, Ivan Oseledets

    Abstract: We propose a new algorithm for calculation of vibrational spectra of molecules using tensor train decomposition. Under the assumption that eigenfunctions lie on a low-parametric manifold of low-rank tensors we suggest using well-known iterative methods that utilize matrix inversion (LOBPCG, inverse iteration) and solve corresponding linear systems inexactly along this manifold. As an application,… ▽ More

    Submitted 6 September, 2016; v1 submitted 26 May, 2016; originally announced May 2016.

    Comments: 11 pages, 4 figures, 2 tables

    MSC Class: 65Z05; 15A69; 65F15

  19. Grid-based electronic structure calculations: the tensor decomposition approach

    Authors: Maxim Rakhuba, Ivan Oseledets

    Abstract: We present a fully grid-based approach for solving Hartree-Fock and all-electron Kohn-Sham equations based on low-rank approximation of three-dimensional electron orbitals. Due to the low-rank structure the total complexity of the algorithm depends linearly with respect to the one-dimensional grid size. Linear complexity allows for the usage of fine grids, e.g. $8192^3$ and, thus, cheap extrapolat… ▽ More

    Submitted 30 August, 2015; originally announced August 2015.

    Comments: 15 pages, 3 figures

    MSC Class: 65Z05; 15A69; 15B05; 44A35; 65F99

  20. arXiv:1412.6553  [pdf, other

    cs.CV cs.LG

    Speeding-up Convolutional Neural Networks Using Fine-tuned CP-Decomposition

    Authors: Vadim Lebedev, Yaroslav Ganin, Maksim Rakhuba, Ivan Oseledets, Victor Lempitsky

    Abstract: We propose a simple two-step approach for speeding up convolution layers within large convolutional neural networks based on tensor decomposition and discriminative fine-tuning. Given a layer, we use non-linear least squares to compute a low-rank CP-decomposition of the 4D convolution kernel tensor into a sum of a small number of rank-one tensors. At the second step, this decomposition is used to… ▽ More

    Submitted 24 April, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

  21. arXiv:1402.5649  [pdf, other

    math.NA

    Fast multidimensional convolution in low-rank formats via cross approximation

    Authors: M. V. Rakhuba, I. V. Oseledets

    Abstract: We propose a new cross-conv algorithm for approximate computation of convolution in different low-rank tensor formats (tensor train, Tucker, Hierarchical Tucker). It has better complexity with respect to the tensor rank than previous approaches. The new algorithm has a high potential impact in different applications. The key idea is based on applying cross approximation in the "frequency domain",… ▽ More

    Submitted 6 September, 2016; v1 submitted 23 February, 2014; originally announced February 2014.

    Comments: 14 pages, 2 figures

    MSC Class: 15A69; 15B05; 44A35; 65F99