Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–38 of 38 results for author: Kossaifi, J

.
  1. arXiv:2403.12553  [pdf, other

    cs.LG

    Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs

    Authors: Md Ashiqur Rahman, Robert Joseph George, Mogab Elleithy, Daniel Leibovici, Zongyi Li, Boris Bonev, Colin White, Julius Berner, Raymond A. Yeh, Jean Kossaifi, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Existing neural operator architectures face challenges when solving multiphysics problems with coupled partial differential equations (PDEs), due to complex geometries, interactions between physical variables, and the lack of large amounts of high-resolution training data. To address these issues, we propose Codomain Attention Neural Operator (CoDA-NO), which tokenizes functions along the codomain… ▽ More

    Submitted 5 April, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  2. arXiv:2401.11037  [pdf, other

    cs.LG math.NA q-bio.QM

    Equivariant Graph Neural Operator for Modeling 3D Dynamics

    Authors: Minkai Xu, Jiaqi Han, Aaron Lou, Jean Kossaifi, Arvind Ramanathan, Kamyar Azizzadenesheli, Jure Leskovec, Stefano Ermon, Anima Anandkumar

    Abstract: Modeling the complex three-dimensional (3D) dynamics of relational systems is an important problem in the natural sciences, with applications ranging from molecular simulations to particle mechanics. Machine learning methods have achieved good success by learning graph neural networks to model spatial interactions. However, these approaches do not faithfully capture temporal correlations since the… ▽ More

    Submitted 2 June, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024. Copyright 2024 by the author(s)

  3. arXiv:2310.00120  [pdf, other

    cs.LG

    Multi-Grid Tensorized Fourier Neural Operator for High-Resolution PDEs

    Authors: Jean Kossaifi, Nikola Kovachki, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Memory complexity and data scarcity have so far prohibited learning solution operators of partial differential equations (PDEs) at high resolutions. We address these limitations by introducing a new data efficient and highly parallelizable operator learning approach with reduced memory requirement and better generalization, called multi-grid tensorized neural operator (MG-TFNO). MG-TFNO scales to… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

  4. arXiv:2309.15325  [pdf, other

    cs.LG physics.comp-ph

    Neural Operators for Accelerating Scientific Simulations and Design

    Authors: Kamyar Azizzadenesheli, Nikola Kovachki, Zongyi Li, Miguel Liu-Schiaffini, Jean Kossaifi, Anima Anandkumar

    Abstract: Scientific discovery and engineering design are currently limited by the time and cost of physical experiments, selected mostly through trial-and-error and intuition that require deep domain expertise. Numerical simulations present an alternative to physical experiments but are usually infeasible for complex real-world domains due to the computational requirements of existing numerical methods. Ar… ▽ More

    Submitted 4 January, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  5. arXiv:2309.00583  [pdf, other

    cs.LG math.NA

    Geometry-Informed Neural Operator for Large-Scale 3D PDEs

    Authors: Zongyi Li, Nikola Borislavov Kovachki, Chris Choy, Boyi Li, Jean Kossaifi, Shourya Prakash Otta, Mohammad Amin Nabian, Maximilian Stadler, Christian Hundt, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: We propose the geometry-informed neural operator (GINO), a highly efficient approach to learning the solution operator of large-scale partial differential equations with varying geometries. GINO uses a signed distance function and point-cloud representations of the input shape and neural operators based on graph and Fourier architectures to learn the solution operator. The graph neural operator ha… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  6. arXiv:2307.15034  [pdf, other

    cs.LG math.NA

    Guaranteed Approximation Bounds for Mixed-Precision Neural Operators

    Authors: Renbo Tu, Colin White, Jean Kossaifi, Boris Bonev, Nikola Kovachki, Gennady Pekhimenko, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Neural operators, such as Fourier Neural Operators (FNO), form a principled approach for learning solution operators for PDEs and other mappings between function spaces. However, many real-world problems require high-resolution training data, and the training time and limited GPU memory pose big barriers. One solution is to train neural operators in mixed precision to reduce the memory requirement… ▽ More

    Submitted 5 May, 2024; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: ICLR 2024

  7. arXiv:2302.07400  [pdf, other

    cs.LG math.FA stat.ML

    Score-based Diffusion Models in Function Space

    Authors: Jae Hyun Lim, Nikola B. Kovachki, Ricardo Baptista, Christopher Beckham, Kamyar Azizzadenesheli, Jean Kossaifi, Vikram Voleti, Jiaming Song, Karsten Kreis, Jan Kautz, Christopher Pal, Arash Vahdat, Anima Anandkumar

    Abstract: Diffusion models have recently emerged as a powerful framework for generative modeling. They consist of a forward process that perturbs input data with Gaussian white noise and a reverse process that learns a score function to generate samples by denoising. Despite their tremendous success, they are mostly formulated on finite-dimensional spaces, e.g. Euclidean, limiting their applications to many… ▽ More

    Submitted 22 November, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 52 pages

    MSC Class: 46B09 (Primary); 60J22 (Secondary) ACM Class: I.2.6; J.2

  8. arXiv:2211.16749  [pdf, other

    cs.LG cs.AI cs.AR

    HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression

    Authors: Jiaqi Gu, Ben Keller, Jean Kossaifi, Anima Anandkumar, Brucek Khailany, David Z. Pan

    Abstract: Transformers have attained superior performance in natural language processing and computer vision. Their self-attention and feedforward layers are overparameterized, limiting inference speed and energy efficiency. Tensor decomposition is a promising technique to reduce parameter redundancy by leveraging tensor algebraic properties to express the parameters in a factorized form. Prior efforts used… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Comments: 9 pages. Accepted to NeurIPS ML for System Workshop 2022 (Spotlight)

  9. arXiv:2211.15188  [pdf, other

    cs.LG

    Incremental Spatial and Spectral Learning of Neural Operators for Solving Large-Scale PDEs

    Authors: Robert Joseph George, Jiawei Zhao, Jean Kossaifi, Zongyi Li, Anima Anandkumar

    Abstract: Fourier Neural Operators (FNO) offer a principled approach to solving challenging partial differential equations (PDE) such as turbulent flows. At the core of FNO is a spectral layer that leverages a discretization-convergent representation in the Fourier domain, and learns weights over a fixed set of frequencies. However, training FNO presents two significant challenges, particularly in large-sca… ▽ More

    Submitted 4 March, 2024; v1 submitted 28 November, 2022; originally announced November 2022.

  10. arXiv:2209.13993  [pdf, other

    quant-ph

    Towards a scalable discrete quantum generative adversarial neural network

    Authors: Smit Chaudhary, Patrick Huembeli, Ian MacCormack, Taylor L. Patti, Jean Kossaifi, Alexey Galda

    Abstract: We introduce a fully quantum generative adversarial network intended for use with binary data. The architecture incorporates several features found in other classical and quantum machine learning models, which up to this point had not been used in conjunction. In particular, we incorporate noise reuploading in the generator, auxiliary qubits in the discriminator to enhance expressivity, and a dire… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 11 pages, 11 figures, GitLab repository

  11. Quantum Goemans-Williamson Algorithm with the Hadamard Test and Approximate Amplitude Constraints

    Authors: Taylor L. Patti, Jean Kossaifi, Anima Anandkumar, Susanne F. Yelin

    Abstract: Semidefinite programs are optimization methods with a wide array of applications, such as approximating difficult combinatorial problems. One such semidefinite program is the Goemans-Williamson algorithm, a popular integer relaxation technique. We introduce a variational quantum algorithm for the Goemans-Williamson algorithm that uses only $n{+}1$ qubits, a constant number of circuit preparations,… ▽ More

    Submitted 8 July, 2023; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: 21 pages, 6 figures. Updated files to the version of manuscript accepted by Quantum

    Journal ref: Quantum 7, 1057 (2023)

  12. arXiv:2112.10239  [pdf, other

    quant-ph

    TensorLy-Quantum: Quantum Machine Learning with Tensor Methods

    Authors: Taylor L. Patti, Jean Kossaifi, Susanne F. Yelin, Anima Anandkumar

    Abstract: Simulation is essential for developing quantum hardware and algorithms. However, simulating quantum circuits on classical hardware is challenging due to the exponential scaling of quantum state space. While factorized tensors can greatly reduce this overhead, tensor network-based simulators are relatively few and often lack crucial functionalities. To address this deficiency, we created TensorLy-Q… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

    Comments: 6 pages, 2 figures

  13. arXiv:2110.14538  [pdf, other

    cs.LG cs.MA

    Reinforcement Learning in Factored Action Spaces using Tensor Decompositions

    Authors: Anuj Mahajan, Mikayel Samvelyan, Lei Mao, Viktor Makoviychuk, Animesh Garg, Jean Kossaifi, Shimon Whiteson, Yuke Zhu, Animashree Anandkumar

    Abstract: We present an extended abstract for the previously published work TESSERACT [Mahajan et al., 2021], which proposes a novel solution for Reinforcement Learning (RL) in large, factored action spaces using tensor decompositions. The goal of this abstract is twofold: (1) To garner greater interest amongst the tensor research community for creating methods and analysis for approximate RL, (2) To elucid… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Journal ref: 2nd Workshop on Quantum Tensor Networks in Machine Learning (NeurIPS 2021)

  14. arXiv:2110.13859  [pdf, other

    cs.LG cs.AI cs.CV

    Defensive Tensorization

    Authors: Adrian Bulat, Jean Kossaifi, Sourav Bhattacharya, Yannis Panagakis, Timothy Hospedales, Georgios Tzimiropoulos, Nicholas D Lane, Maja Pantic

    Abstract: We propose defensive tensorization, an adversarial defence technique that leverages a latent high-order factorization of the network. The layers of a network are first expressed as factorized tensor layers. Tensor dropout is then applied in the latent subspace, therefore resulting in dense reconstructed weights, without the sparsity or perturbations typically induced by the randomization.Our appro… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: To be presented at BMVC 2021

  15. arXiv:2110.13771  [pdf, other

    cs.CV cs.LG

    AugMax: Adversarial Composition of Random Augmentations for Robust Training

    Authors: Haotao Wang, Chaowei Xiao, Jean Kossaifi, Zhiding Yu, Anima Anandkumar, Zhangyang Wang

    Abstract: Data augmentation is a simple yet effective way to improve the robustness of deep neural networks (DNNs). Diversity and hardness are two complementary dimensions of data augmentation to achieve robustness. For example, AugMix explores random compositions of a diverse set of augmentations to enhance broader coverage, while adversarial training generates adversarially hard samples to spot the weakne… ▽ More

    Submitted 1 January, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: NeurIPS, 2021

  16. Tensor Methods in Computer Vision and Deep Learning

    Authors: Yannis Panagakis, Jean Kossaifi, Grigorios G. Chrysos, James Oldfield, Mihalis A. Nicolaou, Anima Anandkumar, Stefanos Zafeiriou

    Abstract: Tensors, or multidimensional arrays, are data structures that can naturally represent visual data of multiple dimensions. Inherently able to efficiently capture structured, latent semantic spaces and high-order interactions, tensors have a long history of applications in a wide span of computer vision problems. With the advent of the deep learning paradigm shift in computer vision, tensors have be… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Proceedings of the IEEE (2021)

  17. arXiv:2106.13304  [pdf, ps, other

    quant-ph

    Variational Quantum Optimization with Multi-Basis Encodings

    Authors: Taylor L. Patti, Jean Kossaifi, Anima Anandkumar, Susanne F. Yelin

    Abstract: Despite extensive research efforts, few quantum algorithms for classical optimization demonstrate realizable quantum advantage. The utility of many quantum algorithms is limited by high requisite circuit depth and nonconvex optimization landscapes. We tackle these challenges by introducing a new variational quantum algorithm that benefits from two innovations: multi-basis graph encodings and nonli… ▽ More

    Submitted 26 January, 2022; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: 10 pages, 4 figures. Corrected circuit structure, added citations, clarified key points. Updated title, method nomenclature, manuscript order, and format

  18. arXiv:2106.00136  [pdf, other

    cs.LG

    Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning

    Authors: Anuj Mahajan, Mikayel Samvelyan, Lei Mao, Viktor Makoviychuk, Animesh Garg, Jean Kossaifi, Shimon Whiteson, Yuke Zhu, Animashree Anandkumar

    Abstract: Reinforcement Learning in large action spaces is a challenging problem. Cooperative multi-agent reinforcement learning (MARL) exacerbates matters by imposing various constraints on communication and observability. In this work, we consider the fundamental hurdle affecting both value-based and policy-gradient approaches: an exponential blowup of the action space with the number of agents. For value… ▽ More

    Submitted 31 May, 2021; originally announced June 2021.

    Comments: 38th International Conference on Machine Learning, PMLR 139, 2021

  19. arXiv:2104.07916  [pdf, other

    cs.CV

    Augmenting Deep Classifiers with Polynomial Neural Networks

    Authors: Grigorios G Chrysos, Markos Georgopoulos, Jiankang Deng, Jean Kossaifi, Yannis Panagakis, Anima Anandkumar

    Abstract: Deep neural networks have been the driving force behind the success in classification tasks, e.g., object and audio recognition. Impressive results and generalization have been achieved by a variety of recently proposed architectures, the majority of which are seemingly disconnected. In this work, we cast the study of deep classifiers under a unifying framework. In particular, we express state-of-… ▽ More

    Submitted 11 August, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted at ECCV'22

  20. arXiv:2007.09250  [pdf, other

    cs.LG cs.CV stat.ML

    Unsupervised Controllable Generation with Self-Training

    Authors: Grigorios G Chrysos, Jean Kossaifi, Zhiding Yu, Anima Anandkumar

    Abstract: Recent generative adversarial networks (GANs) are able to generate impressive photo-realistic images. However, controllable generation with GANs remains a challenging research problem. Achieving controllable generation requires semantically interpretable and disentangled factors of variation. It is challenging to achieve this goal using simple fixed distributions such as Gaussian distribution. Ins… ▽ More

    Submitted 2 May, 2021; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: Accepted in IJCNN 2021

  21. arXiv:2004.07984  [pdf, other

    cs.LG stat.ML

    Spectral Learning on Matrices and Tensors

    Authors: Majid Janzamin, Rong Ge, Jean Kossaifi, Anima Anandkumar

    Abstract: Spectral methods have been the mainstay in several domains such as machine learning and scientific computing. They involve finding a certain kind of spectral decomposition to obtain basis functions that can capture important structures for the problem at hand. The most common spectral method is the principal component analysis (PCA). It utilizes the top eigenvectors of the data covariance matrix,… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Journal ref: Foundations and Trends in Machine Learning: Vol. 12: No. 5-6, pp 393-536 (2019)

  22. arXiv:2002.11098  [pdf, other

    cs.CV

    Toward fast and accurate human pose estimation via soft-gated skip connections

    Authors: Adrian Bulat, Jean Kossaifi, Georgios Tzimiropoulos, Maja Pantic

    Abstract: This paper is on highly accurate and highly efficient human pose estimation. Recent works based on Fully Convolutional Networks (FCNs) have demonstrated excellent results for this difficult problem. While residual connections within FCNs have proved to be quintessential for achieving high accuracy, we re-analyze this design choice in the context of improving both the accuracy and the efficiency ov… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

    Comments: Accepted to FG 2020 (oral)

  23. arXiv:2002.09131  [pdf, other

    cs.LG cs.CV stat.ML

    Convolutional Tensor-Train LSTM for Spatio-temporal Learning

    Authors: Jiahao Su, Wonmin Byeon, Jean Kossaifi, Furong Huang, Jan Kautz, Animashree Anandkumar

    Abstract: Learning from spatio-temporal data has numerous applications such as human-behavior analysis, object tracking, video compression, and physics simulation.However, existing methods still perform poorly on challenging video tasks such as long-term forecasting. This is because these kinds of challenging tasks require learning long-term spatio-temporal correlations in the video sequence. In this paper,… ▽ More

    Submitted 4 October, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: Jiahao Su and Wonmin Byeon contributed equally to this work. 22 pages, 14 figures, NeurIPS 2020

  24. arXiv:1912.05833  [pdf, other

    cs.LG eess.AS stat.ML

    Speech-driven facial animation using polynomial fusion of features

    Authors: Triantafyllos Kefalas, Konstantinos Vougioukas, Yannis Panagakis, Stavros Petridis, Jean Kossaifi, Maja Pantic

    Abstract: Speech-driven facial animation involves using a speech signal to generate realistic videos of talking faces. Recent deep learning approaches to facial synthesis rely on extracting low-dimensional representations and concatenating them, followed by a decoding step of the concatenated vector. This accounts for only first-order interactions of the features and ignores higher-order interactions. In th… ▽ More

    Submitted 19 February, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

  25. arXiv:1906.06196  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Factorized Higher-Order CNNs with an Application to Spatio-Temporal Emotion Estimation

    Authors: Jean Kossaifi, Antoine Toisoul, Adrian Bulat, Yannis Panagakis, Timothy Hospedales, Maja Pantic

    Abstract: Training deep neural networks with spatio-temporal (i.e., 3D) or multidimensional convolutions of higher-order is computationally challenging due to millions of unknown parameters across dozens of layers. To alleviate this, one approach is to apply low-rank tensor decompositions to convolution kernels in order to compress the network and reduce its number of parameters. Alternatively, new convolut… ▽ More

    Submitted 31 March, 2020; v1 submitted 14 June, 2019; originally announced June 2019.

    Comments: IEEE CVPR 2020

  26. arXiv:1904.07852  [pdf, other

    cs.CV cs.AI cs.LG

    Matrix and tensor decompositions for training binary neural networks

    Authors: Adrian Bulat, Jean Kossaifi, Georgios Tzimiropoulos, Maja Pantic

    Abstract: This paper is on improving the training of binary neural networks in which both activations and weights are binary. While prior methods for neural network binarization binarize each filter independently, we propose to instead parametrize the weight tensor of each layer using matrix or tensor decomposition. The binarization process is then performed using this latent parametrization, via a quantiza… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

  27. arXiv:1904.06345  [pdf, other

    cs.CV cs.AI cs.LG

    Incremental multi-domain learning with network latent tensor factorization

    Authors: Adrian Bulat, Jean Kossaifi, Georgios Tzimiropoulos, Maja Pantic

    Abstract: The prominence of deep learning, large amount of annotated data and increasingly powerful hardware made it possible to reach remarkable performance for supervised classification tasks, in many cases saturating the training sets. However the resulting models are specialized to a single very specific task and domain. Adapting the learned classification to new domains is a hard problem due to at leas… ▽ More

    Submitted 22 November, 2019; v1 submitted 12 April, 2019; originally announced April 2019.

    Comments: AAAI20

  28. arXiv:1904.05868  [pdf, other

    cs.CV

    Improved training of binary networks for human pose estimation and image recognition

    Authors: Adrian Bulat, Georgios Tzimiropoulos, Jean Kossaifi, Maja Pantic

    Abstract: Big neural networks trained on large datasets have advanced the state-of-the-art for a large variety of challenging problems, improving performance by a large margin. However, under low memory and limited computational power constraints, the accuracy on the same problems drops considerable. In this paper, we propose a series of techniques that significantly improve the accuracy of binarized neural… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

  29. arXiv:1904.02698  [pdf, other

    cs.CV cs.AI cs.LG

    T-Net: Parametrizing Fully Convolutional Nets with a Single High-Order Tensor

    Authors: Jean Kossaifi, Adrian Bulat, Georgios Tzimiropoulos, Maja Pantic

    Abstract: Recent findings indicate that over-parametrization, while crucial for successfully training deep neural networks, also introduces large amounts of redundancy. Tensor methods have the potential to efficiently parametrize over-complete representations by leveraging this redundancy. In this paper, we propose to fully parametrize Convolutional Neural Networks (CNNs) with a single high-order, low-rank… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: CVPR 2019

  30. arXiv:1902.10758  [pdf, other

    cs.LG stat.ML

    Tensor Dropout for Robust Learning

    Authors: Arinbjörn Kolbeinsson, Jean Kossaifi, Yannis Panagakis, Adrian Bulat, Anima Anandkumar, Ioanna Tzoulaki, Paul Matthews

    Abstract: CNNs achieve remarkable performance by leveraging deep, over-parametrized architectures, trained on large datasets. However, they have limited generalization ability to data outside the training domain, and a lack of robustness to noise and adversarial attacks. By building better inductive biases, we can improve robustness and also obtain smaller networks that are more memory and computationally e… ▽ More

    Submitted 11 December, 2020; v1 submitted 27 February, 2019; originally announced February 2019.

  31. SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild

    Authors: Jean Kossaifi, Robert Walecki, Yannis Panagakis, Jie Shen, Maximilian Schmitt, Fabien Ringeval, Jing Han, Vedhas Pandit, Antoine Toisoul, Bjorn Schuller, Kam Star, Elnar Hajiyev, Maja Pantic

    Abstract: Natural human-computer interaction and audio-visual human behaviour sensing systems, which would achieve robust performance in-the-wild are more needed than ever as digital devices are increasingly becoming an indispensable part of our life. Accurately annotated real-world data are the crux in devising such systems. However, existing databases usually consider controlled settings, low demographic… ▽ More

    Submitted 18 November, 2019; v1 submitted 9 January, 2019; originally announced January 2019.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019

  32. arXiv:1805.08657  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Robust Conditional Generative Adversarial Networks

    Authors: Grigorios G. Chrysos, Jean Kossaifi, Stefanos Zafeiriou

    Abstract: Conditional generative adversarial networks (cGAN) have led to large improvements in the task of conditional image generation, which lies at the heart of computer vision. The major focus so far has been on performance improvement, while there has been little effort in making cGAN more robust to noise. The regression (of the generator) might lead to arbitrarily large errors in the output, which mak… ▽ More

    Submitted 13 March, 2019; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: To appear in ICLR 2019

  33. arXiv:1803.01442  [pdf, other

    cs.LG stat.ML

    Stochastic Activation Pruning for Robust Adversarial Defense

    Authors: Guneet S. Dhillon, Kamyar Azizzadenesheli, Zachary C. Lipton, Jeremy Bernstein, Jean Kossaifi, Aran Khanna, Anima Anandkumar

    Abstract: Neural networks are known to be vulnerable to adversarial examples. Carefully chosen perturbations to real images, while imperceptible to humans, induce misclassification and threaten the reliability of deep learning systems in the wild. To guard against adversarial examples, we take inspiration from game theory and cast the problem as a minimax zero-sum game between the adversary and the model. I… ▽ More

    Submitted 4 March, 2018; originally announced March 2018.

    Comments: ICLR 2018

  34. arXiv:1712.00684  [pdf, other

    cs.CV

    GAGAN: Geometry-Aware Generative Adversarial Networks

    Authors: Jean Kossaifi, Linh Tran, Yannis Panagakis, Maja Pantic

    Abstract: Deep generative models learned through adversarial training have become increasingly popular for their ability to generate naturalistic image textures. However, aside from their texture, the visual appearance of objects is significantly influenced by their shape geometry; information which is not taken into account by existing generative models. This paper introduces the Geometry-Aware Generative… ▽ More

    Submitted 27 March, 2018; v1 submitted 2 December, 2017; originally announced December 2017.

  35. arXiv:1707.08308  [pdf, other

    cs.LG

    Tensor Regression Networks

    Authors: Jean Kossaifi, Zachary C. Lipton, Arinbjorn Kolbeinsson, Aran Khanna, Tommaso Furlanello, Anima Anandkumar

    Abstract: Convolutional neural networks typically consist of many convolutional layers followed by one or more fully connected layers. While convolutional layers map between high-order activation tensors, the fully connected layers operate on flattened activation vectors. Despite empirical success, this approach has notable drawbacks. Flattening followed by fully connected layers discards multilinear struct… ▽ More

    Submitted 20 July, 2020; v1 submitted 26 July, 2017; originally announced July 2017.

  36. arXiv:1706.00439  [pdf, other

    cs.LG

    Tensor Contraction Layers for Parsimonious Deep Nets

    Authors: Jean Kossaifi, Aran Khanna, Zachary C. Lipton, Tommaso Furlanello, Anima Anandkumar

    Abstract: Tensors offer a natural representation for many kinds of data frequently encountered in machine learning. Images, for example, are naturally represented as third order tensors, where the modes correspond to height, width, and channels. Tensor methods are noted for their ability to discover multi-dimensional dependencies, and tensor decompositions in particular, have been used to produce compact lo… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

  37. arXiv:1610.09555  [pdf, other

    cs.LG

    TensorLy: Tensor Learning in Python

    Authors: Jean Kossaifi, Yannis Panagakis, Anima Anandkumar, Maja Pantic

    Abstract: Tensors are higher-order extensions of matrices. While matrix methods form the cornerstone of machine learning and data analysis, tensor methods have been gaining increasing traction. However, software support for tensor operations is not on the same footing. In order to bridge this gap, we have developed \emph{TensorLy}, a high-level API for tensor methods and deep tensorized neural networks in P… ▽ More

    Submitted 9 May, 2018; v1 submitted 29 October, 2016; originally announced October 2016.

  38. arXiv:1412.3919  [pdf, other

    cs.LG cs.CV stat.ML

    Machine Learning for Neuroimaging with Scikit-Learn

    Authors: Alexandre Abraham, Fabian Pedregosa, Michael Eickenberg, Philippe Gervais, Andreas Muller, Jean Kossaifi, Alexandre Gramfort, Bertrand Thirion, Gäel Varoquaux

    Abstract: Statistical machine learning methods are increasingly used for neuroimaging data analysis. Their main virtue is their ability to model high-dimensional datasets, e.g. multivariate analysis of activation images or resting-state time series. Supervised learning is typically used in decoding or encoding settings to relate brain images to behavioral or clinical observations, while unsupervised learnin… ▽ More

    Submitted 12 December, 2014; originally announced December 2014.

    Comments: Frontiers in neuroscience, Frontiers Research Foundation, 2013, pp.15