Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–6 of 6 results for author: Baradad, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.01862  [pdf, other

    cs.CV cs.CL cs.LG

    A Vision Check-up for Language Models

    Authors: Pratyusha Sharma, Tamar Rott Shaham, Manel Baradad, Stephanie Fu, Adrian Rodriguez-Munoz, Shivam Duggal, Phillip Isola, Antonio Torralba

    Abstract: What does learning to model relationships between strings teach large language models (LLMs) about the visual world? We systematically evaluate LLMs' abilities to generate and recognize an assortment of visual concepts of increasing complexity and then demonstrate how a preliminary visual representation learning system can be trained using models of text. As language models lack the ability to con… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  2. arXiv:2306.05428  [pdf, other

    cs.CV

    Background Prompting for Improved Object Depth

    Authors: Manel Baradad, Yuanzhen Li, Forrester Cole, Michael Rubinstein, Antonio Torralba, William T. Freeman, Varun Jampani

    Abstract: Estimating the depth of objects from a single image is a valuable task for many vision, robotics, and graphics applications. However, current methods often fail to produce accurate depth for objects in diverse scenes. In this work, we propose a simple yet effective Background Prompting strategy that adapts the input object image with a learned background. We learn the background prompts only using… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  3. arXiv:2303.14537  [pdf, other

    cs.LG cs.CL cs.CV

    Deep Augmentation: Self-Supervised Learning with Transformations in Activation Space

    Authors: Rickard BrĂ¼el-Gabrielsson, Tongzhou Wang, Manel Baradad, Justin Solomon

    Abstract: We introduce Deep Augmentation, an approach to implicit data augmentation using dropout or PCA to transform a targeted layer within a neural network to improve performance and generalization. We demonstrate Deep Augmentation through extensive experiments on contrastive learning tasks in NLP, computer vision, and graph learning. We observe substantial performance gains with Transformers, ResNets, a… ▽ More

    Submitted 26 February, 2024; v1 submitted 25 March, 2023; originally announced March 2023.

  4. arXiv:2211.16412  [pdf, other

    cs.CV cs.LG

    Procedural Image Programs for Representation Learning

    Authors: Manel Baradad, Chun-Fu Chen, Jonas Wulff, Tongzhou Wang, Rogerio Feris, Antonio Torralba, Phillip Isola

    Abstract: Learning image representations using synthetic data allows training neural networks without some of the concerns associated with real images, such as privacy and bias. Existing work focuses on a handful of curated generative processes which require expert knowledge to design, making it hard to scale up. To overcome this, we propose training with a large dataset of twenty-one thousand programs, eac… ▽ More

    Submitted 6 November, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: 29 pages, Accepted in the Conference on Neural Information Processing Systems 2022 (NeurIPS 2022)

    Journal ref: NeurIPS 2022

  5. arXiv:2106.05963  [pdf, other

    cs.CV cs.AI

    Learning to See by Looking at Noise

    Authors: Manel Baradad, Jonas Wulff, Tongzhou Wang, Phillip Isola, Antonio Torralba

    Abstract: Current vision systems are trained on huge datasets, and these datasets come with costs: curation is expensive, they inherit human biases, and there are concerns over privacy and usage rights. To counter these costs, interest has surged in learning from cheaper data sources, such as unlabeled images. In this paper we go a step further and ask if we can do away with real image datasets entirely, in… ▽ More

    Submitted 28 April, 2022; v1 submitted 10 June, 2021; originally announced June 2021.

  6. arXiv:1712.00617  [pdf, other

    cs.CV

    Recurrent Neural Networks for Semantic Instance Segmentation

    Authors: Amaia Salvador, Miriam Bellver, Victor Campos, Manel Baradad, Ferran Marques, Jordi Torres, Xavier Giro-i-Nieto

    Abstract: We present a recurrent model for semantic instance segmentation that sequentially generates binary masks and their associated class probabilities for every object in an image. Our proposed system is trainable end-to-end from an input image to a sequence of labeled masks and, compared to methods relying on object proposals, does not require post-processing steps on its output. We study the suitabil… ▽ More

    Submitted 12 April, 2019; v1 submitted 2 December, 2017; originally announced December 2017.