Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–9 of 9 results for author: Pokle, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14548  [pdf, other

    cs.LG cs.CV

    Consistency Models Made Easy

    Authors: Zhengyang Geng, Ashwini Pokle, William Luo, Justin Lin, J. Zico Kolter

    Abstract: Consistency models (CMs) are an emerging class of generative models that offer faster sampling than traditional diffusion models. CMs enforce that all points along a sampling trajectory are mapped to the same initial point. But this target leads to resource-intensive training: for example, as of 2024, training a SoTA CM on CIFAR-10 takes one week on 8 GPUs. In this work, we propose an alternative… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2401.08639  [pdf, other

    cs.CV cs.LG

    One-Step Diffusion Distillation via Deep Equilibrium Models

    Authors: Zhengyang Geng, Ashwini Pokle, J. Zico Kolter

    Abstract: Diffusion models excel at producing high-quality samples but naively require hundreds of iterations, prompting multiple attempts to distill the generation process into a faster network. However, many existing approaches suffer from a variety of challenges: the process for distillation training can be complex, often requiring multiple training stages, and the resulting models perform poorly when ut… ▽ More

    Submitted 12 December, 2023; originally announced January 2024.

    Comments: NeurIPS 2023

  3. arXiv:2312.00234  [pdf, other

    cs.LG math.NA stat.ML

    Deep Equilibrium Based Neural Operators for Steady-State PDEs

    Authors: Tanya Marwah, Ashwini Pokle, J. Zico Kolter, Zachary C. Lipton, Jianfeng Lu, Andrej Risteski

    Abstract: Data-driven machine learning approaches are being increasingly used to solve partial differential equations (PDEs). They have shown particularly striking successes when training an operator, which takes as input a PDE in some family, and outputs its solution. However, the architectural design space, especially given structural knowledge of the PDE family of interest, is still poorly understood. We… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  4. arXiv:2310.04432  [pdf, other

    cs.CV cs.AI cs.LG

    Training-free Linear Image Inverses via Flows

    Authors: Ashwini Pokle, Matthew J. Muckley, Ricky T. Q. Chen, Brian Karrer

    Abstract: Solving inverse problems without any training involves using a pretrained generative model and making appropriate modifications to the generation process to avoid finetuning of the generative model. While recent methods have explored the use of diffusion models, they still require the manual tuning of many hyperparameters for different inverse problems. In this work, we propose a training-free met… ▽ More

    Submitted 10 March, 2024; v1 submitted 25 September, 2023; originally announced October 2023.

    Comments: 40 pages, 30 figures. Added additional qualitative results in the appendix

  5. arXiv:2211.09961  [pdf, other

    cs.LG stat.ML

    Path Independent Equilibrium Models Can Better Exploit Test-Time Computation

    Authors: Cem Anil, Ashwini Pokle, Kaiqu Liang, Johannes Treutlein, Yuhuai Wu, Shaojie Bai, Zico Kolter, Roger Grosse

    Abstract: Designing networks capable of attaining better performance with an increased inference budget is important to facilitate generalization to harder problem instances. Recent efforts have shown promising results in this direction by making use of depth-wise recurrent networks. We show that a broad class of architectures named equilibrium models display strong upwards generalization, and find that str… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022

  6. arXiv:2210.12867  [pdf, other

    cs.LG cs.CV

    Deep Equilibrium Approaches to Diffusion Models

    Authors: Ashwini Pokle, Zhengyang Geng, Zico Kolter

    Abstract: Diffusion-based generative models are extremely effective in generating high-quality images, with generated samples often surpassing the quality of those produced by other models under several metrics. One distinguishing feature of these models, however, is that they typically require long sampling chains to produce high-fidelity images. This presents a challenge not only from the lenses of sampli… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  7. arXiv:2203.15702  [pdf, other

    cs.LG cs.CV stat.ML

    Contrasting the landscape of contrastive and non-contrastive learning

    Authors: Ashwini Pokle, Jinjin Tian, Yuchen Li, Andrej Risteski

    Abstract: A lot of recent advances in unsupervised feature learning are based on designing features which are invariant under semantic data augmentations. A common way to do this is contrastive learning, which uses positive and negative samples. Some recent works however have shown promising results for non-contrastive learning, which does not require negative samples. However, the non-contrastive losses ha… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted for publication in the AISTATS 2022 conference (http://aistats.org/aistats2022/accepted.html)

  8. Deep Local Trajectory Replanning and Control for Robot Navigation

    Authors: Ashwini Pokle, Roberto Martín-Martín, Patrick Goebel, Vincent Chow, Hans M. Ewald, Junwei Yang, Zhenkai Wang, Amir Sadeghian, Dorsa Sadigh, Silvio Savarese, Marynel Vázquez

    Abstract: We present a navigation system that combines ideas from hierarchical planning and machine learning. The system uses a traditional global planner to compute optimal paths towards a goal, and a deep local trajectory planner and velocity controller to compute motion commands. The latter components of the system adjust the behavior of the robot through attention mechanisms such that it moves towards t… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

    Report number: 18904288

    Journal ref: 2019 International Conference on Robotics and Automation (ICRA)

  9. arXiv:1810.00663  [pdf, other

    cs.CL cs.AI

    Translating Navigation Instructions in Natural Language to a High-Level Plan for Behavioral Robot Navigation

    Authors: Xiaoxue Zang, Ashwini Pokle, Marynel Vázquez, Kevin Chen, Juan Carlos Niebles, Alvaro Soto, Silvio Savarese

    Abstract: We propose an end-to-end deep learning model for translating free-form natural language instructions to a high-level plan for behavioral robot navigation. We use attention models to connect information from both the user instructions and a topological representation of the environment. We evaluate our model's performance on a new dataset containing 10,050 pairs of navigation instructions. Our mode… ▽ More

    Submitted 24 September, 2018; originally announced October 2018.