Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–6 of 6 results for author: Feigelis, K

.
  1. arXiv:2312.06721  [pdf, other

    cs.CV

    Understanding Physical Dynamics with Counterfactual World Modeling

    Authors: Rahul Venkatesh, Honglin Chen, Kevin Feigelis, Daniel M. Bear, Khaled Jedoui, Klemen Kotar, Felix Binder, Wanhee Lee, Sherry Liu, Kevin A. Smith, Judith E. Fan, Daniel L. K. Yamins

    Abstract: The ability to understand physical dynamics is critical for agents to act in the world. Here, we use Counterfactual World Modeling (CWM) to extract vision structures for dynamics understanding. CWM uses a temporally-factored masking policy for masked prediction of video data without annotations. This policy enables highly effective "counterfactual prompting" of the predictor, allowing a spectrum o… ▽ More

    Submitted 22 July, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: ECCV 2024. Project page at: https://neuroailab.github.io/cwm-physics/

  2. arXiv:2306.01828  [pdf, other

    cs.CV cs.AI

    Unifying (Machine) Vision via Counterfactual World Modeling

    Authors: Daniel M. Bear, Kevin Feigelis, Honglin Chen, Wanhee Lee, Rahul Venkatesh, Klemen Kotar, Alex Durango, Daniel L. K. Yamins

    Abstract: Leading approaches in machine vision employ different architectures for different tasks, trained on costly task-specific labeled datasets. This complexity has held back progress in areas, such as robotics, where robust task-general perception remains a bottleneck. In contrast, "foundation models" of natural language have shown how large pre-trained neural networks can provide zero-shot solutions t… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    ACM Class: I.2.10; I.4.8

  3. arXiv:2007.04954  [pdf, other

    cs.CV cs.GR cs.LG cs.RO

    ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation

    Authors: Chuang Gan, Jeremy Schwartz, Seth Alter, Damian Mrowca, Martin Schrimpf, James Traer, Julian De Freitas, Jonas Kubilius, Abhishek Bhandwaldar, Nick Haber, Megumi Sano, Kuno Kim, Elias Wang, Michael Lingelbach, Aidan Curtis, Kevin Feigelis, Daniel M. Bear, Dan Gutfreund, David Cox, Antonio Torralba, James J. DiCarlo, Joshua B. Tenenbaum, Josh H. McDermott, Daniel L. K. Yamins

    Abstract: We introduce ThreeDWorld (TDW), a platform for interactive multi-modal physical simulation. TDW enables simulation of high-fidelity sensory data and physical interactions between mobile agents and objects in rich 3D environments. Unique properties include: real-time near-photo-realistic image rendering; a library of objects and environments, and routines for their customization; generative procedu… ▽ More

    Submitted 28 December, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: Oral Presentation at NeurIPS 21 Datasets and Benchmarks Track. Project page: http://www.threedworld.org

  4. arXiv:2004.10876  [pdf, other

    cs.AI cs.RO

    Flexible and Efficient Long-Range Planning Through Curious Exploration

    Authors: Aidan Curtis, Minjian Xin, Dilip Arumugam, Kevin Feigelis, Daniel Yamins

    Abstract: Identifying algorithms that flexibly and efficiently discover temporally-extended multi-phase plans is an essential step for the advancement of robotics and model-based reinforcement learning. The core problem of long-range planning is finding an efficient way to search through the tree of possible action sequences. Existing non-learned planning solutions from the Task and Motion Planning (TAMP) l… ▽ More

    Submitted 8 July, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

  5. arXiv:1711.07425  [pdf, other

    cs.LG cs.AI q-bio.NC stat.ML

    Modular Continual Learning in a Unified Visual Environment

    Authors: Kevin T. Feigelis, Blue Sheffer, Daniel L. K. Yamins

    Abstract: A core aspect of human intelligence is the ability to learn new tasks quickly and switch between them flexibly. Here, we describe a modular continual reinforcement learning paradigm inspired by these abilities. We first introduce a visual interaction environment that allows many types of tasks to be unified in a single framework. We then describe a reward map prediction scheme that learns new task… ▽ More

    Submitted 11 December, 2017; v1 submitted 20 November, 2017; originally announced November 2017.

  6. arXiv:1706.07147  [pdf, other

    cs.LG cs.AI q-bio.NC stat.ML

    A Useful Motif for Flexible Task Learning in an Embodied Two-Dimensional Visual Environment

    Authors: Kevin T. Feigelis, Daniel L. K. Yamins

    Abstract: Animals (especially humans) have an amazing ability to learn new tasks quickly, and switch between them flexibly. How brains support this ability is largely unknown, both neuroscientifically and algorithmically. One reasonable supposition is that modules drawing on an underlying general-purpose sensory representation are dynamically allocated on a per-task basis. Recent results from neuroscience a… ▽ More

    Submitted 21 June, 2017; originally announced June 2017.