Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2024
Sketching With Your Voice: "Non-Phonorealistic" Rendering of Sounds via Vocal Imitation
SA '24: SIGGRAPH Asia 2024 Conference PapersArticle No.: 136, Pages 1–11https://doi.org/10.1145/3680528.3687679We present a method for automatically producing human-like vocal imitations of sounds: the equivalent of “sketching,” but for auditory rather than visual representation. Starting with a simulated model of the human vocal tract, we first try generating ...
- research-articleJuly 2024
ContPhy: continuum physical concept learning and reasoning from videos
ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 2545, Pages 61526–61558We introduce the Continuum Physical Dataset (ContPhy), a novel benchmark for assessing machine physical commonsense. ContPhy complements existing physical reasoning benchmarks by encompassing the inference of diverse physical properties, such as mass and ...
- research-articleJuly 2024
Compositional image decomposition with diffusion models
ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 1905, Pages 46823–46842Given an image of a natural scene, we are able to quickly decompose it into a set of components such as objects, lighting, shadows, and foreground. We can then envision a scene where we combine certain components with those from other images, for ...
- research-articleJuly 2024
LLM and simulation as bilevel optimizers: a new paradigm to advance physical scientific discovery
- Pingchuan Ma,
- Tsun-Hsuan Wang,
- Minghao Guo,
- Zhiqing Sun,
- Joshua B. Tenenbaum,
- Daniela Rus,
- Chuang Gan,
- Wojciech Matusik
ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 1381, Pages 33940–33962Large Language Models have recently gained significant attention in scientific discovery for their extensive knowledge and advanced reasoning capabilities. However, they encounter challenges in effectively simulating observational feedback and grounding ...
- research-articleJuly 2024
Pot potential based diffusion motion planning ential based diffusion motion planning
ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 1360, Pages 33486–33510Effective motion planning in high dimensional spaces is a long-standing open problem in robotics. One class of traditional motion planning algorithms corresponds to potential-based motion planning. An advantage of potential based motion planning is ...
-
- research-articleJuly 2024
Learning iterative reasoning through energy diffusion
ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 468, Pages 11764–11776We introduce iterative reasoning through energy diffusion (IRED), a novel framework for learning to reason for a variety of tasks by formulating reasoning and decision-making problems with energy-based optimization. IRED learns energy functions to ...
- research-articleJuly 2024
Improving factuality and reasoning in language models through multiagent debate
ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 467, Pages 11733–11763Large language models (LLMs) have demonstrated remarkable capabilities in language generation, understanding, and few-shot learning in recent years. An extensive body of work has explored how their performance may be further improved through the tools of ...
- research-articleJuly 2024
When is it acceptable to break the rules? Knowledge representation of moral judgements based on empirical data
- Edmond Awad,
- Sydney Levine,
- Andrea Loreggia,
- Nicholas Mattei,
- Iyad Rahwan,
- Francesca Rossi,
- Kartik Talamadupula,
- Joshua Tenenbaum,
- Max Kleiman-Weiner
Autonomous Agents and Multi-Agent Systems (KLU-AGNT), Volume 38, Issue 2https://doi.org/10.1007/s10458-024-09667-4AbstractConstraining the actions of AI systems is one promising way to ensure that these systems behave in a way that is morally acceptable to humans. But constraints alone come with drawbacks as in many AI systems, they are not flexible. If these ...
- research-articleMay 2024
Pragmatic Instruction Following and Goal Assistance via Cooperative Language-Guided Inverse Planning
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2094–2103People often give instructions whose meaning is ambiguous without further context, expecting that their actions or goals will disambiguate their intentions. How can we build assistive agents that follow such instructions in a flexible, context-sensitive ...
- research-articleFebruary 2024
Generalized planning in PDDL domains with pretrained large language models
AAAI'24/IAAI'24/EAAI'24: Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 2258, Pages 20256–20264https://doi.org/10.1609/aaai.v38i18.30006Recent work has considered whether large language models (LLMs) can function as planners: given a task, generate a plan. We investigate whether LLMs can serve as generalized planners: given a domain and training tasks, generate a program that efficiently ...
- research-articleFebruary 2024
Neural amortized inference for nested multi-agent reasoning
AAAI'24/IAAI'24/EAAI'24: Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 60, Pages 530–537https://doi.org/10.1609/aaai.v38i1.27808Multi-agent interactions, such as communication, teaching, and bluffing, often rely on higher-order social inference, i.e., understanding how others infer oneself. Such intricate reasoning can be effectively modeled through nested multi-agent reasoning. ...
- research-articleDecember 2023
Physion++: evaluating physical scene understanding that requires online inference of different physical properties
- Hsiao-Yu Tung,
- Mingyu Ding,
- Zhenfang Chen,
- Daniel M. Bear,
- Chuang Gan,
- Joshua B. Tenenbaum,
- Daniel L. K. Yamins,
- Judith Fan,
- Kevin A. Smith
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 2929, Pages 67048–67068General physical scene understanding requires more than simply localizing and recognizing objects -it requires knowledge that objects can have different latent properties (e.g., mass or elasticity), and that those properties affect the outcome of ...
- research-articleDecember 2023
What planning problems can a relational neural network solve?
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 2601, Pages 59522–59542Goal-conditioned policies are generally understood to be "feed-forward" circuits, in the form of neural networks that map from the current state and the goal specification to the next action to take. However, under what circumstances such a policy can be ...
- research-articleDecember 2023
Human spatiotemporal pattern learning as probabilistic program synthesis
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 2367, Pages 54354–54365People are adept at learning a wide variety of structured patterns from small amounts of data, presenting a conundrum from the standpoint of the bias-variance tradeoff: what kinds of representations and algorithms support the joint flexibility and data-...
- research-articleDecember 2023
DiffuseBot: breeding soft robots with physics-augmented generative diffusion models
- Tsun-Hsuan Wang,
- Juntian Zheng,
- Pingchuan Ma,
- Yilun Du,
- Byungchul Kim,
- Andrew Spielberg,
- Joshua B. Tenenbaum,
- Chuang Gan,
- Daniela Rus
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 1922, Pages 44398–44423Nature evolves creatures with a high complexity of morphological and behavioral intelligence, meanwhile computational methods lag in approaching that diversity and efficacy. Co-optimization of artificial creatures' morphology and control in silico shows ...
- research-articleDecember 2023
What's Left? concept grounding with logic-enhanced foundation models
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 1684, Pages 38798–38814Recent works such as VisProg and ViperGPT have smartly composed foundation models for visual reasoning—using large language models (LLMs) to produce programs that can be executed by pre-trained vision-language models. However, they operate in limited ...
- research-articleDecember 2023
Compositional foundation models for hierarchical planning
- Anurag Ajay,
- Seungwook Han,
- Yilun Du,
- Shuang Li,
- Abhi Gupta,
- Tommi Jaakkola,
- Josh Tenenbaum,
- Leslie Kaelbling,
- Akash Srivastava,
- Pulkit Agrawal
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 979, Pages 22304–22325To make effective decisions in novel environments with long-horizon goals, it is crucial to engage in hierarchical reasoning across spatial and temporal scales. This entails planning abstract subgoal sequences, visually reasoning about the underlying ...
- research-articleDecember 2023
Inferring the future by imagining the past
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 926, Pages 21196–21216A single panel of a comic book can say a lot: it can depict not only where the characters currently are, but also their motions, their motivations, their emotions, and what they might do next. More generally, humans routinely infer complex sequences of ...
- research-articleDecember 2023
Diffusion with forward models: solving stochastic inverse problems without direct supervision
- Ayush Tewari,
- Tianwei Yin,
- George Cazenavette,
- Semon Rezchikov,
- Joshua B. Tenenbaum,
- Frédo Durand,
- William T. Freeman,
- Vincent Sitzmann
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 542, Pages 12349–12362Denoising diffusion models have emerged as a powerful class of generative models capable of capturing the distributions of complex, real-world signals. However, current approaches can only model distributions for which training samples are directly ...
- research-articleDecember 2023
Learning universal policies via text-guided video generation
- Yilun Du,
- Mengjiao Yang,
- Bo Dai,
- Hanjun Dai,
- Ofir Nachum,
- Joshua B. Tenenbaum,
- Dale Schuurmans,
- Pieter Abbeel
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 403, Pages 9156–9172A goal of artificial intelligence is to construct an agent that can solve a wide variety of tasks. Recent progress in text-guided image synthesis has yielded models with an impressive ability to generate complex novel images, exhibiting combinatorial ...