Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 74 results for author: Goodman, N D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.08202  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    What Makes a Maze Look Like a Maze?

    Authors: Joy Hsu, Jiayuan Mao, Joshua B. Tenenbaum, Noah D. Goodman, Jiajun Wu

    Abstract: A unique aspect of human visual understanding is the ability to flexibly interpret abstract concepts: acquiring lifted rules explaining what they symbolize, grounding them across familiar and unfamiliar contexts, and making predictions or reasoning about them. While off-the-shelf vision-language models excel at making literal interpretations of images (e.g., recognizing object categories such as t… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  2. arXiv:2408.03617  [pdf, other

    cs.CL cs.AI cs.LG

    Is Child-Directed Speech Effective Training Data for Language Models?

    Authors: Steven Y. Feng, Noah D. Goodman, Michael C. Frank

    Abstract: While high-performing language models are typically trained on hundreds of billions of words, human children become fluent language users with a much smaller amount of data. What are the features of the data they receive, and how do these features support language modeling objectives? To investigate this question, we train GPT-2 models on 29M words of English-language child-directed speech and a n… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: Preprint. Code and data will be released soon

  3. arXiv:2407.15645  [pdf, other

    cs.CL cs.AI

    Psychometric Alignment: Capturing Human Knowledge Distributions via Language Models

    Authors: Joy He-Yueya, Wanjing Anya Ma, Kanishk Gandhi, Benjamin W. Domingue, Emma Brunskill, Noah D. Goodman

    Abstract: Language models (LMs) are increasingly used to simulate human-like responses in scenarios where accurately mimicking a population's behavior can guide decision-making, such as in developing educational materials and designing public policies. The objective of these simulations is for LMs to capture the variations in human responses, rather than merely providing the expected correct answers. Prior… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: Code and data: https://github.com/joyheyueya/psychometric-alignment

  4. arXiv:2407.04622  [pdf, other

    cs.LG

    On scalable oversight with weak LLMs judging strong LLMs

    Authors: Zachary Kenton, Noah Y. Siegel, János Kramár, Jonah Brown-Cohen, Samuel Albanie, Jannis Bulian, Rishabh Agarwal, David Lindner, Yunhao Tang, Noah D. Goodman, Rohin Shah

    Abstract: Scalable oversight protocols aim to enable humans to accurately supervise superhuman AI. In this paper we study debate, where two AI's compete to convince a judge; consultancy, where a single AI tries to convince a judge that asks questions; and compare to a baseline of direct question-answering, where the judge just answers outright without the AI. We use large language models (LLMs) as both AI a… ▽ More

    Submitted 12 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: 15 pages (53 including appendices). V2: minor correction to Figure 3; add Figure A.9 comparing open vs assigned consultancy; add a reference

  5. arXiv:2407.00900  [pdf, other

    cs.AI cs.CL

    MathCAMPS: Fine-grained Synthesis of Mathematical Problems From Human Curricula

    Authors: Shubhra Mishra, Gabriel Poesia, Belinda Mo, Noah D. Goodman

    Abstract: Mathematical problem solving is an important skill for Large Language Models (LLMs), both as an important capability and a proxy for a range of reasoning abilities. Existing benchmarks probe a diverse set of skills, but they yield aggregate accuracy metrics, obscuring specific abilities or weaknesses. Furthermore, they are difficult to extend with new problems, risking data contamination over time… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: Dataset and code: https://github.com/gpoesia/mathcamps/

  6. arXiv:2407.00695  [pdf, other

    cs.AI cs.LO

    Learning Formal Mathematics From Intrinsic Motivation

    Authors: Gabriel Poesia, David Broman, Nick Haber, Noah D. Goodman

    Abstract: How did humanity coax mathematics from the aether? We explore the Platonic view that mathematics can be discovered from its axioms - a game of conjecture and proof. We describe Minimo (Mathematics from Intrinsic Motivation): an agent that jointly learns to pose challenging problems for itself (conjecturing) and solve them (theorem proving). Given a mathematical domain axiomatized in dependent type… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  7. arXiv:2404.14313  [pdf, other

    cs.CL

    Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels

    Authors: Jan-Philipp Fränken, Eric Zelikman, Rafael Rafailov, Kanishk Gandhi, Tobias Gerstenberg, Noah D. Goodman

    Abstract: When prompting a language model (LM), users often expect the model to adhere to a set of behavioral principles across diverse tasks, such as producing insightful content while avoiding harmful or biased language. Instilling such principles (i.e., a constitution) into a model is resource-intensive, technically challenging, and generally requires human preference labels or examples. We introduce SAM… ▽ More

    Submitted 21 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  8. arXiv:2404.10975  [pdf, other

    cs.CL

    Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models

    Authors: Jan-Philipp Fränken, Kanishk Gandhi, Tori Qiu, Ayesha Khawaja, Noah D. Goodman, Tobias Gerstenberg

    Abstract: As AI systems like language models are increasingly integrated into decision-making processes affecting people's lives, it's critical to ensure that these systems have sound moral reasoning. To test whether they do, we need to develop systematic evaluations. We provide a framework that uses a language model to translate causal graphs that capture key aspects of moral dilemmas into prompt templates… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: CogSci 2024

  9. arXiv:2404.03683  [pdf, other

    cs.LG cs.AI cs.CL

    Stream of Search (SoS): Learning to Search in Language

    Authors: Kanishk Gandhi, Denise Lee, Gabriel Grand, Muxin Liu, Winson Cheng, Archit Sharma, Noah D. Goodman

    Abstract: Language models are rarely shown fruitful mistakes while training. They then struggle to look beyond the next token, suffering from a snowballing of errors and struggling to predict the consequence of their actions several steps ahead. In this paper, we show how language models can be taught to search by representing the process of search in language, as a flattened string -- a stream of search (S… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  10. arXiv:2403.19154  [pdf, other

    cs.CL cs.AI

    STaR-GATE: Teaching Language Models to Ask Clarifying Questions

    Authors: Chinmaya Andukuri, Jan-Philipp Fränken, Tobias Gerstenberg, Noah D. Goodman

    Abstract: When prompting language models to complete a task, users often leave important aspects unsaid. While asking questions could resolve this ambiguity (GATE; Li et al., 2023), models often struggle to ask good questions. We explore a language model's ability to self-improve (STaR; Zelikman et al., 2022) by rewarding the model for generating useful questions-a simple method we dub STaR-GATE. We generat… ▽ More

    Submitted 7 August, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  11. arXiv:2403.09629  [pdf, other

    cs.CL cs.AI cs.LG

    Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

    Authors: Eric Zelikman, Georges Harik, Yijia Shao, Varuna Jayasiri, Nick Haber, Noah D. Goodman

    Abstract: When writing and talking, people sometimes pause to think. Although reasoning-focused works have often framed reasoning as a method of answering questions or completing agentic tasks, reasoning is implicit in almost all written text. For example, this applies to the steps not stated between the lines of a proof or to the theory of mind underlying a conversation. In the Self-Taught Reasoner (STaR,… ▽ More

    Submitted 18 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  12. arXiv:2403.07809  [pdf, other

    cs.LG cs.CL

    pyvene: A Library for Understanding and Improving PyTorch Models via Interventions

    Authors: Zhengxuan Wu, Atticus Geiger, Aryaman Arora, Jing Huang, Zheng Wang, Noah D. Goodman, Christopher D. Manning, Christopher Potts

    Abstract: Interventions on model-internal states are fundamental operations in many areas of AI, including model editing, steering, robustness, and interpretability. To facilitate such research, we introduce $\textbf{pyvene}$, an open-source Python library that supports customizable interventions on a range of different PyTorch modules. $\textbf{pyvene}$ supports complex intervention schemes with an intuiti… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures

  13. arXiv:2403.02795  [pdf, other

    cs.AI cs.CL

    Evaluating and Optimizing Educational Content with Large Language Model Judgments

    Authors: Joy He-Yueya, Noah D. Goodman, Emma Brunskill

    Abstract: Creating effective educational materials generally requires expensive and time-consuming studies of student learning outcomes. To overcome this barrier, one idea is to build computational models of student learning and use them to optimize instructional materials. However, it is difficult to model the cognitive processes of learning dynamics. We propose an alternative approach that uses Language M… ▽ More

    Submitted 6 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 11 pages

  14. arXiv:2402.17879  [pdf, other

    cs.LG cs.CL

    Automated Statistical Model Discovery with Language Models

    Authors: Michael Y. Li, Emily B. Fox, Noah D. Goodman

    Abstract: Statistical model discovery is a challenging search over a vast space of models subject to domain-specific constraints. Efficiently searching over this space requires expertise in modeling and the problem domain. Motivated by the domain knowledge and programming capabilities of large language models (LMs), we introduce a method for language model driven automated statistical model discovery. We ca… ▽ More

    Submitted 22 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  15. arXiv:2401.12631  [pdf, other

    cs.LG cs.AI cs.CL

    A Reply to Makelov et al. (2023)'s "Interpretability Illusion" Arguments

    Authors: Zhengxuan Wu, Atticus Geiger, Jing Huang, Aryaman Arora, Thomas Icard, Christopher Potts, Noah D. Goodman

    Abstract: We respond to the recent paper by Makelov et al. (2023), which reviews subspace interchange intervention methods like distributed alignment search (DAS; Geiger et al. 2023) and claims that these methods potentially cause "interpretability illusions". We first review Makelov et al. (2023)'s technical notion of what an "interpretability illusion" is, and then we show that even intuitive and desirabl… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 20 pages, 14 figures

  16. arXiv:2310.17769  [pdf, other

    cs.CL cs.AI

    Social Contract AI: Aligning AI Assistants with Implicit Group Norms

    Authors: Jan-Philipp Fränken, Sam Kwok, Peixuan Ye, Kanishk Gandhi, Dilip Arumugam, Jared Moore, Alex Tamkin, Tobias Gerstenberg, Noah D. Goodman

    Abstract: We explore the idea of aligning an AI assistant by inverting a model of users' (unknown) preferences from observed interactions. To validate our proposal, we run proof-of-concept simulations in the economic ultimatum game, formalizing user preferences as policies that guide the actions of simulated players. We find that the AI assistant accurately aligns its behavior to match standard policies fro… ▽ More

    Submitted 3 December, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: SoLaR NeurIPS 2023 Workshop (https://solar-neurips.github.io/)

  17. arXiv:2310.17230  [pdf, other

    cs.LG cs.CL

    Codebook Features: Sparse and Discrete Interpretability for Neural Networks

    Authors: Alex Tamkin, Mohammad Taufeeque, Noah D. Goodman

    Abstract: Understanding neural networks is challenging in part because of the dense, continuous nature of their hidden states. We explore whether we can train neural networks to have hidden states that are sparse, discrete, and more interpretable by quantizing their continuous features into what we call codebook features. Codebook features are produced by finetuning neural networks with vector quantization… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  18. arXiv:2310.03635  [pdf, other

    cs.AI cs.CL cs.CV cs.LG stat.ML

    CLEVRER-Humans: Describing Physical and Causal Events the Human Way

    Authors: Jiayuan Mao, Xuelin Yang, Xikun Zhang, Noah D. Goodman, Jiajun Wu

    Abstract: Building machines that can reason about physical events and their causal relationships is crucial for flexible interaction with the physical world. However, most existing physical and causal reasoning benchmarks are exclusively based on synthetically generated events and synthetic natural language descriptions of causal relationships. This design brings up two issues. First, there is a lack of div… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2022 (Dataset and Benchmark Track). First two authors contributed equally. Project page: https://sites.google.com/stanford.edu/clevrer-humans/home

  19. arXiv:2309.05660  [pdf, other

    cs.LG cs.AI cs.CL

    Hypothesis Search: Inductive Reasoning with Language Models

    Authors: Ruocheng Wang, Eric Zelikman, Gabriel Poesia, Yewen Pu, Nick Haber, Noah D. Goodman

    Abstract: Inductive reasoning is a core problem-solving capacity: humans can identify underlying principles from a few examples, which robustly generalize to novel scenarios. Recent work evaluates large language models (LLMs) on inductive reasoning tasks by directly prompting them yielding "in context learning." This works well for straightforward inductive tasks but performs poorly on complex tasks such as… ▽ More

    Submitted 30 May, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: ICLR 2024. The first two authors contributed equally. Code: https://github.com/Relento/hypothesis_search

  20. arXiv:2306.15448  [pdf, other

    cs.CL cs.AI cs.HC

    Understanding Social Reasoning in Language Models with Language Models

    Authors: Kanishk Gandhi, Jan-Philipp Fränken, Tobias Gerstenberg, Noah D. Goodman

    Abstract: As Large Language Models (LLMs) become increasingly integrated into our everyday lives, understanding their ability to comprehend human mental states becomes critical for ensuring effective interactions. However, despite the recent attempts to assess the Theory-of-Mind (ToM) reasoning capabilities of LLMs, the degree to which these models can align with human ToM remains a nuanced topic of explora… ▽ More

    Submitted 4 December, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  21. arXiv:2306.12672  [pdf, other

    cs.CL cs.AI cs.SC

    From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought

    Authors: Lionel Wong, Gabriel Grand, Alexander K. Lew, Noah D. Goodman, Vikash K. Mansinghka, Jacob Andreas, Joshua B. Tenenbaum

    Abstract: How does language inform our downstream thinking? In particular, how do humans make meaning from language--and how can we leverage a theory of linguistic meaning to build machines that think in more human-like ways? In this paper, we propose rational meaning construction, a computational framework for language-informed thinking that combines neural language models with probabilistic models for rat… ▽ More

    Submitted 23 June, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

  22. arXiv:2306.10015  [pdf, other

    cs.LG cs.CL cs.DC

    Just One Byte (per gradient): A Note on Low-Bandwidth Decentralized Language Model Finetuning Using Shared Randomness

    Authors: Eric Zelikman, Qian Huang, Percy Liang, Nick Haber, Noah D. Goodman

    Abstract: Language model training in distributed settings is limited by the communication cost of gradient exchanges. In this short note, we extend recent work from Malladi et al. (2023), using shared randomness to perform distributed fine-tuning with low bandwidth. The method is a natural decentralized extension of memory-efficient Simultaneous Perturbation Stochastic Approximation (SPSA). Each iteration,… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  23. arXiv:2306.04031  [pdf, other

    cs.AI

    Certified Deductive Reasoning with Language Models

    Authors: Gabriel Poesia, Kanishk Gandhi, Eric Zelikman, Noah D. Goodman

    Abstract: Language models often achieve higher accuracy when reasoning step-by-step in complex tasks. However, even when arriving at a correct final answer, their rationales are often logically unsound or inconsistent. This is a major issue when reliable reasoning traces are needed, such when fine-tuning on model-generated reasoning for self-improvement. To tackle these issues, we introduce a class of tools… ▽ More

    Submitted 7 November, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

  24. arXiv:2305.19165  [pdf, other

    cs.AI cs.CL cs.GT cs.HC

    Strategic Reasoning with Language Models

    Authors: Kanishk Gandhi, Dorsa Sadigh, Noah D. Goodman

    Abstract: Strategic reasoning enables agents to cooperate, communicate, and compete with other agents in diverse situations. Existing approaches to solving strategic games rely on extensive training, yielding strategies that do not generalize to new scenarios or games without retraining. Large Language Models (LLMs), with their ability to comprehend and generate complex, context-rich language, could prove p… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  25. arXiv:2305.11374  [pdf, other

    cs.CL

    Characterizing tradeoffs between teaching via language and demonstrations in multi-agent systems

    Authors: Dhara Yu, Noah D. Goodman, Jesse Mu

    Abstract: Humans teach others about the world through language and demonstration. When might one of these modalities be more effective than the other? In this work, we study the factors that modulate the effectiveness of language vs. demonstration using multi-agent systems to model human communication. Specifically, we train neural network agents to teach via language or demonstration in a grounded communic… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 7 pages, 6 figures, to appear in Proceedings of the 45th Annual Conference of the Cognitive Science Society

  26. arXiv:2305.08809  [pdf, other

    cs.CL

    Interpretability at Scale: Identifying Causal Mechanisms in Alpaca

    Authors: Zhengxuan Wu, Atticus Geiger, Thomas Icard, Christopher Potts, Noah D. Goodman

    Abstract: Obtaining human-interpretable explanations of large, general-purpose language models is an urgent goal for AI safety. However, it is just as important that our interpretability methods are faithful to the causal dynamics underlying model behavior and able to robustly generalize to unseen inputs. Distributed Alignment Search (DAS) is a powerful gradient descent method grounded in a theory of causal… ▽ More

    Submitted 6 February, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 with Author Corrections

  27. arXiv:2305.07151  [pdf, other

    cs.CL

    Overinformative Question Answering by Humans and Machines

    Authors: Polina Tsvilodub, Michael Franke, Robert D. Hawkins, Noah D. Goodman

    Abstract: When faced with a polar question, speakers often provide overinformative answers going beyond a simple "yes" or "no". But what principles guide the selection of additional information? In this paper, we provide experimental evidence from two studies suggesting that overinformativeness in human answering is driven by considerations of relevance to the questioner's goals which they flexibly adjust g… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: 7 pages, 2 figures, to appear in the Proceedings of the 45th Annual Conference of the Cognitive Science Society (2023)

  28. arXiv:2305.03263  [pdf, other

    cs.LG cs.AI

    Bayesian Reinforcement Learning with Limited Cognitive Load

    Authors: Dilip Arumugam, Mark K. Ho, Noah D. Goodman, Benjamin Van Roy

    Abstract: All biological and artificial agents must learn and make decisions given limits on their ability to process information. As such, a general theory of adaptive behavior should be able to account for the complex interactions between an agent's learning history, decisions, and capacity constraints. Recent work in computer science has begun to clarify the principles that shape these dynamics by bridgi… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  29. arXiv:2304.09102  [pdf, other

    cs.CL cs.AI

    Solving Math Word Problems by Combining Language Models With Symbolic Solvers

    Authors: Joy He-Yueya, Gabriel Poesia, Rose E. Wang, Noah D. Goodman

    Abstract: Automatically generating high-quality step-by-step solutions to math word problems has many applications in education. Recently, combining large language models (LLMs) with external tools to perform complex reasoning and calculation has emerged as a promising direction for solving math word problems, but prior approaches such as Program-Aided Language model (PAL) are biased towards simple procedur… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

  30. arXiv:2304.03843  [pdf, other

    cs.AI cs.CL cs.LG

    Why think step by step? Reasoning emerges from the locality of experience

    Authors: Ben Prystawski, Michael Y. Li, Noah D. Goodman

    Abstract: Humans have a powerful and mysterious capacity to reason. Working through a set of mental steps enables us to make inferences we would not be capable of making directly even though we get no additional data from the world. Similarly, when large language models generate intermediate steps (a chain of thought) before answering a question, they often produce better answers than they would directly. W… ▽ More

    Submitted 2 November, 2023; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: 22 pages, 6 figures

  31. arXiv:2303.02536  [pdf, other

    cs.AI

    Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations

    Authors: Atticus Geiger, Zhengxuan Wu, Christopher Potts, Thomas Icard, Noah D. Goodman

    Abstract: Causal abstraction is a promising theoretical framework for explainable artificial intelligence that defines when an interpretable high-level causal model is a faithful simplification of a low-level deep learning system. However, existing causal abstraction methods have two major limitations: they require a brute-force search over alignments between the high-level model and the low-level one, and… ▽ More

    Submitted 21 February, 2024; v1 submitted 4 March, 2023; originally announced March 2023.

  32. arXiv:2212.10561  [pdf, other

    cs.CL cs.AI cs.LG

    Parsel: Algorithmic Reasoning with Language Models by Composing Decompositions

    Authors: Eric Zelikman, Qian Huang, Gabriel Poesia, Noah D. Goodman, Nick Haber

    Abstract: Despite recent success in large language model (LLM) reasoning, LLMs struggle with hierarchical multi-step reasoning tasks like generating complex programs. For these tasks, humans often start with a high-level algorithmic design and implement each part gradually. We introduce Parsel, a framework enabling automatic implementation and validation of complex algorithms with code LLMs. With Parsel, we… ▽ More

    Submitted 28 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: humaneval results, clarity

  33. arXiv:2212.00869  [pdf, other

    cs.MA cs.CY

    Flexible social inference facilitates targeted social learning when rewards are not observable

    Authors: Robert D. Hawkins, Andrew M. Berdahl, Alex "Sandy" Pentland, Joshua B. Tenenbaum, Noah D. Goodman, P. M. Krafft

    Abstract: Groups coordinate more effectively when individuals are able to learn from others' successes. But acquiring such knowledge is not always easy, especially in real-world environments where success is hidden from public view. We suggest that social inference capacities may help bridge this gap, allowing individuals to update their beliefs about others' underlying knowledge and success from observable… ▽ More

    Submitted 5 August, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: Nature Human Behaviour

  34. arXiv:2211.16663  [pdf, other

    cs.CV

    Geoclidean: Few-Shot Generalization in Euclidean Geometry

    Authors: Joy Hsu, Jiajun Wu, Noah D. Goodman

    Abstract: Euclidean geometry is among the earliest forms of mathematical thinking. While the geometric primitives underlying its constructions, such as perfect lines and circles, do not often occur in the natural world, humans rarely struggle to perceive and reason with them. Will computer vision models trained on natural images show the same sensitivity to Euclidean geometry? Here we explore these question… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: To appear at NeurIPS 2022

  35. Peano: Learning Formal Mathematical Reasoning

    Authors: Gabriel Poesia, Noah D. Goodman

    Abstract: General mathematical reasoning is computationally undecidable, but humans routinely solve new problems. Moreover, discoveries developed over centuries are taught to subsequent generations quickly. What structure enables this, and how might that inform automated mathematical reasoning? We posit that central to both puzzles is the structure of procedural abstractions underlying mathematics. We explo… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  36. arXiv:2210.16877  [pdf, ps, other

    cs.LG cs.AI

    On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning

    Authors: Dilip Arumugam, Mark K. Ho, Noah D. Goodman, Benjamin Van Roy

    Abstract: Throughout the cognitive-science literature, there is widespread agreement that decision-making agents operating in the real world do so under limited information-processing capabilities and without access to unbounded cognitive or computational resources. Prior work has drawn inspiration from this fact and leveraged an information-theoretic model of such behaviors or policies as communication cha… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Comments: Accepted to the NeurIPS Workshop on Information-Theoretic Principles in Cognitive Systems (InfoCog) 2022. arXiv admin note: text overlap with arXiv:2206.02072

  37. arXiv:2209.08141  [pdf, other

    cs.CL cs.AI cs.LG

    Psychologically-informed chain-of-thought prompts for metaphor understanding in large language models

    Authors: Ben Prystawski, Paul Thibodeau, Christopher Potts, Noah D. Goodman

    Abstract: Probabilistic models of language understanding are valuable tools for investigating human language use. However, they need to be hand-designed for a particular domain. In contrast, large language models (LLMs) are trained on text that spans a wide array of domains, but they lack the structure and interpretability of probabilistic models. In this paper, we use chain-of-thought prompts to introduce… ▽ More

    Submitted 19 May, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: 7 pages, 1 figure

  38. arXiv:2205.09172  [pdf, other

    cs.CL

    Color Overmodification Emerges from Data-Driven Learning and Pragmatic Reasoning

    Authors: Fei Fang, Kunal Sinha, Noah D. Goodman, Christopher Potts, Elisa Kreiss

    Abstract: Speakers' referential expressions often depart from communicative ideals in ways that help illuminate the nature of pragmatic language use. Patterns of overmodification, in which a speaker uses a modifier that is redundant given their communicative goal, have proven especially informative in this regard. It seems likely that these patterns are shaped by the environment a speaker is exposed to in c… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: Proceedings of the Annual Meeting of the Cognitive Science Society (2022)

  39. arXiv:2203.14465  [pdf, other

    cs.LG cs.AI cs.CL

    STaR: Bootstrapping Reasoning With Reasoning

    Authors: Eric Zelikman, Yuhuai Wu, Jesse Mu, Noah D. Goodman

    Abstract: Generating step-by-step "chain-of-thought" rationales improves language model performance on complex reasoning tasks like mathematics or commonsense question-answering. However, inducing language model rationale generation currently requires either constructing massive rationale datasets or sacrificing accuracy by using only few-shot inference. We propose a technique to iteratively leverage a smal… ▽ More

    Submitted 20 May, 2022; v1 submitted 27 March, 2022; originally announced March 2022.

  40. arXiv:2112.02505  [pdf, other

    cs.CL cs.LG

    Causal Distillation for Language Models

    Authors: Zhengxuan Wu, Atticus Geiger, Josh Rozner, Elisa Kreiss, Hanson Lu, Thomas Icard, Christopher Potts, Noah D. Goodman

    Abstract: Distillation efforts have led to language models that are more compact and efficient without serious drops in performance. The standard approach to distillation trains a student model against two objectives: a task-specific objective (e.g., language modeling) and an imitation objective that encourages the hidden states of the student model to be similar to those of the larger teacher model. In thi… ▽ More

    Submitted 3 June, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 7 pages, 2 figures

    Journal ref: NAACL 2022

  41. arXiv:2112.00826  [pdf, other

    cs.LG

    Inducing Causal Structure for Interpretable Neural Networks

    Authors: Atticus Geiger, Zhengxuan Wu, Hanson Lu, Josh Rozner, Elisa Kreiss, Thomas Icard, Noah D. Goodman, Christopher Potts

    Abstract: In many areas, we have well-founded insights about causal structure that would be useful to bring into our trained models while still allowing them to learn in a data-driven fashion. To achieve this, we present the new method of interchange intervention training (IIT). In IIT, we (1) align variables in a causal model (e.g., a deterministic program or Bayesian network) with representations in a neu… ▽ More

    Submitted 20 July, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

  42. arXiv:2110.05422  [pdf, other

    cs.CL cs.AI cs.LG cs.MA

    Calibrate your listeners! Robust communication-based training for pragmatic speakers

    Authors: Rose E. Wang, Julia White, Jesse Mu, Noah D. Goodman

    Abstract: To be good conversational partners, natural language processing (NLP) systems should be trained to produce contextually useful utterances. Prior work has investigated training NLP systems with communication-based objectives, where a neural listener stands in as a communication partner. However, these systems commonly suffer from semantic drift where the learned language diverges radically from nat… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: Findings of EMNLP 2021 Code: https://github.com/rosewang2008/calibrate_your_listeners

  43. arXiv:2109.13861  [pdf, other

    cs.CV

    Visual resemblance and communicative context constrain the emergence of graphical conventions

    Authors: Robert D. Hawkins, Megumi Sano, Noah D. Goodman, Judith E. Fan

    Abstract: From photorealistic sketches to schematic diagrams, drawing provides a versatile medium for communicating about the visual world. How do images spanning such a broad range of appearances reliably convey meaning? Do viewers understand drawings based solely on their ability to resemble the entities they refer to (i.e., as images), or do they understand drawings based on shared but arbitrary associat… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: 26 pages; 8 figures; submitted version of manuscript

  44. arXiv:2107.13377  [pdf, other

    cs.CL cs.AI

    Learning to solve complex tasks by growing knowledge culturally across generations

    Authors: Michael Henry Tessler, Jason Madeano, Pedro A. Tsividis, Brin Harper, Noah D. Goodman, Joshua B. Tenenbaum

    Abstract: Knowledge built culturally across generations allows humans to learn far more than an individual could glean from their own experience in a lifetime. Cultural knowledge in turn rests on language: language is the richest record of what previous generations believed, valued, and practiced, and how these evolved over time. The power and mechanisms of language as a means of cultural learning, however,… ▽ More

    Submitted 16 December, 2021; v1 submitted 28 July, 2021; originally announced July 2021.

    Comments: Presented at the NeurIPS 2021 Cooperative AI Workshop (Dec 2021) and the 43rd Annual Meeting of the Cognitive Science Society (July 2021)

  45. arXiv:2104.08376  [pdf, other

    cs.CL

    Concadia: Towards Image-Based Text Generation with a Purpose

    Authors: Elisa Kreiss, Fei Fang, Noah D. Goodman, Christopher Potts

    Abstract: Current deep learning models often achieve excellent results on benchmark image-to-text datasets but fail to generate texts that are useful in practice. We argue that to close this gap, it is vital to distinguish descriptions from captions based on their distinct communicative roles. Descriptions focus on visual features and are meant to replace an image (often to increase accessibility), whereas… ▽ More

    Submitted 27 October, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Proceedings of EMNLP 2022

  46. arXiv:2104.05857  [pdf, other

    cs.CL cs.AI

    From partners to populations: A hierarchical Bayesian account of coordination and convention

    Authors: Robert D. Hawkins, Michael Franke, Michael C. Frank, Adele E. Goldberg, Kenny Smith, Thomas L. Griffiths, Noah D. Goodman

    Abstract: Languages are powerful solutions to coordination problems: they provide stable, shared expectations about how the words we say correspond to the beliefs and intentions in our heads. Yet language use in a variable and non-stationary social environment requires linguistic representations to be flexible: old words acquire new ad hoc or partner-specific meanings on the fly. In this paper, we introduce… ▽ More

    Submitted 2 December, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: In press at Psychological Review

  47. arXiv:2006.00418  [pdf, other

    cs.CL

    Learning to refer informatively by amortizing pragmatic reasoning

    Authors: Julia White, Jesse Mu, Noah D. Goodman

    Abstract: A hallmark of human language is the ability to effectively and efficiently convey contextually relevant information. One theory for how humans reason about language is presented in the Rational Speech Acts (RSA) framework, which captures pragmatic phenomena via a process of recursive social reasoning (Goodman & Frank, 2016). However, RSA represents ideal reasoning in an unconstrained setting. We e… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

    Comments: Accepted to CogSci 2020

  48. arXiv:2002.01510  [pdf, other

    cs.CL cs.SI

    Generalizing meanings from partners to populations: Hierarchical inference supports convention formation on networks

    Authors: Robert D. Hawkins, Noah D. Goodman, Adele E. Goldberg, Thomas L. Griffiths

    Abstract: A key property of linguistic conventions is that they hold over an entire community of speakers, allowing us to communicate efficiently even with people we have never met before. At the same time, much of our language use is partner-specific: we know that words may be understood differently by different people based on our shared history. This poses a challenge for accounts of convention formation… ▽ More

    Submitted 30 May, 2020; v1 submitted 4 February, 2020; originally announced February 2020.

    Comments: CogSci 2020

  49. arXiv:1912.07199  [pdf, other

    cs.CL

    Characterizing the dynamics of learning in repeated reference games

    Authors: Robert D. Hawkins, Michael C. Frank, Noah D. Goodman

    Abstract: The language we use over the course of conversation changes as we establish common ground and learn what our partner finds meaningful. Here we draw upon recent advances in natural language processing to provide a finer-grained characterization of the dynamics of this learning process. We release an open corpus (>15,000 utterances) of extended dyadic interactions in a classic repeated reference gam… ▽ More

    Submitted 13 April, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: Accepted at Cognitive Science

  50. arXiv:1911.09896  [pdf, other

    cs.CL

    Continual adaptation for efficient machine communication

    Authors: Robert D. Hawkins, Minae Kwon, Dorsa Sadigh, Noah D. Goodman

    Abstract: To communicate with new partners in new contexts, humans rapidly form new linguistic conventions. Recent neural language models are able to comprehend and produce the existing conventions present in their training data, but are not able to flexibly and interactively adapt those conventions on the fly as humans do. We introduce an interactive repeated reference task as a benchmark for models of ada… ▽ More

    Submitted 13 October, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

    Comments: Accepted at CoNLL