Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–45 of 45 results for author: Bau, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.01536  [pdf, other

    cs.CV cs.GR cs.LG

    Customizing Text-to-Image Models with a Single Image Pair

    Authors: Maxwell Jones, Sheng-Yu Wang, Nupur Kumari, David Bau, Jun-Yan Zhu

    Abstract: Art reinterpretation is the practice of creating a variation of a reference work, making a paired artwork that exhibits a distinct artistic style. We ask if such an image pair can be used to customize a generative model to capture the demonstrated stylistic difference. We propose Pair Customization, a new customization method that learns stylistic difference from a single image pair and then appli… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: project page: https://paircustomization.github.io/

  2. arXiv:2404.03646  [pdf, other

    cs.CL

    Locating and Editing Factual Associations in Mamba

    Authors: Arnab Sen Sharma, David Atkinson, David Bau

    Abstract: We investigate the mechanisms of factual recall in the Mamba state space model. Our work is inspired by previous findings in autoregressive transformer language models suggesting that their knowledge recall is localized to particular modules at specific token locations; we therefore ask whether factual recall in Mamba can be similarly localized. To investigate this, we conduct four lines of experi… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  3. arXiv:2403.19647  [pdf, other

    cs.LG cs.AI cs.CL

    Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

    Authors: Samuel Marks, Can Rager, Eric J. Michaud, Yonatan Belinkov, David Bau, Aaron Mueller

    Abstract: We introduce methods for discovering and applying sparse feature circuits. These are causally implicated subnetworks of human-interpretable features for explaining language model behaviors. Circuits identified in prior work consist of polysemantic and difficult-to-interpret units like attention heads or neurons, rendering them unsuitable for many downstream applications. In contrast, sparse featur… ▽ More

    Submitted 31 March, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Code and data at https://github.com/saprmarks/feature-circuits. Demonstration at https://feature-circuits.xyz

  4. arXiv:2403.02327  [pdf, other

    cs.DB cs.AI

    Model Lakes

    Authors: Koyena Pal, David Bau, Renée J. Miller

    Abstract: Given a set of deep learning models, it can be hard to find models appropriate to a task, understand the models, and characterize how models are different one from another. Currently, practitioners rely on manually-written documentation to understand and choose models. However, not all models have complete and reliable documentation. As the number of machine learning models increases, this issue o… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  5. arXiv:2402.14811  [pdf, other

    cs.CL cs.LG

    Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking

    Authors: Nikhil Prakash, Tamar Rott Shaham, Tal Haklay, Yonatan Belinkov, David Bau

    Abstract: Fine-tuning on generalized tasks such as instruction following, code generation, and mathematics has been shown to enhance language models' performance on a range of tasks. Nevertheless, explanations of how such fine-tuning influences the internal computations in these models remain elusive. We study how fine-tuning affects the internal mechanisms implemented in language models. As a case study, w… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: ICLR 2024. 26 pages, 13 figures. Code and data at https://finetuning.baulab.info/

  6. arXiv:2402.10962  [pdf, other

    cs.CL cs.AI cs.LG

    Measuring and Controlling Instruction (In)Stability in Language Model Dialogs

    Authors: Kenneth Li, Tianle Liu, Naomi Bashkansky, David Bau, Fernanda Viégas, Hanspeter Pfister, Martin Wattenberg

    Abstract: System-prompting is a standard tool for customizing language-model chatbots, enabling them to follow a specific instruction. An implicit assumption in the use of system prompts is that they will be stable, so the chatbot will continue to generate text according to the stipulated instructions for the duration of a conversation. We propose a quantitative benchmark to test this assumption, evaluating… ▽ More

    Submitted 1 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: Code: https://github.com/likenneth/persona_drift

  7. arXiv:2401.14446  [pdf, other

    cs.CY cs.AI cs.CR

    Black-Box Access is Insufficient for Rigorous AI Audits

    Authors: Stephen Casper, Carson Ezell, Charlotte Siegmann, Noam Kolt, Taylor Lynn Curtis, Benjamin Bucknall, Andreas Haupt, Kevin Wei, Jérémy Scheurer, Marius Hobbhahn, Lee Sharkey, Satyapriya Krishna, Marvin Von Hagen, Silas Alberti, Alan Chan, Qinyi Sun, Michael Gerovitch, David Bau, Max Tegmark, David Krueger, Dylan Hadfield-Menell

    Abstract: External audits of AI systems are increasingly recognized as a key mechanism for AI governance. The effectiveness of an audit, however, depends on the degree of access granted to auditors. Recent audits of state-of-the-art AI systems have primarily relied on black-box access, in which auditors can only query the system and observe its outputs. However, white-box access to the system's inner workin… ▽ More

    Submitted 29 May, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: FAccT 2024

    Journal ref: The 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24), June 3-6, 2024, Rio de Janeiro, Brazil

  8. arXiv:2311.12092  [pdf, other

    cs.CV

    Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models

    Authors: Rohit Gandikota, Joanna Materzynska, Tingrui Zhou, Antonio Torralba, David Bau

    Abstract: We present a method to create interpretable concept sliders that enable precise control over attributes in image generations from diffusion models. Our approach identifies a low-rank parameter direction corresponding to one concept while minimizing interference with other attributes. A slider is created using a small set of prompts or sample images; thus slider directions can be created for either… ▽ More

    Submitted 27 November, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

  9. arXiv:2311.11350  [pdf, ps, other

    cs.CY

    An Alternative to Regulation: The Case for Public AI

    Authors: Nicholas Vincent, David Bau, Sarah Schwettmann, Joshua Tan

    Abstract: Can governments build AI? In this paper, we describe an ongoing effort to develop ``public AI'' -- publicly accessible AI models funded, provisioned, and governed by governments or other public bodies. Public AI presents both an alternative and a complement to standard regulatory approaches to AI, but it also suggests new technical and policy challenges. We present a roadmap for how the ML researc… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: To be presented at Regulatable ML @ NeurIPS2023 workshop

  10. arXiv:2311.10538  [pdf, other

    cs.AI

    Testing Language Model Agents Safely in the Wild

    Authors: Silen Naihin, David Atkinson, Marc Green, Merwane Hamadi, Craig Swift, Douglas Schonholtz, Adam Tauman Kalai, David Bau

    Abstract: A prerequisite for safe autonomy-in-the-wild is safe testing-in-the-wild. Yet real-world autonomous tests face several unique safety challenges, both due to the possibility of causing harm during a test, as well as the risk of encountering new unsafe agent behavior through interactions with real-world and potentially malicious actors. We propose a framework for conducting safe autonomous agent tes… ▽ More

    Submitted 3 December, 2023; v1 submitted 17 November, 2023; originally announced November 2023.

  11. Future Lens: Anticipating Subsequent Tokens from a Single Hidden State

    Authors: Koyena Pal, Jiuding Sun, Andrew Yuan, Byron C. Wallace, David Bau

    Abstract: We conjecture that hidden state vectors corresponding to individual input tokens encode information sufficient to accurately predict several tokens ahead. More concretely, in this paper we ask: Given a hidden (internal) representation of a single token at position $t$ in an input, can we reliably anticipate the tokens that will appear at positions $\geq t + 2$? To test this, we measure linear appr… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: Accepted at CoNLL 2023

  12. arXiv:2310.15213  [pdf, other

    cs.CL cs.LG

    Function Vectors in Large Language Models

    Authors: Eric Todd, Millicent L. Li, Arnab Sen Sharma, Aaron Mueller, Byron C. Wallace, David Bau

    Abstract: We report the presence of a simple neural mechanism that represents an input-output function as a vector within autoregressive transformer language models (LMs). Using causal mediation analysis on a diverse range of in-context-learning (ICL) tasks, we find that a small number attention heads transport a compact representation of the demonstrated task, which we call a function vector (FV). FVs are… ▽ More

    Submitted 25 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: ICLR 2024. 52 pages, 30 figures, 23 tables. Code and data at https://functions.baulab.info

  13. arXiv:2309.03886  [pdf, other

    cs.CL cs.AI cs.LG

    FIND: A Function Description Benchmark for Evaluating Interpretability Methods

    Authors: Sarah Schwettmann, Tamar Rott Shaham, Joanna Materzynska, Neil Chowdhury, Shuang Li, Jacob Andreas, David Bau, Antonio Torralba

    Abstract: Labeling neural network submodules with human-legible descriptions is useful for many downstream tasks: such descriptions can surface failures, guide interventions, and perhaps even explain important model behaviors. To date, most mechanistic descriptions of trained networks have involved small models, narrowly delimited phenomena, and large amounts of human labor. Labeling all human-interpretable… ▽ More

    Submitted 8 December, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: 28 pages, 10 figures

    Journal ref: NeurIPS 2023

  14. arXiv:2308.14761  [pdf, other

    cs.CV cs.LG

    Unified Concept Editing in Diffusion Models

    Authors: Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna Materzyńska, David Bau

    Abstract: Text-to-image models suffer from various safety issues that may limit their suitability for deployment. Previous methods have separately addressed individual issues of bias, copyright, and offensive content in text-to-image models. However, in the real world, all of these issues appear simultaneously in the same model. We present a method that tackles all issues with a single approach. Our method,… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  15. arXiv:2308.09124  [pdf, other

    cs.CL

    Linearity of Relation Decoding in Transformer Language Models

    Authors: Evan Hernandez, Arnab Sen Sharma, Tal Haklay, Kevin Meng, Martin Wattenberg, Jacob Andreas, Yonatan Belinkov, David Bau

    Abstract: Much of the knowledge encoded in transformer language models (LMs) may be expressed in terms of relations: relations between words and their synonyms, entities and their attributes, etc. We show that, for a subset of relations, this computation is well-approximated by a single linear transformation on the subject representation. Linear relation representations may be obtained by constructing a fir… ▽ More

    Submitted 15 February, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

  16. arXiv:2308.01544  [pdf, other

    cs.CV cs.CL

    Multimodal Neurons in Pretrained Text-Only Transformers

    Authors: Sarah Schwettmann, Neil Chowdhury, Samuel Klein, David Bau, Antonio Torralba

    Abstract: Language models demonstrate remarkable capacity to generalize representations learned in one modality to downstream tasks in other modalities. Can we trace this ability to individual neurons? We study the case where a frozen text transformer is augmented with vision using a self-supervised visual encoder and a single linear projection learned on an image-to-text task. Outputs of the projection lay… ▽ More

    Submitted 1 October, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: Oral presentation at ICCV CLVL 2023

  17. arXiv:2307.03637  [pdf, other

    cs.AI

    Discovering Variable Binding Circuitry with Desiderata

    Authors: Xander Davies, Max Nadeau, Nikhil Prakash, Tamar Rott Shaham, David Bau

    Abstract: Recent work has shown that computation in language models may be human-understandable, with successful efforts to localize and intervene on both single-unit features and input-output circuits. Here, we introduce an approach which extends causal mediation experiments to automatically identify model components responsible for performing a specific subtask by solely specifying a set of \textit{deside… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  18. arXiv:2303.07345  [pdf, other

    cs.CV

    Erasing Concepts from Diffusion Models

    Authors: Rohit Gandikota, Joanna Materzynska, Jaden Fiotto-Kaufman, David Bau

    Abstract: Motivated by recent advancements in text-to-image diffusion, we study erasure of specific concepts from the model's weights. While Stable Diffusion has shown promise in producing explicit or realistic artwork, it has raised concerns regarding its potential for misuse. We propose a fine-tuning method that can erase a visual concept from a pre-trained diffusion model, given only the name of the styl… ▽ More

    Submitted 20 June, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

  19. arXiv:2210.13382  [pdf, other

    cs.LG cs.AI cs.CL

    Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task

    Authors: Kenneth Li, Aspen K. Hopkins, David Bau, Fernanda Viégas, Hanspeter Pfister, Martin Wattenberg

    Abstract: Language models show a surprising range of capabilities, but the source of their apparent competence is unclear. Do these networks just memorize a collection of surface statistics, or do they rely on internal representations of the process that generates the sequences they see? We investigate this question by applying a variant of the GPT model to the task of predicting legal moves in a simple boa… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: ICLR 2023 oral (notable-top-5%): https://openreview.net/forum?id=DeG07_TcZvT ; code: https://github.com/likenneth/othello_world

  20. arXiv:2210.07229  [pdf, other

    cs.CL cs.LG

    Mass-Editing Memory in a Transformer

    Authors: Kevin Meng, Arnab Sen Sharma, Alex Andonian, Yonatan Belinkov, David Bau

    Abstract: Recent work has shown exciting promise in updating large language models with new memories, so as to replace obsolete information or add specialized knowledge. However, this line of work is predominantly limited to updating single associations. We develop MEMIT, a method for directly updating a language model with many memories, demonstrating experimentally that it can scale up to thousands of ass… ▽ More

    Submitted 1 August, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: 18 pages, 11 figures. Code and data at https://memit.baulab.info

  21. arXiv:2210.03116  [pdf, other

    cs.CV cs.GR cs.IR cs.LG

    Content-Based Search for Deep Generative Models

    Authors: Daohan Lu, Sheng-Yu Wang, Nupur Kumari, Rohan Agarwal, Mia Tang, David Bau, Jun-Yan Zhu

    Abstract: The growing proliferation of customized and pretrained generative models has made it infeasible for a user to be fully cognizant of every model in existence. To address this need, we introduce the task of content-based model search: given a query and a large set of generative models, finding the models that best match the query. As each generative model produces a distribution of images, we formul… ▽ More

    Submitted 24 October, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: Our project page is hosted at https://generative-intelligence-lab.github.io/modelverse/

  22. arXiv:2207.14288  [pdf, other

    cs.CV

    Rewriting Geometric Rules of a GAN

    Authors: Sheng-Yu Wang, David Bau, Jun-Yan Zhu

    Abstract: Deep generative models make visual content creation more accessible to novice users by automating the synthesis of diverse, realistic content based on a collected dataset. However, the current machine learning approaches miss a key element of the creative process -- the ability to synthesize things that go far beyond the data distribution and everyday experience. To begin to address this issue, we… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Comments: SIGGRAPH 2022 website: https://peterwang512.github.io/GANWarping/ code: https://github.com/PeterWang512/GANWarping

  23. arXiv:2207.02774  [pdf, other

    cs.CV cs.GR

    Local Relighting of Real Scenes

    Authors: Audrey Cui, Ali Jahanian, Agata Lapedriza, Antonio Torralba, Shahin Mahdizadehaghdam, Rohit Kumar, David Bau

    Abstract: We introduce the task of local relighting, which changes a photograph of a scene by switching on and off the light sources that are visible within the image. This new task differs from the traditional image relighting problem, as it introduces the challenge of detecting light sources and inferring the pattern of light that emanates from them. We propose an approach for local relighting that trains… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: 15 pages, 15 figures

  24. arXiv:2206.07835  [pdf, other

    cs.CV

    Disentangling visual and written concepts in CLIP

    Authors: Joanna Materzynska, Antonio Torralba, David Bau

    Abstract: The CLIP network measures the similarity between natural text and images; in this work, we investigate the entanglement of the representation of word images and natural images in its image encoder. First, we find that the image encoder has an ability to match word images with natural images of scenes described by those words. This is consistent with previous research that suggests that the meaning… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  25. arXiv:2202.05262  [pdf, other

    cs.CL cs.LG

    Locating and Editing Factual Associations in GPT

    Authors: Kevin Meng, David Bau, Alex Andonian, Yonatan Belinkov

    Abstract: We analyze the storage and recall of factual associations in autoregressive transformer language models, finding evidence that these associations correspond to localized, directly-editable computations. We first develop a causal intervention for identifying neuron activations that are decisive in a model's factual predictions. This reveals a distinct set of steps in middle-layer feed-forward modul… ▽ More

    Submitted 13 January, 2023; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: NeurIPS 2022. 35 pages, 30 figures. Code and data at https://rome.baulab.info/

    ACM Class: I.2.7

  26. arXiv:2201.11114  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Natural Language Descriptions of Deep Visual Features

    Authors: Evan Hernandez, Sarah Schwettmann, David Bau, Teona Bagashvili, Antonio Torralba, Jacob Andreas

    Abstract: Some neurons in deep networks specialize in recognizing highly specific perceptual, structural, or semantic features of inputs. In computer vision, techniques exist for identifying neurons that respond to individual concept categories like colors, textures, and object classes. But these techniques are limited in scope, labeling only a small subset of neurons and behaviors in any network. Is a rich… ▽ More

    Submitted 18 April, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

    Comments: To be published as a conference paper at ICLR 2022

  27. arXiv:2112.01008  [pdf, other

    cs.LG cs.CV

    Editing a classifier by rewriting its prediction rules

    Authors: Shibani Santurkar, Dimitris Tsipras, Mahalaxmi Elango, David Bau, Antonio Torralba, Aleksander Madry

    Abstract: We present a methodology for modifying the behavior of a classifier by directly rewriting its prediction rules. Our approach requires virtually no additional data collection and can be applied to a variety of settings, including adapting a model to new environments, and modifying it to ignore spurious features. Our code is available at https://github.com/MadryLab/EditingClassifiers .

    Submitted 2 December, 2021; originally announced December 2021.

  28. arXiv:2110.04292  [pdf, other

    cs.CV cs.AI

    Toward a Visual Concept Vocabulary for GAN Latent Space

    Authors: Sarah Schwettmann, Evan Hernandez, David Bau, Samuel Klein, Jacob Andreas, Antonio Torralba

    Abstract: A large body of recent work has identified transformations in the latent spaces of generative adversarial networks (GANs) that consistently and interpretably transform generated images. But existing techniques for identifying these transformations rely on either a fixed vocabulary of pre-specified visual concepts, or on unsupervised disentanglement techniques whose alignment with human judgments a… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

    Comments: 15 pages, 13 figures. Accepted to ICCV 2021. Project page: https://visualvocab.csail.mit.edu

    ACM Class: I.4

  29. arXiv:2108.02774  [pdf, other

    cs.CV cs.LG

    Sketch Your Own GAN

    Authors: Sheng-Yu Wang, David Bau, Jun-Yan Zhu

    Abstract: Can a user create a deep generative model by sketching a single example? Traditionally, creating a GAN model has required the collection of a large-scale dataset of exemplars and specialized knowledge in deep learning. In contrast, sketching is possibly the most universally accessible way to convey a visual concept. In this work, we present a method, GAN Sketching, for rewriting GANs with one or m… ▽ More

    Submitted 20 September, 2021; v1 submitted 5 August, 2021; originally announced August 2021.

    Comments: ICCV 2021 website: https://peterwang512.github.io/GANSketching code: https://github.com/PeterWang512/GANSketching

  30. arXiv:2103.10951  [pdf, other

    cs.CV cs.AI cs.GR

    Paint by Word

    Authors: Alex Andonian, Sabrina Osmany, Audrey Cui, YeonHwan Park, Ali Jahanian, Antonio Torralba, David Bau

    Abstract: We investigate the problem of zero-shot semantic image painting. Instead of painting modifications into an image using only concrete colors or a finite set of semantic concepts, we ask how to create semantic paint based on open full-text descriptions: our goal is to be able to point to a location in a synthesized image and apply an arbitrary new concept such as "rustic" or "opulent" or "happy dog.… ▽ More

    Submitted 23 March, 2023; v1 submitted 19 March, 2021; originally announced March 2021.

    Comments: 10 pages, 9 figures

    ACM Class: I.2.10; I.4; I.3

  31. arXiv:2009.05041  [pdf, other

    cs.CV cs.LG cs.NE

    Understanding the Role of Individual Units in a Deep Neural Network

    Authors: David Bau, Jun-Yan Zhu, Hendrik Strobelt, Agata Lapedriza, Bolei Zhou, Antonio Torralba

    Abstract: Deep neural networks excel at finding hierarchical representations that solve complex tasks over large data sets. How can we humans understand these learned representations? In this work, we present network dissection, an analytic framework to systematically identify the semantics of individual hidden units within image classification and image generation networks. First, we analyze a convolutiona… ▽ More

    Submitted 12 September, 2020; v1 submitted 10 September, 2020; originally announced September 2020.

    Comments: Proceedings of the National Academy of Sciences 2020. Code at https://github.com/davidbau/dissect/ and website at https://dissect.csail.mit.edu/

    MSC Class: 68T07 ACM Class: I.4; I.2

  32. arXiv:2008.10588  [pdf, other

    cs.CV

    What makes fake images detectable? Understanding properties that generalize

    Authors: Lucy Chai, David Bau, Ser-Nam Lim, Phillip Isola

    Abstract: The quality of image generation and manipulation is reaching impressive levels, making it increasingly difficult for a human to distinguish between what is real and what is fake. However, deep networks can still pick up on the subtle artifacts in these doctored images. We seek to understand what properties of fake images make them detectable and identify what generalizes across different model arc… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

  33. arXiv:2007.15646  [pdf, other

    cs.CV cs.GR cs.LG

    Rewriting a Deep Generative Model

    Authors: David Bau, Steven Liu, Tongzhou Wang, Jun-Yan Zhu, Antonio Torralba

    Abstract: A deep generative model such as a GAN learns to model a rich set of semantic and physical rules about the target distribution, but up to now, it has been obscure how such rules are encoded in the network, or how a rule could be changed. In this paper, we introduce a new problem setting: manipulation of specific rules encoded by a deep generative model. To address the problem, we propose a formulat… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

    Comments: ECCV 2020 (oral). Code at https://github.com/davidbau/rewriting. For videos and demos see https://rewriting.csail.mit.edu/

    ACM Class: I.2.6; I.2.10; I.3.3

  34. arXiv:2006.10728  [pdf, other

    cs.CV cs.LG

    Diverse Image Generation via Self-Conditioned GANs

    Authors: Steven Liu, Tongzhou Wang, David Bau, Jun-Yan Zhu, Antonio Torralba

    Abstract: We introduce a simple but effective unsupervised method for generating realistic and diverse images. We train a class-conditional GAN model without using manually annotated class labels. Instead, our model is conditional on labels automatically derived from clustering in the discriminator's feature space. Our clustering step automatically discovers diverse modes, and explicitly requires the genera… ▽ More

    Submitted 9 February, 2022; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: CVPR 2020. Code: https://github.com/stevliu/self-conditioned-gan. Webpage: http://selfcondgan.csail.mit.edu/

  35. arXiv:2005.07727  [pdf, other

    cs.CV cs.GR cs.LG

    Semantic Photo Manipulation with a Generative Image Prior

    Authors: David Bau, Hendrik Strobelt, William Peebles, Jonas Wulff, Bolei Zhou, Jun-Yan Zhu, Antonio Torralba

    Abstract: Despite the recent success of GANs in synthesizing images conditioned on inputs such as a user sketch, text, or semantic labels, manipulating the high-level attributes of an existing natural photograph with GANs is challenging for two reasons. First, it is hard for GANs to precisely reproduce an input image. Second, after manipulation, the newly synthesized pixels often do not fit the original ima… ▽ More

    Submitted 12 September, 2020; v1 submitted 15 May, 2020; originally announced May 2020.

    Comments: SIGGRAPH 2019

    ACM Class: I.2.10; I.4; I.3

    Journal ref: ACM Transactions on Graphics (TOG) 38.4 (2019)

  36. arXiv:1910.11626  [pdf, other

    cs.CV cs.GR cs.LG eess.IV

    Seeing What a GAN Cannot Generate

    Authors: David Bau, Jun-Yan Zhu, Jonas Wulff, William Peebles, Hendrik Strobelt, Bolei Zhou, Antonio Torralba

    Abstract: Despite the success of Generative Adversarial Networks (GANs), mode collapse remains a serious issue during GAN training. To date, little work has focused on understanding and quantifying which modes have been dropped by a model. In this work, we visualize mode collapse at both the distribution level and the instance level. First, we deploy a semantic segmentation network to compare the distributi… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

    Comments: ICCV 2019 oral; http://ganseeing.csail.mit.edu/

  37. arXiv:1907.00262  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Dissecting Pruned Neural Networks

    Authors: Jonathan Frankle, David Bau

    Abstract: Pruning is a standard technique for removing unnecessary structure from a neural network to reduce its storage footprint, computational demands, or energy consumption. Pruning can reduce the parameter-counts of many state-of-the-art neural networks by an order of magnitude without compromising accuracy, meaning these networks contain a vast amount of unnecessary structure. In this paper, we study… ▽ More

    Submitted 29 June, 2019; originally announced July 2019.

  38. arXiv:1901.09887  [pdf, other

    cs.LG stat.ML

    On the Units of GANs (Extended Abstract)

    Authors: David Bau, Jun-Yan Zhu, Hendrik Strobelt, Bolei Zhou, Joshua B. Tenenbaum, William T. Freeman, Antonio Torralba

    Abstract: Generative Adversarial Networks (GANs) have achieved impressive results for many real-world applications. As an active research topic, many GAN variants have emerged with improvements in sample quality and training stability. However, visualization and understanding of GANs is largely missing. How does a GAN represent our visual world internally? What causes the artifacts in GAN results? How do ar… ▽ More

    Submitted 6 August, 2020; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: In AAAI-19 workshop on Network Interpretability for Deep Learning arXiv admin note: substantial text overlap with arXiv:1811.10597

  39. arXiv:1812.09359  [pdf, other

    cs.CL

    NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks

    Authors: Fahim Dalvi, Avery Nortonsmith, D. Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, James Glass

    Abstract: We present a toolkit to facilitate the interpretation and understanding of neural network models. The toolkit provides several methods to identify salient neurons with respect to the model itself or an external task. A user can visualize selected neurons, ablate them to measure their effect on the model accuracy, and manipulate them to control the behavior of the model at the test time. Such an an… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

    Comments: AAAI Conference on Artificial Intelligence (AAAI 2019), Demonstration track, pages 2

  40. arXiv:1811.10597  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

    Authors: David Bau, Jun-Yan Zhu, Hendrik Strobelt, Bolei Zhou, Joshua B. Tenenbaum, William T. Freeman, Antonio Torralba

    Abstract: Generative Adversarial Networks (GANs) have recently achieved impressive results for many real-world applications, and many GAN variants have emerged with improvements in sample quality and training stability. However, they have not been well visualized or understood. How does a GAN represent our visual world internally? What causes the artifacts in GAN results? How do architectural choices affect… ▽ More

    Submitted 8 December, 2018; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: 18 pages, 19 figures

  41. arXiv:1806.02891  [pdf, other

    cs.CV cs.AI cs.LG

    Revisiting the Importance of Individual Units in CNNs via Ablation

    Authors: Bolei Zhou, Yiyou Sun, David Bau, Antonio Torralba

    Abstract: We revisit the importance of the individual units in Convolutional Neural Networks (CNNs) for visual recognition. By conducting unit ablation experiments on CNNs trained on large scale image datasets, we demonstrate that, though ablating any individual unit does not hurt overall classification accuracy, it does lead to significant damage on the accuracy of specific classes. This result shows that… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

  42. arXiv:1806.00069  [pdf, ps, other

    cs.AI cs.LG stat.ML

    Explaining Explanations: An Overview of Interpretability of Machine Learning

    Authors: Leilani H. Gilpin, David Bau, Ben Z. Yuan, Ayesha Bajwa, Michael Specter, Lalana Kagal

    Abstract: There has recently been a surge of work in explanatory artificial intelligence (XAI). This research area tackles the important problem that complex machines and algorithms often cannot provide insights into their behavior and thought processes. XAI allows users and parts of the internal system to be more transparent, providing explanations of their decisions in some level of detail. These explanat… ▽ More

    Submitted 3 February, 2019; v1 submitted 31 May, 2018; originally announced June 2018.

    Comments: The 5th IEEE International Conference on Data Science and Advanced Analytics (DSAA 2018). [Research Track]

  43. arXiv:1711.05611  [pdf, other

    cs.CV

    Interpreting Deep Visual Representations via Network Dissection

    Authors: Bolei Zhou, David Bau, Aude Oliva, Antonio Torralba

    Abstract: The success of recent deep convolutional neural networks (CNNs) depends on learning hidden representations that can summarize the important factors of variation behind the data. However, CNNs often criticized as being black boxes that lack interpretability, since they have millions of unexplained model parameters. In this work, we describe Network Dissection, a method that interprets networks by p… ▽ More

    Submitted 26 June, 2018; v1 submitted 15 November, 2017; originally announced November 2017.

    Comments: *B. Zhou and D. Bau contributed equally to this work. 15 pages, 27 figures

    ACM Class: I.2.10

  44. arXiv:1705.09413  [pdf, other

    cs.PL cs.CY cs.HC cs.SE

    Learnable Programming: Blocks and Beyond

    Authors: David Bau, Jeff Gray, Caitlin Kelleher, Josh Sheldon, Franklyn Turbak

    Abstract: Blocks-based programming has become the lingua franca for introductory coding. Studies have found that experience with blocks-based programming can help beginners learn more traditional text-based languages. We explore how blocks environments improve learnability for novices by 1) favoring recognition over recall, 2) reducing cognitive load, and 3) preventing errors. Increased usability of blocks… ▽ More

    Submitted 25 May, 2017; originally announced May 2017.

    ACM Class: K.3.2; H.5.2; D.1.7; D.2.6

    Journal ref: Communications of the ACM, June 2017, pp. 72-80

  45. arXiv:1704.05796  [pdf, other

    cs.CV cs.AI

    Network Dissection: Quantifying Interpretability of Deep Visual Representations

    Authors: David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, Antonio Torralba

    Abstract: We propose a general framework called Network Dissection for quantifying the interpretability of latent representations of CNNs by evaluating the alignment between individual hidden units and a set of semantic concepts. Given any CNN model, the proposed method draws on a broad data set of visual concepts to score the semantics of hidden units at each intermediate convolutional layer. The units wit… ▽ More

    Submitted 19 April, 2017; originally announced April 2017.

    Comments: First two authors contributed equally. Oral presentation at CVPR 2017

    ACM Class: I.2.10