Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–10 of 10 results for author: Eberle, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07592  [pdf, other

    cs.LG cs.AI stat.ML

    MambaLRP: Explaining Selective State Space Sequence Models

    Authors: Farnoush Rezaei Jafari, Grégoire Montavon, Klaus-Robert Müller, Oliver Eberle

    Abstract: Recent sequence modeling approaches using Selective State Space Sequence Models, referred to as Mamba models, have seen a surge of interest. These models allow efficient processing of long sequences in linear time and are rapidly being adopted in a wide range of applications such as language modeling, demonstrating promising performance. To foster their reliable use in real-world scenarios, it is… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2406.04280  [pdf, other

    cs.LG cs.CV

    xMIL: Insightful Explanations for Multiple Instance Learning in Histopathology

    Authors: Julius Hense, Mina Jamshidi Idaji, Oliver Eberle, Thomas Schnake, Jonas Dippel, Laure Ciernik, Oliver Buchstab, Andreas Mock, Frederick Klauschen, Klaus-Robert Müller

    Abstract: Multiple instance learning (MIL) is an effective and widely used approach for weakly supervised machine learning. In histopathology, MIL models have achieved remarkable success in tasks like tumor detection, biomarker prediction, and outcome prognostication. However, MIL explanation methods are still lagging behind, as they are limited to small bag sizes or disregard instance interactions. We revi… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2405.06604  [pdf, other

    cs.CL cs.LG

    Explaining Text Similarity in Transformer Models

    Authors: Alexandros Vasileiou, Oliver Eberle

    Abstract: As Transformers have become state-of-the-art models for natural language processing (NLP) tasks, the need to understand and explain their predictions is increasingly apparent. Especially in unsupervised applications, such as information retrieval tasks, similarity models built on top of foundation model representations have been widely applied. However, their inner prediction mechanisms have mostl… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: Accepted to NAACL 2024

  4. arXiv:2402.19133  [pdf, other

    cs.CL

    Evaluating Webcam-based Gaze Data as an Alternative for Human Rationale Annotations

    Authors: Stephanie Brandl, Oliver Eberle, Tiago Ribeiro, Anders Søgaard, Nora Hollenstein

    Abstract: Rationales in the form of manually annotated input spans usually serve as ground truth when evaluating explainability methods in NLP. They are, however, time-consuming and often biased by the annotation process. In this paper, we debate whether human gaze, in the form of webcam-based eye-tracking recordings, poses a valid alternative when evaluating importance scores. We evaluate the additional in… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted to LREC-COLING 2024

  5. arXiv:2310.11906  [pdf, other

    cs.CL

    Rather a Nurse than a Physician -- Contrastive Explanations under Investigation

    Authors: Oliver Eberle, Ilias Chalkidis, Laura Cabello, Stephanie Brandl

    Abstract: Contrastive explanations, where one decision is explained in contrast to another, are supposed to be closer to how humans explain a decision than non-contrastive explanations, where the decision is not necessarily referenced to an alternative. This claim has never been empirically validated. We analyze four English text-classification datasets (SST2, DynaSent, BIOS and DBpedia-Animals). We fine-tu… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 9 pages, long paper at EMNLP 2023 proceedings

  6. arXiv:2310.09091  [pdf, other

    cs.LG cs.AI cs.CY cs.DL

    Insightful analysis of historical sources at scales beyond human capabilities using unsupervised Machine Learning and XAI

    Authors: Oliver Eberle, Jochen Büttner, Hassan El-Hajj, Grégoire Montavon, Klaus-Robert Müller, Matteo Valleriani

    Abstract: Historical materials are abundant. Yet, piecing together how human knowledge has evolved and spread both diachronically and synchronically remains a challenge that can so far only be very selectively addressed. The vast volume of materials precludes comprehensive studies, given the restricted number of human specialists. However, as large amounts of historical materials are now available in digita… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  7. arXiv:2205.10226  [pdf, other

    cs.CL cs.LG

    Do Transformer Models Show Similar Attention Patterns to Task-Specific Human Gaze?

    Authors: Stephanie Brandl, Oliver Eberle, Jonas Pilot, Anders Søgaard

    Abstract: Learned self-attention functions in state-of-the-art NLP models often correlate with human attention. We investigate whether self-attention in large-scale pre-trained language models is as predictive of human eye fixation patterns during task-reading as classical cognitive models of human attention. We compare attention functions across two task-specific reading datasets for sentiment analysis and… ▽ More

    Submitted 25 April, 2022; originally announced May 2022.

    Comments: Accepted to ACL 2022

  8. arXiv:2202.07304  [pdf, other

    cs.LG

    XAI for Transformers: Better Explanations through Conservative Propagation

    Authors: Ameen Ali, Thomas Schnake, Oliver Eberle, Grégoire Montavon, Klaus-Robert Müller, Lior Wolf

    Abstract: Transformers have become an important workhorse of machine learning, with numerous applications. This necessitates the development of reliable methods for increasing their transparency. Multiple interpretability methods, often based on gradient information, have been proposed. We show that the gradient in a Transformer reflects the function only locally, and thus fails to reliably identify the con… ▽ More

    Submitted 23 June, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  9. arXiv:2006.03589  [pdf, other

    cs.LG cs.AI stat.ML

    Higher-Order Explanations of Graph Neural Networks via Relevant Walks

    Authors: Thomas Schnake, Oliver Eberle, Jonas Lederer, Shinichi Nakajima, Kristof T. Schütt, Klaus-Robert Müller, Grégoire Montavon

    Abstract: Graph Neural Networks (GNNs) are a popular approach for predicting graph structured data. As GNNs tightly entangle the input graph into the neural network structure, common explainable AI approaches are not applicable. To a large extent, GNNs have remained black-boxes for the user so far. In this paper, we show that GNNs can in fact be naturally explained using higher-order expansions, i.e. by ide… ▽ More

    Submitted 26 November, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: 14 pages + 6 pages supplement

  10. Building and Interpreting Deep Similarity Models

    Authors: Oliver Eberle, Jochen Büttner, Florian Kräutli, Klaus-Robert Müller, Matteo Valleriani, Grégoire Montavon

    Abstract: Many learning algorithms such as kernel machines, nearest neighbors, clustering, or anomaly detection, are based on the concept of 'distance' or 'similarity'. Before similarities are used for training an actual machine learning model, we would like to verify that they are bound to meaningful patterns in the data. In this paper, we propose to make similarities interpretable by augmenting them with… ▽ More

    Submitted 11 March, 2020; originally announced March 2020.

    Comments: 12 pages, 10 figures