Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 58 results for author: Kloft, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.21656  [pdf, other

    cs.LG

    Comgra: A Tool for Analyzing and Debugging Neural Networks

    Authors: Florian Dietz, Sophie Fellenz, Dietrich Klakow, Marius Kloft

    Abstract: Neural Networks are notoriously difficult to inspect. We introduce comgra, an open source python library for use with PyTorch. Comgra extracts data about the internal activations of a model and organizes it in a GUI (graphical user interface). It can show both summary statistics and individual data points, compare early and late stages of training, focus on individual samples of interest, and visu… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  2. arXiv:2406.16308  [pdf, other

    cs.LG cs.AI cs.CL

    Anomaly Detection of Tabular Data Using LLMs

    Authors: Aodong Li, Yunhan Zhao, Chen Qiu, Marius Kloft, Padhraic Smyth, Maja Rudolph, Stephan Mandt

    Abstract: Large language models (LLMs) have shown their potential in long-context understanding and mathematical reasoning. In this paper, we study the problem of using LLMs to detect tabular anomalies and show that pre-trained LLMs are zero-shot batch-level anomaly detectors. That is, without extra distribution-specific model fitting, they can discover hidden outliers in a batch of data, demonstrating thei… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: accepted at the Anomaly Detection with Foundation Models workshop

  3. arXiv:2406.14866  [pdf, other

    cs.AI eess.IV

    AI-based Anomaly Detection for Clinical-Grade Histopathological Diagnostics

    Authors: Jonas Dippel, Niklas Prenißl, Julius Hense, Philipp Liznerski, Tobias Winterhoff, Simon Schallenberg, Marius Kloft, Oliver Buchstab, David Horst, Maximilian Alber, Lukas Ruff, Klaus-Robert Müller, Frederick Klauschen

    Abstract: While previous studies have demonstrated the potential of AI to diagnose diseases in imaging data, clinical implementation is still lagging behind. This is partly because AI models require training with large numbers of examples only available for common diseases. In clinical reality, however, only few diseases are common, whereas the majority of diseases are less frequent (long-tail distribution)… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  4. arXiv:2405.04671  [pdf, other

    cs.LG

    Interpretable Tensor Fusion

    Authors: Saurabh Varshneya, Antoine Ledent, Philipp Liznerski, Andriy Balinskyy, Purvanshi Mehta, Waleed Mustafa, Marius Kloft

    Abstract: Conventional machine learning methods are predominantly designed to predict outcomes based on a single data type. However, practical applications may encompass data of diverse types, such as text, images, and audio. We introduce interpretable tensor fusion (InTense), a multimodal learning method for training neural networks to simultaneously learn multimodal data representations and their interpre… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  5. arXiv:2403.00025  [pdf, ps, other

    cs.LG cs.AI

    On the Challenges and Opportunities in Generative AI

    Authors: Laura Manduchi, Kushagra Pandey, Robert Bamler, Ryan Cotterell, Sina Däubener, Sophie Fellenz, Asja Fischer, Thomas Gärtner, Matthias Kirchler, Marius Kloft, Yingzhen Li, Christoph Lippert, Gerard de Melo, Eric Nalisnick, Björn Ommer, Rajesh Ranganath, Maja Rudolph, Karen Ullrich, Guy Van den Broeck, Julia E Vogt, Yixin Wang, Florian Wenzel, Frank Wood, Stephan Mandt, Vincent Fortuin

    Abstract: The field of deep generative modeling has grown rapidly and consistently over the years. With the availability of massive amounts of training data coupled with advances in scalable unsupervised learning paradigms, recent large-scale generative models show tremendous promise in synthesizing high-resolution images and text, as well as structured data such as videos and molecules. However, we argue t… ▽ More

    Submitted 28 February, 2024; originally announced March 2024.

  6. arXiv:2402.14469  [pdf, other

    cs.CV cs.LG stat.ML

    Reimagining Anomalies: What If Anomalies Were Normal?

    Authors: Philipp Liznerski, Saurabh Varshneya, Ece Calikus, Sophie Fellenz, Marius Kloft

    Abstract: Deep learning-based methods have achieved a breakthrough in image anomaly detection, but their complexity introduces a considerable challenge to understanding why an instance is predicted to be anomalous. We introduce a novel explanation method that generates multiple counterfactual examples for each anomaly, capturing diverse concepts of anomalousness. A counterfactual example is a modification o… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 30 pages; preprint

  7. arXiv:2311.13594  [pdf, other

    cs.LG cs.AI stat.ML

    Labeling Neural Representations with Inverse Recognition

    Authors: Kirill Bykov, Laura Kopf, Shinichi Nakajima, Marius Kloft, Marina M. -C. Höhne

    Abstract: Deep Neural Networks (DNNs) demonstrate remarkable capabilities in learning complex hierarchical data representations, but the nature of these representations remains largely unknown. Existing global explainability methods, such as Network Dissection, face limitations such as reliance on segmentation masks, lack of statistical significance testing, and high computational demands. We propose Invers… ▽ More

    Submitted 18 January, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: 25 pages, 16 figures

    Journal ref: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  8. arXiv:2309.16606  [pdf, other

    cs.HC cs.AI

    "AI enhances our performance, I have no doubt this one will do the same": The Placebo effect is robust to negative descriptions of AI

    Authors: Agnes M. Kloft, Robin Welsch, Thomas Kosch, Steeven Villa

    Abstract: Heightened AI expectations facilitate performance in human-AI interactions through placebo effects. While lowering expectations to control for placebo effects is advisable, overly negative expectations could induce nocebo effects. In a letter discrimination task, we informed participants that an AI would either increase or decrease their performance by adapting the interface, but in reality, no AI… ▽ More

    Submitted 23 January, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

  9. arXiv:2309.08627  [pdf, other

    cs.CL cs.IR cs.LG

    Evaluating Dynamic Topic Models

    Authors: Charu James, Mayank Nagda, Nooshin Haji Ghassemi, Marius Kloft, Sophie Fellenz

    Abstract: There is a lack of quantitative measures to evaluate the progression of topics through time in dynamic topic models (DTMs). Filling this gap, we propose a novel evaluation measure for DTMs that analyzes the changes in the quality of each topic over time. Additionally, we propose an extension combining topic quality with the model's temporal consistency. We demonstrate the utility of the proposed m… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  10. arXiv:2308.13577  [pdf, other

    cs.CL cs.LG

    Text Style Transfer Evaluation Using Large Language Models

    Authors: Phil Ostheimer, Mayank Nagda, Marius Kloft, Sophie Fellenz

    Abstract: Evaluating Text Style Transfer (TST) is a complex task due to its multifaceted nature. The quality of the generated text is measured based on challenging factors, such as style transfer accuracy, content preservation, and overall fluency. While human evaluation is considered to be the gold standard in TST assessment, it is costly and often hard to reproduce. Therefore, automated metrics are preval… ▽ More

    Submitted 23 September, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

  11. arXiv:2306.00539  [pdf, other

    cs.LG cs.CL

    A Call for Standardization and Validation of Text Style Transfer Evaluation

    Authors: Phil Ostheimer, Mayank Nagda, Marius Kloft, Sophie Fellenz

    Abstract: Text Style Transfer (TST) evaluation is, in practice, inconsistent. Therefore, we conduct a meta-analysis on human and automated TST evaluation and experimentation that thoroughly examines existing literature in the field. The meta-analysis reveals a substantial standardization gap in human and automated evaluation. In addition, we also find a validation gap: only few automated metrics have been v… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to Findings of ACL 2023

  12. arXiv:2303.05904  [pdf, ps, other

    cs.LG

    Deep Anomaly Detection on Tennessee Eastman Process Data

    Authors: Fabian Hartung, Billy Joe Franks, Tobias Michels, Dennis Wagner, Philipp Liznerski, Steffen Reithermann, Sophie Fellenz, Fabian Jirasek, Maja Rudolph, Daniel Neider, Heike Leitte, Chen Song, Benjamin Kloepper, Stephan Mandt, Michael Bortz, Jakob Burger, Hans Hasse, Marius Kloft

    Abstract: This paper provides the first comprehensive evaluation and analysis of modern (deep-learning) unsupervised anomaly detection methods for chemical process data. We focus on the Tennessee Eastman process dataset, which has been a standard litmus test to benchmark anomaly detection methods for nearly three decades. Our extensive study will facilitate choosing appropriate anomaly detection methods in… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  13. arXiv:2302.07849  [pdf, other

    cs.LG cs.AI stat.ML

    Zero-Shot Anomaly Detection via Batch Normalization

    Authors: Aodong Li, Chen Qiu, Marius Kloft, Padhraic Smyth, Maja Rudolph, Stephan Mandt

    Abstract: Anomaly detection (AD) plays a crucial role in many safety-critical application domains. The challenge of adapting an anomaly detector to drift in the normal data distribution, especially when no training data is available for the "new normal," has led to the development of zero-shot AD techniques. In this paper, we propose a simple yet effective method called Adaptive Centered Representations (AC… ▽ More

    Submitted 7 November, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: accepted at NeurIPS 2023

  14. arXiv:2302.07832  [pdf, other

    cs.LG cs.AI

    Deep Anomaly Detection under Labeling Budget Constraints

    Authors: Aodong Li, Chen Qiu, Marius Kloft, Padhraic Smyth, Stephan Mandt, Maja Rudolph

    Abstract: Selecting informative data points for expert feedback can significantly improve the performance of anomaly detection (AD) in various contexts, such as medical diagnostics or fraud detection. In this paper, we determine a set of theoretical conditions under which anomaly scores generalize from labeled queries to unlabeled data. Motivated by these results, we propose a data labeling strategy with op… ▽ More

    Submitted 4 July, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  15. arXiv:2301.09485  [pdf, other

    cs.LG

    Ordinal Regression for Difficulty Estimation of StepMania Levels

    Authors: Billy Joe Franks, Benjamin Dinkelmann, Sophie Fellenz, Marius Kloft

    Abstract: StepMania is a popular open-source clone of a rhythm-based video game. As is common in popular games, there is a large number of community-designed levels. It is often difficult for players and level authors to determine the difficulty level of such community contributions. In this work, we formalize and analyze the difficulty prediction task on StepMania levels as an ordinal regression (OR) task.… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  16. arXiv:2212.08339  [pdf, other

    cs.LG stat.ML

    Generalization Bounds for Inductive Matrix Completion in Low-noise Settings

    Authors: Antoine Ledent, Rodrigo Alves, Yunwen Lei, Yann Guermeur, Marius Kloft

    Abstract: We study inductive matrix completion (matrix completion with side information) under an i.i.d. subgaussian noise assumption at a low noise regime, with uniform sampling of the entries. We obtain for the first time generalization bounds with the following three properties: (1) they scale like the standard deviation of the noise and in particular approach zero in the exact recovery case; (2) even in… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: 30 Pages, 1 figure; Accepted for publication at AAAI 2023

    Journal ref: AAAI 2023

  17. arXiv:2210.14056  [pdf, other

    cs.LG cs.AI

    Unsupervised Anomaly Detection for Auditing Data and Impact of Categorical Encodings

    Authors: Ajay Chawda, Stefanie Grimm, Marius Kloft

    Abstract: In this paper, we introduce the Vehicle Claims dataset, consisting of fraudulent insurance claims for automotive repairs. The data belongs to the more broad category of Auditing data, which includes also Journals and Network Intrusion data. Insurance claim data are distinctively different from other auditing data (such as network intrusion data) in their high number of categorical attributes. We t… ▽ More

    Submitted 26 October, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: This work has been accepted at Proceedings of the Neurips 2022 Workshop on Synthetic Data 4ML

    ACM Class: I.2.m

  18. arXiv:2209.14933  [pdf, other

    cs.LG stat.ML

    Training Normalizing Flows from Dependent Data

    Authors: Matthias Kirchler, Christoph Lippert, Marius Kloft

    Abstract: Normalizing flows are powerful non-parametric statistical models that function as a hybrid between density estimators and generative models. Current learning algorithms for normalizing flows assume that data points are sampled independently, an assumption that is frequently violated in practice, which may lead to erroneous density estimation and data generation. We propose a likelihood objective o… ▽ More

    Submitted 30 May, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

  19. arXiv:2205.13845  [pdf, other

    cs.LG cs.AI

    Raising the Bar in Graph-level Anomaly Detection

    Authors: Chen Qiu, Marius Kloft, Stephan Mandt, Maja Rudolph

    Abstract: Graph-level anomaly detection has become a critical topic in diverse areas, such as financial fraud detection and detecting anomalous activities in social networks. While most research has focused on anomaly detection for visual data such as images, where high detection accuracies have been obtained, existing deep learning approaches for graphs currently show considerably worse performance. This p… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: To appear in IJCAI-ECAI 2022

    Journal ref: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (IJCAI-22), 2022

  20. arXiv:2205.11474  [pdf, other

    cs.CV cs.LG stat.ML

    Exposing Outlier Exposure: What Can Be Learned From Few, One, and Zero Outlier Images

    Authors: Philipp Liznerski, Lukas Ruff, Robert A. Vandermeulen, Billy Joe Franks, Klaus-Robert Müller, Marius Kloft

    Abstract: Due to the intractability of characterizing everything that looks unlike the normal data, anomaly detection (AD) is traditionally treated as an unsupervised problem utilizing only normal samples. However, it has recently been found that unsupervised image AD can be drastically improved through the utilization of huge corpora of random images to represent anomalousness; a technique which is known a… ▽ More

    Submitted 14 November, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: 47 pages; extended experiments; published in Transactions on Machine Learning Research. arXiv admin note: substantial text overlap with arXiv:2006.00339

  21. arXiv:2202.08088  [pdf, other

    cs.LG cs.AI

    Latent Outlier Exposure for Anomaly Detection with Contaminated Data

    Authors: Chen Qiu, Aodong Li, Marius Kloft, Maja Rudolph, Stephan Mandt

    Abstract: Anomaly detection aims at identifying data points that show systematic deviations from the majority of data in an unlabeled dataset. A common assumption is that clean training data (free of anomalies) is available, which is often violated in practice. We propose a strategy for training an anomaly detector in the presence of unlabeled anomalies that is compatible with a broad class of models. The i… ▽ More

    Submitted 26 June, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: To appear in ICML 2022

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, 2022, volume:162, pages:18153--18167

  22. arXiv:2202.03944  [pdf, other

    cs.LG cs.AI

    Detecting Anomalies within Time Series using Local Neural Transformations

    Authors: Tim Schneider, Chen Qiu, Marius Kloft, Decky Aspandi Latif, Steffen Staab, Stephan Mandt, Maja Rudolph

    Abstract: We develop a new method to detect anomalies within time series, which is essential in many application domains, reaching from self-driving cars, finance, and marketing to medical diagnosis and epidemiology. The method is based on self-supervised deep learning that has played a key role in facilitating deep anomaly detection on images, where powerful image transformations are available. However, su… ▽ More

    Submitted 20 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

  23. arXiv:2112.04314  [pdf, ps, other

    cs.LG cs.AI stat.ML

    A systematic approach to random data augmentation on graph neural networks

    Authors: Billy Joe Franks, Markus Anders, Marius Kloft, Pascal Schweitzer

    Abstract: Random data augmentations (RDAs) are state of the art regarding practical graph neural networks that are provably universal. There is great diversity regarding terminology, methodology, benchmarks, and evaluation metrics used among existing RDAs. Not only does this make it increasingly difficult for practitioners to decide which technique to apply to a given problem, but it also stands in the way… ▽ More

    Submitted 21 March, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

  24. Learning Interpretable Concept Groups in CNNs

    Authors: Saurabh Varshneya, Antoine Ledent, Robert A. Vandermeulen, Yunwen Lei, Matthias Enders, Damian Borth, Marius Kloft

    Abstract: We propose a novel training methodology -- Concept Group Learning (CGL) -- that encourages training of interpretable CNN filters by partitioning filters in each layer into concept groups, each of which is trained to learn a single visual concept. We achieve this through a novel regularization strategy that forces filters in the same group to be active in similar image regions for a given layer. We… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

  25. arXiv:2109.07869  [pdf, other

    cs.LG cs.CV cs.HC stat.ML

    Explainability Requires Interactivity

    Authors: Matthias Kirchler, Martin Graf, Marius Kloft, Christoph Lippert

    Abstract: When explaining the decisions of deep neural networks, simple stories are tempting but dangerous. Especially in computer vision, the most popular explanation approaches give a false sense of comprehension to its users and provide an overly simplistic picture. We introduce an interactive framework to understand the highly complex decision boundaries of modern vision models. It allows the user to ex… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  26. arXiv:2108.10346  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Explaining Bayesian Neural Networks

    Authors: Kirill Bykov, Marina M. -C. Höhne, Adelaida Creosteanu, Klaus-Robert Müller, Frederick Klauschen, Shinichi Nakajima, Marius Kloft

    Abstract: To make advanced learning machines such as Deep Neural Networks (DNNs) more transparent in decision making, explainable AI (XAI) aims to provide interpretations of DNNs' predictions. These interpretations are usually given in the form of heatmaps, each one illustrating relevant patterns regarding the prediction for a given instance. Bayesian approaches such as Bayesian Neural Networks (BNNs) so fa… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: 16 pages, 8 figures

  27. arXiv:2106.00115  [pdf, ps, other

    cs.LG stat.ML

    Fine-grained Generalization Analysis of Structured Output Prediction

    Authors: Waleed Mustafa, Yunwen Lei, Antoine Ledent, Marius Kloft

    Abstract: In machine learning we often encounter structured output prediction problems (SOPPs), i.e. problems where the output space admits a rich internal structure. Application domains where SOPPs naturally occur include natural language processing, speech recognition, and computer vision. Typical SOPPs have an extremely large label set, which grows exponentially as a function of the size of the output. E… ▽ More

    Submitted 31 May, 2021; originally announced June 2021.

    Comments: To appearn in IJCAI 2021

  28. arXiv:2104.14173  [pdf, other

    cs.LG cs.AI

    Fine-grained Generalization Analysis of Vector-valued Learning

    Authors: Liang Wu, Antoine Ledent, Yunwen Lei, Marius Kloft

    Abstract: Many fundamental machine learning tasks can be formulated as a problem of learning with vector-valued functions, where we learn multiple scalar-valued functions together. Although there is some generalization analysis on different specific algorithms under the empirical risk minimization principle, a unifying analysis of vector-valued learning under a regularization framework is still lacking. In… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

    Comments: To appear in AAAI 2021

  29. arXiv:2103.16440  [pdf, other

    cs.LG cs.AI

    Neural Transformation Learning for Deep Anomaly Detection Beyond Images

    Authors: Chen Qiu, Timo Pfrommer, Marius Kloft, Stephan Mandt, Maja Rudolph

    Abstract: Data transformations (e.g. rotations, reflections, and cropping) play an important role in self-supervised learning. Typically, images are transformed into different views, and neural networks trained on tasks involving these views produce useful feature representations for downstream tasks, including anomaly detection. However, for anomaly detection beyond image data, it is often unclear which tr… ▽ More

    Submitted 3 February, 2022; v1 submitted 30 March, 2021; originally announced March 2021.

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, 2021, volume:139, pages:8703--8714

  30. arXiv:2009.11732  [pdf, other

    cs.LG cs.AI stat.ML

    A Unifying Review of Deep and Shallow Anomaly Detection

    Authors: Lukas Ruff, Jacob R. Kauffmann, Robert A. Vandermeulen, Grégoire Montavon, Wojciech Samek, Marius Kloft, Thomas G. Dietterich, Klaus-Robert Müller

    Abstract: Deep learning approaches to anomaly detection have recently improved the state of the art in detection performance on complex datasets such as large collections of images or text. These results have sparked a renewed interest in the anomaly detection problem and led to the introduction of a great variety of new methods. With the emergence of numerous such methods, including approaches based on gen… ▽ More

    Submitted 8 February, 2021; v1 submitted 24 September, 2020; originally announced September 2020.

    Comments: 40 pages; accepted for publication in the Proceedings of the IEEE;

    Journal ref: Proceedings of the IEEE (2021) 1-40

  31. arXiv:2009.06571  [pdf, other

    cs.LG stat.ML

    Input Hessian Regularization of Neural Networks

    Authors: Waleed Mustafa, Robert A. Vandermeulen, Marius Kloft

    Abstract: Regularizing the input gradient has shown to be effective in promoting the robustness of neural networks. The regularization of the input's Hessian is therefore a natural next step. A key challenge here is the computational complexity. Computing the Hessian of inputs is computationally infeasible. In this paper we propose an efficient algorithm to train deep neural networks with Hessian operator-n… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

    Comments: Workshop on "Beyond first-order methods in ML systems" at the 37th International Conference on Machine Learning, Vienna, Austria, 2020

  32. arXiv:2008.11988  [pdf, other

    cs.CV cs.LG eess.IV

    Cloze Test Helps: Effective Video Anomaly Detection via Learning to Complete Video Events

    Authors: Guang Yu, Siqi Wang, Zhiping Cai, En Zhu, Chuanfu Xu, Jianping Yin, Marius Kloft

    Abstract: As a vital topic in media content interpretation, video anomaly detection (VAD) has made fruitful progress via deep neural network (DNN). However, existing methods usually follow a reconstruction or frame prediction routine. They suffer from two gaps: (1) They cannot localize video activities in a both precise and comprehensive manner. (2) They lack sufficient abilities to utilize high-level seman… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Comments: To be published as an oral paper in Proceedings of the 28th ACM International Conference on Multimedia (ACM MM '20). 9 pages, 7 figures

  33. arXiv:2007.01760  [pdf, other

    cs.CV cs.LG stat.ML

    Explainable Deep One-Class Classification

    Authors: Philipp Liznerski, Lukas Ruff, Robert A. Vandermeulen, Billy Joe Franks, Marius Kloft, Klaus-Robert Müller

    Abstract: Deep one-class classification variants for anomaly detection learn a mapping that concentrates nominal samples in feature space causing anomalies to be mapped away. Because this transformation is highly non-linear, finding interpretations poses a significant challenge. In this paper we present an explainable deep one-class classification method, Fully Convolutional Data Description (FCDD), where t… ▽ More

    Submitted 18 March, 2021; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: 25 pages, published as a conference paper at ICLR 2021

  34. arXiv:2006.09000  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    How Much Can I Trust You? -- Quantifying Uncertainties in Explaining Neural Networks

    Authors: Kirill Bykov, Marina M. -C. Höhne, Klaus-Robert Müller, Shinichi Nakajima, Marius Kloft

    Abstract: Explainable AI (XAI) aims to provide interpretations for predictions made by learning machines, such as deep neural networks, in order to make the machines more transparent for the user and furthermore trustworthy also for applications in e.g. safety-critical areas. So far, however, no methods for quantifying uncertainties of explanations have been conceived, which is problematic in domains where… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: 12 pages, 10 figures

  35. arXiv:2006.00339  [pdf, other

    cs.LG stat.ML

    Rethinking Assumptions in Deep Anomaly Detection

    Authors: Lukas Ruff, Robert A. Vandermeulen, Billy Joe Franks, Klaus-Robert Müller, Marius Kloft

    Abstract: Though anomaly detection (AD) can be viewed as a classification problem (nominal vs. anomalous) it is usually treated in an unsupervised manner since one typically does not have access to, or it is infeasible to utilize, a dataset that sufficiently characterizes what it means to be "anomalous." In this paper we present results demonstrating that this intuition surprisingly seems not to extend to d… ▽ More

    Submitted 27 January, 2023; v1 submitted 30 May, 2020; originally announced June 2020.

    Comments: 17 pages; accepted at the ICML 2021 Workshop on Uncertainty & Robustness in Deep Learning; An extended Journal paper of this work has been published in Transactions on Machine Learning Research: arXiv:2205.11474

  36. Orthogonal Inductive Matrix Completion

    Authors: Antoine Ledent, Rodrigo Alves, Marius Kloft

    Abstract: We propose orthogonal inductive matrix completion (OMIC), an interpretable approach to matrix completion based on a sum of multiple orthonormal side information terms, together with nuclear-norm regularization. The approach allows us to inject prior knowledge about the singular vectors of the ground truth matrix. We optimize the approach by a provably converging algorithm, which optimizes all… ▽ More

    Submitted 25 August, 2021; v1 submitted 3 April, 2020; originally announced April 2020.

    Comments: To appear in Transactions of Neural Networks and Learning Systems (TNNLS)

  37. Simple and Effective Prevention of Mode Collapse in Deep One-Class Classification

    Authors: Penny Chong, Lukas Ruff, Marius Kloft, Alexander Binder

    Abstract: Anomaly detection algorithms find extensive use in various fields. This area of research has recently made great advances thanks to deep learning. A recent method, the deep Support Vector Data Description (deep SVDD), which is inspired by the classic kernel-based Support Vector Data Description (SVDD), is capable of simultaneously learning a feature representation of the data and a data-enclosing… ▽ More

    Submitted 19 January, 2021; v1 submitted 23 January, 2020; originally announced January 2020.

    Comments: Accepted in 2020 International Joint Conference on Neural Networks (IJCNN)

  38. arXiv:1910.06239  [pdf, other

    stat.ML cs.LG stat.ME

    Two-sample Testing Using Deep Learning

    Authors: Matthias Kirchler, Shahryar Khorasani, Marius Kloft, Christoph Lippert

    Abstract: We propose a two-sample testing procedure based on learned deep neural network representations. To this end, we define two test statistics that perform an asymptotic location test on data samples mapped onto a hidden layer. The tests are consistent and asymptotically control the type-1 error rate. Their test statistics can be evaluated in linear time (in the sample size). Suitable data representat… ▽ More

    Submitted 10 March, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

  39. arXiv:1910.01249  [pdf, other

    cs.LG stat.ML

    Analyzing the Variance of Policy Gradient Estimators for the Linear-Quadratic Regulator

    Authors: James A. Preiss, Sébastien M. R. Arnold, Chen-Yu Wei, Marius Kloft

    Abstract: We study the variance of the REINFORCE policy gradient estimator in environments with continuous state and action spaces, linear dynamics, quadratic cost, and Gaussian noise. These simple environments allow us to derive bounds on the estimator variance in terms of the environment and noise parameters. We compare the predictions of our bounds to the empirical variance in simulation experiments.

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: Accepted at NeurIPS 2019 Workshop on Optimization Foundations for Reinforcement Learning. 7 pages + 6 pages appendix

  40. arXiv:1906.02694  [pdf, other

    cs.LG stat.ML

    Deep Semi-Supervised Anomaly Detection

    Authors: Lukas Ruff, Robert A. Vandermeulen, Nico Görnitz, Alexander Binder, Emmanuel Müller, Klaus-Robert Müller, Marius Kloft

    Abstract: Deep approaches to anomaly detection have recently shown promising results over shallow methods on large and complex datasets. Typically anomaly detection is treated as an unsupervised learning problem. In practice however, one may have---in addition to a large set of unlabeled samples---access to a small pool of labeled samples, e.g. a subset verified by some domain expert as being normal or anom… ▽ More

    Submitted 14 February, 2020; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: 23 pages, Published as a conference paper at ICLR 2020

  41. arXiv:1905.12430  [pdf, other

    cs.LG stat.ML

    Norm-based generalisation bounds for multi-class convolutional neural networks

    Authors: Antoine Ledent, Waleed Mustafa, Yunwen Lei, Marius Kloft

    Abstract: We show generalisation error bounds for deep learning with two main improvements over the state of the art. (1) Our bounds have no explicit dependence on the number of classes except for logarithmic factors. This holds even when formulating the bounds in terms of the $L^2$-norm of the weight matrices, where previous bounds exhibit at least a square-root dependence on the number of classes. (2) We… ▽ More

    Submitted 21 February, 2021; v1 submitted 29 May, 2019; originally announced May 2019.

  42. arXiv:1803.07868  [pdf, other

    stat.ML cs.LG

    Scalable Generalized Dynamic Topic Models

    Authors: Patrick Jähnichen, Florian Wenzel, Marius Kloft, Stephan Mandt

    Abstract: Dynamic topic models (DTMs) model the evolution of prevalent themes in literature, online media, and other forms of text over time. DTMs assume that word co-occurrence statistics change continuously and therefore impose continuous stochastic process priors on their model parameters. These dynamical priors make inference much harder than in regular topic models, and also limit scalability. In this… ▽ More

    Submitted 21 March, 2018; originally announced March 2018.

    Comments: Published version, International Conference on Artificial Intelligence and Statistics (AISTATS 2018)

  43. arXiv:1802.06383  [pdf, other

    stat.ML cs.LG

    Efficient Gaussian Process Classification Using Polya-Gamma Data Augmentation

    Authors: Florian Wenzel, Theo Galy-Fajou, Christan Donner, Marius Kloft, Manfred Opper

    Abstract: We propose a scalable stochastic variational approach to GP classification building on Polya-Gamma data augmentation and inducing points. Unlike former approaches, we obtain closed-form updates based on natural gradients that lead to efficient optimization. We evaluate the algorithm on real-world datasets containing up to 11 million data points and demonstrate that it is up to two orders of magnit… ▽ More

    Submitted 27 November, 2018; v1 submitted 18 February, 2018; originally announced February 2018.

  44. arXiv:1707.05532  [pdf, other

    stat.ML cs.LG

    Bayesian Nonlinear Support Vector Machines for Big Data

    Authors: Florian Wenzel, Theo Galy-Fajou, Matthaeus Deutsch, Marius Kloft

    Abstract: We propose a fast inference method for Bayesian nonlinear support vector machines that leverages stochastic variational inference and inducing points. Our experiments show that the proposed method is faster than competing Bayesian approaches and scales easily to millions of data points. It provides additional features over frequentist competitors such as accurate predictive uncertainty estimates a… ▽ More

    Submitted 18 July, 2017; originally announced July 2017.

    Comments: accepted as conference paper at ECML-PKDD 2017

  45. arXiv:1706.09814  [pdf, other

    cs.LG

    Data-dependent Generalization Bounds for Multi-class Classification

    Authors: Yunwen Lei, Urun Dogan, Ding-Xuan Zhou, Marius Kloft

    Abstract: In this paper, we study data-dependent generalization error bounds exhibiting a mild dependency on the number of classes, making them suitable for multi-class learning with a large number of label classes. The bounds generally hold for empirical multi-class risk minimization algorithms using an arbitrary norm as regularizer. Key to our analysis are new structural results for multi-class Gaussian c… ▽ More

    Submitted 29 December, 2017; v1 submitted 29 June, 2017; originally announced June 2017.

  46. Distributed Optimization of Multi-Class SVMs

    Authors: Maximilian Alber, Julian Zimmert, Urun Dogan, Marius Kloft

    Abstract: Training of one-vs.-rest SVMs can be parallelized over the number of classes in a straight forward way. Given enough computational resources, one-vs.-rest SVMs can thus be trained on data involving a large number of classes. The same cannot be stated, however, for the so-called all-in-one SVMs, which require solving a quadratic program of size quadratically in the number of classes. We develop dis… ▽ More

    Submitted 8 December, 2016; v1 submitted 25 November, 2016; originally announced November 2016.

  47. arXiv:1611.07567  [pdf, other

    cs.AI cs.LG stat.ML

    Feature Importance Measure for Non-linear Learning Algorithms

    Authors: Marina M. -C. Vidovic, Nico Görnitz, Klaus-Robert Müller, Marius Kloft

    Abstract: Complex problems may require sophisticated, non-linear learning methods such as kernel machines or deep neural networks to achieve state of the art prediction accuracies. However, high prediction accuracies are not the only objective to consider when solving problems using machine learning. Instead, particular scientific applications require some explanation of the learned prediction function. Unf… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

  48. arXiv:1602.05916  [pdf, ps, other

    cs.LG

    Local Rademacher Complexity-based Learning Guarantees for Multi-Task Learning

    Authors: Niloofar Yousefi, Yunwen Lei, Marius Kloft, Mansooreh Mollaghasemi, Georgios Anagnostopoulos

    Abstract: We show a Talagrand-type concentration inequality for Multi-Task Learning (MTL), using which we establish sharp excess risk bounds for MTL in terms of distribution- and data-dependent versions of the Local Rademacher Complexity (LRC). We also give a new bound on the LRC for norm regularized as well as strongly convex hypothesis classes, which applies not only to MTL but also to the standard i.i.d.… ▽ More

    Submitted 9 February, 2017; v1 submitted 18 February, 2016; originally announced February 2016.

    Comments: In this version, some arguments and results (of the previous version) have been corrected, or modified

  49. Sparse Probit Linear Mixed Model

    Authors: Stephan Mandt, Florian Wenzel, Shinichi Nakajima, John P. Cunningham, Christoph Lippert, Marius Kloft

    Abstract: Linear Mixed Models (LMMs) are important tools in statistical genetics. When used for feature selection, they allow to find a sparse set of genetic traits that best predict a continuous phenotype of interest, while simultaneously correcting for various confounding factors such as age, ethnicity and population structure. Formulated as models for linear regression, LMMs have been restricted to conti… ▽ More

    Submitted 17 July, 2017; v1 submitted 16 July, 2015; originally announced July 2015.

    Comments: Published version, 21 pages, 6 figures

    Journal ref: Machine Learning, 106(9), 1621-1642 (2017)

  50. arXiv:1506.09153  [pdf, other

    stat.ML cs.CE cs.LG

    Framework for Multi-task Multiple Kernel Learning and Applications in Genome Analysis

    Authors: Christian Widmer, Marius Kloft, Vipin T Sreedharan, Gunnar Rätsch

    Abstract: We present a general regularization-based framework for Multi-task learning (MTL), in which the similarity between tasks can be learned or refined using $\ell_p$-norm Multiple Kernel learning (MKL). Based on this very general formulation (including a general loss function), we derive the corresponding dual formulation using Fenchel duality applied to Hermitian matrices. We show that numerous estab… ▽ More

    Submitted 30 June, 2015; originally announced June 2015.