Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–44 of 44 results for author: Igel, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05469  [pdf, other

    cs.LG

    Bayesian vs. PAC-Bayesian Deep Neural Network Ensembles

    Authors: Nick Hauptvogel, Christian Igel

    Abstract: Bayesian neural networks address epistemic uncertainty by learning a posterior distribution over model parameters. Sampling and weighting networks according to this posterior yields an ensemble model referred to as Bayes ensemble. Ensembles of neural networks (deep ensembles) can profit from the cancellation of errors effect: Errors by ensemble members may average out and the deep ensemble achieve… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  2. arXiv:2406.04949  [pdf, other

    cs.CV cs.AI cs.LG

    Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment

    Authors: Venkanna Babu Guthula, Stefan Oehmcke, Remigio Chilaule, Hui Zhang, Nico Lang, Ankit Kariryaa, Johan Mottelson, Christian Igel

    Abstract: As low-quality housing and in particular certain roof characteristics are associated with an increased risk of malaria, classification of roof types based on remote sensing imagery can support the assessment of malaria risk and thereby help prevent the disease. To support research in this area, we release the Nacala-Roof-Material dataset, which contains high-resolution drone images from Mozambique… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2406.01345  [pdf, other

    cs.LG stat.ML

    BMRS: Bayesian Model Reduction for Structured Pruning

    Authors: Dustin Wright, Christian Igel, Raghavendra Selvan

    Abstract: Modern neural networks are often massively overparameterized leading to high compute costs during training and at inference. One effective method to improve both the compute and energy efficiency of neural networks while maintaining good performance is structured pruning, where full network structures (e.g. neurons or convolutional filters) that have limited impact on the model output are removed.… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 17 pages; 8 figures; 2 tables

  4. arXiv:2405.02771  [pdf, other

    cs.CV cs.AI cs.LG

    MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning

    Authors: Vishal Nedungadi, Ankit Kariryaa, Stefan Oehmcke, Serge Belongie, Christian Igel, Nico Lang

    Abstract: The volume of unlabelled Earth observation (EO) data is huge, but many important applications lack labelled training data. However, EO data offers the unique opportunity to pair data from different modalities and sensors automatically based on geographic location and time, at virtually no human labor cost. We seize this opportunity to create a diverse multi-modal pretraining dataset at global scal… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: Data and code is available on the project page: https://vishalned.github.io/mmearth

  5. arXiv:2403.12562  [pdf, other

    cs.LG cs.AI stat.ML

    Equity through Access: A Case for Small-scale Deep Learning

    Authors: Raghavendra Selvan, Bob Pepin, Christian Igel, Gabrielle Samuel, Erik B Dam

    Abstract: The recent advances in deep learning (DL) have been accelerated by access to large-scale data and compute. These large-scale resources have been used to train progressively larger models which are resource intensive in terms of compute, data, energy, and carbon emissions. These costs are becoming a new type of entry barrier to researchers and practitioners with limited access to resources at such… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Source code available at https://github.com/saintslab/PePR

  6. arXiv:2403.01932   

    cs.CV

    Tree Counting by Bridging 3D Point Clouds with Imagery

    Authors: Lei Li, Tianfang Zhang, Zhongyu Jiang, Cheng-Yen Yang, Jenq-Neng Hwang, Stefan Oehmcke, Dimitri Pierre Johannes Gominski, Fabian Gieseke, Christian Igel

    Abstract: Accurate and consistent methods for counting trees based on remote sensing data are needed to support sustainable forest management, assess climate change mitigation strategies, and build trust in tree carbon credits. Two-dimensional remote sensing imagery primarily shows overstory canopy, and it does not facilitate easy differentiation of individual trees in areas with a dense canopy and does not… ▽ More

    Submitted 11 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: need more experiments

  7. arXiv:2311.11592  [pdf, other

    cs.CV

    Predicting urban tree cover from incomplete point labels and limited background information

    Authors: Hui Zhang, Ankit Kariryaa, Venkanna Babu Guthula, Christian Igel, Stefan Oehmcke

    Abstract: Trees inside cities are important for the urban microclimate, contributing positively to the physical and mental health of the urban dwellers. Despite their importance, often only limited information about city trees is available. Therefore in this paper, we propose a method for mapping urban trees in high-resolution aerial imagery using limited datasets and deep learning. Deep learning has become… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  8. arXiv:2311.07981  [pdf, other

    cs.CV

    Benchmarking Individual Tree Mapping with Sub-meter Imagery

    Authors: Dimitri Gominski, Ankit Kariryaa, Martin Brandt, Christian Igel, Sizhuo Li, Maurice Mugabowindekwe, Rasmus Fensholt

    Abstract: There is a rising interest in mapping trees using satellite or aerial imagery, but there is no standardized evaluation protocol for comparing and enhancing methods. In dense canopy areas, the high variability of tree sizes and their spatial proximity makes it arduous to define the quality of the predictions. Concurrently, object-centric approaches such as bounding box detection usuallyperform poor… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  9. arXiv:2311.05006  [pdf, other

    cs.CV cs.LG

    Familiarity-Based Open-Set Recognition Under Adversarial Attacks

    Authors: Philip Enevoldsen, Christian Gundersen, Nico Lang, Serge Belongie, Christian Igel

    Abstract: Open-set recognition (OSR), the identification of novel categories, can be a critical component when deploying classification models in real-world applications. Recent work has shown that familiarity-based scoring rules such as the Maximum Softmax Probability (MSP) or the Maximum Logit Score (MLS) are strong baselines when the closed-set accuracy is high. However, one of the potential weaknesses o… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: Published in: The 2nd Workshop and Challenges for Out-of-Distribution Generalization in Computer Vision, ICCV 2023

  10. arXiv:2309.02065  [pdf, other

    cs.LG cs.CY stat.ML

    Efficiency is Not Enough: A Critical Perspective of Environmentally Sustainable AI

    Authors: Dustin Wright, Christian Igel, Gabrielle Samuel, Raghavendra Selvan

    Abstract: Artificial Intelligence (AI) is currently spearheaded by machine learning (ML) methods such as deep learning (DL) which have accelerated progress on many tasks thought to be out of reach of AI. These ML methods can often be compute hungry, energy intensive, and result in significant carbon emissions, a known driver of anthropogenic climate change. Additionally, the platforms on which ML systems ru… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 24 pages; 6 figures

  11. arXiv:2307.03798  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Fooling Contrastive Language-Image Pre-trained Models with CLIPMasterPrints

    Authors: Matthias Freiberger, Peter Kun, Christian Igel, Anders Sundnes Løvlie, Sebastian Risi

    Abstract: Models leveraging both visual and textual data such as Contrastive Language-Image Pre-training (CLIP), are the backbone of many recent advances in artificial intelligence. In this work, we show that despite their versatility, such models are vulnerable to what we refer to as fooling master images. Fooling master images are capable of maximizing the confidence score of a CLIP model for a significan… ▽ More

    Submitted 16 April, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: This work was supported by a research grant (40575) from VILLUM FONDEN

  12. arXiv:2306.01147  [pdf, other

    cs.LG cs.AI

    Smooth Min-Max Monotonic Networks

    Authors: Christian Igel

    Abstract: Monotonicity constraints are powerful regularizers in statistical modelling. They can support fairness in computer-aided decision making and increase plausibility in data-driven scientific models. The seminal min-max (MM) neural network architecture ensures monotonicity, but often gets stuck in undesired local optima during training because of partial derivatives of the MM nonlinearities being zer… ▽ More

    Submitted 25 May, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning (ICML), Vienna, Austria. PMLR 235, 2024

  13. arXiv:2301.06190  [pdf, other

    cs.CV eess.IV

    BuildSeg: A General Framework for the Segmentation of Buildings

    Authors: Lei Li, Tianfang Zhang, Stefan Oehmcke, Fabian Gieseke, Christian Igel

    Abstract: Building segmentation from aerial images and 3D laser scanning (LiDAR) is a challenging task due to the diversity of backgrounds, building textures, and image quality. While current research using different types of convolutional and transformer networks has considerably improved the performance on this task, even more accurate segmentation methods for buildings are desirable for applications such… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

  14. arXiv:2212.09088  [pdf, other

    cs.CV eess.IV

    LR-CSNet: Low-Rank Deep Unfolding Network for Image Compressive Sensing

    Authors: Tianfang Zhang, Lei Li, Christian Igel, Stefan Oehmcke, Fabian Gieseke, Zhenming Peng

    Abstract: Deep unfolding networks (DUNs) have proven to be a viable approach to compressive sensing (CS). In this work, we propose a DUN called low-rank CS network (LR-CSNet) for natural image CS. Real-world image patches are often well-represented by low-rank approximations. LR-CSNet exploits this property by adding a low-rank prior to the CS optimization task. We derive a corresponding iterative optimizat… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

  15. EC-NAS: Energy Consumption Aware Tabular Benchmarks for Neural Architecture Search

    Authors: Pedram Bakhtiarifard, Christian Igel, Raghavendra Selvan

    Abstract: Energy consumption from the selection, training, and deployment of deep learning models has seen a significant uptick recently. This work aims to facilitate the design of energy-efficient deep learning models that require less computational resources and prioritize environmental sustainability by focusing on the energy consumption. Neural architecture search (NAS) benefits from tabular benchmarks,… ▽ More

    Submitted 22 March, 2024; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to be presented at the International Conference on Acoustics, Speech and Signal Processing (ICASSP-2024). Source code at https://github.com/saintslab/EC-NAS-Bench

    Journal ref: 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5660-5664

  16. arXiv:2205.10643  [pdf, other

    cs.CL cs.SD eess.AS

    Self-Supervised Speech Representation Learning: A Review

    Authors: Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath, Shinji Watanabe

    Abstract: Although supervised deep learning has revolutionized speech and audio processing, it has necessitated the building of specialist models for individual tasks and application scenarios. It is likewise difficult to apply this to dialects and languages for which only limited labeled data is available. Self-supervised representation learning methods promise a single universal model that would benefit a… ▽ More

    Submitted 27 October, 2022; v1 submitted 21 May, 2022; originally announced May 2022.

  17. arXiv:2203.16470  [pdf, other

    cs.LG stat.ML

    Remember to correct the bias when using deep learning for regression!

    Authors: Christian Igel, Stefan Oehmcke

    Abstract: When training deep learning models for least-squares regression, we cannot expect that the training error residuals of the final model, selected after a fixed training time or based on performance on a hold-out data set, sum to zero. This can introduce a systematic error that accumulates if we are interested in the total aggregated performance over many data points. We suggest to adjust the bias o… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: 8 pages, 3 figures, 2 tables

    MSC Class: 68T07 ACM Class: I.2.0

  18. arXiv:2203.01829  [pdf, other

    eess.AS cs.LG cs.SD

    A Brief Overview of Unsupervised Neural Speech Representation Learning

    Authors: Lasse Borgholt, Jakob Drachmann Havtorn, Joakim Edin, Lars Maaløe, Christian Igel

    Abstract: Unsupervised representation learning for speech processing has matured greatly in the last few years. Work in computer vision and natural language processing has paved the way, but speech data offers unique challenges. As a result, methods from other domains rarely translate directly. We review the development of unsupervised representation learning for speech over the last decade. We identify two… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: The 2nd Workshop on Self-supervised Learning for Audio and Speech Processing (SAS) at AAAI

  19. arXiv:2112.11335  [pdf, other

    cs.CV cs.CY cs.LG

    Deep Learning Based 3D Point Cloud Regression for Estimating Forest Biomass

    Authors: Stefan Oehmcke, Lei Li, Katerina Trepekli, Jaime Revenga, Thomas Nord-Larsen, Fabian Gieseke, Christian Igel

    Abstract: Quantification of forest biomass stocks and their dynamics is important for implementing effective climate change mitigation measures. The knowledge is needed, e.g., for local forest management, studying the processes driving af-, re-, and deforestation, and can improve the accuracy of carbon-accounting. Remote sensing using airborne LiDAR can be used to perform these measurements of vegetation st… ▽ More

    Submitted 21 February, 2023; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: 31 pages, 14 figures, 4 tables

    ACM Class: I.2.10; I.2.1; J.0

  20. arXiv:2111.14842  [pdf, other

    eess.AS cs.CL cs.LG

    Do We Still Need Automatic Speech Recognition for Spoken Language Understanding?

    Authors: Lasse Borgholt, Jakob Drachmann Havtorn, Mostafa Abdou, Joakim Edin, Lars Maaløe, Anders Søgaard, Christian Igel

    Abstract: Spoken language understanding (SLU) tasks are usually solved by first transcribing an utterance with automatic speech recognition (ASR) and then feeding the output to a text-based model. Recent advances in self-supervised representation learning for speech data have focused on improving the ASR component. We investigate whether representation learning for speech has matured enough to replace ASR i… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: Under review as a conference paper at ICASSP 2022

  21. arXiv:2106.13624  [pdf, other

    cs.LG stat.ML

    Chebyshev-Cantelli PAC-Bayes-Bennett Inequality for the Weighted Majority Vote

    Authors: Yi-Shan Wu, Andrés R. Masegosa, Stephan S. Lorenzen, Christian Igel, Yevgeny Seldin

    Abstract: We present a new second-order oracle bound for the expected risk of a weighted majority vote. The bound is based on a novel parametric form of the Chebyshev- Cantelli inequality (a.k.a. one-sided Chebyshev's), which is amenable to efficient minimization. The new form resolves the optimization challenge faced by prior oracle bounds based on the Chebyshev-Cantelli inequality, the C-bounds [Germain e… ▽ More

    Submitted 17 January, 2023; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: aligned with the camera-ready version published at NeurIPS 2021

  22. arXiv:2106.12912  [pdf, other

    cs.LG

    Information Bottleneck: Exact Analysis of (Quantized) Neural Networks

    Authors: Stephan Sloth Lorenzen, Christian Igel, Mads Nielsen

    Abstract: The information bottleneck (IB) principle has been suggested as a way to analyze deep neural networks. The learning dynamics are studied by inspecting the mutual information (MI) between the hidden layers and the input and output. Notably, separate fitting and compression phases during training have been reported. This led to some controversy including claims that the observations are not reproduc… ▽ More

    Submitted 14 February, 2022; v1 submitted 24 June, 2021; originally announced June 2021.

  23. arXiv:2102.09928  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Do End-to-End Speech Recognition Models Care About Context?

    Authors: Lasse Borgholt, Jakob Drachmann Havtorn, Željko Agić, Anders Søgaard, Lars Maaløe, Christian Igel

    Abstract: The two most common paradigms for end-to-end speech recognition are connectionist temporal classification (CTC) and attention-based encoder-decoder (AED) models. It has been argued that the latter is better suited for learning an implicit language model. We test this hypothesis by measuring temporal context sensitivity and evaluate how the models perform when we constrain the amount of contextual… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: Published in the proceedings of INTERSPEECH 2020, pp. 4352-4356

  24. arXiv:2102.00850  [pdf, other

    eess.AS cs.LG cs.SD

    On Scaling Contrastive Representations for Low-Resource Speech Recognition

    Authors: Lasse Borgholt, Tycho Max Sylvester Tax, Jakob Drachmann Havtorn, Lars Maaløe, Christian Igel

    Abstract: Recent advances in self-supervised learning through contrastive training have shown that it is possible to learn a competitive speech recognition system with as little as 10 minutes of labeled data. However, these systems are computationally expensive since they require pre-training followed by fine-tuning in a large parameter space. We explore the performance of such systems without fine-tuning b… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: © 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  25. arXiv:2101.07240  [pdf, other

    cs.LG cs.AI

    Multimodal Variational Autoencoders for Semi-Supervised Learning: In Defense of Product-of-Experts

    Authors: Svetlana Kutuzova, Oswin Krause, Douglas McCloskey, Mads Nielsen, Christian Igel

    Abstract: Multimodal generative models should be able to learn a meaningful latent representation that enables a coherent joint generation of all modalities (e.g., images and text). Many applications also require the ability to accurately sample modalities conditioned on observations of a subset of the modalities. Often not all modalities may be observed for all training data points, so semi-supervised lear… ▽ More

    Submitted 30 July, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

  26. arXiv:2007.13532  [pdf, other

    cs.LG stat.ML

    Second Order PAC-Bayesian Bounds for the Weighted Majority Vote

    Authors: Andrés R. Masegosa, Stephan S. Lorenzen, Christian Igel, Yevgeny Seldin

    Abstract: We present a novel analysis of the expected risk of weighted majority vote in multiclass classification. The analysis takes correlation of predictions by ensemble members into account and provides a bound that is amenable to efficient minimization, which yields improved weighting for the majority vote. We also provide a specialized version of our bound for binary classification, which allows to ex… ▽ More

    Submitted 17 December, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

  27. arXiv:2006.15057  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    A Loss Function for Generative Neural Networks Based on Watson's Perceptual Model

    Authors: Steffen Czolbe, Oswin Krause, Ingemar Cox, Christian Igel

    Abstract: To train Variational Autoencoders (VAEs) to generate realistic imagery requires a loss function that reflects human perception of image similarity. We propose such a loss function based on Watson's perceptual model, which computes a weighted distance in frequency space and accounts for luminance and contrast masking. We extend the model to color images, increase its robustness to translation by us… ▽ More

    Submitted 6 January, 2021; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: Published at the 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

  28. arXiv:2006.14999  [pdf, other

    stat.ML cs.LG

    On the convergence of the Metropolis algorithm with fixed-order updates for multivariate binary probability distributions

    Authors: Kai Brügge, Asja Fischer, Christian Igel

    Abstract: The Metropolis algorithm is arguably the most fundamental Markov chain Monte Carlo (MCMC) method. But the algorithm is not guaranteed to converge to the desired distribution in the case of multivariate binary distributions (e.g., Ising models or stochastic neural networks such as Boltzmann machines) if the variables (sites or neurons) are updated in a fixed order, a setting commonly used in practi… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

  29. arXiv:2004.14003  [pdf, other

    eess.IV cs.CV

    The International Workshop on Osteoarthritis Imaging Knee MRI Segmentation Challenge: A Multi-Institute Evaluation and Analysis Framework on a Standardized Dataset

    Authors: Arjun D. Desai, Francesco Caliva, Claudia Iriondo, Naji Khosravan, Aliasghar Mortazi, Sachin Jambawalikar, Drew Torigian, Jutta Ellermann, Mehmet Akcakaya, Ulas Bagci, Radhika Tibrewala, Io Flament, Matthew O`Brien, Sharmila Majumdar, Mathias Perslev, Akshay Pai, Christian Igel, Erik B. Dam, Sibaji Gaj, Mingrui Yang, Kunio Nakamura, Xiaojuan Li, Cem M. Deniz, Vladimir Juras, Ravinder Regatte , et al. (4 additional authors not shown)

    Abstract: Purpose: To organize a knee MRI segmentation challenge for characterizing the semantic and clinical efficacy of automatic segmentation methods relevant for monitoring osteoarthritis progression. Methods: A dataset partition consisting of 3D knee MRI from 88 subjects at two timepoints with ground-truth articular (femoral, tibial, patellar) cartilage and meniscus segmentations was standardized. Ch… ▽ More

    Submitted 26 May, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: Submitted to Radiology: Artificial Intelligence; Fixed typos

  30. arXiv:1911.06902  [pdf, other

    cs.CV cs.LG

    Label-similarity Curriculum Learning

    Authors: Urun Dogan, Aniket Anand Deshmukh, Marcin Machura, Christian Igel

    Abstract: Curriculum learning can improve neural network training by guiding the optimization to desirable optima. We propose a novel curriculum learning approach for image classification that adapts the loss function by changing the label representation. The idea is to use a probability distribution over classes as target label, where the class probabilities reflect the similarity to the true class. Gradua… ▽ More

    Submitted 22 July, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

    Comments: Accepted as a conference paper at ECCV 2020

  31. One Network to Segment Them All: A General, Lightweight System for Accurate 3D Medical Image Segmentation

    Authors: Mathias Perslev, Erik Bjørnager Dam, Akshay Pai, Christian Igel

    Abstract: Many recent medical segmentation systems rely on powerful deep learning models to solve highly specific tasks. To maximize performance, it is standard practice to evaluate numerous pipelines with varying model topologies, optimization parameters, pre- & postprocessing steps, and even model cascades. It is often not clear how the resulting pipeline transfers to different tasks. We propose a simple… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Journal ref: Medical Image Computing and Computer Assisted Intervention (MICCAI), LNCS 11765, pp. 30-38, Springer, 2019

  32. arXiv:1910.11162  [pdf, other

    cs.LG eess.SP stat.ML

    U-Time: A Fully Convolutional Network for Time Series Segmentation Applied to Sleep Staging

    Authors: Mathias Perslev, Michael Hejselbak Jensen, Sune Darkner, Poul Jørgen Jennum, Christian Igel

    Abstract: Neural networks are becoming more and more popular for the analysis of physiological time-series. The most successful deep learning systems in this domain combine convolutional and recurrent layers to extract useful features to model temporal relations. Unfortunately, these recurrent models are difficult to tune and optimize. In our experience, they often require task-specific modifications, which… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

    Comments: To appear in Advances in Neural Information Processing Systems (NeurIPS), 2019

  33. arXiv:1908.07355  [pdf, other

    cs.LG eess.IV stat.ML

    Knowledge distillation for semi-supervised domain adaptation

    Authors: Mauricio Orbes-Arteaga, Jorge Cardoso, Lauge Sørensen, Christian Igel, Sebastien Ourselin, Marc Modat, Mads Nielsen, Akshay Pai

    Abstract: In the absence of sufficient data variation (e.g., scanner and protocol variability) in annotated data, deep neural networks (DNNs) tend to overfit during training. As a result, their performance is significantly lower on data from unseen sources compared to the performance on data from the same source as the training data. Semi-supervised domain adaptation methods can alleviate this problem by tu… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.

    Comments: MLCN MICCAI workshop

  34. The Liver Tumor Segmentation Benchmark (LiTS)

    Authors: Patrick Bilic, Patrick Christ, Hongwei Bran Li, Eugene Vorontsov, Avi Ben-Cohen, Georgios Kaissis, Adi Szeskin, Colin Jacobs, Gabriel Efrain Humpire Mamani, Gabriel Chartrand, Fabian Lohöfer, Julian Walter Holch, Wieland Sommer, Felix Hofmann, Alexandre Hostettler, Naama Lev-Cohain, Michal Drozdzal, Michal Marianne Amitai, Refael Vivantik, Jacob Sosna, Ivan Ezhov, Anjany Sekuboyina, Fernando Navarro, Florian Kofler, Johannes C. Paetzold , et al. (84 additional authors not shown)

    Abstract: In this work, we report the set-up and results of the Liver Tumor Segmentation Benchmark (LiTS), which was organized in conjunction with the IEEE International Symposium on Biomedical Imaging (ISBI) 2017 and the International Conferences on Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2017 and 2018. The image dataset is diverse and contains primary and secondary tumors with… ▽ More

    Submitted 25 November, 2022; v1 submitted 13 January, 2019; originally announced January 2019.

    Comments: Patrick Bilic, Patrick Christ, Hongwei Bran Li, and Eugene Vorontsov made equal contributions to this work. Published in Medical Image Analysis

    Journal ref: Medical Image Analysis (2022) Pg. 102680

  35. arXiv:1810.09746  [pdf, ps, other

    cs.LG stat.ML

    On PAC-Bayesian Bounds for Random Forests

    Authors: Stephan Sloth Lorenzen, Christian Igel, Yevgeny Seldin

    Abstract: Existing guarantees in terms of rigorous upper bounds on the generalization error for the original random forest algorithm, one of the most frequently used machine learning methods, are unsatisfying. We discuss and evaluate various PAC-Bayesian approaches to derive such bounds. The bounds do not require additional hold-out data, because the out-of-bag samples from the bagging in the training proce… ▽ More

    Submitted 6 March, 2019; v1 submitted 23 October, 2018; originally announced October 2018.

  36. arXiv:1810.01928  [pdf, other

    cs.CV

    PADDIT: Probabilistic Augmentation of Data using Diffeomorphic Image Transformation

    Authors: Mauricio Orbes Arteaga, Lauge Sørensen, M. Jorge Cardoso, Marc Modat, Sebastien Ourselin, Stefan Sommer, Mads Nielsen, Christian Igel, Akshay Pai

    Abstract: For proper generalization performance of convolutional neural networks (CNNs) in medical image segmentation, the learnt features should be invariant under particular non-linear shape variations of the input. To induce invariance in CNNs to such transformations, we propose Probabilistic Augmentation of Data using Diffeomorphic Image Transformation (PADDIT) -- a systematic framework for generating r… ▽ More

    Submitted 9 March, 2020; v1 submitted 3 October, 2018; originally announced October 2018.

  37. arXiv:1802.06394  [pdf, other

    cs.LG stat.ML

    Training Big Random Forests with Little Resources

    Authors: Fabian Gieseke, Christian Igel

    Abstract: Without access to large compute clusters, building random forests on large datasets is still a challenging problem. This is, in particular, the case if fully-grown trees are desired. We propose a simple yet effective framework that allows to efficiently construct ensembles of huge trees for hundreds of millions or even billions of training instances using a cheap desktop computer with commodity ha… ▽ More

    Submitted 18 February, 2018; originally announced February 2018.

    Comments: 9 pages, 9 Figures

  38. arXiv:1704.04650  [pdf, other

    astro-ph.IM cs.CV stat.ML

    Big Universe, Big Data: Machine Learning and Image Analysis for Astronomy

    Authors: Jan Kremer, Kristoffer Stensbo-Smidt, Fabian Gieseke, Kim Steenstrup Pedersen, Christian Igel

    Abstract: Astrophysics and cosmology are rich with data. The advent of wide-area digital cameras on large aperture telescopes has led to ever more ambitious surveys of the sky. Data volumes of entire surveys a decade ago can now be acquired in a single night and real-time analysis is often desired. Thus, modern astronomy requires big data know-how, in particular it demands highly efficient machine learning… ▽ More

    Submitted 15 April, 2017; originally announced April 2017.

    Journal ref: IEEE Intelligent Systems, vol. 32, no. , pp. 16-22, Mar.-Apr. 2017

  39. arXiv:1608.05610  [pdf, other

    cs.LG stat.ML

    A Strongly Quasiconvex PAC-Bayesian Bound

    Authors: Niklas Thiemann, Christian Igel, Olivier Wintenberger, Yevgeny Seldin

    Abstract: We propose a new PAC-Bayesian bound and a way of constructing a hypothesis space, so that the bound is convex in the posterior distribution and also convex in a trade-off parameter between empirical performance of the posterior distribution and its complexity. The complexity is measured by the Kullback-Leibler divergence to a prior. We derive an alternating procedure for minimizing the bound. We s… ▽ More

    Submitted 24 August, 2017; v1 submitted 19 August, 2016; originally announced August 2016.

  40. arXiv:1512.02831  [pdf, other

    cs.DC cs.DS cs.LG

    Bigger Buffer k-d Trees on Multi-Many-Core Systems

    Authors: Fabian Gieseke, Cosmin Eugen Oancea, Ashish Mahabal, Christian Igel, Tom Heskes

    Abstract: A buffer k-d tree is a k-d tree variant for massively-parallel nearest neighbor search. While providing valuable speed-ups on modern many-core devices in case both a large number of reference and query points are given, buffer k-d trees are limited by the amount of points that can fit on a single device. In this work, we show how to modify the original data structure and the associated workflow to… ▽ More

    Submitted 9 December, 2015; originally announced December 2015.

  41. arXiv:1510.01624  [pdf, other

    cs.LG cs.NE stat.ML

    Population-Contrastive-Divergence: Does Consistency help with RBM training?

    Authors: Oswin Krause, Asja Fischer, Christian Igel

    Abstract: Estimating the log-likelihood gradient with respect to the parameters of a Restricted Boltzmann Machine (RBM) typically requires sampling using Markov Chain Monte Carlo (MCMC) techniques. To save computation time, the Markov chains are only run for a small number of steps, which leads to a biased estimate. This bias can cause RBM training algorithms such as Contrastive Divergence (CD) learning to… ▽ More

    Submitted 28 June, 2017; v1 submitted 6 October, 2015; originally announced October 2015.

    Comments: An updated version is under review

  42. arXiv:cs/0303032  [pdf, ps, other

    cs.NE math.OC nlin.AO

    Recent Results on No-Free-Lunch Theorems for Optimization

    Authors: Christian Igel, Marc Toussaint

    Abstract: The sharpened No-Free-Lunch-theorem (NFL-theorem) states that the performance of all optimization algorithms averaged over any finite set F of functions is equal if and only if F is closed under permutation (c.u.p.) and each target function in F is equally likely. In this paper, we first summarize some consequences of this theorem, which have been proven recently: The average number of evaluatio… ▽ More

    Submitted 31 March, 2003; originally announced March 2003.

    Comments: 10 pages, LaTeX, see http://www.neuroinformatik.rub.de/PROJECTS/SONN/

    ACM Class: G.1.6

  43. arXiv:nlin/0204038  [pdf, ps, other

    nlin.AO cs.NE q-bio

    Neutrality: A Necessity for Self-Adaptation

    Authors: Marc Toussaint, Christian Igel

    Abstract: Self-adaptation is used in all main paradigms of evolutionary computation to increase efficiency. We claim that the basis of self-adaptation is the use of neutrality. In the absence of external control neutrality allows a variation of the search distribution without the risk of fitness loss.

    Submitted 16 April, 2002; originally announced April 2002.

    Comments: 6 pages, 3 figures, LaTeX

    Journal ref: Proceedings of the Congress on Evolutionary Computation (CEC 2002), 1354-1359.

  44. arXiv:cs/0108011  [pdf, ps, other

    cs.NE math.OC nlin.AO

    On Classes of Functions for which No Free Lunch Results Hold

    Authors: Christian Igel, Marc Toussaint

    Abstract: In a recent paper it was shown that No Free Lunch results hold for any subset F of the set of all possible functions from a finite set X to a finite set Y iff F is closed under permutation of X. In this article, we prove that the number of those subsets can be neglected compared to the overall number of possible subsets. Further, we present some arguments why problem classes relevant in practice… ▽ More

    Submitted 21 August, 2001; originally announced August 2001.

    Comments: 8 pages, 1 figure, see http://www.neuroinformatik.ruhr-uni-bochum.de/

    ACM Class: G.1.6