-
Learned Image Compression for HE-stained Histopathological Images via Stain Deconvolution
Authors:
Maximilian Fischer,
Peter Neher,
Tassilo Wald,
Silvia Dias Almeida,
Shuhan Xiao,
Peter Schüffler,
Rickmer Braren,
Michael Götz,
Alexander Muckenhuber,
Jens Kleesiek,
Marco Nolden,
Klaus Maier-Hein
Abstract:
Processing histopathological Whole Slide Images (WSI) leads to massive storage requirements for clinics worldwide. Even after lossy image compression during image acquisition, additional lossy compression is frequently possible without substantially affecting the performance of deep learning-based (DL) downstream tasks. In this paper, we show that the commonly used JPEG algorithm is not best suite…
▽ More
Processing histopathological Whole Slide Images (WSI) leads to massive storage requirements for clinics worldwide. Even after lossy image compression during image acquisition, additional lossy compression is frequently possible without substantially affecting the performance of deep learning-based (DL) downstream tasks. In this paper, we show that the commonly used JPEG algorithm is not best suited for further compression and we propose Stain Quantized Latent Compression (SQLC ), a novel DL based histopathology data compression approach. SQLC compresses staining and RGB channels before passing it through a compression autoencoder (CAE ) in order to obtain quantized latent representations for maximizing the compression. We show that our approach yields superior performance in a classification downstream task, compared to traditional approaches like JPEG, while image quality metrics like the Multi-Scale Structural Similarity Index (MS-SSIM) is largely preserved. Our method is online available.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
cOOpD: Reformulating COPD classification on chest CT scans as anomaly detection using contrastive representations
Authors:
Silvia D. Almeida,
Carsten T. Lüth,
Tobias Norajitra,
Tassilo Wald,
Marco Nolden,
Paul F. Jaeger,
Claus P. Heussel,
Jürgen Biederer,
Oliver Weinheimer,
Klaus Maier-Hein
Abstract:
Classification of heterogeneous diseases is challenging due to their complexity, variability of symptoms and imaging findings. Chronic Obstructive Pulmonary Disease (COPD) is a prime example, being underdiagnosed despite being the third leading cause of death. Its sparse, diffuse and heterogeneous appearance on computed tomography challenges supervised binary classification. We reformulate COPD bi…
▽ More
Classification of heterogeneous diseases is challenging due to their complexity, variability of symptoms and imaging findings. Chronic Obstructive Pulmonary Disease (COPD) is a prime example, being underdiagnosed despite being the third leading cause of death. Its sparse, diffuse and heterogeneous appearance on computed tomography challenges supervised binary classification. We reformulate COPD binary classification as an anomaly detection task, proposing cOOpD: heterogeneous pathological regions are detected as Out-of-Distribution (OOD) from normal homogeneous lung regions. To this end, we learn representations of unlabeled lung regions employing a self-supervised contrastive pretext model, potentially capturing specific characteristics of diseased and healthy unlabeled regions. A generative model then learns the distribution of healthy representations and identifies abnormalities (stemming from COPD) as deviations. Patient-level scores are obtained by aggregating region OOD scores. We show that cOOpD achieves the best performance on two public datasets, with an increase of 8.2% and 7.7% in terms of AUROC compared to the previous supervised state-of-the-art. Additionally, cOOpD yields well-interpretable spatial anomaly maps and patient-level scores which we show to be of additional value in identifying individuals in the early stage of progression. Experiments in artificially designed real-world prevalence settings further support that anomaly detection is a powerful way of tackling COPD classification.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Precise Energy Consumption Measurements of Heterogeneous Artificial Intelligence Workloads
Authors:
René Caspart,
Sebastian Ziegler,
Arvid Weyrauch,
Holger Obermaier,
Simon Raffeiner,
Leon Pascal Schuhmacher,
Jan Scholtyssek,
Darya Trofimova,
Marco Nolden,
Ines Reinartz,
Fabian Isensee,
Markus Götz,
Charlotte Debus
Abstract:
With the rise of AI in recent years and the increase in complexity of the models, the growing demand in computational resources is starting to pose a significant challenge. The need for higher compute power is being met with increasingly more potent accelerators and the use of large compute clusters. However, the gain in prediction accuracy from large models trained on distributed and accelerated…
▽ More
With the rise of AI in recent years and the increase in complexity of the models, the growing demand in computational resources is starting to pose a significant challenge. The need for higher compute power is being met with increasingly more potent accelerators and the use of large compute clusters. However, the gain in prediction accuracy from large models trained on distributed and accelerated systems comes at the price of a substantial increase in energy demand, and researchers have started questioning the environmental friendliness of such AI methods at scale. Consequently, energy efficiency plays an important role for AI model developers and infrastructure operators alike. The energy consumption of AI workloads depends on the model implementation and the utilized hardware. Therefore, accurate measurements of the power draw of AI workflows on different types of compute nodes is key to algorithmic improvements and the design of future compute clusters and hardware. To this end, we present measurements of the energy consumption of two typical applications of deep learning models on different types of compute nodes. Our results indicate that 1. deriving energy consumption directly from runtime is not accurate, but the consumption of the compute node needs to be considered regarding its composition; 2. neglecting accelerator hardware on mixed nodes results in overproportional inefficiency regarding energy consumption; 3. energy consumption of model training and inference should be considered separately - while training on GPUs outperforms all other node types regarding both runtime and energy consumption, inference on CPU nodes can be comparably efficient. One advantage of our approach is that the information on energy consumption is available to all users of the supercomputer, enabling an easy transfer to other workloads alongside a raise in user-awareness of energy consumption.
△ Less
Submitted 3 December, 2022;
originally announced December 2022.
-
An open-source software platform for translational photoacoustic research and its application to motion-corrected blood oxygenation estimation
Authors:
Thomas Kirchner,
Janek Gröhl,
Franz Sattler,
Moritz S. Bischoff,
Angelika Laha,
Marco Nolden,
Lena Maier-Hein
Abstract:
Photoacoustic (PA) imaging systems based on clinical linear ultrasound arrays have become increasingly popular in translational PA research. Such systems can be more easily integrated in a clinical workflow due to the simultaneous access to ultrasonic imaging and their familiarity of use to clinicians. In contrast to more complex setups, hand held linear probes can be applied to a large variety of…
▽ More
Photoacoustic (PA) imaging systems based on clinical linear ultrasound arrays have become increasingly popular in translational PA research. Such systems can be more easily integrated in a clinical workflow due to the simultaneous access to ultrasonic imaging and their familiarity of use to clinicians. In contrast to more complex setups, hand held linear probes can be applied to a large variety of clinical use cases. However, most translational work with such scanners is based on proprietary development and as such not accessible to the community. In this contribution, we present a custom-built, hybrid, multispectral, real-time photoacoustic and ultrasonic imaging system with a linear array probe that is controlled by software developed within the Medical Imaging Interaction Toolkit (MITK) a highly customizable and extendable open-source software platform. Our software offers direct control of both the laser and the ultrasonic system and may serve as a starting point for various translational research projects and developments. To demonstrate the applicability of the platform, we used it to implement a new method for blood oxygenation estimation in the presence of non-rigid inter-frame motion caused by pulsing arteries. Initial results from experiments with healthy human volunteers demonstrate the suitability of the method with the sample clinical application of imaging the common carotid artery as well as peripheral extremity vessels.
△ Less
Submitted 28 January, 2019;
originally announced January 2019.
-
MITK-ModelFit: A generic open-source framework for model fits and their exploration in medical imaging -- design, implementation and application on the example of DCE-MRI
Authors:
Charlotte Debus,
Ralf Floca,
Michael Ingrisch,
Ina Kompan,
Klaus Maier-Hein,
Amir Abdollahi,
Marco Nolden
Abstract:
Many medical imaging techniques utilize fitting approaches for quantitative parameter estimation and analysis. Common examples are pharmacokinetic modeling in DCE MRI/CT, ADC calculations and IVIM modeling in diffusion-weighted MRI and Z-spectra analysis in chemical exchange saturation transfer MRI. Most available software tools are limited to a special purpose and do not allow for own development…
▽ More
Many medical imaging techniques utilize fitting approaches for quantitative parameter estimation and analysis. Common examples are pharmacokinetic modeling in DCE MRI/CT, ADC calculations and IVIM modeling in diffusion-weighted MRI and Z-spectra analysis in chemical exchange saturation transfer MRI. Most available software tools are limited to a special purpose and do not allow for own developments and extensions. Furthermore, they are mostly designed as stand-alone solutions using external frameworks and thus cannot be easily incorporated natively in the analysis workflow. We present a framework for medical image fitting tasks that is included in MITK, following a rigorous open-source, well-integrated and operating system independent policy. Software engineering-wise, the local models, the fitting infrastructure and the results representation are abstracted and thus can be easily adapted to any model fitting task on image data, independent of image modality or model. Several ready-to-use libraries for model fitting and use-cases, including fit evaluation and visualization, were implemented. Their embedding into MITK allows for easy data loading, pre- and post-processing and thus a natural inclusion of model fitting into an overarching workflow. As an example, we present a comprehensive set of plug-ins for the analysis of DCE MRI data, which we validated on existing and novel digital phantoms, yielding competitive deviations between fit and ground truth. Providing a very flexible environment, our software mainly addresses developers of medical imaging software that includes model fitting algorithms and tools. Additionally, the framework is of high interest to users in the domain of perfusion MRI, as it offers feature-rich, freely available, validated tools to perform pharmacokinetic analysis on DCE MRI data, with both interactive and automatized batch processing workflows.
△ Less
Submitted 12 February, 2019; v1 submitted 19 July, 2018;
originally announced July 2018.
-
What does it all mean? Capturing Semantics of Surgical Data and Algorithms with Ontologies
Authors:
Darko Katić,
Maria Maleshkova,
Sandy Engelhardt,
Ivo Wolf,
Keno März,
Lena Maier-Hein,
Marco Nolden,
Martin Wagner,
Hannes Kenngott,
Beat Peter Müller-Stich,
Rüdiger Dillmann,
Stefanie Speidel
Abstract:
Every year approximately 234 million major surgeries are performed, leading to plentiful, highly diverse data. This is accompanied by a matching number of novel algorithms for the surgical domain. To garner all benefits of surgical data science it is necessary to have an unambiguous, shared understanding of algorithms and data. This includes inputs and outputs of algorithms and thus their function…
▽ More
Every year approximately 234 million major surgeries are performed, leading to plentiful, highly diverse data. This is accompanied by a matching number of novel algorithms for the surgical domain. To garner all benefits of surgical data science it is necessary to have an unambiguous, shared understanding of algorithms and data. This includes inputs and outputs of algorithms and thus their function, but also the semantic content, i.e. meaning of data such as patient parameters. We therefore propose the establishment of a new ontology for data and algorithms in surgical data science. Such an ontology can be used to provide common data sets for the community, encouraging sharing of knowledge and comparison of algorithms on common data. We hold that this is a necessary foundation towards new methods for applications such as semantic-based content retrieval and similarity measures and that it is overall vital for the future of surgical data science.
△ Less
Submitted 22 May, 2017;
originally announced May 2017.