-
SPLICE -- Streamlining Digital Pathology Image Processing
Authors:
Areej Alsaafin,
Peyman Nejat,
Abubakr Shafique,
Jibran Khan,
Saghir Alfasly,
Ghazal Alabtah,
H. R. Tizhoosh
Abstract:
Digital pathology and the integration of artificial intelligence (AI) models have revolutionized histopathology, opening new opportunities. With the increasing availability of Whole Slide Images (WSIs), there's a growing demand for efficient retrieval, processing, and analysis of relevant images from vast biomedical archives. However, processing WSIs presents challenges due to their large size and…
▽ More
Digital pathology and the integration of artificial intelligence (AI) models have revolutionized histopathology, opening new opportunities. With the increasing availability of Whole Slide Images (WSIs), there's a growing demand for efficient retrieval, processing, and analysis of relevant images from vast biomedical archives. However, processing WSIs presents challenges due to their large size and content complexity. Full computer digestion of WSIs is impractical, and processing all patches individually is prohibitively expensive. In this paper, we propose an unsupervised patching algorithm, Sequential Patching Lattice for Image Classification and Enquiry (SPLICE). This novel approach condenses a histopathology WSI into a compact set of representative patches, forming a "collage" of WSI while minimizing redundancy. SPLICE prioritizes patch quality and uniqueness by sequentially analyzing a WSI and selecting non-redundant representative features. We evaluated SPLICE for search and match applications, demonstrating improved accuracy, reduced computation time, and storage requirements compared to existing state-of-the-art methods. As an unsupervised method, SPLICE effectively reduces storage requirements for representing tissue images by 50%. This reduction enables numerous algorithms in computational pathology to operate much more efficiently, paving the way for accelerated adoption of digital pathology.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Analysis and Validation of Image Search Engines in Histopathology
Authors:
Isaiah Lahr,
Saghir Alfasly,
Peyman Nejat,
Jibran Khan,
Luke Kottom,
Vaishnavi Kumbhar,
Areej Alsaafin,
Abubakr Shafique,
Sobhan Hemati,
Ghazal Alabtah,
Nneka Comfere,
Dennis Murphee,
Aaron Mangold,
Saba Yasir,
Chady Meroueh,
Lisa Boardman,
Vijay H. Shah,
Joaquin J. Garcia,
H. R. Tizhoosh
Abstract:
Searching for similar images in archives of histology and histopathology images is a crucial task that may aid in patient matching for various purposes, ranging from triaging and diagnosis to prognosis and prediction. Whole slide images (WSIs) are highly detailed digital representations of tissue specimens mounted on glass slides. Matching WSI to WSI can serve as the critical method for patient ma…
▽ More
Searching for similar images in archives of histology and histopathology images is a crucial task that may aid in patient matching for various purposes, ranging from triaging and diagnosis to prognosis and prediction. Whole slide images (WSIs) are highly detailed digital representations of tissue specimens mounted on glass slides. Matching WSI to WSI can serve as the critical method for patient matching. In this paper, we report extensive analysis and validation of four search methods bag of visual words (BoVW), Yottixel, SISH, RetCCL, and some of their potential variants. We analyze their algorithms and structures and assess their performance. For this evaluation, we utilized four internal datasets ($1269$ patients) and three public datasets ($1207$ patients), totaling more than $200,000$ patches from $38$ different classes/subtypes across five primary sites. Certain search engines, for example, BoVW, exhibit notable efficiency and speed but suffer from low accuracy. Conversely, search engines like Yottixel demonstrate efficiency and speed, providing moderately accurate results. Recent proposals, including SISH, display inefficiency and yield inconsistent outcomes, while alternatives like RetCCL prove inadequate in both accuracy and efficiency. Further research is imperative to address the dual aspects of accuracy and minimal storage requirements in histopathological image search.
△ Less
Submitted 8 June, 2024; v1 submitted 6 January, 2024;
originally announced January 2024.
-
Selection of Distinct Morphologies to Divide & Conquer Gigapixel Pathology Images
Authors:
Abubakr Shafique,
Saghir Alfasly,
Areej Alsaafin,
Peyman Nejat,
Jibran A. Khan,
H. R. Tizhoosh
Abstract:
Whole slide images (WSIs) are massive digital pathology files illustrating intricate tissue structures. Selecting a small, representative subset of patches from each WSI is essential yet challenging. Therefore, following the "Divide & Conquer" approach becomes essential to facilitate WSI analysis including the classification and the WSI matching in computational pathology. To this end, we propose…
▽ More
Whole slide images (WSIs) are massive digital pathology files illustrating intricate tissue structures. Selecting a small, representative subset of patches from each WSI is essential yet challenging. Therefore, following the "Divide & Conquer" approach becomes essential to facilitate WSI analysis including the classification and the WSI matching in computational pathology. To this end, we propose a novel method termed "Selection of Distinct Morphologies" (SDM) to choose a subset of WSI patches. The aim is to encompass all inherent morphological variations within a given WSI while simultaneously minimizing the number of selected patches to represent these variations, ensuring a compact yet comprehensive set of patches. This systematically curated patch set forms what we term a "montage". We assess the representativeness of the SDM montage across various public and private histopathology datasets. This is conducted by using the leave-one-out WSI search and matching evaluation method, comparing it with the state-of-the-art Yottixel's mosaic. SDM demonstrates remarkable efficacy across all datasets during its evaluation. Furthermore, SDM eliminates the necessity for empirical parameterization, a crucial aspect of Yottixel's mosaic, by inherently optimizing the selection process to capture the distinct morphological features within the WSI.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Rotation-Agnostic Image Representation Learning for Digital Pathology
Authors:
Saghir Alfasly,
Abubakr Shafique,
Peyman Nejat,
Jibran Khan,
Areej Alsaafin,
Ghazal Alabtah,
H. R. Tizhoosh
Abstract:
This paper addresses complex challenges in histopathological image analysis through three key contributions. Firstly, it introduces a fast patch selection method, FPS, for whole-slide image (WSI) analysis, significantly reducing computational cost while maintaining accuracy. Secondly, it presents PathDino, a lightweight histopathology feature extractor with a minimal configuration of five Transfor…
▽ More
This paper addresses complex challenges in histopathological image analysis through three key contributions. Firstly, it introduces a fast patch selection method, FPS, for whole-slide image (WSI) analysis, significantly reducing computational cost while maintaining accuracy. Secondly, it presents PathDino, a lightweight histopathology feature extractor with a minimal configuration of five Transformer blocks and only 9 million parameters, markedly fewer than alternatives. Thirdly, it introduces a rotation-agnostic representation learning paradigm using self-supervised learning, effectively mitigating overfitting. We also show that our compact model outperforms existing state-of-the-art histopathology-specific vision transformers on 12 diverse datasets, including both internal datasets spanning four sites (breast, liver, skin, and colorectal) and seven public datasets (PANDA, CAMELYON16, BRACS, DigestPath, Kather, PanNuke, and WSSS4LUAD). Notably, even with a training dataset of 6 million histopathology patches from The Cancer Genome Atlas (TCGA), our approach demonstrates an average 8.5% improvement in patch-level majority vote performance. These contributions provide a robust framework for enhancing image analysis in digital pathology, rigorously validated through extensive evaluation. Project Page: https://kimialabmayo.github.io/PathDino-Page/
△ Less
Submitted 12 March, 2024; v1 submitted 14 November, 2023;
originally announced November 2023.
-
When is a Foundation Model a Foundation Model
Authors:
Saghir Alfasly,
Peyman Nejat,
Sobhan Hemati,
Jibran Khan,
Isaiah Lahr,
Areej Alsaafin,
Abubakr Shafique,
Nneka Comfere,
Dennis Murphree,
Chady Meroueh,
Saba Yasir,
Aaron Mangold,
Lisa Boardman,
Vijay Shah,
Joaquin J. Garcia,
H. R. Tizhoosh
Abstract:
Recently, several studies have reported on the fine-tuning of foundation models for image-text modeling in the field of medicine, utilizing images from online data sources such as Twitter and PubMed. Foundation models are large, deep artificial neural networks capable of learning the context of a specific domain through training on exceptionally extensive datasets. Through validation, we have obse…
▽ More
Recently, several studies have reported on the fine-tuning of foundation models for image-text modeling in the field of medicine, utilizing images from online data sources such as Twitter and PubMed. Foundation models are large, deep artificial neural networks capable of learning the context of a specific domain through training on exceptionally extensive datasets. Through validation, we have observed that the representations generated by such models exhibit inferior performance in retrieval tasks within digital pathology when compared to those generated by significantly smaller, conventional deep networks.
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
A Preliminary Investigation into Search and Matching for Tumour Discrimination in WHO Breast Taxonomy Using Deep Networks
Authors:
Abubakr Shafique,
Ricardo Gonzalez,
Liron Pantanowitz,
Puay Hoon Tan,
Alberto Machado,
Ian A Cree,
Hamid R. Tizhoosh
Abstract:
Breast cancer is one of the most common cancers affecting women worldwide. They include a group of malignant neoplasms with a variety of biological, clinical, and histopathological characteristics. There are more than 35 different histological forms of breast lesions that can be classified and diagnosed histologically according to cell morphology, growth, and architecture patterns. Recently, deep…
▽ More
Breast cancer is one of the most common cancers affecting women worldwide. They include a group of malignant neoplasms with a variety of biological, clinical, and histopathological characteristics. There are more than 35 different histological forms of breast lesions that can be classified and diagnosed histologically according to cell morphology, growth, and architecture patterns. Recently, deep learning, in the field of artificial intelligence, has drawn a lot of attention for the computerized representation of medical images. Searchable digital atlases can provide pathologists with patch matching tools allowing them to search among evidently diagnosed and treated archival cases, a technology that may be regarded as computational second opinion. In this study, we indexed and analyzed the WHO breast taxonomy (Classification of Tumours 5th Ed.) spanning 35 tumour types. We visualized all tumour types using deep features extracted from a state-of-the-art deep learning model, pre-trained on millions of diagnostic histopathology images from the TCGA repository. Furthermore, we test the concept of a digital "atlas" as a reference for search and matching with rare test cases. The patch similarity search within the WHO breast taxonomy data reached over 88% accuracy when validating through "majority vote" and more than 91% accuracy when validating using top-n tumour types. These results show for the first time that complex relationships among common and rare breast lesions can be investigated using an indexed digital archive.
△ Less
Submitted 21 August, 2023;
originally announced August 2023.
-
Immunohistochemistry Biomarkers-Guided Image Search for Histopathology
Authors:
Abubakr Shafique,
Morteza Babaie,
Ricardo Gonzalez,
H. R. Tizhoosh
Abstract:
Medical practitioners use a number of diagnostic tests to make a reliable diagnosis. Traditionally, Haematoxylin and Eosin (H&E) stained glass slides have been used for cancer diagnosis and tumor detection. However, recently a variety of immunohistochemistry (IHC) stained slides can be requested by pathologists to examine and confirm diagnoses for determining the subtype of a tumor when this is di…
▽ More
Medical practitioners use a number of diagnostic tests to make a reliable diagnosis. Traditionally, Haematoxylin and Eosin (H&E) stained glass slides have been used for cancer diagnosis and tumor detection. However, recently a variety of immunohistochemistry (IHC) stained slides can be requested by pathologists to examine and confirm diagnoses for determining the subtype of a tumor when this is difficult using H&E slides only. Deep learning (DL) has received a lot of interest recently for image search engines to extract features from tissue regions, which may or may not be the target region for diagnosis. This approach generally fails to capture high-level patterns corresponding to the malignant or abnormal content of histopathology images. In this work, we are proposing a targeted image search approach, inspired by the pathologists workflow, which may use information from multiple IHC biomarker images when available. These IHC images could be aligned, filtered, and merged together to generate a composite biomarker image (CBI) that could eventually be used to generate an attention map to guide the search engine for localized search. In our experiments, we observed that an IHC-guided image search engine can retrieve relevant data more accurately than a conventional (i.e., H&E-only) search engine without IHC guidance. Moreover, such engines are also able to accurately conclude the subtypes through majority votes.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
Composite Biomarker Image for Advanced Visualization in Histopathology
Authors:
Abubakr Shafique,
Morteza Babaie,
Ricardo Gonzalez,
Adrian Batten,
Soma Sikdar,
H. R. Tizhoosh
Abstract:
Immunohistochemistry (IHC) biomarkers are essential tools for reliable cancer diagnosis and subtyping. It requires cross-staining comparison among Whole Slide Images (WSIs) of IHCs and hematoxylin and eosin (H&E) slides. Currently, pathologists examine the visually co-localized areas across IHC and H&E glass slides for a final diagnosis, which is a tedious and challenging task. Moreover, visually…
▽ More
Immunohistochemistry (IHC) biomarkers are essential tools for reliable cancer diagnosis and subtyping. It requires cross-staining comparison among Whole Slide Images (WSIs) of IHCs and hematoxylin and eosin (H&E) slides. Currently, pathologists examine the visually co-localized areas across IHC and H&E glass slides for a final diagnosis, which is a tedious and challenging task. Moreover, visually inspecting different IHC slides back and forth to analyze local co-expressions is inherently subjective and prone to error, even when carried out by experienced pathologists. Relying on digital pathology, we propose Composite Biomarker Image (CBI) in this work. CBI is a single image that can be composed using different filtered IHC biomarker images for better visualization. We present a CBI image produced in two steps by the proposed solution for better visualization and hence more efficient clinical workflow. In the first step, IHC biomarker images are aligned with the H&E images using one coordinate system and orientation. In the second step, the positive or negative IHC regions from each biomarker image (based on the pathologists recommendation) are filtered and combined into one image using a fuzzy inference system. For evaluation, the resulting CBI images, from the proposed system, were evaluated qualitatively by the expert pathologists. The CBI concept helps the pathologists to identify the suspected target tissues more easily, which could be further assessed by examining the actual WSIs at the same suspected regions.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
Comments on 'Fast and scalable search of whole-slide images via self-supervised deep learning'
Authors:
Milad Sikaroudi,
Mehdi Afshari,
Abubakr Shafique,
Shivam Kalra,
H. R. Tizhoosh
Abstract:
Chen et al. [Chen2022] recently published the article 'Fast and scalable search of whole-slide images via self-supervised deep learning' in Nature Biomedical Engineering. The authors call their method 'self-supervised image search for histology', short SISH. We express our concerns that SISH is an incremental modification of Yottixel, has used MinMax binarization but does not cite the original wor…
▽ More
Chen et al. [Chen2022] recently published the article 'Fast and scalable search of whole-slide images via self-supervised deep learning' in Nature Biomedical Engineering. The authors call their method 'self-supervised image search for histology', short SISH. We express our concerns that SISH is an incremental modification of Yottixel, has used MinMax binarization but does not cite the original works, and is based on a misnomer 'self-supervised image search'. As well, we point to several other concerns regarding experiments and comparisons performed by Chen et al.
△ Less
Submitted 14 June, 2023; v1 submitted 7 April, 2023;
originally announced April 2023.
-
Lattice Instability and Ultralow Lattice Thermal Conductivity of Layered PbIF
Authors:
N. Yedukondalu,
Aamir Shafique,
S. C. Rakesh Roshan,
Mohamed Barhoumi,
Rajmohan Muthaiah,
Lars Ehm,
John B. Parise,
Udo Schwingenschlögl
Abstract:
Understanding the interplay between various design strategies (for instance, bonding heterogeneity and lone pair induced anharmonicity) to achieve ultralow lattice thermal conductivity ($κ_l$) is indispensable for discovering novel functional materials for thermal energy applications. In the present study, we investigate layered PbXF (X = Cl, Br, I), which offers bonding heterogeneity through the…
▽ More
Understanding the interplay between various design strategies (for instance, bonding heterogeneity and lone pair induced anharmonicity) to achieve ultralow lattice thermal conductivity ($κ_l$) is indispensable for discovering novel functional materials for thermal energy applications. In the present study, we investigate layered PbXF (X = Cl, Br, I), which offers bonding heterogeneity through the layered crystal structure, anharmonicity through the Pb$^{2+}$ $6s^2$ lone pair, and phonon softening through the mass difference between F and Pb/X. The weak inter-layer van der Waals bonding and the strong intra-layer ionic bonding with partial covalent bonding result in a significant bonding heterogeneity and a poor phonon transport in the out-of-plane direction. Large average Grüneisen parameters ($\geq$ 2.5) demonstrate strong anharmonicity. The computed phonon dispersions show flat bands, which suggest short phonon lifetimes, especially for PbIF. Enhanced Born effective charges are due to cross-band-gap hybridization. PbIF shows lattice instability at a small volume expansion of 0.1$\%$. The $κ_l$ values obtained by the two channel transport model are 20-50$\%$ higher than those obtained by solving the Boltzmann transport equation. Overall, ultralow $κ_l$ values are found at 300 K, especially for PbIF. We propose that the interplay of bonding heterogeneity, lone pair induced anharmonicity, and constituent elements with high mass difference aids the design of low $κ_l$ materials for thermal energy applications.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
Two-dimensional pentagonal material Penta-PdPSe: A first-principle study
Authors:
A. Bafekry,
M. M. Fadlallah,
M. Faraji,
A. Shafique,
H. R. Jappor,
I. Abdolhoseini Sarsari,
Yee Sin Ang,
M. Ghergherehchi
Abstract:
Low-symmetry Penta-PdPSe with intrinsic in-plane anisotropy synthesized successfully [(P. Li et al., Adv. Mater., 2102541, (2021)]. Motivated by this experimental discovery, we investigate the structural, mechanical, electronic, optical and thermoelectric properties of PdPSe monolayer via density functional theory calculations. The phonon dispersion, molecular dynamics simulation, the cohesive ene…
▽ More
Low-symmetry Penta-PdPSe with intrinsic in-plane anisotropy synthesized successfully [(P. Li et al., Adv. Mater., 2102541, (2021)]. Motivated by this experimental discovery, we investigate the structural, mechanical, electronic, optical and thermoelectric properties of PdPSe monolayer via density functional theory calculations. The phonon dispersion, molecular dynamics simulation, the cohesive energy mechanical properties of the penta-PdPSe monolayer is verified to confirm its stability. The phonon spectrum represents a striking gap between the high-frequency and the low-frequency optical branches and an out-of-plane flexure mode with a quadratic dispersion in the long-wavelength limit. The Poissons ratio indicates that penta-PdPSe is a brittle monolayer. The penda-PdPSe monolayer is an indirect semiconductor with bandgap of 1.40 (2.07) eV using PBE (HSE06) functional. Optical properties simulation suggests that PdPSe is capable of absorbing a substantial range of visible to ultraviolet light. Band alignment analysis also reveals the compatibility of PdPSe for water splitting photocatalysis application. By combining the electrical and thermal transport properties of PdPSe, we show that a high PF is achievable at room temperature, thus making PdPSe a candidate material for thermoelectric application. Our findings reveal the strong potential of penta-PdPSe monolayer for a wide array of applications, including optoelectronic, water splitting and thermoelectric device applications.
△ Less
Submitted 11 September, 2021; v1 submitted 8 September, 2021;
originally announced September 2021.
-
Automatic Multi-Stain Registration of Whole Slide Images in Histopathology
Authors:
Abubakr Shafique,
Morteza Babaie,
Mahjabin Sajadi,
Adrian Batten,
Soma Skdar,
H. R. Tizhoosh
Abstract:
Joint analysis of multiple biomarker images and tissue morphology is important for disease diagnosis, treatment planning and drug development. It requires cross-staining comparison among Whole Slide Images (WSIs) of immuno-histochemical and hematoxylin and eosin (H&E) microscopic slides. However, automatic, and fast cross-staining alignment of enormous gigapixel WSIs at single-cell precision is ch…
▽ More
Joint analysis of multiple biomarker images and tissue morphology is important for disease diagnosis, treatment planning and drug development. It requires cross-staining comparison among Whole Slide Images (WSIs) of immuno-histochemical and hematoxylin and eosin (H&E) microscopic slides. However, automatic, and fast cross-staining alignment of enormous gigapixel WSIs at single-cell precision is challenging. In addition to morphological deformations introduced during slide preparation, there are large variations in cell appearance and tissue morphology across different staining. In this paper, we propose a two-step automatic feature-based cross-staining WSI alignment to assist localization of even tiny metastatic foci in the assessment of lymph node. Image pairs were aligned allowing for translation, rotation, and scaling. The registration was performed automatically by first detecting landmarks in both images, using the scale-invariant image transform (SIFT), followed by the fast sample consensus (FSC) protocol for finding point correspondences and finally aligned the images. The Registration results were evaluated using both visual and quantitative criteria using the Jaccard index. The average Jaccard similarity index of the results produced by the proposed system is 0.942 when compared with the manual registration.
△ Less
Submitted 29 July, 2021;
originally announced July 2021.