Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–49 of 49 results for author: Saltz, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.14709  [pdf, other

    cs.CV

    $\infty$-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions

    Authors: Minh-Quan Le, Alexandros Graikos, Srikar Yellapragada, Rajarsi Gupta, Joel Saltz, Dimitris Samaras

    Abstract: Synthesizing high-resolution images from intricate, domain-specific information remains a significant challenge in generative modeling, particularly for applications in large-image domains such as digital histopathology and remote sensing. Existing methods face critical limitations: conditional diffusion models in pixel or latent space cannot exceed the resolution on which they were trained withou… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024. Project page: https://histodiffusion.github.io

  2. arXiv:2403.17255  [pdf, other

    eess.IV cs.CV

    Decoding the visual attention of pathologists to reveal their level of expertise

    Authors: Souradeep Chakraborty, Dana Perez, Paul Friedman, Natallia Sheuka, Constantin Friedman, Oksana Yaskiv, Rajarsi Gupta, Gregory J. Zelinsky, Joel H. Saltz, Dimitris Samaras

    Abstract: We present a method for classifying the expertise of a pathologist based on how they allocated their attention during a cancer reading. We engage this decoding task by developing a novel method for predicting the attention of pathologists as they read whole-slide Images (WSIs) of prostate and make cancer grade classifications. Our ground truth measure of a pathologists' attention is the x, y and z… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  3. arXiv:2312.15010  [pdf, other

    cs.CV

    SI-MIL: Taming Deep MIL for Self-Interpretability in Gigapixel Histopathology

    Authors: Saarthak Kapse, Pushpak Pati, Srijan Das, Jingwei Zhang, Chao Chen, Maria Vakalopoulou, Joel Saltz, Dimitris Samaras, Rajarsi R. Gupta, Prateek Prasanna

    Abstract: Introducing interpretability and reasoning into Multiple Instance Learning (MIL) methods for Whole Slide Image (WSI) analysis is challenging, given the complexity of gigapixel slides. Traditionally, MIL interpretability is limited to identifying salient regions deemed pertinent for downstream tasks, offering little insight to the end-user (pathologist) regarding the rationale behind these selectio… ▽ More

    Submitted 18 May, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

  4. arXiv:2312.07330  [pdf, other

    cs.CV

    Learned representation-guided diffusion models for large-image generation

    Authors: Alexandros Graikos, Srikar Yellapragada, Minh-Quan Le, Saarthak Kapse, Prateek Prasanna, Joel Saltz, Dimitris Samaras

    Abstract: To synthesize high-fidelity samples, diffusion models typically require auxiliary data to guide the generation process. However, it is impractical to procure the painstaking patch-level annotation effort required in specialized domains like histopathology and satellite imagery; it is often performed by domain experts and involves hundreds of millions of patches. Modern-day self-supervised learning… ▽ More

    Submitted 28 March, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

  5. arXiv:2309.06439  [pdf, other

    cs.CV

    Attention De-sparsification Matters: Inducing Diversity in Digital Pathology Representation Learning

    Authors: Saarthak Kapse, Srijan Das, Jingwei Zhang, Rajarsi R. Gupta, Joel Saltz, Dimitris Samaras, Prateek Prasanna

    Abstract: We propose DiRL, a Diversity-inducing Representation Learning technique for histopathology imaging. Self-supervised learning techniques, such as contrastive and non-contrastive approaches, have been shown to learn rich and effective representations of digitized tissue samples with limited pathologist supervision. Our analysis of vanilla SSL-pretrained models' attention distribution reveals an insi… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  6. Open and reusable deep learning for pathology with WSInfer and QuPath

    Authors: Jakub R. Kaczmarzyk, Alan O'Callaghan, Fiona Inglis, Tahsin Kurc, Rajarsi Gupta, Erich Bremer, Peter Bankhead, Joel H. Saltz

    Abstract: The field of digital pathology has seen a proliferation of deep learning models in recent years. Despite substantial progress, it remains rare for other researchers and pathologists to be able to access models published in the literature and apply them to their own images. This is due to difficulties in both sharing and running models. To address these concerns, we introduce WSInfer: a new, open-s… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  7. arXiv:2309.00748  [pdf, other

    cs.CV cs.LG

    PathLDM: Text conditioned Latent Diffusion Model for Histopathology

    Authors: Srikar Yellapragada, Alexandros Graikos, Prateek Prasanna, Tahsin Kurc, Joel Saltz, Dimitris Samaras

    Abstract: To achieve high-quality results, diffusion models must be trained on large datasets. This can be notably prohibitive for models in specialized domains, such as computational pathology. Conditioning on labeled data is known to help in data-efficient model training. Therefore, histopathology reports, which are rich in valuable clinical information, are an ideal choice as guidance for a histopatholog… ▽ More

    Submitted 30 November, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: WACV 2024 publication

  8. arXiv:2307.09570  [pdf, other

    eess.IV cs.CV

    SAM-Path: A Segment Anything Model for Semantic Segmentation in Digital Pathology

    Authors: Jingwei Zhang, Ke Ma, Saarthak Kapse, Joel Saltz, Maria Vakalopoulou, Prateek Prasanna, Dimitris Samaras

    Abstract: Semantic segmentations of pathological entities have crucial clinical value in computational pathology workflows. Foundation models, such as the Segment Anything Model (SAM), have been recently proposed for universal use in segmentation tasks. SAM shows remarkable promise in instance segmentation on natural images. However, the applicability of SAM to computational pathology tasks is limited due t… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Submitted to MedAGI 2023

  9. arXiv:2304.10612  [pdf

    cs.HC cs.CV q-bio.QM

    Halcyon -- A Pathology Imaging and Feature analysis and Management System

    Authors: Erich Bremer, Tammy DiPrima, Joseph Balsamo, Jonas Almeida, Rajarsi Gupta, Joel Saltz

    Abstract: Halcyon is a new pathology imaging analysis and feature management system based on W3C linked-data open standards and is designed to scale to support the needs for the voluminous production of features from deep-learning feature pipelines. Halcyon can support multiple users with a web-based UX with access to all user data over a standards-based web API allowing for integration with other processes… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 15 pages, 11 figures. arXiv admin note: text overlap with arXiv:2005.06469

  10. arXiv:2304.02255  [pdf, other

    eess.IV cs.CV

    Topology-Guided Multi-Class Cell Context Generation for Digital Pathology

    Authors: Shahira Abousamra, Rajarsi Gupta, Tahsin Kurc, Dimitris Samaras, Joel Saltz, Chao Chen

    Abstract: In digital pathology, the spatial context of cells is important for cell classification, cancer diagnosis and prognosis. To model such complex cell context, however, is challenging. Cells form different mixtures, lineages, clusters and holes. To model such structural patterns in a learnable fashion, we introduce several mathematical tools from spatial statistics and topological data analysis. We i… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: To be published in proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  11. arXiv:2303.12214  [pdf, other

    cs.CV

    Prompt-MIL: Boosting Multi-Instance Learning Schemes via Task-specific Prompt Tuning

    Authors: Jingwei Zhang, Saarthak Kapse, Ke Ma, Prateek Prasanna, Joel Saltz, Maria Vakalopoulou, Dimitris Samaras

    Abstract: Whole slide image (WSI) classification is a critical task in computational pathology, requiring the processing of gigapixel-sized images, which is challenging for current deep-learning methods. Current state of the art methods are based on multi-instance learning schemes (MIL), which usually rely on pretrained features to represent the instances. Due to the lack of task-specific annotated data, th… ▽ More

    Submitted 4 October, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepted to MICCAI 2023 (Oral)

  12. arXiv:2212.12105  [pdf, other

    cs.CV

    Precise Location Matching Improves Dense Contrastive Learning in Digital Pathology

    Authors: Jingwei Zhang, Saarthak Kapse, Ke Ma, Prateek Prasanna, Maria Vakalopoulou, Joel Saltz, Dimitris Samaras

    Abstract: Dense prediction tasks such as segmentation and detection of pathological entities hold crucial clinical value in computational pathology workflows. However, obtaining dense annotations on large cohorts is usually tedious and expensive. Contrastive learning (CL) is thus often employed to leverage large volumes of unlabeled data to pre-train the backbone network. To boost CL for dense prediction, s… ▽ More

    Submitted 22 March, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: Accept to IPMI 2023

  13. arXiv:2207.09654  [pdf, other

    cs.CV

    Learning Topological Interactions for Multi-Class Medical Image Segmentation

    Authors: Saumya Gupta, Xiaoling Hu, James Kaan, Michael Jin, Mutshipay Mpoy, Katherine Chung, Gagandeep Singh, Mary Saltz, Tahsin Kurc, Joel Saltz, Apostolos Tassiopoulos, Prateek Prasanna, Chao Chen

    Abstract: Deep learning methods have achieved impressive performance for multi-class medical image segmentation. However, they are limited in their ability to encode topological interactions among different classes (e.g., containment and exclusion). These constraints naturally arise in biomedical images and can be crucial in improving segmentation quality. In this paper, we introduce a novel topological int… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022 (Oral); 32 pages, 19 figures

  14. Gigapixel Whole-Slide Images Classification using Locally Supervised Learning

    Authors: Jingwei Zhang, Xin Zhang, Ke Ma, Rajarsi Gupta, Joel Saltz, Maria Vakalopoulou, Dimitris Samaras

    Abstract: Histopathology whole slide images (WSIs) play a very important role in clinical studies and serve as the gold standard for many cancer diagnoses. However, generating automatic tools for processing WSIs is challenging due to their enormous sizes. Currently, to deal with this issue, conventional methods rely on a multiple instance learning (MIL) strategy to process a WSI at patch level. Although eff… ▽ More

    Submitted 26 September, 2022; v1 submitted 17 July, 2022; originally announced July 2022.

    Comments: Accepted to MICCAI 2022 Oral

    Journal ref: International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, Cham, 2022

  15. arXiv:2207.01734  [pdf

    q-bio.QM cs.SE eess.IV

    ImageBox3: No-Server Tile Serving to Traverse Whole Slide Images on the Web

    Authors: Praphulla MS Bhawsar, Erich Bremer, Máire A Duggan, Stephen Chanock, Montserrat Garcia-Closas, Joel Saltz, Jonas S Almeida

    Abstract: Whole slide imaging (WSI) has become the primary modality for digital pathology data. However, due to the size and high-resolution nature of these images, they are generally only accessed in smaller sections or tiles via specialized platforms, most of which require extensive setup and/or costly infrastructure. These platforms typically also need a copy of the images to be locally available to them… ▽ More

    Submitted 5 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: 9 pages, 3 figures

  16. arXiv:2206.07573  [pdf

    cs.AI q-bio.QM q-bio.TO

    AI and Pathology: Steering Treatment and Predicting Outcomes

    Authors: Rajarsi Gupta, Jakub Kaczmarzyk, Soma Kobayashi, Tahsin Kurc, Joel Saltz

    Abstract: The combination of data analysis methods, increasing computing capacity, and improved sensors enable quantitative granular, multi-scale, cell-based analyses. We describe the rich set of application challenges related to tissue interpretation and survey AI methods currently used to address these challenges. We focus on a particular class of targeted human tissue analysis - histopathology - aimed at… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  17. arXiv:2206.06862  [pdf, other

    q-bio.QM cs.CV cs.LG eess.IV

    Evaluating histopathology transfer learning with ChampKit

    Authors: Jakub R. Kaczmarzyk, Tahsin M. Kurc, Shahira Abousamra, Rajarsi Gupta, Joel H. Saltz, Peter K. Koo

    Abstract: Histopathology remains the gold standard for diagnosis of various cancers. Recent advances in computer vision, specifically deep learning, have facilitated the analysis of histopathology images for various tasks, including immune cell detection and microsatellite instability classification. The state-of-the-art for each task often employs base architectures that have been pretrained for image clas… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Submitted to NeurIPS 2022 Track on Datasets and Benchmarks. Source code available at https://github.com/kaczmarj/champkit

    ACM Class: J.3; I.4.9; D.2.13

  18. arXiv:2204.12283  [pdf

    q-bio.QM cs.CV eess.IV

    A Novel Framework for Characterization of Tumor-Immune Spatial Relationships in Tumor Microenvironment

    Authors: Mahmudul Hasan, Jakub R. Kaczmarzyk, David Paredes, Lyanne Oblein, Jaymie Oentoro, Shahira Abousamra, Michael Horowitz, Dimitris Samaras, Chao Chen, Tahsin Kurc, Kenneth R. Shroyer, Joel Saltz

    Abstract: Understanding the impact of tumor biology on the composition of nearby cells often requires characterizing the impact of biologically distinct tumor regions. Biomarkers have been developed to label biologically distinct tumor regions, but challenges arise because of differences in the spatial extent and distribution of differentially labeled regions. In this work, we present a framework for system… ▽ More

    Submitted 1 May, 2022; v1 submitted 23 April, 2022; originally announced April 2022.

  19. Federated Learning Enables Big Data for Rare Cancer Boundary Detection

    Authors: Sarthak Pati, Ujjwal Baid, Brandon Edwards, Micah Sheller, Shih-Han Wang, G Anthony Reina, Patrick Foley, Alexey Gruzdev, Deepthi Karkada, Christos Davatzikos, Chiharu Sako, Satyam Ghodasara, Michel Bilello, Suyash Mohan, Philipp Vollmuth, Gianluca Brugnara, Chandrakanth J Preetha, Felix Sahm, Klaus Maier-Hein, Maximilian Zenk, Martin Bendszus, Wolfgang Wick, Evan Calabrese, Jeffrey Rudie, Javier Villanueva-Meyer , et al. (254 additional authors not shown)

    Abstract: Although machine learning (ML) has shown promise in numerous domains, there are concerns about generalizability to out-of-sample data. This is currently addressed by centrally sharing ample, and importantly diverse, data from multiple sites. However, such centralization is challenging to scale (or even not feasible) due to various limitations. Federated ML (FL) provides an alternative to train acc… ▽ More

    Submitted 25 April, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: federated learning, deep learning, convolutional neural network, segmentation, brain tumor, glioma, glioblastoma, FeTS, BraTS

  20. arXiv:2203.16622  [pdf, other

    eess.IV cs.CV cs.LG

    Federated Learning for the Classification of Tumor Infiltrating Lymphocytes

    Authors: Ujjwal Baid, Sarthak Pati, Tahsin M. Kurc, Rajarsi Gupta, Erich Bremer, Shahira Abousamra, Siddhesh P. Thakur, Joel H. Saltz, Spyridon Bakas

    Abstract: We evaluate the performance of federated learning (FL) in developing deep learning models for analysis of digitized tissue sections. A classification application was considered as the example use case, on quantifiying the distribution of tumor infiltrating lymphocytes within whole slide images (WSIs). A deep learning classification model was trained using 50*50 square micron patches extracted from… ▽ More

    Submitted 31 March, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

  21. Visual attention analysis of pathologists examining whole slide images of Prostate cancer

    Authors: Souradeep Chakraborty, Ke Ma, Rajarsi Gupta, Beatrice Knudsen, Gregory J. Zelinsky, Joel H. Saltz, Dimitris Samaras

    Abstract: We study the attention of pathologists as they examine whole-slide images (WSIs) of prostate cancer tissue using a digital microscope. To the best of our knowledge, our study is the first to report in detail how pathologists navigate WSIs of prostate cancer as they accumulate information for their diagnoses. We collected slide navigation data (i.e., viewport location, magnification level, and time… ▽ More

    Submitted 2 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: ISBI 2022 (Oral presentation)

  22. arXiv:2110.10780  [pdf

    cs.CL cs.IR

    An Open Natural Language Processing Development Framework for EHR-based Clinical Research: A case demonstration using the National COVID Cohort Collaborative (N3C)

    Authors: Sijia Liu, Andrew Wen, Liwei Wang, Huan He, Sunyang Fu, Robert Miller, Andrew Williams, Daniel Harris, Ramakanth Kavuluru, Mei Liu, Noor Abu-el-rub, Dalton Schutte, Rui Zhang, Masoud Rouhizadeh, John D. Osborne, Yongqun He, Umit Topaloglu, Stephanie S Hong, Joel H Saltz, Thomas Schaffter, Emily Pfaff, Christopher G. Chute, Tim Duong, Melissa A. Haendel, Rafael Fuentes , et al. (7 additional authors not shown)

    Abstract: While we pay attention to the latest advances in clinical natural language processing (NLP), we can notice some resistance in the clinical and translational research community to adopt NLP models due to limited transparency, interpretability, and usability. In this study, we proposed an open natural language processing development framework. We evaluated it through the implementation of NLP algori… ▽ More

    Submitted 21 March, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: update on contents

  23. arXiv:2110.04886  [pdf, other

    cs.CV

    Multi-Class Cell Detection Using Spatial Context Representation

    Authors: Shahira Abousamra, David Belinsky, John Van Arnam, Felicia Allard, Eric Yee, Rajarsi Gupta, Tahsin Kurc, Dimitris Samaras, Joel Saltz, Chao Chen

    Abstract: In digital pathology, both detection and classification of cells are important for automatic diagnostic and prognostic tasks. Classifying cells into subtypes, such as tumor cells, lymphocytes or stromal cells is particularly challenging. Existing methods focus on morphological appearance of individual cells, whereas in practice pathologists often infer cell classes through their spatial context. I… ▽ More

    Submitted 5 June, 2022; v1 submitted 10 October, 2021; originally announced October 2021.

  24. GaNDLF: A Generally Nuanced Deep Learning Framework for Scalable End-to-End Clinical Workflows in Medical Imaging

    Authors: Sarthak Pati, Siddhesh P. Thakur, İbrahim Ethem Hamamcı, Ujjwal Baid, Bhakti Baheti, Megh Bhalerao, Orhun Güley, Sofia Mouchtaris, David Lang, Spyridon Thermos, Karol Gotkowski, Camila González, Caleb Grenko, Alexander Getka, Brandon Edwards, Micah Sheller, Junwen Wu, Deepthi Karkada, Ravi Panchumarthy, Vinayak Ahluwalia, Chunrui Zou, Vishnu Bashyam, Yuemeng Li, Babak Haghighi, Rhea Chitalia , et al. (17 additional authors not shown)

    Abstract: Deep Learning (DL) has the potential to optimize machine learning in both the scientific and clinical communities. However, greater expertise is required to develop DL algorithms, and the variability of implementations hinders their reproducibility, translation, and deployment. Here we present the community-driven Generally Nuanced Deep Learning Framework (GaNDLF), with the goal of lowering these… ▽ More

    Submitted 16 May, 2023; v1 submitted 25 February, 2021; originally announced March 2021.

    Comments: Deep Learning, Framework, Segmentation, Regression, Classification, Cross-validation, Data augmentation, Deployment, Clinical, Workflows

    Journal ref: Commun Eng 2, 23 (2023)

  25. arXiv:2010.04589  [pdf

    cs.LG cs.CY stat.ML

    Identifying Risk of Opioid Use Disorder for Patients Taking Opioid Medications with Deep Learning

    Authors: Xinyu Dong, Jianyuan Deng, Sina Rashidian, Kayley Abell-Hart, Wei Hou, Richard N Rosenthal, Mary Saltz, Joel Saltz, Fusheng Wang

    Abstract: The United States is experiencing an opioid epidemic, and there were more than 10 million opioid misusers aged 12 or older each year. Identifying patients at high risk of Opioid Use Disorder (OUD) can help to make early clinical interventions to reduce the risk of OUD. Our goal is to predict OUD patients among opioid prescription users through analyzing electronic health records with machine learn… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: 20 pages, 6 figures

  26. arXiv:2005.06469  [pdf

    cs.GR cs.DB cs.IR q-bio.QM

    Representing Whole Slide Cancer Image Features with Hilbert Curves

    Authors: Erich Bremer, Jonas Almeida, Joel Saltz

    Abstract: Regions of Interest (ROI) contain morphological features in pathology whole slide images (WSI) are delimited with polygons[1]. These polygons are often represented in either a textual notation (with the array of edges) or in a binary mask form. Textual notations have an advantage of human readability and portability, whereas, binary mask representations are more useful as the input and output of f… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    Comments: 9 pages, 5 figures

  27. Dataset of Segmented Nuclei in Hematoxylin and Eosin Stained Histopathology Images of 10 Cancer Types

    Authors: Le Hou, Rajarsi Gupta, John S. Van Arnam, Yuwei Zhang, Kaustubh Sivalenka, Dimitris Samaras, Tahsin M. Kurc, Joel H. Saltz

    Abstract: The distribution and appearance of nuclei are essential markers for the diagnosis and study of cancer. Despite the importance of nuclear morphology, there is a lack of large scale, accurate, publicly accessible nucleus segmentation data. To address this, we developed an analysis pipeline that segments nuclei in whole slide tissue images from multiple cancer types with a quality control process. We… ▽ More

    Submitted 30 November, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    ACM Class: I.4.6; J.3

    Journal ref: Sci Data 7, 185 (2020)

  28. arXiv:1910.14548  [pdf, other

    cs.DC

    Run-time Parameter Sensitivity Analysis Optimizations

    Authors: Eduardo Scartezini, Willian Barreiros Jr., Tahsin Kurc, Jun Kong, Alba C. M. A. Melo, Joel Saltz, George Teodoro

    Abstract: Efficient execution of parameter sensitivity analysis (SA) is critical to allow for its routinely use. The pathology image processing application investigated in this work processes high-resolution whole-slide cancer tissue images from large datasets to characterize and classify the disease. However, the application is parameterized and changes in parameter values may significantly affect its resu… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: 8 pages, 8 figures

  29. arXiv:1909.12291  [pdf, other

    cs.LG cs.DC stat.ML

    Exascale Deep Learning to Accelerate Cancer Research

    Authors: Robert M. Patton, J. Travis Johnston, Steven R. Young, Catherine D. Schuman, Thomas E. Potok, Derek C. Rose, Seung-Hwan Lim, Junghoon Chae, Le Hou, Shahira Abousamra, Dimitris Samaras, Joel Saltz

    Abstract: Deep learning, through the use of neural networks, has demonstrated remarkable ability to automate many routine tasks when presented with sufficient data for training. The neural network architecture (e.g. number of layers, types of layers, connections between layers, etc.) plays a critical role in determining what, if anything, the neural network is able to learn from the training data. The trend… ▽ More

    Submitted 26 September, 2019; originally announced September 2019.

    Comments: Submitted to IEEE Big Data

  30. arXiv:1907.03960  [pdf, other

    eess.IV cs.CV

    Learning from Thresholds: Fully Automated Classification of Tumor Infiltrating Lymphocytes for Multiple Cancer Types

    Authors: Shahira Abousamra, Le Hou, Rajarsi Gupta, Chao Chen, Dimitris Samaras, Tahsin Kurc, Rebecca Batiste, Tianhao Zhao, Shroyer Kenneth, Joel Saltz

    Abstract: Deep learning classifiers for characterization of whole slide tissue morphology require large volumes of annotated data to learn variations across different tissue and cancer types. As is well known, manual generation of digital pathology training data is time consuming and expensive. In this paper, we propose a semi-automated method for annotating a group of similar instances at once, instead of… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

  31. arXiv:1905.10841  [pdf

    eess.IV cs.CV

    Utilizing Automated Breast Cancer Detection to Identify Spatial Distributions of Tumor Infiltrating Lymphocytes in Invasive Breast Cancer

    Authors: Han Le, Rajarsi Gupta, Le Hou, Shahira Abousamra, Danielle Fassler, Tahsin Kurc, Dimitris Samaras, Rebecca Batiste, Tianhao Zhao, Arvind Rao, Alison L. Van Dyke, Ashish Sharma, Erich Bremer, Jonas S. Almeida, Joel Saltz

    Abstract: Quantitative assessment of Tumor-TIL spatial relationships is increasingly important in both basic science and clinical aspects of breast cancer research. We have developed and evaluated convolutional neural network (CNN) analysis pipelines to generate combined maps of cancer regions and tumor infiltrating lymphocytes (TILs) in routine diagnostic breast cancer whole slide tissue images (WSIs). We… ▽ More

    Submitted 13 January, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

    Comments: The American Journal of Pathology

  32. arXiv:1904.04429  [pdf, other

    cs.CV

    Label Super Resolution with Inter-Instance Loss

    Authors: Maozheng Zhao, Le Hou, Han Le, Dimitris Samaras, Nebojsa Jojic, Danielle Fassler, Tahsin Kurc, Rajarsi Gupta, Kolya Malkin, Shroyer Kenneth, Joel Saltz

    Abstract: For the task of semantic segmentation, high-resolution (pixel-level) ground truth is very expensive to collect, especially for high resolution images such as gigapixel pathology images. On the other hand, collecting low resolution labels (labels for a block of pixels) for these high resolution images is much more cost efficient. Conventional methods trained on these low-resolution labels are only… ▽ More

    Submitted 7 January, 2020; v1 submitted 8 April, 2019; originally announced April 2019.

  33. arXiv:1811.11818  [pdf, other

    cs.LG stat.ML

    Disease phenotyping using deep learning: A diabetes case study

    Authors: Sina Rashidian, Janos Hajagos, Richard Moffitt, Fusheng Wang, Xinyu Dong, Kayley Abell-Hart, Kimberly Noel, Rajarsi Gupta, Mathew Tharakan, Veena Lingam, Joel Saltz, Mary Saltz

    Abstract: Characterization of a patient clinical phenotype is central to biomedical informatics. ICD codes, assigned to inpatient encounters by coders, is important for population health and cohort discovery when clinical information is limited. While ICD codes are assigned to patients by professionals trained and certified in coding there is substantial variability in coding. We present a methodology that… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:cs/0101200

    Report number: ML4H/2018/38

  34. arXiv:1810.13230  [pdf

    cs.CV

    Methods for Segmentation and Classification of Digital Microscopy Tissue Images

    Authors: Quoc Dang Vu, Simon Graham, Minh Nguyen Nhat To, Muhammad Shaban, Talha Qaiser, Navid Alemi Koohbanani, Syed Ali Khurram, Tahsin Kurc, Keyvan Farahani, Tianhao Zhao, Rajarsi Gupta, Jin Tae Kwak, Nasir Rajpoot, Joel Saltz

    Abstract: High-resolution microscopy images of tissue specimens provide detailed information about the morphology of normal and diseased tissue. Image analysis of tissue morphology can help cancer researchers develop a better understanding of cancer biology. Segmentation of nuclei and classification of tissue images are two common tasks in tissue image analysis. Development of accurate and efficient algorit… ▽ More

    Submitted 16 November, 2018; v1 submitted 31 October, 2018; originally announced October 2018.

  35. arXiv:1810.02911  [pdf

    cs.DC

    Tuning for Tissue Image Segmentation Workflows for Accuracy and Performance

    Authors: Luis F. R. Taveira, Tahsin Kurc, Alba C. M. A. Melo, Jun Kong, Erich Bremer, Joel H. Saltz, George Teodoro

    Abstract: We propose a software platform that integrates methods and tools for multi-objective parameter auto- tuning in tissue image segmentation workflows. The goal of our work is to provide an approach for improving the accuracy of nucleus/cell segmentation pipelines by tuning their input parameters. The shape, size and texture features of nuclei in tissue are important biomarkers for disease prognosis,… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

    Comments: 29 pages, 5 figures

  36. arXiv:1712.05021  [pdf, other

    cs.CV

    Unsupervised Histopathology Image Synthesis

    Authors: Le Hou, Ayush Agarwal, Dimitris Samaras, Tahsin M. Kurc, Rajarsi R. Gupta, Joel H. Saltz

    Abstract: Hematoxylin and Eosin stained histopathology image analysis is essential for the diagnosis and study of complicated diseases such as cancer. Existing state-of-the-art approaches demand extensive amount of supervised training data from trained pathologists. In this work we synthesize in an unsupervised manner, large histopathology image datasets, suitable for supervised training tasks. We propose a… ▽ More

    Submitted 13 December, 2017; originally announced December 2017.

  37. arXiv:1704.00406  [pdf, other

    cs.CV

    Sparse Autoencoder for Unsupervised Nucleus Detection and Representation in Histopathology Images

    Authors: Le Hou, Vu Nguyen, Dimitris Samaras, Tahsin M. Kurc, Yi Gao, Tianhao Zhao, Joel H. Saltz

    Abstract: Histopathology images are crucial to the study of complex diseases such as cancer. The histologic characteristics of nuclei play a key role in disease diagnosis, prognosis and analysis. In this work, we propose a sparse Convolutional Autoencoder (CAE) for fully unsupervised, simultaneous nucleus detection and feature extraction in histopathology tissue images. Our CAE detects and encodes nuclei in… ▽ More

    Submitted 10 April, 2017; v1 submitted 2 April, 2017; originally announced April 2017.

  38. arXiv:1612.06825  [pdf

    cs.CV

    Center-Focusing Multi-task CNN with Injected Features for Classification of Glioma Nuclear Images

    Authors: Veda Murthy, Le Hou, Dimitris Samaras, Tahsin M. Kurc, Joel H. Saltz

    Abstract: Classifying the various shapes and attributes of a glioma cell nucleus is crucial for diagnosis and understanding the disease. We investigate automated classification of glioma nuclear shapes and visual attributes using Convolutional Neural Networks (CNNs) on pathology images of automatically segmented nuclei. We propose three methods that improve the performance of a previously-developed semi-sup… ▽ More

    Submitted 10 January, 2017; v1 submitted 20 December, 2016; originally announced December 2016.

  39. arXiv:1612.03413  [pdf, other

    cs.DC

    Efficient Methods and Parallel Execution for Algorithm Sensitivity Analysis with Parameter Tuning on Microscopy Imaging Datasets

    Authors: George Teodoro, Tahsin Kurc, Luis F. R. Taveira, Alba C. M. A. Melo, Jun Kong, Joel Saltz

    Abstract: Background: We describe an informatics framework for researchers and clinical investigators to efficiently perform parameter sensitivity analysis and auto-tuning for algorithms that segment and classify image features in a large dataset of high-resolution images. The computational cost of the sensitivity analysis process can be very high, because the process requires processing the input dataset s… ▽ More

    Submitted 11 December, 2016; originally announced December 2016.

    Comments: 36 pages, 10 figures

  40. arXiv:1608.06557  [pdf, other

    cs.CV

    Neural Networks with Smooth Adaptive Activation Functions for Regression

    Authors: Le Hou, Dimitris Samaras, Tahsin M. Kurc, Yi Gao, Joel H. Saltz

    Abstract: In Neural Networks (NN), Adaptive Activation Functions (AAF) have parameters that control the shapes of activation functions. These parameters are trained along with other parameters in the NN. AAFs have improved performance of Neural Networks (NN) in multiple classification tasks. In this paper, we propose and apply AAFs on feedforward NNs for regression tasks. We argue that applying AAFs in the… ▽ More

    Submitted 23 August, 2016; originally announced August 2016.

  41. arXiv:1505.03819  [pdf, other

    cs.DC

    Performance Analysis and Efficient Execution on Systems with multi-core CPUs, GPUs and MICs

    Authors: George Teodoro, Tahsin Kurc, Guilherme Andrade, Jun Kong, Renato Ferreira, Joel Saltz

    Abstract: We carry out a comparative performance study of multi-core CPUs, GPUs and Intel Xeon Phi (Many Integrated Core - MIC) with a microscopy image analysis application. We experimentally evaluate the performance of computing devices on core operations of the application. We correlate the observed performance with the characteristics of computing devices and data access patterns, computation complexitie… ▽ More

    Submitted 14 May, 2015; originally announced May 2015.

    Comments: 22 pages, 12 figures, 6 tables

  42. arXiv:1504.07947  [pdf, other

    cs.CV

    Patch-based Convolutional Neural Network for Whole Slide Tissue Image Classification

    Authors: Le Hou, Dimitris Samaras, Tahsin M. Kurc, Yi Gao, James E. Davis, Joel H. Saltz

    Abstract: Convolutional Neural Networks (CNN) are state-of-the-art models for many image classification tasks. However, to recognize cancer subtypes automatically, training a CNN on gigapixel resolution Whole Slide Tissue Images (WSI) is currently computationally impossible. The differentiation of cancer subtypes is based on cellular-level visual features observed on image patch scale. Therefore, we argue t… ▽ More

    Submitted 9 March, 2016; v1 submitted 29 April, 2015; originally announced April 2015.

    ACM Class: J.3; I.4; I.5

  43. arXiv:1405.7958  [pdf, other

    cs.DC

    Region Templates: Data Representation and Management for Large-Scale Image Analysis

    Authors: George Teodoro, Tony Pan, Tahsin Kurc, Jun Kong, Lee Cooper, Scott Klasky, Joel Saltz

    Abstract: Distributed memory machines equipped with CPUs and GPUs (hybrid computing nodes) are hard to program because of the multiple layers of memory and heterogeneous computing configurations. In this paper, we introduce a region template abstraction for the efficient management of common data types used in analysis of large datasets of high resolution images on clusters of hybrid computing nodes. The re… ▽ More

    Submitted 30 May, 2014; originally announced May 2014.

    Comments: 43 pages, 17 figures

  44. arXiv:1311.0378  [pdf, other

    cs.DC cs.PF

    Comparative Performance Analysis of Intel Xeon Phi, GPU, and CPU

    Authors: George Teodoro, Tahsin Kurc, Jun Kong, Lee Cooper, Joel Saltz

    Abstract: We investigate and characterize the performance of an important class of operations on GPUs and Many Integrated Core (MIC) architectures. Our work is motivated by applications that analyze low-dimensional spatial datasets captured by high resolution sensors, such as image datasets obtained from whole slide tissue specimens using microscopy image scanners. We identify the data access and computatio… ▽ More

    Submitted 2 November, 2013; originally announced November 2013.

    Comments: 11 pages, 2 figures

    ACM Class: C.4; D.1.3; D.2.6

  45. arXiv:1310.4136  [pdf, other

    cs.DC cs.DB cs.IR

    Scalable Locality-Sensitive Hashing for Similarity Search in High-Dimensional, Large-Scale Multimedia Datasets

    Authors: Thiago S. F. X. Teixeira, George Teodoro, Eduardo Valle, Joel H. Saltz

    Abstract: Similarity search is critical for many database applications, including the increasingly popular online services for Content-Based Multimedia Retrieval (CBMR). These services, which include image search engines, must handle an overwhelming volume of data, while keeping low response times. Thus, scalability is imperative for similarity search in Web-scale applications, but most existing methods are… ▽ More

    Submitted 15 October, 2013; originally announced October 2013.

  46. arXiv:1209.3332  [pdf, other

    cs.DC eess.SY

    High-throughput Execution of Hierarchical Analysis Pipelines on Hybrid Cluster Platforms

    Authors: George Teodoro, Tony Pan, Tahsin M. Kurc, Jun Kong, Lee A. D. Cooper, Joel H. Saltz

    Abstract: We propose, implement, and experimentally evaluate a runtime middleware to support high-throughput execution on hybrid cluster machines of large-scale analysis applications. A hybrid cluster machine consists of computation nodes which have multiple CPUs and general purpose graphics processing units (GPUs). Our work targets scientific analysis applications in which datasets are processed in applica… ▽ More

    Submitted 14 September, 2012; originally announced September 2012.

    Comments: 12 pages, 14 figures

  47. arXiv:1209.3314  [pdf, other

    cs.DC cs.DS

    Efficient Irregular Wavefront Propagation Algorithms on Hybrid CPU-GPU Machines

    Authors: George Teodoro, Tony Pan, Tahsin Kurc, Jun Kong, Lee Cooper, Joel Saltz

    Abstract: In this paper, we address the problem of efficient execution of a computation pattern, referred to here as the irregular wavefront propagation pattern (IWPP), on hybrid systems with multiple CPUs and GPUs. The IWPP is common in several image processing operations. In the IWPP, data elements in the wavefront propagate waves to their neighboring elements on a grid if a propagation condition is satis… ▽ More

    Submitted 14 September, 2012; originally announced September 2012.

    Comments: 37 pages, 16 figures

  48. arXiv:1209.0410  [pdf, other

    cs.MM cs.DB cs.DC

    Approximate Similarity Search for Online Multimedia Services on Distributed CPU-GPU Platforms

    Authors: George Teodoro, Eduardo Valle, Nathan Mariano, Ricardo Torres, Wagner Meira Jr, Joel H. Saltz

    Abstract: Similarity search in high-dimentional spaces is a pivotal operation found a variety of database applications. Recently, there has been an increase interest in similarity search for online content-based multimedia services. Those services, however, introduce new challenges with respect to the very large volumes of data that have to be indexed/searched, and the need to minimize response times observ… ▽ More

    Submitted 3 September, 2012; originally announced September 2012.

    Comments: 25 pages

  49. arXiv:1208.0277  [pdf, other

    cs.DB

    Accelerating Pathology Image Data Cross-Comparison on CPU-GPU Hybrid Systems

    Authors: Kaibo Wang, Yin Huai, Rubao Lee, Fusheng Wang, Xiaodong Zhang, Joel H. Saltz

    Abstract: As an important application of spatial databases in pathology imaging analysis, cross-comparing the spatial boundaries of a huge amount of segmented micro-anatomic objects demands extremely data- and compute-intensive operations, requiring high throughput at an affordable cost. However, the performance of spatial database systems has not been satisfactory since their implementations of spatial ope… ▽ More

    Submitted 1 August, 2012; originally announced August 2012.

    Comments: VLDB2012

    Journal ref: Proceedings of the VLDB Endowment (PVLDB), Vol. 5, No. 11, pp. 1543-1554 (2012)