Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 77 results for author: Chen, D Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.19763  [pdf, other

    eess.IV cs.CV

    TeleOR: Real-time Telemedicine System for Full-Scene Operating Room

    Authors: Yixuan Wu, Kaiyuan Hu, Qian Shao, Jintai Chen, Danny Z. Chen, Jian Wu

    Abstract: The advent of telemedicine represents a transformative development in leveraging technology to extend the reach of specialized medical expertise to remote surgeries, a field where the immediacy of expert guidance is paramount. However, the intricate dynamics of Operating Room (OR) scene pose unique challenges for telemedicine, particularly in achieving high-fidelity, real-time scene reconstruction… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  2. arXiv:2407.09790  [pdf, other

    cs.LG

    Team up GBDTs and DNNs: Advancing Efficient and Effective Tabular Prediction with Tree-hybrid MLPs

    Authors: Jiahuan Yan, Jintai Chen, Qianxing Wang, Danny Z. Chen, Jian Wu

    Abstract: Tabular datasets play a crucial role in various applications. Thus, developing efficient, effective, and widely compatible prediction algorithms for tabular data is important. Currently, two prominent model types, Gradient Boosted Decision Trees (GBDTs) and Deep Neural Networks (DNNs), have demonstrated performance advantages on distinct tabular prediction tasks. However, selecting an effective mo… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: Accepted at KDD 2024 Research Track, codes will be available at https://github.com/jyansir/tmlp

  3. arXiv:2406.11026  [pdf, other

    cs.CV cs.AI

    Boosting Medical Image Classification with Segmentation Foundation Model

    Authors: Pengfei Gu, Zihan Zhao, Hongxiao Wang, Yaopeng Peng, Yizhe Zhang, Nishchal Sapkota, Chaoli Wang, Danny Z. Chen

    Abstract: The Segment Anything Model (SAM) exhibits impressive capabilities in zero-shot segmentation for natural images. Recently, SAM has gained a great deal of attention for its applications in medical image segmentation. However, to our best knowledge, no studies have shown how to harness the power of SAM for medical image classification. To fill this gap and make SAM a true ``foundation model'' for med… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  4. arXiv:2406.10519  [pdf, other

    cs.CV cs.AI

    Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentation

    Authors: Pengfei Gu, Yejia Zhang, Huimin Li, Chaoli Wang, Danny Z. Chen

    Abstract: Masked Autoencoders (MAEs) have been shown to be effective in pre-training Vision Transformers (ViTs) for natural and medical image analysis problems. By reconstructing missing pixel/voxel information in visible patches, a ViT encoder can aggregate contextual information for downstream tasks. But, existing MAE pre-training methods, which were specifically developed with the ViT architecture, lack… ▽ More

    Submitted 15 July, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

  5. arXiv:2405.10255  [pdf, other

    cs.CV cs.RO

    When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models

    Authors: Xianzheng Ma, Yash Bhalgat, Brandon Smart, Shuai Chen, Xinghui Li, Jian Ding, Jindong Gu, Dave Zhenyu Chen, Songyou Peng, Jia-Wang Bian, Philip H Torr, Marc Pollefeys, Matthias Nießner, Ian D Reid, Angel X. Chang, Iro Laina, Victor Adrian Prisacariu

    Abstract: As large language models (LLMs) evolve, their integration with 3D spatial data (3D-LLMs) has seen rapid progress, offering unprecedented capabilities for understanding and interacting with physical spaces. This survey provides a comprehensive overview of the methodologies enabling LLMs to process, understand, and generate 3D data. Highlighting the unique advantages of LLMs, such as in-context lear… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  6. arXiv:2405.00915  [pdf, other

    cs.CV cs.AI cs.LG

    EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion

    Authors: Guangyao Zhai, Evin Pınar Örnek, Dave Zhenyu Chen, Ruotong Liao, Yan Di, Nassir Navab, Federico Tombari, Benjamin Busam

    Abstract: We present EchoScene, an interactive and controllable generative model that generates 3D indoor scenes on scene graphs. EchoScene leverages a dual-branch diffusion model that dynamically adapts to scene graphs. Existing methods struggle to handle scene graphs due to varying numbers of nodes, multiple edge combinations, and manipulator-induced node-edge operations. EchoScene overcomes this by assoc… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 25 pages. 10 figures

  7. arXiv:2404.18906  [pdf, other

    cs.CG

    On Clustering Induced Voronoi Diagrams

    Authors: Danny Z. Chen, Ziyun Huang, Yangwei Liu, Jinhui Xu

    Abstract: In this paper, we study a generalization of the classical Voronoi diagram, called clustering induced Voronoi diagram (CIVD). Different from the traditional model, CIVD takes as its sites the power set $U$ of an input set $P$ of objects. For each subset $C$ of $P$, CIVD uses an influence function $F(C,q)$ to measure the total (or joint) influence of all objects in $C$ on an arbitrary point $q$ in t… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: https://info.arxiv.org/help/prep#comments

  8. arXiv:2404.05235  [pdf, other

    cs.AI

    Novelty Heuristics, Multi-Queue Search, and Portfolios for Numeric Planning

    Authors: Dillon Z. Chen, Sylvie Thiébaux

    Abstract: Heuristic search is a powerful approach for solving planning problems and numeric planning is no exception. In this paper, we boost the performance of heuristic search for numeric planning with various powerful techniques orthogonal to improving heuristic informedness: numeric novelty heuristics, the Manhattan distance heuristic, and exploring the use of multi-queue search and portfolios for combi… ▽ More

    Submitted 11 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Extended version of SoCS 2024 paper

  9. arXiv:2403.16508  [pdf, other

    cs.AI

    Return to Tradition: Learning Reliable Heuristics with Classical Machine Learning

    Authors: Dillon Z. Chen, Felipe Trevizan, Sylvie Thiébaux

    Abstract: Current approaches for learning for planning have yet to achieve competitive performance against classical planners in several domains, and have poor overall performance. In this work, we construct novel graph representations of lifted planning tasks and use the WL algorithm to generate features from them. These features are used with classical machine learning methods which have up to 2 orders of… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Extended version of ICAPS 2024 paper

  10. arXiv:2403.11375  [pdf, other

    cs.CV cs.LG q-bio.GN

    Path-GPTOmic: A Balanced Multi-modal Learning Framework for Survival Outcome Prediction

    Authors: Hongxiao Wang, Yang Yang, Zhuo Zhao, Pengfei Gu, Nishchal Sapkota, Danny Z. Chen

    Abstract: For predicting cancer survival outcomes, standard approaches in clinical research are often based on two main modalities: pathology images for observing cell morphology features, and genomic (e.g., bulk RNA-seq) for quantifying gene expressions. However, existing pathology-genomic multi-modal algorithms face significant challenges: (1) Valuable biological insights regarding genes and gene-gene int… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE International Symposium on Biomedical Imaging (ISBI 2024)

  11. arXiv:2403.01841  [pdf, other

    cs.CL cs.LG

    Making Pre-trained Language Models Great on Tabular Prediction

    Authors: Jiahuan Yan, Bo Zheng, Hongxia Xu, Yiheng Zhu, Danny Z. Chen, Jimeng Sun, Jian Wu, Jintai Chen

    Abstract: The transferability of deep neural networks (DNNs) has made significant progress in image and language processing. However, due to the heterogeneity among tables, such DNN bonus is still far from being well exploited on tabular data prediction (e.g., regression or classification tasks). Condensing knowledge from diverse domains, language models (LMs) possess the capability to comprehend feature na… ▽ More

    Submitted 12 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted to ICLR 2024 as spotlight presentation (Notable Top 5%). OpenReview link is https://openreview.net/forum?id=anzIzGZuLi, codes will be available at https://github.com/jyansir/tp-berta

  12. arXiv:2402.03697  [pdf, other

    cs.CV

    SHMC-Net: A Mask-guided Feature Fusion Network for Sperm Head Morphology Classification

    Authors: Nishchal Sapkota, Yejia Zhang, Sirui Li, Peixian Liang, Zhuo Zhao, Jingjing Zhang, Xiaomin Zha, Yiru Zhou, Yunxia Cao, Danny Z Chen

    Abstract: Male infertility accounts for about one-third of global infertility cases. Manual assessment of sperm abnormalities through head morphology analysis encounters issues of observer variability and diagnostic discrepancies among experts. Its alternative, Computer-Assisted Semen Analysis (CASA), suffers from low-quality sperm images, small datasets, and noisy class labels. We propose a new approach fo… ▽ More

    Submitted 5 March, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Published on ISBI 2024

  13. arXiv:2402.03695  [pdf, other

    eess.IV cs.CV

    ConUNETR: A Conditional Transformer Network for 3D Micro-CT Embryonic Cartilage Segmentation

    Authors: Nishchal Sapkota, Yejia Zhang, Susan M. Motch Perrine, Yuhan Hsi, Sirui Li, Meng Wu, Greg Holmes, Abdul R. Abdulai, Ethylin W. Jabs, Joan T. Richtsmeier, Danny Z Chen

    Abstract: Studying the morphological development of cartilaginous and osseous structures is critical to the early detection of life-threatening skeletal dysmorphology. Embryonic cartilage undergoes rapid structural changes within hours, introducing biological variations and morphological shifts that limit the generalization of deep learning-based segmentation models that infer across multiple embryonic age… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Published in ISBI 2024

  14. arXiv:2402.03093  [pdf, other

    cs.CV cs.HC

    AI-Enhanced Virtual Reality in Medicine: A Comprehensive Survey

    Authors: Yixuan Wu, Kaiyuan Hu, Danny Z. Chen, Jian Wu

    Abstract: With the rapid advance of computer graphics and artificial intelligence technologies, the ways we interact with the world have undergone a transformative shift. Virtual Reality (VR) technology, aided by artificial intelligence (AI), has emerged as a dominant interaction media in multiple application areas, thanks to its advantage of providing users with immersive experiences. Among those applicati… ▽ More

    Submitted 11 July, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  15. arXiv:2402.02649  [pdf, other

    cs.CV

    Densely Decoded Networks with Adaptive Deep Supervision for Medical Image Segmentation

    Authors: Suraj Mishra, Danny Z. Chen

    Abstract: Medical image segmentation using deep neural networks has been highly successful. However, the effectiveness of these networks is often limited by inadequate dense prediction and inability to extract robust features. To achieve refined dense prediction, we propose densely decoded networks (ddn), by selectively introducing 'crutch' network connections. Such 'crutch' connections in each upsampling s… ▽ More

    Submitted 4 March, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  16. arXiv:2312.11143  [pdf, other

    cs.AI

    Learning Domain-Independent Heuristics for Grounded and Lifted Planning

    Authors: Dillon Z. Chen, Sylvie Thiébaux, Felipe Trevizan

    Abstract: We present three novel graph representations of planning tasks suitable for learning domain-independent heuristics using Graph Neural Networks (GNNs) to guide search. In particular, to mitigate the issues caused by large grounded GNNs we present the first method for learning domain-independent heuristics with only the lifted representation of a planning task. We also provide a theoretical analysis… ▽ More

    Submitted 20 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Extended version of AAAI 2024 paper

  17. arXiv:2312.09899  [pdf, other

    eess.IV cs.CV cs.LG

    SQA-SAM: Segmentation Quality Assessment for Medical Images Utilizing the Segment Anything Model

    Authors: Yizhe Zhang, Shuo Wang, Tao Zhou, Qi Dou, Danny Z. Chen

    Abstract: Segmentation quality assessment (SQA) plays a critical role in the deployment of a medical image based AI system. Users need to be informed/alerted whenever an AI system generates unreliable/incorrect predictions. With the introduction of the Segment Anything Model (SAM), a general foundation segmentation model, new research opportunities emerged in how one can utilize SAM for medical image segmen… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Work in progress;

  18. arXiv:2311.17791  [pdf, other

    eess.IV cs.CV

    U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image Segmentation

    Authors: Yaopeng Peng, Milan Sonka, Danny Z. Chen

    Abstract: In this paper, we introduce U-Net v2, a new robust and efficient U-Net variant for medical image segmentation. It aims to augment the infusion of semantic information into low-level features while simultaneously refining high-level features with finer details. For an input image, we begin by extracting multi-level features with a deep neural network encoder. Next, we enhance the feature map of eac… ▽ More

    Submitted 30 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  19. arXiv:2311.17261  [pdf, other

    cs.CV

    SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors

    Authors: Dave Zhenyu Chen, Haoxuan Li, Hsin-Ying Lee, Sergey Tulyakov, Matthias Nießner

    Abstract: We propose SceneTex, a novel method for effectively generating high-quality and style-consistent textures for indoor scenes using depth-to-image diffusion priors. Unlike previous methods that either iteratively warp 2D views onto a mesh surface or distillate diffusion latent features without accurate geometric and style cues, SceneTex formulates the texture synthesis task as an optimization proble… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Project website: https://daveredrum.github.io/SceneTex/

  20. arXiv:2311.17243  [pdf, other

    cs.CV eess.IV

    PHG-Net: Persistent Homology Guided Medical Image Classification

    Authors: Yaopeng Peng, Hongxiao Wang, Milan Sonka, Danny Z. Chen

    Abstract: Modern deep neural networks have achieved great successes in medical image analysis. However, the features captured by convolutional neural networks (CNNs) or Transformers tend to be optimized for pixel intensities and neglect key anatomical structures such as connected components and loops. In this paper, we propose a persistent homology guided approach (PHG-Net) that explores topological feature… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted by WACV 2024

  21. arXiv:2310.19516  [pdf, other

    cs.CV

    Generating Context-Aware Natural Answers for Questions in 3D Scenes

    Authors: Mohammed Munzer Dwedari, Matthias Niessner, Dave Zhenyu Chen

    Abstract: 3D question answering is a young field in 3D vision-language that is yet to be explored. Previous methods are limited to a pre-defined answer space and cannot generate answers naturally. In this work, we pivot the question answering task to a sequence generation task to generate free-form natural answers for questions in 3D scenes (Gen3DQA). To this end, we optimize our model directly on the langu… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  22. arXiv:2309.13671  [pdf, other

    cs.CV

    OneSeg: Self-learning and One-shot Learning based Single-slice Annotation for 3D Medical Image Segmentation

    Authors: Yixuan Wu, Bo Zheng, Jintai Chen, Danny Z. Chen, Jian Wu

    Abstract: As deep learning methods continue to improve medical image segmentation performance, data annotation is still a big bottleneck due to the labor-intensive and time-consuming burden on medical experts, especially for 3D images. To significantly reduce annotation efforts while attaining competitive segmentation accuracy, we propose a self-learning and one-shot learning based framework for 3D medical… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  23. arXiv:2309.08888  [pdf, other

    cs.CV cs.AI

    GCL: Gradient-Guided Contrastive Learning for Medical Image Segmentation with Multi-Perspective Meta Labels

    Authors: Yixuan Wu, Jintai Chen, Jiahuan Yan, Yiheng Zhu, Danny Z. Chen, Jian Wu

    Abstract: Since annotating medical images for segmentation tasks commonly incurs expensive costs, it is highly desirable to design an annotation-efficient method to alleviate the annotation burden. Recently, contrastive learning has exhibited a great potential in learning robust representations to boost downstream tasks with limited labels. In medical imaging scenarios, ready-made meta labels (i.e., specifi… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

  24. arXiv:2309.04760  [pdf, other

    cs.LG cs.AI cs.CV

    RR-CP: Reliable-Region-Based Conformal Prediction for Trustworthy Medical Image Classification

    Authors: Yizhe Zhang, Shuo Wang, Yejia Zhang, Danny Z. Chen

    Abstract: Conformal prediction (CP) generates a set of predictions for a given test sample such that the prediction set almost always contains the true label (e.g., 99.5\% of the time). CP provides comprehensive predictions on possible labels of a given test sample, and the size of the set indicates how certain the predictions are (e.g., a set larger than one is `uncertain'). Such distinct properties of CP… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: UNSURE2023 (Uncertainty for Safe Utilization of Machine Learning in Medical Imaging) at MICCAI2023; Spotlight

  25. arXiv:2308.13759  [pdf, other

    cs.CV cs.AI cs.LG

    SamDSK: Combining Segment Anything Model with Domain-Specific Knowledge for Semi-Supervised Learning in Medical Image Segmentation

    Authors: Yizhe Zhang, Tao Zhou, Shuo Wang, Ye Wu, Pengfei Gu, Danny Z. Chen

    Abstract: The Segment Anything Model (SAM) exhibits a capability to segment a wide array of objects in natural images, serving as a versatile perceptual tool for various downstream image segmentation tasks. In contrast, medical image segmentation tasks often rely on domain-specific knowledge (DSK). In this paper, we propose a novel method that combines the segmentation foundation model (i.e., SAM) with doma… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: 15 pages, 7 figures, Github: https://github.com/yizhezhang2000/SamDSK

  26. arXiv:2307.12429  [pdf, other

    cs.CV

    SwIPE: Efficient and Robust Medical Image Segmentation with Implicit Patch Embeddings

    Authors: Yejia Zhang, Pengfei Gu, Nishchal Sapkota, Danny Z. Chen

    Abstract: Modern medical image segmentation methods primarily use discrete representations in the form of rasterized masks to learn features and generate predictions. Although effective, this paradigm is spatially inflexible, scales poorly to higher-resolution images, and lacks direct understanding of object shapes. To address these limitations, some recent works utilized implicit neural representations (IN… ▽ More

    Submitted 21 March, 2024; v1 submitted 23 July, 2023; originally announced July 2023.

    Comments: Accepted to the 2023 International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI'23)

  27. arXiv:2304.11332  [pdf, other

    cs.CV cs.AI cs.LG

    Input Augmentation with SAM: Boosting Medical Image Segmentation with Segmentation Foundation Model

    Authors: Yizhe Zhang, Tao Zhou, Shuo Wang, Peixian Liang, Danny Z. Chen

    Abstract: The Segment Anything Model (SAM) is a recently developed large model for general-purpose segmentation for computer vision tasks. SAM was trained using 11 million images with over 1 billion masks and can produce segmentation results for a wide range of objects in natural scene images. SAM can be viewed as a general perception model for segmentation (partitioning images into semantically meaningful… ▽ More

    Submitted 21 June, 2023; v1 submitted 22 April, 2023; originally announced April 2023.

    Comments: GitHub: https://github.com/yizhezhang2000/SAMAug. Comments and questions are welcome

  28. arXiv:2303.11396  [pdf, other

    cs.CV

    Text2Tex: Text-driven Texture Synthesis via Diffusion Models

    Authors: Dave Zhenyu Chen, Yawar Siddiqui, Hsin-Ying Lee, Sergey Tulyakov, Matthias Nießner

    Abstract: We present Text2Tex, a novel method for generating high-quality textures for 3D meshes from the given text prompts. Our method incorporates inpainting into a pre-trained depth-aware image diffusion model to progressively synthesize high resolution partial textures from multiple viewpoints. To avoid accumulating inconsistent and stretched artifacts across views, we dynamically segment the rendered… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Project page: https://daveredrum.github.io/Text2Tex/

  29. arXiv:2302.08722  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    GPT4MIA: Utilizing Generative Pre-trained Transformer (GPT-3) as A Plug-and-Play Transductive Model for Medical Image Analysis

    Authors: Yizhe Zhang, Danny Z. Chen

    Abstract: In this paper, we propose a novel approach (called GPT4MIA) that utilizes Generative Pre-trained Transformer (GPT) as a plug-and-play transductive inference tool for medical image analysis (MIA). We provide theoretical analysis on why a large pre-trained language model such as GPT-3 can be used as a plug-and-play transductive inference model for MIA. At the methodological level, we develop several… ▽ More

    Submitted 21 March, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: Version 3: Added appendix with more results and visualizations. Questions and suggestions are welcome

  30. arXiv:2301.02819  [pdf, other

    cs.LG

    ExcelFormer: A neural network surpassing GBDTs on tabular data

    Authors: Jintai Chen, Jiahuan Yan, Qiyuan Chen, Danny Ziyi Chen, Jian Wu, Jimeng Sun

    Abstract: Data organized in tabular format is ubiquitous in real-world applications, and users often craft tables with biased feature definitions and flexibly set prediction targets of their interests. Thus, a rapid development of a robust, effective, dataset-versatile, user-friendly tabular prediction approach is highly desired. While Gradient Boosting Decision Trees (GBDTs) and existing deep neural networ… ▽ More

    Submitted 25 July, 2024; v1 submitted 7 January, 2023; originally announced January 2023.

  31. arXiv:2212.00836  [pdf, other

    cs.CV

    UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding

    Authors: Dave Zhenyu Chen, Ronghang Hu, Xinlei Chen, Matthias Nießner, Angel X. Chang

    Abstract: Performing 3D dense captioning and visual grounding requires a common and shared understanding of the underlying multimodal relationships. However, despite some previous attempts on connecting these two related tasks with highly task-specific neural modules, it remains understudied how to explicitly depict their shared nature to learn them simultaneously. In this work, we propose UniT3D, a simple… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  32. arXiv:2211.16887  [pdf, other

    cs.LG

    T2G-Former: Organizing Tabular Features into Relation Graphs Promotes Heterogeneous Feature Interaction

    Authors: Jiahuan Yan, Jintai Chen, Yixuan Wu, Danny Z. Chen, Jian Wu

    Abstract: Recent development of deep neural networks (DNNs) for tabular learning has largely benefited from the capability of DNNs for automatic feature interaction. However, the heterogeneity nature of tabular features makes such features relatively independent, and developing effective methods to promote tabular feature interaction still remains an open problem. In this paper, we propose a novel Graph Est… ▽ More

    Submitted 5 March, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: AAAI 2023, Oral. Code: https://github.com/jyansir/t2g-former

  33. arXiv:2211.08643  [pdf, other

    cs.CV

    Keep Your Friends Close & Enemies Farther: Debiasing Contrastive Learning with Spatial Priors in 3D Radiology Images

    Authors: Yejia Zhang, Nishchal Sapkota, Pengfei Gu, Yaopeng Peng, Hao Zheng, Danny Z. Chen

    Abstract: Understanding of spatial attributes is central to effective 3D radiology image analysis where crop-based learning is the de facto standard. Given an image patch, its core spatial properties (e.g., position & orientation) provide helpful priors on expected object sizes, appearances, and structures through inherent anatomical consistencies. Spatial correspondences, in particular, can effectively gau… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Accepted to 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM'22)

  34. arXiv:2211.08564  [pdf, other

    cs.CV

    ConvFormer: Combining CNN and Transformer for Medical Image Segmentation

    Authors: Pengfei Gu, Yejia Zhang, Chaoli Wang, Danny Z. Chen

    Abstract: Convolutional neural network (CNN) based methods have achieved great successes in medical image segmentation, but their capability to learn global representations is still limited due to using small effective receptive fields of convolution operations. Transformer based methods are capable of modelling long-range dependencies of information for capturing global representations, yet their ability t… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  35. arXiv:2211.08557  [pdf, other

    cs.CV

    Unsupervised Feature Clustering Improves Contrastive Representation Learning for Medical Image Segmentation

    Authors: Yejia Zhang, Xinrong Hu, Nishchal Sapkota, Yiyu Shi, Danny Z. Chen

    Abstract: Self-supervised instance discrimination is an effective contrastive pretext task to learn feature representations and address limited medical image annotations. The idea is to make features of transformed versions of the same images similar while forcing all other augmented images' representations to contrast. However, this instance-based contrastive learning leaves performance on the table by fai… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Accepted to 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM'22) proceedings

  36. arXiv:2211.08533  [pdf, other

    cs.CV

    A Point in the Right Direction: Vector Prediction for Spatially-aware Self-supervised Volumetric Representation Learning

    Authors: Yejia Zhang, Pengfei Gu, Nishchal Sapkota, Hao Zheng, Peixian Liang, Danny Z. Chen

    Abstract: High annotation costs and limited labels for dense 3D medical imaging tasks have recently motivated an assortment of 3D self-supervised pretraining methods that improve transfer learning performance. However, these methods commonly lack spatial awareness despite its centrality in enabling effective 3D image analysis. More specifically, position, scale, and orientation are not only informative but… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  37. arXiv:2211.06614  [pdf, other

    cs.LG cs.AI

    Robust Training of Graph Neural Networks via Noise Governance

    Authors: Siyi Qian, Haochao Ying, Renjun Hu, Jingbo Zhou, Jintai Chen, Danny Z. Chen, Jian Wu

    Abstract: Graph Neural Networks (GNNs) have become widely-used models for semi-supervised learning. However, the robustness of GNNs in the presence of label noise remains a largely under-explored problem. In this paper, we consider an important yet challenging scenario where labels on nodes of graphs are not only noisy but also scarce. In this scenario, the performance of GNNs is prone to degrade due to lab… ▽ More

    Submitted 25 February, 2023; v1 submitted 12 November, 2022; originally announced November 2022.

    Comments: 9 pages, accepted to WSDM 2023 Research Track

  38. arXiv:2209.01527  [pdf, other

    cs.CV

    Data-Driven Deep Supervision for Skin Lesion Classification

    Authors: Suraj Mishra, Yizhe Zhang, Li Zhang, Tianyu Zhang, X. Sharon Hu, Danny Z. Chen

    Abstract: Automatic classification of pigmented, non-pigmented, and depigmented non-melanocytic skin lesions have garnered lots of attention in recent years. However, imaging variations in skin texture, lesion shape, depigmentation contrast, lighting condition, etc. hinder robust feature extraction, affecting classification accuracy. In this paper, we propose a new deep neural network that exploits input da… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: MICCAI 2022

  39. arXiv:2208.11311  [pdf, other

    cs.LG cs.AI cs.CR

    Federated Learning via Decentralized Dataset Distillation in Resource-Constrained Edge Environments

    Authors: Rui Song, Dai Liu, Dave Zhenyu Chen, Andreas Festag, Carsten Trinitis, Martin Schulz, Alois Knoll

    Abstract: In federated learning, all networked clients contribute to the model training cooperatively. However, with model sizes increasing, even sharing the trained partial models often leads to severe communication bottlenecks in underlying networks, especially when communicated iteratively. In this paper, we introduce a federated learning framework FedD3 requiring only one-shot communication by integrati… ▽ More

    Submitted 19 May, 2023; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: Accepted at IJCNN 2023

  40. arXiv:2207.10670  [pdf, other

    cs.LG cs.AI eess.SP

    ME-GAN: Learning Panoptic Electrocardio Representations for Multi-view ECG Synthesis Conditioned on Heart Diseases

    Authors: Jintai Chen, Kuanlun Liao, Kun Wei, Haochao Ying, Danny Z. Chen, Jian Wu

    Abstract: Electrocardiogram (ECG) is a widely used non-invasive diagnostic tool for heart diseases. Many studies have devised ECG analysis models (e.g., classifiers) to assist diagnosis. As an upstream task, researches have built generative models to synthesize ECG data, which are beneficial to providing training samples, privacy protection, and annotation reduction. However, previous generative methods for… ▽ More

    Submitted 29 May, 2023; v1 submitted 21 July, 2022; originally announced July 2022.

    Journal ref: In International Conference on Machine Learning, 3360--3370, (2022), PMLR

  41. arXiv:2207.00156  [pdf, other

    eess.IV cs.CV cs.LG

    Usable Region Estimate for Assessing Practical Usability of Medical Image Segmentation Models

    Authors: Yizhe Zhang, Suraj Mishra, Peixian Liang, Hao Zheng, Danny Z. Chen

    Abstract: We aim to quantitatively measure the practical usability of medical image segmentation models: to what extent, how often, and on which samples a model's predictions can be used/trusted. We first propose a measure, Correctness-Confidence Rank Correlation (CCRC), to capture how predictions' confidence estimates correlate with their correctness scores in rank. A model with a high value of CCRC means… ▽ More

    Submitted 30 June, 2022; originally announced July 2022.

    Comments: Accepted by MICCAI2022

  42. arXiv:2206.10592  [pdf, other

    cs.AI cs.LG eess.SP

    Identifying Electrocardiogram Abnormalities Using a Handcrafted-Rule-Enhanced Neural Network

    Authors: Yuexin Bian, Jintai Chen, Xiaojun Chen, Xiaoxian Yang, Danny Z. Chen, JIan Wu

    Abstract: A large number of people suffer from life-threatening cardiac abnormalities, and electrocardiogram (ECG) analysis is beneficial to determining whether an individual is at risk of such abnormalities. Automatic ECG classification methods, especially the deep learning based ones, have been proposed to detect cardiac abnormalities using ECG records, showing good potential to improve clinical diagnosis… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Journal ref: IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2022

  43. arXiv:2206.01309  [pdf, other

    cs.CV

    H-EMD: A Hierarchical Earth Mover's Distance Method for Instance Segmentation

    Authors: Peixian Liang, Yizhe Zhang, Yifan Ding, Jianxu Chen, Chinedu S. Madukoma, Tim Weninger, Joshua D. Shrout, Danny Z. Chen

    Abstract: Deep learning (DL) based semantic segmentation methods have achieved excellent performance in biomedical image segmentation, producing high quality probability maps to allow extraction of rich instance information to facilitate good instance segmentation. While numerous efforts were put into developing new DL semantic segmentation models, less attention was paid to a key issue of how to effectivel… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: Accepted at IEEE Transactions On Medical Imaging (TMI)

  44. arXiv:2202.07191  [pdf, other

    cs.CV cs.LG

    Improving Human Sperm Head Morphology Classification with Unsupervised Anatomical Feature Distillation

    Authors: Yejia Zhang, Jingjing Zhang, Xiaomin Zha, Yiru Zhou, Yunxia Cao, Danny Z. Chen

    Abstract: With rising male infertility, sperm head morphology classification becomes critical for accurate and timely clinical diagnosis. Recent deep learning (DL) morphology analysis methods achieve promising benchmark results, but leave performance and robustness on the table by relying on limited and possibly noisy class labels. To address this, we introduce a new DL training framework that leverages ana… ▽ More

    Submitted 16 March, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted to ISBI 2022 proceedings

  45. arXiv:2201.00462  [pdf, other

    cs.CV cs.AI

    D-Former: A U-shaped Dilated Transformer for 3D Medical Image Segmentation

    Authors: Yixuan Wu, Kuanlun Liao, Jintai Chen, Jinhong Wang, Danny Z. Chen, Honghao Gao, Jian Wu

    Abstract: Computer-aided medical image segmentation has been applied widely in diagnosis and treatment to obtain clinically useful information of shapes and volumes of target organs and tissues. In the past several years, convolutional neural network (CNN) based methods (e.g., U-Net) have dominated this area, but still suffered from inadequate long-range information capturing. Hence, recent work presented c… ▽ More

    Submitted 9 January, 2022; v1 submitted 2 January, 2022; originally announced January 2022.

  46. AGMI: Attention-Guided Multi-omics Integration for Drug Response Prediction with Graph Neural Networks

    Authors: Ruiwei Feng, Yufeng Xie, Minshan Lai, Danny Z. Chen, Ji Cao, Jian Wu

    Abstract: Accurate drug response prediction (DRP) is a crucial yet challenging task in precision medicine. This paper presents a novel Attention-Guided Multi-omics Integration (AGMI) approach for DRP, which first constructs a Multi-edge Graph (MeG) for each cell line, and then aggregates multi-omics features to predict drug response using a novel structure, called Graph edge-aware Network (GeNet). For the f… ▽ More

    Submitted 9 January, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

  47. arXiv:2112.02962  [pdf, other

    cs.LG cs.AI

    DANets: Deep Abstract Networks for Tabular Data Classification and Regression

    Authors: Jintai Chen, Kuanlun Liao, Yao Wan, Danny Z. Chen, Jian Wu

    Abstract: Tabular data are ubiquitous in real world applications. Although many commonly-used neural components (e.g., convolution) and extensible neural networks (e.g., ResNet) have been developed by the machine learning community, few of them were effective for tabular data and few designs were adequately tailored for tabular data structures. In this paper, we propose a novel and flexible neural component… ▽ More

    Submitted 7 September, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: @inproceedings{danets, title={DANets: Deep Abstract Networks for Tabular Data Classification and Regression}, author={Chen, Jintai and Liao, Kuanlun and Wan, Yao and Chen, Danny Z and Wu, Jian}, booktitle={AAAI}, year={2022} }

  48. arXiv:2112.01551  [pdf, other

    cs.CV

    D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding

    Authors: Dave Zhenyu Chen, Qirui Wu, Matthias Nießner, Angel X. Chang

    Abstract: Recent studies on dense captioning and visual grounding in 3D have achieved impressive results. Despite developments in both areas, the limited amount of available 3D vision-language data causes overfitting issues for 3D visual grounding and 3D dense captioning methods. Also, how to discriminatively describe objects in complex 3D environments is not fully studied yet. To address these challenges,… ▽ More

    Submitted 22 July, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: Project website: https://daveredrum.github.io/D3Net/

  49. arXiv:2107.04886  [pdf, other

    cs.CV

    Hierarchical Self-Supervised Learning for Medical Image Segmentation Based on Multi-Domain Data Aggregation

    Authors: Hao Zheng, Jun Han, Hongxiao Wang, Lin Yang, Zhuo Zhao, Chaoli Wang, Danny Z. Chen

    Abstract: A large labeled dataset is a key to the success of supervised deep learning, but for medical image segmentation, it is highly challenging to obtain sufficient annotated images for model training. In many scenarios, unannotated images are abundant and easy to acquire. Self-supervised learning (SSL) has shown great potentials in exploiting raw data information and representation learning. In this pa… ▽ More

    Submitted 10 July, 2021; originally announced July 2021.

    Comments: Accepted to MICCAI 2021

  50. arXiv:2107.02927  [pdf, other

    eess.IV cs.CV

    Image Complexity Guided Network Compression for Biomedical Image Segmentation

    Authors: Suraj Mishra, Danny Z. Chen, X. Sharon Hu

    Abstract: Compression is a standard procedure for making convolutional neural networks (CNNs) adhere to some specific computing resource constraints. However, searching for a compressed architecture typically involves a series of time-consuming training/validation experiments to determine a good compromise between network size and performance accuracy. To address this, we propose an image complexity-guided… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: ACM JETC