Search | arXiv e-print repository

Organic room-temperature polariton condensate in a higher-order topological lattice

Authors: Christoph Bennenhei, Hangyong Shan, Marti Struve, Nils Kunte, Falk Eilenberger, Jürgen Ohmer, Utz Fischer, Stefan Schumacher, Xuekai Ma, Christian Schneider, Martin Esmann

Abstract: Organic molecule exciton-polaritons in photonic lattices are a versatile platform to emulate unconventional phases of matter at ambient conditions, including protected interface modes in topological insulators. Here, we investigate bosonic condensation in the most prototypical higher-order topological lattice: a 2D-version of the Su-Schrieffer-Heeger (SSH) model, supporting both 0D and 1D topologi… ▽ More Organic molecule exciton-polaritons in photonic lattices are a versatile platform to emulate unconventional phases of matter at ambient conditions, including protected interface modes in topological insulators. Here, we investigate bosonic condensation in the most prototypical higher-order topological lattice: a 2D-version of the Su-Schrieffer-Heeger (SSH) model, supporting both 0D and 1D topological modes. We study fluorescent protein-filled, structured microcavities defining a staggered photonic trapping potential and observe the resulting first- and higher-order topologically protected modes via spatially resolved photoluminescence spectroscopy. We account for the spatial mode patterns by tight-binding calculations and theoretically characterize the topological invariants of the lattice. Under strong optical pumping, we observe bosonic condensation into the topological modes. Via interferometric measurements, we map the spatial first-order coherence in the protected 1D modes extending over 10 microns. Our findings pave the way towards organic on-chip polaritonics using higher-order topology as a tool for the generation of robustly confined polaritonic lasing states. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 23 pages, 7 figures

arXiv:2307.06891 [pdf, other]

Engineering the impact of phonon dephasing on the coherence of a WSe$_{2}$ single-photon source via cavity quantum electrodynamics

Authors: Victor Nikolaevich Mitryakhin, Alexander Steinhoff, Jens-Christian Drawer, Hangyong Shan, Matthias Florian, Lukas Lackner, Bo Han, Falk Eilenberger, Sefaattin Tongay, Kenji Watanabe, Takashi Taniguchi, Carlos Antón-Solanas, Ana Predojević, Christopher Gies, Martin Esmann, Christian Schneider

Abstract: Emitter dephasing is one of the key issues in the performance of solid-state single photon sources. Among the various sources of dephasing, acoustic phonons play a central role in adding decoherence to the single photon emission. Here, we demonstrate, that it is possible to tune and engineer the coherence of photons emitted from a single WSe$_2$ monolayer quantum dot via selectively coupling it to… ▽ More Emitter dephasing is one of the key issues in the performance of solid-state single photon sources. Among the various sources of dephasing, acoustic phonons play a central role in adding decoherence to the single photon emission. Here, we demonstrate, that it is possible to tune and engineer the coherence of photons emitted from a single WSe$_2$ monolayer quantum dot via selectively coupling it to a spectral cavity resonance. We utilize an open cavity to demonstrate spectral enhancement, leveling, and suppression of the highly asymmetric phonon sideband, finding excellent agreement with a microscopic description of the exciton-phonon dephasing in a truly two-dimensional system. Moreover, the impact of cavity tuning on the dephasing is directly assessed via optical interferometry, which points out the capability to utilize light-matter coupling to steer and design dephasing and coherence of quantum emitters in atomically thin crystals. △ Less

Submitted 4 April, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

arXiv:2304.01814 [pdf, other]

doi 10.1109/TMI.2023.3320812

CoreDiff: Contextual Error-Modulated Generalized Diffusion Model for Low-Dose CT Denoising and Generalization

Authors: Qi Gao, Zilong Li, Junping Zhang, Yi Zhang, Hongming Shan

Abstract: Low-dose computed tomography (CT) images suffer from noise and artifacts due to photon starvation and electronic noise. Recently, some works have attempted to use diffusion models to address the over-smoothness and training instability encountered by previous deep-learning-based denoising models. However, diffusion models suffer from long inference times due to the large number of sampling steps i… ▽ More Low-dose computed tomography (CT) images suffer from noise and artifacts due to photon starvation and electronic noise. Recently, some works have attempted to use diffusion models to address the over-smoothness and training instability encountered by previous deep-learning-based denoising models. However, diffusion models suffer from long inference times due to the large number of sampling steps involved. Very recently, cold diffusion model generalizes classical diffusion models and has greater flexibility. Inspired by the cold diffusion, this paper presents a novel COntextual eRror-modulated gEneralized Diffusion model for low-dose CT (LDCT) denoising, termed CoreDiff. First, CoreDiff utilizes LDCT images to displace the random Gaussian noise and employs a novel mean-preserving degradation operator to mimic the physical process of CT degradation, significantly reducing sampling steps thanks to the informative LDCT images as the starting point of the sampling process. Second, to alleviate the error accumulation problem caused by the imperfect restoration operator in the sampling process, we propose a novel ContextuaL Error-modulAted Restoration Network (CLEAR-Net), which can leverage contextual information to constrain the sampling process from structural distortion and modulate time step embedding features for better alignment with the input at the next time step. Third, to rapidly generalize to a new, unseen dose level with as few resources as possible, we devise a one-shot learning framework to make CoreDiff generalize faster and better using only a single LDCT image (un)paired with NDCT. Extensive experimental results on two datasets demonstrate that our CoreDiff outperforms competing methods in denoising and generalization performance, with a clinically acceptable inference time. Source code is made available at https://github.com/qgao21/CoreDiff. △ Less

Submitted 6 October, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

Comments: IEEE Transactions on Medical Imaging, 2023

Journal ref: IEEE Transactions on Medical Imaging, 43(2), 2024

arXiv:2302.10630 [pdf, other]

doi 10.1109/TMI.2024.3351723

LIT-Former: Linking In-plane and Through-plane Transformers for Simultaneous CT Image Denoising and Deblurring

Authors: Zhihao Chen, Chuang Niu, Qi Gao, Ge Wang, Hongming Shan

Abstract: This paper studies 3D low-dose computed tomography (CT) imaging. Although various deep learning methods were developed in this context, typically they focus on 2D images and perform denoising due to low-dose and deblurring for super-resolution separately. Up to date, little work was done for simultaneous in-plane denoising and through-plane deblurring, which is important to obtain high-quality 3D… ▽ More This paper studies 3D low-dose computed tomography (CT) imaging. Although various deep learning methods were developed in this context, typically they focus on 2D images and perform denoising due to low-dose and deblurring for super-resolution separately. Up to date, little work was done for simultaneous in-plane denoising and through-plane deblurring, which is important to obtain high-quality 3D CT images with lower radiation and faster imaging speed. For this task, a straightforward method is to directly train an end-to-end 3D network. However, it demands much more training data and expensive computational costs. Here, we propose to link in-plane and through-plane transformers for simultaneous in-plane denoising and through-plane deblurring, termed as LIT-Former, which can efficiently synergize in-plane and through-plane sub-tasks for 3D CT imaging and enjoy the advantages of both convolution and transformer networks. LIT-Former has two novel designs: efficient multi-head self-attention modules (eMSM) and efficient convolutional feedforward networks (eCFN). First, eMSM integrates in-plane 2D self-attention and through-plane 1D self-attention to efficiently capture global interactions of 3D self-attention, the core unit of transformer networks. Second, eCFN integrates 2D convolution and 1D convolution to extract local information of 3D convolution in the same fashion. As a result, the proposed LIT-Former synergize these two subtasks, significantly reducing the computational complexity as compared to 3D counterparts and enabling rapid convergence. Extensive experimental results on simulated and clinical datasets demonstrate superior performance over state-of-the-art models. The source code is made available at https://github.com/hao1635/LIT-Former. △ Less

Submitted 7 January, 2024; v1 submitted 21 February, 2023; originally announced February 2023.

Comments: 15 pages, 12 figures

Journal ref: IEEE Transactions on Medical Imaging, 2024

arXiv:2205.04329 [pdf, other]

doi 10.1016/j.compbiomed.2023.106717

SAN-Net: Learning Generalization to Unseen Sites for Stroke Lesion Segmentation with Self-Adaptive Normalization

Authors: Weiyi Yu, Zhizhong Huang, Junping Zhang, Hongming Shan

Abstract: There are considerable interests in automatic stroke lesion segmentation on magnetic resonance (MR) images in the medical imaging field, as stroke is an important cerebrovascular disease. Although deep learning-based models have been proposed for this task, generalizing these models to unseen sites is difficult due to not only the large inter-site discrepancy among different scanners, imaging prot… ▽ More There are considerable interests in automatic stroke lesion segmentation on magnetic resonance (MR) images in the medical imaging field, as stroke is an important cerebrovascular disease. Although deep learning-based models have been proposed for this task, generalizing these models to unseen sites is difficult due to not only the large inter-site discrepancy among different scanners, imaging protocols, and populations, but also the variations in stroke lesion shape, size, and location. To tackle this issue, we introduce a self-adaptive normalization network, termed SAN-Net, to achieve adaptive generalization on unseen sites for stroke lesion segmentation. Motivated by traditional z-score normalization and dynamic network, we devise a masked adaptive instance normalization (MAIN) to minimize inter-site discrepancies, which standardizes input MR images from different sites into a site-unrelated style by dynamically learning affine parameters from the input; \ie, MAIN can affinely transform the intensity values. Then, we leverage a gradient reversal layer to force the U-net encoder to learn site-invariant representation with a site classifier, which further improves the model generalization in conjunction with MAIN. Finally, inspired by the ``pseudosymmetry'' of the human brain, we introduce a simple yet effective data augmentation technique, termed symmetry-inspired data augmentation (SIDA), that can be embedded within SAN-Net to double the sample size while halving memory consumption. Experimental results on the benchmark Anatomical Tracings of Lesions After Stroke (ATLAS) v1.2 dataset, which includes MR images from 9 different sites, demonstrate that under the ``leave-one-site-out'' setting, the proposed SAN-Net outperforms recently published methods in terms of quantitative metrics and qualitative comparisons. △ Less

Submitted 24 February, 2023; v1 submitted 9 May, 2022; originally announced May 2022.

Comments: 18 pages, 9 figures

Journal ref: Computers in Biology and Medicine, 156, 106717, 2023

arXiv:2203.15725 [pdf, other]

doi 10.1109/MSP.2022.3204407

Physics-/Model-Based and Data-Driven Methods for Low-Dose Computed Tomography: A survey

Authors: Wenjun Xia, Hongming Shan, Ge Wang, Yi Zhang

Abstract: Since 2016, deep learning (DL) has advanced tomographic imaging with remarkable successes, especially in low-dose computed tomography (LDCT) imaging. Despite being driven by big data, the LDCT denoising and pure end-to-end reconstruction networks often suffer from the black box nature and major issues such as instabilities, which is a major barrier to apply deep learning methods in low-dose CT app… ▽ More Since 2016, deep learning (DL) has advanced tomographic imaging with remarkable successes, especially in low-dose computed tomography (LDCT) imaging. Despite being driven by big data, the LDCT denoising and pure end-to-end reconstruction networks often suffer from the black box nature and major issues such as instabilities, which is a major barrier to apply deep learning methods in low-dose CT applications. An emerging trend is to integrate imaging physics and model into deep networks, enabling a hybridization of physics/model-based and data-driven elements. %This type of hybrid methods has become increasingly influential. In this paper, we systematically review the physics/model-based data-driven methods for LDCT, summarize the loss functions and training strategies, evaluate the performance of different methods, and discuss relevant issues and future directions. △ Less

Submitted 24 March, 2023; v1 submitted 29 March, 2022; originally announced March 2022.

Journal ref: IEEE Signal Processing Magazine, 40(2), 89-100, 2023

arXiv:2202.08303 [pdf, other]

doi 10.1088/1361-6560/ac8044

OpenKBP-Opt: An international and reproducible evaluation of 76 knowledge-based planning pipelines

Authors: Aaron Babier, Rafid Mahmood, Binghao Zhang, Victor G. L. Alves, Ana Maria Barragán-Montero, Joel Beaudry, Carlos E. Cardenas, Yankui Chang, Zijie Chen, Jaehee Chun, Kelly Diaz, Harold David Eraso, Erik Faustmann, Sibaji Gaj, Skylar Gay, Mary Gronberg, Bingqi Guo, Junjun He, Gerd Heilemann, Sanchit Hira, Yuliang Huang, Fuxin Ji, Dashan Jiang, Jean Carlo Jimenez Giraldo, Hoyeon Lee , et al. (34 additional authors not shown)

Abstract: We establish an open framework for developing plan optimization models for knowledge-based planning (KBP) in radiotherapy. Our framework includes reference plans for 100 patients with head-and-neck cancer and high-quality dose predictions from 19 KBP models that were developed by different research groups during the OpenKBP Grand Challenge. The dose predictions were input to four optimization mode… ▽ More We establish an open framework for developing plan optimization models for knowledge-based planning (KBP) in radiotherapy. Our framework includes reference plans for 100 patients with head-and-neck cancer and high-quality dose predictions from 19 KBP models that were developed by different research groups during the OpenKBP Grand Challenge. The dose predictions were input to four optimization models to form 76 unique KBP pipelines that generated 7600 plans. The predictions and plans were compared to the reference plans via: dose score, which is the average mean absolute voxel-by-voxel difference in dose a model achieved; the deviation in dose-volume histogram (DVH) criterion; and the frequency of clinical planning criteria satisfaction. We also performed a theoretical investigation to justify our dose mimicking models. The range in rank order correlation of the dose score between predictions and their KBP pipelines was 0.50 to 0.62, which indicates that the quality of the predictions is generally positively correlated with the quality of the plans. Additionally, compared to the input predictions, the KBP-generated plans performed significantly better (P<0.05; one-sided Wilcoxon test) on 18 of 23 DVH criteria. Similarly, each optimization model generated plans that satisfied a higher percentage of criteria than the reference plans. Lastly, our theoretical investigation demonstrated that the dose mimicking models generated plans that are also optimal for a conventional planning model. This was the largest international effort to date for evaluating the combination of KBP prediction and optimization models. In the interest of reproducibility, our data and code is freely available at https://github.com/ababier/open-kbp-opt. △ Less

Submitted 16 February, 2022; originally announced February 2022.

Comments: 19 pages, 7 tables, 6 figures

arXiv:2111.06890 [pdf, other]

doi 10.1016/j.artmed.2023.102555

Impact of loss functions on the performance of a deep neural network designed to restore low-dose digital mammography

Authors: Hongming Shan, Rodrigo de Barros Vimieiro, Lucas Rodrigues Borges, Marcelo Andrade da Costa Vieira, Ge Wang

Abstract: Digital mammography is still the most common imaging tool for breast cancer screening. Although the benefits of using digital mammography for cancer screening outweigh the risks associated with the x-ray exposure, the radiation dose must be kept as low as possible while maintaining the diagnostic utility of the generated images, thus minimizing patient risks. Many studies investigated the feasibil… ▽ More Digital mammography is still the most common imaging tool for breast cancer screening. Although the benefits of using digital mammography for cancer screening outweigh the risks associated with the x-ray exposure, the radiation dose must be kept as low as possible while maintaining the diagnostic utility of the generated images, thus minimizing patient risks. Many studies investigated the feasibility of dose reduction by restoring low-dose images using deep neural networks. In these cases, choosing the appropriate training database and loss function is crucial and impacts the quality of the results. In this work, a modification of the ResNet architecture, with hierarchical skip connections, is proposed to restore low-dose digital mammography. We compared the restored images to the standard full-dose images. Moreover, we evaluated the performance of several loss functions for this task. For training purposes, we extracted 256,000 image patches from a dataset of 400 images of retrospective clinical mammography exams, where different dose levels were simulated to generate low and standard-dose pairs. To validate the network in a real scenario, a physical anthropomorphic breast phantom was used to acquire real low-dose and standard full-dose images in a commercially avaliable mammography system, which were then processed through our trained model. An analytical restoration model for low-dose digital mammography, previously presented, was used as a benchmark in this work. Objective assessment was performed through the signal-to-noise ratio (SNR) and mean normalized squared error (MNSE), decomposed into residual noise and bias. Results showed that the perceptual loss function (PL4) is able to achieve virtually the same noise levels of a full-dose acquisition, while resulting in smaller signal bias compared to other loss functions. △ Less

Submitted 12 November, 2021; originally announced November 2021.

Comments: 15 pages, 12 figures

Journal ref: Artificial Intelligence In Medicine, 142(2023), 102555, 2023

arXiv:2108.10772 [pdf, other]

doi 10.1109/TIM.2021.3128703

DU-GAN: Generative Adversarial Networks with Dual-Domain U-Net Based Discriminators for Low-Dose CT Denoising

Authors: Zhizhong Huang, Junping Zhang, Yi Zhang, Hongming Shan

Abstract: LDCT has drawn major attention in the medical imaging field due to the potential health risks of CT-associated X-ray radiation to patients. Reducing the radiation dose, however, decreases the quality of the reconstructed images, which consequently compromises the diagnostic performance. Various deep learning techniques have been introduced to improve the image quality of LDCT images through denois… ▽ More LDCT has drawn major attention in the medical imaging field due to the potential health risks of CT-associated X-ray radiation to patients. Reducing the radiation dose, however, decreases the quality of the reconstructed images, which consequently compromises the diagnostic performance. Various deep learning techniques have been introduced to improve the image quality of LDCT images through denoising. GANs-based denoising methods usually leverage an additional classification network, i.e. discriminator, to learn the most discriminate difference between the denoised and normal-dose images and, hence, regularize the denoising model accordingly; it often focuses either on the global structure or local details. To better regularize the LDCT denoising model, this paper proposes a novel method, termed DU-GAN, which leverages U-Net based discriminators in the GANs framework to learn both global and local difference between the denoised and normal-dose images in both image and gradient domains. The merit of such a U-Net based discriminator is that it can not only provide the per-pixel feedback to the denoising network through the outputs of the U-Net but also focus on the global structure in a semantic level through the middle layer of the U-Net. In addition to the adversarial training in the image domain, we also apply another U-Net based discriminator in the image gradient domain to alleviate the artifacts caused by photon starvation and enhance the edge of the denoised CT images. Furthermore, the CutMix technique enables the per-pixel outputs of the U-Net based discriminator to provide radiologists with a confidence map to visualize the uncertainty of the denoised results, facilitating the LDCT-based screening and diagnosis. Extensive experiments on the simulated and real-world datasets demonstrate superior performance over recently published methods both qualitatively and quantitatively. △ Less

Submitted 7 November, 2021; v1 submitted 24 August, 2021; originally announced August 2021.

Comments: Accepted by IEEE Transactions on Instrumentation and Measurement

Journal ref: IEEE Transactions on Instrumentation and Measurement, 4500512, 2022

arXiv:2103.12995 [pdf, other]

MANAS: Multi-Scale and Multi-Level Neural Architecture Search for Low-Dose CT Denoising

Authors: Zexin Lu, Wenjun Xia, Yongqiang Huang, Hongming Shan, Hu Chen, Jiliu Zhou, Yi Zhang

Abstract: Lowering the radiation dose in computed tomography (CT) can greatly reduce the potential risk to public health. However, the reconstructed images from the dose-reduced CT or low-dose CT (LDCT) suffer from severe noise, compromising the subsequent diagnosis and analysis. Recently, convolutional neural networks have achieved promising results in removing noise from LDCT images; the network architect… ▽ More Lowering the radiation dose in computed tomography (CT) can greatly reduce the potential risk to public health. However, the reconstructed images from the dose-reduced CT or low-dose CT (LDCT) suffer from severe noise, compromising the subsequent diagnosis and analysis. Recently, convolutional neural networks have achieved promising results in removing noise from LDCT images; the network architectures used are either handcrafted or built on top of conventional networks such as ResNet and U-Net. Recent advance on neural network architecture search (NAS) has proved that the network architecture has a dramatic effect on the model performance, which indicates that current network architectures for LDCT may be sub-optimal. Therefore, in this paper, we make the first attempt to apply NAS to LDCT and propose a multi-scale and multi-level NAS for LDCT denoising, termed MANAS. On the one hand, the proposed MANAS fuses features extracted by different scale cells to capture multi-scale image structural details. On the other hand, the proposed MANAS can search a hybrid cell- and network-level structure for better performance. Extensively experimental results on three different dose levels demonstrate that the proposed MANAS can achieve better performance in terms of preserving image structural details than several state-of-the-art methods. In addition, we also validate the effectiveness of the multi-scale and multi-level architecture for LDCT denoising. △ Less

Submitted 24 March, 2021; originally announced March 2021.

arXiv:2103.10459 [pdf]

doi 10.1038/s41467-021-26715-9

Spatial coherence of room-temperature monolayer WSe$_2$ exciton-polaritons in a trap

Authors: Hangyong Shan, Lukas Lackner, Bo Han, Evgeny Sedov, Christoph Rupprecht, Heiko Knopf, Falk Eilenberger, Johannes Beierlein, Nils Kunte, Martin Esmann, Kentaro Yumigeta, Kenji Watanabe, Takashi Taniguchi, Sebastian Klembt, Sven Höfling, Alexey V. Kavokin, Sefaattin Tongay, Christian Schneider, Carlos Antón-Solanas

Abstract: The emergence of spatial and temporal coherence of light emitted from solid-state systems is a fundamental phenomenon, rooting in a plethora of microscopic processes. It is intrinsically aligned with the control of light-matter coupling, and canonical for laser oscillation. However, it also emerges in the superradiance of multiple, phase-locked emitters, and more recently, coherence and long-range… ▽ More The emergence of spatial and temporal coherence of light emitted from solid-state systems is a fundamental phenomenon, rooting in a plethora of microscopic processes. It is intrinsically aligned with the control of light-matter coupling, and canonical for laser oscillation. However, it also emerges in the superradiance of multiple, phase-locked emitters, and more recently, coherence and long-range order have been investigated in bosonic condensates of thermalized light, as well as in exciton-polaritons driven to a ground state via stimulated scattering. Here, we experimentally show that the interaction between photons in a Fabry-Perot microcavity and excitons in an atomically thin WSe$_2$ layer is sufficient such that the system enters the hybridized regime of strong light-matter coupling at ambient conditions. Via Michelson interferometry, we capture clear evidence of increased spatial and temporal coherence of the emitted light from the spatially confined system ground-state. The coherence build-up is accompanied by a threshold-like behaviour of the emitted light intensity, which is a fingerprint of a polariton laser effect. Valley-physics is manifested in the presence of an external magnetic field, which allows us to manipulate K and K' polaritons via the Valley-Zeeman-effect. Our findings are of high application relevance, as they confirm the possibility to use atomically thin crystals as simple and versatile components of coherent light-sources, and in valleytronic applications at room temperature. △ Less

Submitted 9 November, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

Comments: 13 pages, 4 figures

Journal ref: Shan, H. et al., Nature Communications 12, 6406 (2021)

arXiv:2006.12700 [pdf, other]

Cine Cardiac MRI Motion Artifact Reduction Using a Recurrent Neural Network

Authors: Qing Lyu, Hongming Shan, Yibin Xie, Debiao Li, Ge Wang

Abstract: Cine cardiac magnetic resonance imaging (MRI) is widely used for diagnosis of cardiac diseases thanks to its ability to present cardiovascular features in excellent contrast. As compared to computed tomography (CT), MRI, however, requires a long scan time, which inevitably induces motion artifacts and causes patients' discomfort. Thus, there has been a strong clinical motivation to develop techniq… ▽ More Cine cardiac magnetic resonance imaging (MRI) is widely used for diagnosis of cardiac diseases thanks to its ability to present cardiovascular features in excellent contrast. As compared to computed tomography (CT), MRI, however, requires a long scan time, which inevitably induces motion artifacts and causes patients' discomfort. Thus, there has been a strong clinical motivation to develop techniques to reduce both the scan time and motion artifacts. Given its successful applications in other medical imaging tasks such as MRI super-resolution and CT metal artifact reduction, deep learning is a promising approach for cardiac MRI motion artifact reduction. In this paper, we propose a recurrent neural network to simultaneously extract both spatial and temporal features from under-sampled, motion-blurred cine cardiac images for improved image quality. The experimental results demonstrate substantially improved image quality on two clinical test datasets. Also, our method enables data-driven frame interpolation at an enhanced temporal resolution. Compared with existing methods, our deep learning approach gives a superior performance in terms of structural similarity (SSIM) and peak signal-to-noise ratio (PSNR). △ Less

Submitted 22 June, 2020; originally announced June 2020.

Comments: 10 pages, 11 figures

arXiv:1910.07735 [pdf]

Deep learning for accelerating Monte Carlo radiation transport simulation in intensity-modulated radiation therapy

Authors: Zhao Peng, Hongming Shan, Tianyu Liu, Xi Pei, Jieping Zhou, Ge Wang, X. George Xu

Abstract: Cancer is a primary cause of morbidity and mortality worldwide. The radiotherapy plays a more and more important role in cancer treatment. In the radiotherapy, the dose distribution maps in patient need to be calculated and evaluated for the purpose of killing tumor and protecting healthy tissue. Monte Carlo (MC) radiation transport calculation is able to account for all aspects of radiological ph… ▽ More Cancer is a primary cause of morbidity and mortality worldwide. The radiotherapy plays a more and more important role in cancer treatment. In the radiotherapy, the dose distribution maps in patient need to be calculated and evaluated for the purpose of killing tumor and protecting healthy tissue. Monte Carlo (MC) radiation transport calculation is able to account for all aspects of radiological physics within 3D heterogeneous media such as the human body and generate the dose distribution maps accurately. However, an MC calculation for doses in radiotherapy usually takes a great mass of time to achieve acceptable statistical uncertainty, impeding the MC methods from wider clinic applications. Here we introduce a convolutional neural network (CNN), termed as Monte Carlo Denoising Net (MCDNet), to achieve the acceleration of the MC dose calculations in radiotherapy, which is trained to directly predict the high-photon (noise-free) dose maps from the low-photon (noise-much) dose maps. Thirty patients with postoperative rectal cancer who accepted intensity-modulated radiation therapy (IMRT) were enrolled in this study. 3D Gamma Index Passing Rate (GIPR) is used to evaluate the performance of predicted dose maps. The experimental results demonstrate that the MCDNet can improve the GIPR of dose maps of 1x107 photons over that of 1x108 photons, yielding over 10x speed-up in terms of photon numbers used in the MC simulations of IMRT. It is of great potential to investigate the performance of this method on the other tumor sites and treatment modalities. △ Less

Submitted 17 October, 2019; originally announced October 2019.

arXiv:1909.11721 [pdf]

doi 10.1117/12.2530234

Deep-learning-based Breast CT for Radiation Dose Reduction

Authors: Wenxiang Cong, Hongming Shan, Xiaohua Zhang, Shaohua Liu, Ruola Ning, Ge Wang

Abstract: Cone-beam breast computed tomography (CT) provides true 3D breast images with isotropic resolution and high-contrast information, detecting calcifications as small as a few hundred microns and revealing subtle tissue differences. However, breast is highly sensitive to x-ray radiation. It is critically important for healthcare to reduce radiation dose. Few-view cone-beam CT only uses a fraction of… ▽ More Cone-beam breast computed tomography (CT) provides true 3D breast images with isotropic resolution and high-contrast information, detecting calcifications as small as a few hundred microns and revealing subtle tissue differences. However, breast is highly sensitive to x-ray radiation. It is critically important for healthcare to reduce radiation dose. Few-view cone-beam CT only uses a fraction of x-ray projection data acquired by standard cone-beam breast CT, enabling significant reduction of the radiation dose. However, insufficient sampling data would cause severe streak artifacts in CT images reconstructed using conventional methods. In this study, we propose a deep-learning-based method to establish a residual neural network model for the image reconstruction, which is applied for few-view breast CT to produce high quality breast CT images. We respectively evaluate the deep-learning-based image reconstruction using one third and one quarter of x-ray projection views of the standard cone-beam breast CT. Based on clinical breast imaging dataset, we perform a supervised learning to train the neural network from few-view CT images to corresponding full-view CT images. Experimental results show that the deep learning-based image reconstruction method allows few-view breast CT to achieve a radiation dose <6 mGy per cone-beam CT scan, which is a threshold set by FDA for mammographic screening. △ Less

Submitted 25 September, 2019; originally announced September 2019.

Comments: 7 pages, 4 figures

arXiv:1908.01612 [pdf, other]

doi 10.1109/TMI.2020.2974858

Multi-Contrast Super-Resolution MRI Through a Progressive Network

Authors: Qing Lyu, Hongming Shan, Ge Wang

Abstract: Magnetic resonance imaging (MRI) is widely used for screening, diagnosis, image-guided therapy, and scientific research. A significant advantage of MRI over other imaging modalities such as computed tomography (CT) and nuclear imaging is that it clearly shows soft tissues in multi-contrasts. Compared with other medical image super-resolution (SR) methods that are in a single contrast, multi-contra… ▽ More Magnetic resonance imaging (MRI) is widely used for screening, diagnosis, image-guided therapy, and scientific research. A significant advantage of MRI over other imaging modalities such as computed tomography (CT) and nuclear imaging is that it clearly shows soft tissues in multi-contrasts. Compared with other medical image super-resolution (SR) methods that are in a single contrast, multi-contrast super-resolution studies can synergize multiple contrast images to achieve better super-resolution results. In this paper, we propose a one-level non-progressive neural network for low up-sampling multi-contrast super-resolution and a two-level progressive network for high up-sampling multi-contrast super-resolution. Multi-contrast information is combined in high-level feature space. Our experimental results demonstrate that the proposed networks can produce MRI super-resolution images with good image quality and outperform other multi-contrast super-resolution methods in terms of structural similarity and peak signal-to-noise ratio. Also, the progressive network produces a better SR image quality than the non-progressive network, even if the original low-resolution images were highly down-sampled. △ Less

Submitted 6 August, 2019; v1 submitted 5 August, 2019; originally announced August 2019.

Comments: 10 figures, 5 tables, 11 pages

Journal ref: IEEE Transactions on Medical Imaging, early access, 2020

arXiv:1908.00360 [pdf]

doi 10.1002/mp.14131

A Method of Rapid Quantification of Patient-Specific Organ Dose for CT Using Coupled Deep-Learning based Multi-Organ Segmentation and GPU-accelerated Monte Carlo Dose Computing

Authors: Zhao Peng, Xi Fang, Pingkun Yan, Hongming Shan, Tianyu Liu, Xi Pei, Ge Wang, Bob Liu, Mannudeep K. Kalra, X. George Xu

Abstract: Purpose: This paper describes a new method to apply deep-learning algorithms for automatic segmentation of radiosensitive organs from 3D tomographic CT images before computing organ doses using a GPU-based Monte Carlo code. Methods: A deep convolutional neural network (CNN) for organ segmentation is trained to automatically delineate radiosensitive organs from CT. With a GPU-based Monte Carlo dose… ▽ More Purpose: This paper describes a new method to apply deep-learning algorithms for automatic segmentation of radiosensitive organs from 3D tomographic CT images before computing organ doses using a GPU-based Monte Carlo code. Methods: A deep convolutional neural network (CNN) for organ segmentation is trained to automatically delineate radiosensitive organs from CT. With a GPU-based Monte Carlo dose engine (ARCHER) to derive CT dose of a phantom made from a subject's CT scan, we are then able to compute the patient-specific CT dose for each of the segmented organs. The developed tool is validated by using Relative Dose Error (RDE) against the organ doses calculated by ARCHER with manual segmentation performed by radiologists. The dose computation results are also compared against organ doses from population-average phantoms to demonstrate the improvement achieved by using the developed tool. In this study, two datasets were used: The Lung CT Segmentation Challenge 2017 (LCTSC) dataset, which contains 60 thoracic CT scan patients each with 5 segmented organs, and the Pancreas-CT (PCT) dataset, which contains 43 abdominal CT scan patients each with 8 segmented organs. Five-fold cross-validation of the new method is performed on both datasets. Results: Comparing with the traditional organ dose evaluation method that based on population-average phantom, our proposed method achieved the smaller RDE range on all organs with -4.3%~1.5% vs -31.5%~33.9% (lung), -7.0%~2.3% vs -15.2%~125.1% (heart), -18.8%~40.2% vs -10.3%~124.1% (esophagus) in the LCTSC dataset and -5.6%~1.6% vs -20.3%~57.4% (spleen), -4.5%~4.6% vs -19.5%~61.0% (pancreas), -2.3%~4.4% vs -37.8%~75.8% (left kidney), -14.9%~5.4% vs -39.9% ~14.6% (gall bladder), -0.9%~1.6% vs -30.1%~72.5% (liver), and -23.0%~11.1% vs -52.5%~-1.3% (stomach) in the PCT dataset. △ Less

Submitted 29 September, 2019; v1 submitted 1 August, 2019; originally announced August 2019.

arXiv:1907.03063 [pdf]

doi 10.1109/TCI.2020.2964201

MRI Super-Resolution with Ensemble Learning and Complementary Priors

Authors: Qing Lyu, Hongming Shan, Ge Wang

Abstract: Magnetic resonance imaging (MRI) is a widely used medical imaging modality. However, due to the limitations in hardware, scan time, and throughput, it is often clinically challenging to obtain high-quality MR images. The super-resolution approach is potentially promising to improve MR image quality without any hardware upgrade. In this paper, we propose an ensemble learning and deep learning frame… ▽ More Magnetic resonance imaging (MRI) is a widely used medical imaging modality. However, due to the limitations in hardware, scan time, and throughput, it is often clinically challenging to obtain high-quality MR images. The super-resolution approach is potentially promising to improve MR image quality without any hardware upgrade. In this paper, we propose an ensemble learning and deep learning framework for MR image super-resolution. In our study, we first enlarged low resolution images using 5 commonly used super-resolution algorithms and obtained differentially enlarged image datasets with complementary priors. Then, a generative adversarial network (GAN) is trained with each dataset to generate super-resolution MR images. Finally, a convolutional neural network is used for ensemble learning that synergizes the outputs of GANs into the final MR super-resolution images. According to our results, the ensemble learning results outcome any one of GAN outputs. Compared with some state-of-the-art deep learning-based super-resolution methods, our approach is advantageous in suppressing artifacts and keeping more image details. △ Less

Submitted 5 July, 2019; originally announced July 2019.

Journal ref: IEEE Transactions on Computational Imaging, vol. 6, pp. 615-624, 2020

arXiv:1811.03691 [pdf, other]

doi 10.1038/s42256-019-0057-9

Can Deep Learning Outperform Modern Commercial CT Image Reconstruction Methods?

Authors: Hongming Shan, Atul Padole, Fatemeh Homayounieh, Uwe Kruger, Ruhani Doda Khera, Chayanin Nitiwarangkul, Mannudeep K. Kalra, Ge Wang

Abstract: Commercial iterative reconstruction techniques on modern CT scanners target radiation dose reduction but there are lingering concerns over their impact on image appearance and low contrast detectability. Recently, machine learning, especially deep learning, has been actively investigated for CT. Here we design a novel neural network architecture for low-dose CT (LDCT) and compare it with commercia… ▽ More Commercial iterative reconstruction techniques on modern CT scanners target radiation dose reduction but there are lingering concerns over their impact on image appearance and low contrast detectability. Recently, machine learning, especially deep learning, has been actively investigated for CT. Here we design a novel neural network architecture for low-dose CT (LDCT) and compare it with commercial iterative reconstruction methods used for standard of care CT. While popular neural networks are trained for end-to-end mapping, driven by big data, our novel neural network is intended for end-to-process mapping so that intermediate image targets are obtained with the associated search gradients along which the final image targets are gradually reached. This learned dynamic process allows to include radiologists in the training loop to optimize the LDCT denoising workflow in a task-specific fashion with the denoising depth as a key parameter. Our progressive denoising network was trained with the Mayo LDCT Challenge Dataset, and tested on images of the chest and abdominal regions scanned on the CT scanners made by three leading CT vendors. The best deep learning based reconstructions are systematically compared to the best iterative reconstructions in a double-blinded reader study. It is found that our deep learning approach performs either comparably or favorably in terms of noise suppression and structural fidelity, and runs orders of magnitude faster than the commercial iterative CT reconstruction algorithms. △ Less

Submitted 8 November, 2018; originally announced November 2018.

Comments: 17 pages, 7 figures

Journal ref: Nature Machine Intelligence, 1(6) (2019) 269-276

arXiv:1810.06776 [pdf]

Super-resolution MRI through Deep Learning

Authors: Qing Lyu, Chenyu You, Hongming Shan, Ge Wang

Abstract: Magnetic resonance imaging (MRI) is extensively used for diagnosis and image-guided therapeutics. Due to hardware, physical and physiological limitations, acquisition of high-resolution MRI data takes long scan time at high system cost, and could be limited to low spatial coverage and also subject to motion artifacts. Super-resolution MRI can be achieved with deep learning, which is a promising ap… ▽ More Magnetic resonance imaging (MRI) is extensively used for diagnosis and image-guided therapeutics. Due to hardware, physical and physiological limitations, acquisition of high-resolution MRI data takes long scan time at high system cost, and could be limited to low spatial coverage and also subject to motion artifacts. Super-resolution MRI can be achieved with deep learning, which is a promising approach and has a great potential for preclinical and clinical imaging. Compared with polynomial interpolation or sparse-coding algorithms, deep learning extracts prior knowledge from big data and produces superior MRI images from a low-resolution counterpart. In this paper, we adapt two state-of-the-art neural network models for CT denoising and deblurring, transfer them for super-resolution MRI, and demonstrate encouraging super-resolution MRI results toward two-fold resolution enhancement. △ Less

Submitted 15 October, 2018; originally announced October 2018.

arXiv:1805.12006 [pdf]

A Synergized Pulsing-Imaging Network (SPIN)

Authors: Qing Lyu, Tao Xu, Hongming Shan, Ge Wang

Abstract: Currently, the deep neural network is the mainstream for machine learning, and being actively developed for biomedical imaging applications with an increasing emphasis on tomographic reconstruction for MRI, CT, and other imaging modalities. Multiple deep-learning-based approaches were applied to MRI image reconstruction from k-space samples to final images. Each of these studies assumes a given pu… ▽ More Currently, the deep neural network is the mainstream for machine learning, and being actively developed for biomedical imaging applications with an increasing emphasis on tomographic reconstruction for MRI, CT, and other imaging modalities. Multiple deep-learning-based approaches were applied to MRI image reconstruction from k-space samples to final images. Each of these studies assumes a given pulse sequence that produces incomplete and/or inconsistent data in the Fourier space, and targets a trained neural network that recovers an underlying image as close as possible to the ground truth. For the first time, in this paper we view data acquisition and the image reconstruction as the two key parts of an integrated MRI process, and optimize both the pulse sequence and the reconstruction scheme seamlessly in the machine learning framework. Our pilot simulation results show an exemplary embodiment of our new MRI strategy. Clearly, this work can be extended to other imaging modalities and their combinations as well, such as ultrasound imaging, and also potentially simultaneous emission-transmission tomography aided by polarized radiotracers. △ Less

Submitted 27 May, 2018; originally announced May 2018.

Comments: 13 pages, 9 figures, 16 references

arXiv:1801.08432 [pdf, ps, other]

BIGSTICK: A flexible configuration-interaction shell-model code

Authors: Calvin W. Johnson, W. Erich Ormand, Kenneth S. McElvain, Hongzhang Shan

Abstract: We present BIGSTICK, a flexible configuration-interaction open-source shell-model code for the many-fermion problem. Written mostly in Fortran 90 with some later extensions, BIGSTICK utilizes a factorized on-the-fly algorithm for computing many-body matrix elements, and has both MPI (distributed memory) and OpenMP (shared memory) parallelization, and can run on platforms ranging from laptops to th… ▽ More We present BIGSTICK, a flexible configuration-interaction open-source shell-model code for the many-fermion problem. Written mostly in Fortran 90 with some later extensions, BIGSTICK utilizes a factorized on-the-fly algorithm for computing many-body matrix elements, and has both MPI (distributed memory) and OpenMP (shared memory) parallelization, and can run on platforms ranging from laptops to the largest parallel supercomputers. It uses a flexible yet efficient many-body truncation scheme, and reads input files in multiple formats, allowing one to tackle both phenomenological (major valence shell space) and ab initio (the so-called no-core shell model) calculations. BIGSTICK can generate energy spectra, static and transition one-body densities, and expectation values of scalar operators. Using the built-in Lanczos algorithm one can compute transition probability distributions and decompose wave functions into components defined by group theory. This manual provides a general guide to compiling and running BIGSTICK, which comes with numerous sample input files, as well as some of the basic theory underlying the code. △ Less

Submitted 24 January, 2018; originally announced January 2018.

Comments: This code is distributed under the MIT Open Source License. The source code and sample inputs are found at github.com/cwjsdsu/BigstickPublick

Report number: LLNL-SM-739926

Showing 1–21 of 21 results for author: Shan, H