Multi-scale Residual Network with Two Channels of Raw CT Image and Its Differential Excitation Component for Emphysema Classification

Peng, Liying; Lin, Lanfen; Hu, Hongjie; Li, Huali; Chen, Qingqing; Wang, Dan; Han, Xian-Hua; Iwamoto, Yutaro; Chen, Yen-Wei

doi:10.1007/978-3-030-00889-5_5

Liying Peng³⁶,
Lanfen Lin³⁶,
Hongjie Hu³⁷,
Huali Li³⁷,
Qingqing Chen³⁷,
Dan Wang³⁷,
Xian-Hua Han³⁸,
Yutaro Iwamoto³⁸ &
…
Yen-Wei Chen³⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11045))

Included in the following conference series:

8142 Accesses
1 Citations

Abstract

Automated tissue classification is an essential step for quantitative analysis and treatment of emphysema. Although many studies have been conducted in this area, there still remain two major challenges. First, different emphysematous tissue appears in different scales, which we call “inter-class variations”. Second, the intensities of CT images acquired from different patients, scanners or scanning protocols may vary, which we call “intra-class variations”. In this paper, we present a novel multi-scale residual network with two channels of raw CT image and its differential excitation component. We incorporate multi-scale information into our networks to address the challenge of inter-class variations. In addition to the conventional raw CT image, we use its differential excitation component as a pair of inputs to handle intra-class variations. Experimental results show that our approach has superior performance over the state-of-the-art methods, achieving a classification accuracy of 93.74% on our original emphysema database.

You have full access to this open access chapter, Download conference paper PDF

Automated classification of emphysema using data augmentation and effective pixel location estimation with multi-scale residual network

Article 27 July 2022

Automatic Grading of Emphysema by Combining 3D Lung Tissue Appearance and Deformation Map Using a Two-Stream Fully Convolutional Neural Network

A Deep Learning Paradigm for Computer Aided Diagnosis of Emphysema from Lung HRCT Images

Keywords

1 Introduction

Emphysema is a major component of chronic obstructive pulmonary disease (COPD), which is emerging as a worldwide health problem. Generally, as shown in Fig. 1, emphysema can be classified into three subtypes: centrilobular emphysema (CLE) that generally appears as scattered small low attenuation areas; paraseptal emphysema (PSE) which is shown as low attenuation areas aligned in a row along a visceral pleura [1]; and panlobular emphysema (PLE) that usually manifests as a wide range low attenuation region with fewer and smaller lung vessels [1]. They have different pathophysiological significance [2]. Hence, classification and quantification of emphysema are important.

Much research has been conducted to classify the lung tissue of different emphysema subtypes. One common way is based on the local intensity distribution, such as kernel density estimation (KDE) [3]. Another class of approaches describes the morphology of emphysema using texture analysis techniques [1, 4,5,6]. In the last years, some attempts have revealed the potential of deep learning techniques on lung disease classification, but it has been applied in only two studies [7, 8] for emphysema classification. The networks in these two studies are very preliminary, using two or three convolutional layers, so they are not able to capture the high-level features. Since the classification of emphysema mainly depends on features of texture and intensity, there still remain two major challenges. (1) “inter-class variations”: as can be seen in Fig. 1, different emphysematous tissue appears in different scales. Since existing methods ignore the scales of different emphysema which are useful clues for diagnosing emphysema, it is highly desirable to develop new models that can take full advantage of the information from multiple scales. (2) “intra-class variations”: in clinical practice, the intensities of CT images acquired from different patients, scanners or scanning protocols may vary [9]. The variation in CT images will affect the classification accuracy of emphysema, so it is necessary to design models which are robust to such variability. In addition, existing methods for emphysema classification are limited to extracting low-level features or mid-level features, which have limited abilities to distinguish different patterns.

In this paper, we focus on the supervised classification of emphysema. We propose a novel deep learning method using the multi-scale residual network (MS-ResNet) [16] with two channels of the raw CT image and its differential excitation component. In contrast to previous works, our proposed method discovers high-level features that can better characterize the emphysema lesions. We incorporate multi-scale information into our networks to address the challenge of inter-class variations. Moreover, to handle intra-class variations, we first transform the raw image data into the differential excitation domain of human perception based on weber’s law, which is robust to intensity variability. Then we use the raw CT images and the transformed images as different channels of the inputs of networks. The experiments show that our method can achieve higher classification accuracy than the state-of-the-art methods. Based on the classification results, we calculate the area percentage of each class (CLE%, PLE%, PSE%, respectively). Then, we show the relationship between the quantitative results (area percentages) and the forced expiratory volume in one second dividing with a predicted value (FEV_1%), which is the primary indicator of pulmonary function tests (PFTs).

2 Methods

In this section, we first describe how to transform the raw CT image into the differential excitation domain. Subsequently, we present our multi-scale residual network with two channels of the raw CT image and its differential excitation component. An overview of the proposed method is shown in Fig. 2.

2.1 Differential Excitation Component

Ernst Heinrich Weber, an experimental psychologist in the 19^th century, observed that the ratio of the perceived change in stimulus to the initial stimulus is a constant [10], which is well-known as Weber’s law and can be defined as ΔΙ/Ι = α, where ΔΙ denotes the perceived change in stimulus, Ι denotes the initial stimulus, and α is referred to as the Weber fraction for detecting changes in stimulus.

Inspired by Weber’s law, which shows that human perception of a pattern depends not only on the absolute intensity of the stimulus but also on the relative variance of the stimulus, we transform the raw image into the differential excitation domain of human perception which is robust to intensity variability [10]. In order to do so, we first compute the difference between a focused pixel and its neighbors, which can be formulated as

$$ \Delta I_{c} = \sum\limits_{i = 0}^{p - 1} {(\Delta I_{c}^{i} )} = \sum\limits_{i = 0}^{p - 1} {(I_{c}^{i} - I_{c} )} $$

(1)

where $ I_{c} $ is the intensity at position $ x_{c} $, $ I_{c}^{i} \left( {i = 0,\,1,\, \ldots ,\,p - 1} \right) $ is intensity of the ith neighbor of c, and p is the number of neighbors. The differential excitation component of the focused pixel c is defined as

$$ E_{c} = \arctan [\frac{{\Delta I_{c} }}{{I_{c} + \lambda }}] = \arctan [\sum\limits_{i = 0}^{p - 1} {\frac{{(I_{c}^{i} - I_{c} )}}{{I_{c} + \lambda }}} ] $$

(2)

where λ is a constant which avoids the situation in which there is zero intensity. λ is set to one in our experiments.

2.2 MS-ResNet with Raw and Excitation Channels

MS-ResNet.

Due to the inter-class variations of emphysema, one target category tends to be identified on a certain scale and the most suitable scales for different target classes may vary. That is, we cannot find the best scale for all cases. Thus, it is essential to incorporate information from different scales into our deep neural networks [16].

For a baseline, we build a 20-layer ResNet [11], which has been shown to achieve the excellent performance on image classification. For the sake of adapting it to our problem (small inputs and only 4 classes), we remove the pooling layer and modify the configuration for some layers. Figure 2 (bottom) presents the details of our ResNet. As shown in Fig. 2 (top), for each annotated pixel, we can extract patches with different scales from its neighborhood. The label assigned to each patch is the same as label of the central pixel. Note that, in this paper, different scales mean various sizes of inputs. Figure 2 (middle) presents two ways for fusing information from different scales: multi-scale early fusion (MSEF) and multi-scale late fusion (MSLF). For the MSEF, we employ the independent convolutional layers for each scale. The outputs of average pooling layers are combined and fed into a 4-way shared fully connected layer with softmax to compute a cross entropy classification loss. For the MSLF, we train three separate networks, each focusing on a certain scale. During the fusion step, we first sum up the values of probability vectors yielded by different networks, and then compute the average of them.

Fused Representation of Raw Image and its Differential Excitation Component.

As mentioned in Introduction part, there exists the challenge of intra-class variations for emphysema classification. As shown in Fig. 2 in order to reduce the impact of intensity variability, we first transform the raw image data into the differential excitation domain of human perception, which is robust to intensity variability. Then we use the raw CT images and their differential excitation components as different channels of the inputs of networks.

3 Experimental Results

3.1 Materials

Our dataset contains 101 HRCT volumes. The first part of our dataset includes 91 HRCT volumes annotated manually by two experienced radiologists and checked by one experienced chest radiologist. Four types of patterns were annotated: CLE, PLE, PSE, and non-emphysema (NE) which corresponds to tissue without emphysema. This part of dataset is used for evaluation of classification accuracy shown in Sect. 3.2. Since the first part of dataset does not include complete pulmonary function evaluations, we collected additional 10 HRCT volumes from patients who have a complete pulmonary function evaluation for a quantitative analysis of emphysema shown in Sect. 3.3. All data came from two hospitals and were acquired using seven types of CT machines with a slice collimation of 1 mm–2 mm, a matrix of 512 × 512 pixels, and an in-plane resolution of 0.62 mm–0.71 mm.

3.2 Evaluation of Classification Accuracy

Experimental Setup.

Our classification experiments are conducted on 91 annotated subjects (the first part of dataset): 59 subjects (about 720,000 patches) for training, 14 subjects (about 140,000 patches) for validation, and 18 subjects (about 160,000 patches) for testing. A 20-layer ResNet is chosen as the baseline in this work (we found 8-layer, 32-layer, 44-layer, and 56-layer ResNet decrease the performance, compared to 20-layer ResNet, on our data). We have done extensive experiments for selecting patch sizes and the experimental results show that the most suitable scales (patch sizes) for different target categories are different: for non-emphysema tissue, the inputs of 27 × 27 generate the best result; for CLE, the best scale is 41 × 41; for PLE and PSE, the highest classification accuracy is obtained with inputs of size 61 × 61. Therefore, patches of sizes 27 × 27, 41 × 41, and 61 × 61 are selected as inputs of the multi-scale neural networks.

Single Scale versus Multiple Scales.

In this section, to investigate the effect of fusi-ng multi-scale information on the classification accuracy, we use only raw images as inputs of networks. As shown in Table 1, both MSEF model and MSLF model outperform the single-scale models (27 × 27, 41 × 41, and 61 × 61). To test the statistical significance of the classification accuracy differences between single-scale models and multi-scale models, we calculated the classification accuracy of each patient, and then employed t-test. The results of analysis confirmed the statistically significant (p-value < 0.05) superior performance of the multi-scale models against all single-scale models. Fusion of multi-scale information leads to higher accuracy, so we can conclude that the multi-scale methods are beneficial compared to the single scale setting.

Table 1. The comparison between the single-scale models and the multi-scale models.

Full size table

Single Channel versus Multiple Channels.

This part compares the classification ac-curacy between the single-channel models (use only raw images as inputs) and the multi-channel models (use raw CT images and their differential excitation components as different channels of inputs). As shown in Table 2, for both single-scale setting and multi-scale setting, the multi-channel models offer superior performance to the single-channel models (p-value < 0.05).

Table 2. The comparison between the single-channel models and the multi-channel models.

Full size table

Comparison to the State-of-the-Art Methods.

In this section, our approaches are compared to other state-of-the-art methods. The comparison between our methods and the machine learning (ML) methods for emphysema classification is provided in the first five rows. The results prove the superior performance of our methods that significantly outperform the rest by 14% to 20%. The rest of Table 3 shows a comparison to other deep learning methods. Since existing deep learning methods for emphysema classification [7, 8] are very primary using only two or three convolutional layers, we also compare our approaches with other CNNs for interstitial lung disease (ILD) classification [12, 14]. The results show that our approaches have superior performance over other deep learning methods.

Table 3. The comparison of classification accuracy (Acc.) to the state-of-the-art approaches.

Full size table

3.3 Emphysema Quantification

In this section, based on the classification results, we quantify the whole lung area of 10 subjects (the second part of dataset with complete pulmonary function evaluations) by calculating the area percentage of each class (CLE%, PLE%, PSE%, respectively), and show the relationship between the quantitative results (area percentages) and the forced expiratory volume in one second dividing with a predicted value (FEV_1%), which is the primary indicator of pulmonary function tests (PFTs). Some visual results of full lung classification are shown in Fig. 3. It can be seen that, auto-annotations (or classification results) of proposed method are similar to annotations of radiologists (manual annotations). The relationship between the quantitative results (area percentages) and FEV_1% of 10 subjects are shown in Table 4. According to [15], FEV_1% is an effective indicator that indicates both functional and symptomatic impairment of COPD. Symptoms arise in individuals in relation to a relative loss of FEV₁. More specifically, FEV_1% can reflect the severity of airflow obstruction in the lungs. The lower value of FEV1% means the more severe the airflow obstruction in the lungs. Our results show that a larger CLE% (or PLE%) corresponds to a lower FEV_1% (the more severe the airflow obstruction in the lungs). From our experiments, we found there is no relationship between PSE% and FEV_1%. According to the literature [1], PSE is often not associated with significant symptoms or physiological impairments, which is in close agreement with our experimental results.

Table 4. Relationship between quantitative results and FEV_1%.

Full size table

4 Conclusions

In this paper, we proposed a novel deep learning approach for emphysema classification, using the multi-scale ResNet with two channels of raw CT image and its differential excitation component. Our proposed approach achieved a classification accuracy of 93.74%, which is superior to the state-of-the-art methods.

References

Sørensen, L., et al.: Quantitative analysis of pulmonary emphysema using local binary patterns. IEEE Trans. Med. Imaging 29(2), 559–569 (2010)
Article Google Scholar
Lynch, D.A., et al.: CT-definable subtypes of chronic obstructive pulmonary disease: a statement of the fleischner society. Radiology 277(1), 192–205 (2015)
Article Google Scholar
Mendoza, C.S., et al.: Emphysema quantification in a multi-scanner HRCT cohort using local intensity distributions. In: ISBI 2012, pp. 474–477 (2012)
Google Scholar
Gangeh, M.J., Sørensen, L., Shaker, S.B., Kamel, M.S., de Bruijne, M., Loog, M.: A texton-based approach for the classification of lung parenchyma in CT images. In: Jiang, T., Navab, N., Pluim, J.P.W., Viergever, M.A. (eds.) MICCAI 2010. LNCS, vol. 6363, pp. 595–602. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15711-0_74
Chapter Google Scholar
Jie, Y., et al.: Texton and sparse representation based texture classification of lung parenchyma in CT images. In: EMBC 2016, pp. 1276–1279 (2016)
Google Scholar
Liying, P., et al.: Joint weber-based rotation invariant uniform local ternary pattern for classification of pulmonary emphysema in CT images. In: ICIP 2017, pp. 2050–2054 (2017)
Google Scholar
Karabulut, E.M., et al.: Emphysema discrimination from raw HRCT images by convolutional neural networks. In: ELECO 2015, pp. 705–708 (2015)
Google Scholar
Pei, X.: Emphysema classification using convolutional neural networks. In: Liu, H., Kubota, N., Zhu, X., Dillmann, R., Zhou, D. (eds.) ICIRA 2015. LNCS (LNAI), vol. 9244, pp. 455–461. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-22879-2_42
Chapter Google Scholar
Cheplygina, V., et al.: Transfer learning for multi-center classification of chronic obstructive pulmonary disease. JBHI (2018)
Google Scholar
Xianhua, H., et al.: Integration of spatial and orientation contexts in local ternary patterns for HEp-2 cell classification. Pattern Recognit. Lett. 82, 23–27 (2016)
Article Google Scholar
Kaiming, H., et al.: Deep residual learning for image recognition. In: CVPR 2016, pp. 770–778 (2016)
Google Scholar
Shin, H.C., et al.: Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans. Med. Imaging 35(5), 1285–1298 (2016)
Article Google Scholar
Qian, W., et al.: Multiscale rotation-invariant convolutional neural networks for lung texture classification. JBHI 22, 184–195 (2017)
Google Scholar
Anthimopoulos, M., et al.: Lung pattern classification for interstitial lung diseases using a deep convolutional neural network. IEEE Trans. Med. Imaging 35(5), 1207–1216 (2016)
Article Google Scholar
Jakeways, N., et al.: Relationship between FEV1 reduction and respiratory symptoms in the general population. Eur. Respir. J. 21(4), 658–663 (2003)
Article Google Scholar
Liying, P., et al.: Classification of pulmonary emphysema in CT images based on multi-scale deep convolutional neural networks. In: ICIP (2018, in Press)
Google Scholar

Download references

Acknowledgements

This work was supported in part by the National Key R&D Program of China under the Grant No. 2017YFB0309800, in part by the Science and Technology Support Program of Hangzhou under the Grant No. 20172011A038, and in part by the Grant-in Aid for Scientific Research from the Japanese Ministry for Education, Science, Culture and Sports (MEXT) under the Grant Nos. 18H03267 and No. 17H00754.

Author information

Authors and Affiliations

College of Computer Science and Technology, Zhejiang University, Hangzhou, China
Liying Peng & Lanfen Lin
Department of Radiology, Sir Run Run Shaw Hospital, Hangzhou, China
Hongjie Hu, Huali Li, Qingqing Chen & Dan Wang
College of Information Science and Engineering, Ritsumeikan University, Kusatsu, Japan
Xian-Hua Han, Yutaro Iwamoto & Yen-Wei Chen

Authors

Liying Peng
View author publications
You can also search for this author in PubMed Google Scholar
Lanfen Lin
View author publications
You can also search for this author in PubMed Google Scholar
Hongjie Hu
View author publications
You can also search for this author in PubMed Google Scholar
Huali Li
View author publications
You can also search for this author in PubMed Google Scholar
Qingqing Chen
View author publications
You can also search for this author in PubMed Google Scholar
Dan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xian-Hua Han
View author publications
You can also search for this author in PubMed Google Scholar
Yutaro Iwamoto
View author publications
You can also search for this author in PubMed Google Scholar
Yen-Wei Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lanfen Lin .

Editor information

Editors and Affiliations

University College London, London, UK
Danail Stoyanov
University of Leeds, Leeds, UK
Zeike Taylor
University of Adelaide, Adelaide, SA, Australia
Gustavo Carneiro
IBM Research – Almaden, San Jose, CA, USA
Tanveer Syeda-Mahmood
Sunnybrook Health Science Centre, Toronto, ON, Canada
Anne Martel
Deutsches Krebsforschungszentrum (DKFZ), Heidelberg, Germany
Lena Maier-Hein
University of Porto, Porto, Portugal
João Manuel R.S. Tavares
Queensland University of Technology, Brisbane, QLD, Australia
Andrew Bradley
Universidade Estadual Paulista, Bauru, São Paulo, Brazil
João Paulo Papa
OSRAM (Germany), Garching b. München, Germany
Vasileios Belagiannis
University of Lisbon, Lisboa, Portugal
Jacinto C. Nascimento
ReFUEL4, Singapore, Singapore
Zhi Lu
German Center for Neurodegenerative Diseases (DZNE), Munich, Germany
Sailesh Conjeti
IBM Research – Almaden, San Jose, CA, USA
Mehdi Moradi
Tel Aviv University, Tel Aviv, Israel
Hayit Greenspan
Case Western Reserve University, Cleveland, OH, USA
Anant Madabhushi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Peng, L. et al. (2018). Multi-scale Residual Network with Two Channels of Raw CT Image and Its Differential Excitation Component for Emphysema Classification. In: Stoyanov, D., et al. Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. DLMIA ML-CDS 2018 2018. Lecture Notes in Computer Science(), vol 11045. Springer, Cham. https://doi.org/10.1007/978-3-030-00889-5_5

Download citation

DOI: https://doi.org/10.1007/978-3-030-00889-5_5
Published: 20 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00888-8
Online ISBN: 978-3-030-00889-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Multi-scale Residual Network with Two Channels of Raw CT Image and Its Differential Excitation Component for Emphysema Classification

Abstract

Similar content being viewed by others

Automated classification of emphysema using data augmentation and effective pixel location estimation with multi-scale residual network

Automatic Grading of Emphysema by Combining 3D Lung Tissue Appearance and Deformation Map Using a Two-Stream Fully Convolutional Neural Network

A Deep Learning Paradigm for Computer Aided Diagnosis of Emphysema from Lung HRCT Images

Keywords

1 Introduction

2 Methods

2.1 Differential Excitation Component

2.2 MS-ResNet with Raw and Excitation Channels

MS-ResNet.

Fused Representation of Raw Image and its Differential Excitation Component.

3 Experimental Results

3.1 Materials

3.2 Evaluation of Classification Accuracy

Experimental Setup.

Single Scale versus Multiple Scales.

Single Channel versus Multiple Channels.

Comparison to the State-of-the-Art Methods.

3.3 Emphysema Quantification

4 Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Multi-scale Residual Network with Two Channels of Raw CT Image and Its Differential Excitation Component for Emphysema Classification

Abstract

Similar content being viewed by others

Automated classification of emphysema using data augmentation and effective pixel location estimation with multi-scale residual network

Automatic Grading of Emphysema by Combining 3D Lung Tissue Appearance and Deformation Map Using a Two-Stream Fully Convolutional Neural Network

A Deep Learning Paradigm for Computer Aided Diagnosis of Emphysema from Lung HRCT Images

Keywords

1 Introduction

2 Methods

2.1 Differential Excitation Component

2.2 MS-ResNet with Raw and Excitation Channels

MS-ResNet.

Fused Representation of Raw Image and its Differential Excitation Component.

3 Experimental Results

3.1 Materials

3.2 Evaluation of Classification Accuracy

Experimental Setup.

Single Scale versus Multiple Scales.

Single Channel versus Multiple Channels.

Comparison to the State-of-the-Art Methods.

3.3 Emphysema Quantification

4 Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation