Abstract
Esophageal cancer is ranked as the sixth most fatal cancer type. Most esophageal cancers are believed to arise from overlooked abnormalities in the esophagus tube. The early detection of these abnormalities is considered challenging due to their different appearance and random location throughout the esophagus tube. In this paper, a novel Gabor Fractal DenseNet Faster R-CNN (GFD Faster R-CNN) is proposed which is a two-input network adapted from the Faster R-CNN to address the challenges of esophageal abnormality detection. First, a Gabor Fractal (GF) image is generated using various Gabor filter responses considering different orientations and scales, obtained from the original endoscopic image that strengthens the fractal texture information within the image. Secondly, we incorporate Densely Connected Convolutional Network (DenseNet) as the backbone network to extract features from both original endoscopic image and the generated GF image separately; the DenseNet provides a reduction in the trained parameters while supporting the network accuracy and enables a maximum flow of information. Features extracted from the GF and endoscopic images are fused through bilinear fusion before ROI pooling stage in Faster R-CNN, providing a rich feature representation that boosts the performance of final detection. The proposed architecture was trained and tested on two different datasets independently: Kvasir (1000 images) and MICCAI’15 (100 images). Extensive experiments have been carried out to evaluate the performance of the model, with a recall of 0.927 and precision of 0.942 for Kvasir dataset, and a recall of 0.97 and precision of 0.92 for MICCAI’15 dataset, demonstrating a high detection performance compared to the state-of-the-art.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Cancer.Net. https://www.cancer.net/cancer-types/esophageal-cancer/statistics
Mendel, R., Ebigbo, A., Probst, A., Messmann, H., Palm, C.: Barrett’s esophagus analysis using convolutional neural networks. Bildverarbeitung für die Medizin 2017. I, pp. 80–85. Springer, Heidelberg (2017). https://doi.org/10.1007/978-3-662-54345-0_23
Van Riel, S., Van Der Sommen, F., Zinger, S., Schoon, E.J., de With, P.H.: Automatic detection of early esophageal cancer with CNNS using transfer learning. In: IEEE ICIP, pp. 1383–1387 (2018)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142–158 (2016)
Girshick R.: Fast R-CNN. In: ICCV 2015, pp. 1440–1448. IEEE (2015)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017)
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: CVPR 2017, pp. 4700–4708, IEEE (2017)
Shangzhen, L., Chen, C., Zhang, B., Han, J., Liu, J.: Gabor convolutional networks. IEEE Trans. Image Process. 27(9), 4357–4366 (2018). https://doi.org/10.1109/TIP.2018.2835143
Feichtenhofer, C., Pinz, A., Zisserman, A.: Convolutional two-stream network fusion for video action recognition. In: CVPR 2016, pp. 1933–1941. IEEE (2016)
Sub-Challenge Early Barrett’s cancer detection. https://endovissub-barrett.grand-challenge.org
Pogorelov, K., et al.: KVASIR: a multi-class image dataset for computer aided gastrointestinal disease detection. In: The 8th ACM on Multimedia Systems Conference, pp. 164–169. ACM (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Ghatwary, N., Zolgharni, M., Ye, X. (2019). GFD Faster R-CNN: Gabor Fractal DenseNet Faster R-CNN for Automatic Detection of Esophageal Abnormalities in Endoscopic Images. In: Suk, HI., Liu, M., Yan, P., Lian, C. (eds) Machine Learning in Medical Imaging. MLMI 2019. Lecture Notes in Computer Science(), vol 11861. Springer, Cham. https://doi.org/10.1007/978-3-030-32692-0_11
Download citation
DOI: https://doi.org/10.1007/978-3-030-32692-0_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32691-3
Online ISBN: 978-3-030-32692-0
eBook Packages: Computer ScienceComputer Science (R0)