Abstract
Individual tooth segmentation and identification from cone-beam computed tomography images are preoperative prerequisites for orthodontic treatments. Instance segmentation methods using convolutional neural networks have demonstrated ground-breaking results on individual tooth segmentation tasks, and are used in various medical imaging applications. While point-based detection networks achieve superior results on dental images, it is still a challenging task to distinguish adjacent teeth because of their similar topologies and proximate nature. In this study, we propose a point-based tooth localization network that effectively disentangles each individual tooth based on a Gaussian disentanglement objective function. The proposed network first performs heatmap regression accompanied by box regression for all the anatomical teeth. A novel Gaussian disentanglement penalty is employed by minimizing the sum of the pixel-wise multiplication of the heatmaps for all adjacent teeth pairs. Subsequently, individual tooth segmentation is performed by converting a pixel-wise labeling task to a distance map regression task to minimize false positives in adjacent regions of the teeth. Experimental results demonstrate that the proposed algorithm outperforms state-of-the-art approaches by increasing the average precision of detection by 9.1%, which results in a high performance in terms of individual tooth segmentation. The primary significance of the proposed method is two-fold: (1) the introduction of a point-based tooth detection framework that does not require additional classification and (2) the design of a novel loss function that effectively separates Gaussian distributions based on heatmap responses in the point-based detection framework.
Similar content being viewed by others
References
Çiçek Ö, Abdulkadir A, Lienkamp SS, Brox T, Ronneberger O (2016) 3d u-net: learning dense volumetric segmentation from sparse annotation. In: International conference on medical image computing computer-assisted intervention. Springer, Berlin, pp 424–432
Chung M, Lee M, Park S, Lee M, Lee CE, Lee J, Shin Y-G (2020) Individual tooth detection and identification from dental panoramic x-ray images via point-wise localization and distance regularization. Artif Intell Med 111:101996
Chung M, Lee M, Hong J, Park S, Lee J, Lee J, Yang I-H, Lee J, Shin Y-G (2020) Pose-aware instance segmentation framework from cone beam ct images for tooth segmentation. Comput Biol Med 120:103720
Cui Z, Li C, Wang W (2019) Toothnet: Automatic tooth instance segmentation and identification from cone beam ctimages. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 6368–6377
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), vol 1. IEEE, pp 886–893
Duan K, Bai S, Xie L, Qi H, Huang Q, Tian Q (2019) Centernet: Keypoint triplets for object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 6569–6578
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmen-tation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 580–587
He K, Gkioxari G (2017) Piotr Dollár, and Ross Girshick. Mask r-cnn. In: Proceedings of the IEEE International Conference on Computer Vision, pp 2961–2969
Law H, Deng J (2018) Cornernet: Detecting objects as paired keypoints. In: Proceedings of the European Conference on Computer Vision(ECCV), pp 734–750
Lee J-H, Han S-S, Kim YH, Lee C, Kim I (2020) Application of a fully deep convolutional neural network to the automation of tooth segmentation on panoramic radiographs. Oral Surg Oral Med Oral Pathol Oral Radiol 129(6):635–642
Lienhart R, Maydt J (2002) An extended set of haar-like features for rapid object detection. In: Proceedings. International Conferenceon Image Processing, vol 1. IEEE, pp I–I
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3431–3440
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vision 60(2):91–110
Miki Y, Muramatsu C, Hayashi T, Zhou X, Hara T, Katsumata A, Fujita H (2017) Classification of teeth in cone-beam CT using deep convolutional neural network. Comput Biol Med 80:24–29
Newell A, Yang K, Deng J (2016) Stacked hourglass networks for human pose estimation. In: European Conference on Computer Vision. Springer, Berlin, pp 483–499
Pedro F, Felzenszwalb RB, Girshick D, McAllester, Ramanan D (2009) Object detection with discriminatively trained part-basedmodels. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645
Redmon J, Farhadi A (2018) Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767
Redmon J, Farhadi A (2017) Yolo9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 7263–7271
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 779–788
Ren R, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp 91–99
Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, Berlin, pp 234–241
Rosenfeld A, Pfaltz JL (1968) Distance functions on digital pictures. Pattern Recogn 1(1):33–61
Tuzoff DV, Tuzova LN, Bornstein M, Krasnov AS, Kharchenko MA, Nikolenko SI, Sveshnikov MM, Bednenko GB (2019) Tooth detection and numbering in panoramic radiographs using convolutional neural networks. Dento Maxillo Fac Radiol 48(4):20180051
Van Ginneken B, Ter Romeny BM, Viergever MA (2001) Computer-aided diagnosis in chest radiography: a survey. IEEE Trans Med Imaging 20(12):1228–1241
Wu K, Chen L, Li J, Zhou Y (2014) Tooth segmentation on dental meshes using morphologic skeleton. Comput Graph 38:199–211
Xu X, Liu C, Zheng Y (2019) 3D tooth segmentation and labeling using deep convolutional neural networks. IEEE Trans Vis Comput Graph 25(7):2336–2348. https://doi.org/10.1109/TVCG.2018.2839685
Zhao Z-Q, Zheng P, Xu Shou-tao, Wu X (2019) Object detection with deep learning: A review. IEEE Trans Neural Networks Learn Syst 30(11):3212–3232
Zhou X, Wang D, Krähenbühl P (2019) Objects as points. arXiv preprint arXiv:1904.07850
Acknowledgements
This work was supported by Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government(MSIT) (No.2021-0-00511, Robust AI and Distributed Attack Detection for Edge AI Security).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
None declared.
Author agreement
All authors including Jusang Lee, Minyoung Chung, Minkyung Lee, and Yeong-Gil Shin agreed the submission of this manuscript.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Lee, J., Chung, M., Lee, M. et al. Tooth instance segmentation from cone-beam CT images through point-based detection and Gaussian disentanglement. Multimed Tools Appl 81, 18327–18342 (2022). https://doi.org/10.1007/s11042-022-12524-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-12524-9