Estimating infant age from skull X-ray images using deep learning

Lee, Heui Seung; Kang, Jaewoong; Kim, So Eui; Kim, Ji Hee; Cho, Bum-Joo

doi:10.1038/s41598-024-64489-4

Download PDF

Article
Open access
Published: 18 July 2024

Estimating infant age from skull X-ray images using deep learning

Heui Seung Lee^1,2^Â na1,
Jaewoong Kang³^Â na1,
So Eui Kim³,
Ji Hee Kim¹ &
â¦
Bum-Joo Cho^3,4Â

Scientific Reports volumeÂ 14, ArticleÂ number:Â 16600 (2024) Cite this article

726 Accesses
Metrics details

Subjects

Abstract

This study constructed deep learning models using plain skull radiograph images to predict the accurate postnatal age of infants under 12Â months. Utilizing the results of the trained deep learning models, it aimed to evaluate the feasibility of employing major changes visible in skull X-ray images for assessing postnatal cranial development through gradient-weighted class activation mapping. We developed DenseNet-121 and EfficientNet-v2-M convolutional neural network models to analyze 4933 skull X-ray images collected from 1343 infants. Notably, allowing for a Â±â1Â month error margin, DenseNet-121 reached a maximum corrected accuracy of 79.4% for anteroposterior (AP) views (average: 78.0âÂ±â1.5%) and 84.2% for lateral views (average: 81.1âÂ±â2.9%). EfficientNet-v2-M reached a maximum corrected accuracy 79.1% for AP views (average: 77.0âÂ±â2.3%) and 87.3% for lateral views (average: 85.1âÂ±â2.5%). Saliency maps identified critical discriminative areas in skull radiographs, including the coronal, sagittal, and metopic sutures in AP skull X-ray images, and the lambdoid suture and cortical bone density in lateral images, marking them as indicators for evaluating cranial development. These findings highlight the precision of deep learning in estimating infant age through non-invasive methods, offering the progress for clinical diagnostics and developmental assessment tools.

Combining deep learning with 3D stereophotogrammetry for craniosynostosis diagnosis

Article Open access 18 September 2020

Attention-guided deep learning for gestational age prediction using fetal brain MRI

Article Open access 26 January 2022

Pediatric age estimation from thoracic and abdominal CT scout views using deep learning

Article Open access 08 February 2023

Introduction

As neonates grow, the cranium undergoes significant changes, such as the narrowing of cranial sutures and the closure of fontanels, which are critical markers of normal cranial development^1,2. Given that craniosynostosisâa condition characterized by the premature fusion of cranial sutures, affecting approximately 1 out of every 2500 live birthsâcan lead to severe neurodevelopmental impairments if left untreated, the need for accurate diagnostic tools for its early detection is crucial^3,4,5,6. Furthermore, in developing diagnostic tools capable of distinguishing such pathological conditions, it is imperative first to have tools that provide information serving as criteria for indicating normal cranial development. However, the lack of comprehensive research on these developmental milestones has led to a scarcity of reference data, making it challenging to assess whether an infant's cranial growth aligns with normal chronological changes.

Plain skull X-ray imaging, with its low radiation exposure and non-invasiveness, offers a valuable resource for evaluating cranial development, yet its potential has been underutilized in the context of age estimation.

With the advent of convolutional neural network (CNN) models, deep learning has revolutionized the ability to classify and interpret medical images with precision surpassing traditional methods^7,8. This technological advancement opens new avenues for the application of plain skull X-rays in the precise estimation of postnatal age, facilitating the early detection of cranial anomalies such as craniosynostosis.

Therefore, this study aims to assess the utility of a deep learning model developed to predict the postnatal age of infants using plain skull X-ray images, evaluating its applicability in clinical medicine and various related fields. By integrating class activation mapping (CAM) into the deep learning model⁹, this research seeks not only to improve the accuracy of age estimation but also to provide a novel tool that facilitates the assessment of both normal and pathological cranial development. Thus, the model aims to make a significant contribution in areas such as pediatric care, forensic science, and archaeological research.

Methods

Data collection

Infants under 12Â months of age who underwent plain skull X-rays for head trauma evaluation from January 2010 to December 2021 were included in this study. Patients with congenital cranial malformations were excluded. The study was approved by the Institutional Review Board of Hallym University Sacred Hospital (No. 2023-01-002). Informed consent was waived due to the retrospective nature of the study by the Institutional Review Board of Hallym University Sacred Hospital (No. 2023-01-002), and all image data were anonymized.

All plain skull X-ray images (74 kVp, 200Â mA, 100Â ms) were taken using a digital radiographic device (GC85A, SAMSUNG, Korea) and retrieved from the Picture Archiving and Communication System (PACS, Infinite version 3.0.9) of the institution in DICOM format and converted into .png format. Personal information and annotations were removed during the conversion process to ensure patient confidentiality, and images that were not properly focused were systematically excluded from the database. To maintain the integrity of our dataset, the acquired images were evaluated by the neurosurgical expert (H.S.L.), who decisively excluded AP, Towne, and lateral view radiographs that deviated significantly from the standards.

Additionally, the study utilized another 864 images of 216 distinct patients from the same institution for external validation. These images were obtained from skull X-ray examinations performed over the period from January 2017 through December 2021. The mean vertical resolution of the internal dataset was 2046âÂ±â80 (1382â2177) pixels, while the mean horizontal resolution has 1703âÂ±â74 (1382â2177) pixels. In the external dataset for validation, the mean vertical resolution has 1991âÂ±â188 (1453â2177) pixels, and the mean horizontal resolution has 1641âÂ±â181 (1160â1814) pixels.

The skull X-ray images included four different types: anteroposterior (AP), Towne, right lateral, and left lateral views. Some patients had all four types of X-rays, while others had only some of them. Before the study, the X-ray views were categorized into two groups: the AP view dataset, which included both AP and Towne views, and the lateral view dataset, comprising right and left lateral views. Two separate deep learning models were independently developed for these datasets.

Dataset construction

Each image was labeled according to the patient's age group, categorized into 12 categories by month of age. As presented in Table 1, the entire dataset was divided into three subsets: training, validation, and test datasets, using random sampling at an 8:1:1 ratio. These sub-datasets were mutually exclusive. The validation dataset was used to determine the optimal training process point. Sampling was performed stratified by age groups to maintain consistent data proportions in each subset. To enhance performance reliability, dataset splitting was carried out three times with three different seeds to train deep learning models separately.

Table 1 Data composition of enrolled plain skull X-rays of anteriorâposterior (AP) and lateral views in the internal datasets.

Full size table

Data pre-processing

To eliminate potential age prediction biases unrelated to the skull, all images were pre-processed to hide teeth and paranasal sinus areas. The region of exclusion (ROE) containing the orbital and mandibular regions in the skull X-ray was identified. The border lines of the ROE were defined as follows:

1.
on the AP or Townâs skull X-ray image, they encompassed the upper margin of the supraorbital rim and the lower margin of the mandible (Fig.Â 1A)
2.
on the lateral skull X-ray image, they included the supraorbital rim, the foremost part of the mandible, and the posterior margin of the cervical spinous process (Fig.Â 1B).

The defined area on each of 293 skull X-ray images was labeled as ROE by a neurosurgery expert (H.S.L). The entire ROE dataset was divided into training, validation, and test datasets through random sampling with a ratio of 8:1:1. The MobileNetV3 model was trained for object detection of the labeled ROE. Regarding training parameters, the Adam optimizer was used with an initial learning rate of 1eâââ3 and batch size of 16. Subsequently, post-processing was performed on all images to eliminate the detected ROEs based on the following criteria:

1)
on AP or Townâs skull X-ray, the region below the upper margin of the ROE was removed (Fig.Â 1C)
2)
on the lateral skull X-ray, the square box defined by the upper margin of the ROE and right margin of the ROE was removed (Fig.Â 1D).

All tailored images were then reviewed by a neurosurgeon (H.S.L.) and adjusted for any misprocessing. After tailoring the region of interest (ROI) in the images, all images were center-symmetrically zero-padded into square shapes to match the longer side of the width and height. Bi-linear interpolation was applied to the transformed square images of different sizes to resize them to a uniform size of 1024âÃâ1024 pixels. Minâmax normalization was applied to normalize all images.

Training CNN models

To construct deep-learning models, two different CNN architectures, DenseNet-121 and EfficientNet-V2-M, were adopted. DenseNet-121 has an improved algorithm for feature representation and learning efficiency and has been effective at medical image classification¹⁰, and EfficientNet-V2-M, which has been relatively recently introduced and has shown higher performance in general image classification tasks with low computational cost^11,12. In brief, DenseNet consists of dense blocks linking the feature map of previous layers together, while the EfficientNet-V2-M model searches for the most effective CNN architecture using neural architecture search, similar to EfficientNet. DenseNet-121 and EfficientNet-V2-M had previously been trained with the ImageNet dataset and were fine-tuned by unboxing the weights^11,12,13. All layers were unfreezed, allowing fine-tuning of every layer in the network.

The batch size was set at 8 for DenseNet-121 and 4 for EfficientNet-V2-M, the maximum capacity that the GPU memory of our hardware could handle with each architecture. Categorical cross-entropy was used as the loss function, and the Adam optimizer was applied¹⁴. The initial learning rate was set to 0.0001 and was reduced by a factor of 0.1 every 10 epochs. Early stopping was employed after the 20th epoch with a patience value of 10, which counts sustaining training steps based on the loss for the tuning dataset or the validation loss value, completing training within a total of 100 epochs. During training, if the validation loss value exceeded the minimum validation loss achieved so far in any epoch, the model was not saved. Thus, the model updated at the epoch showing the minimum validation loss in the training process was chosen as the final saved model to prevent overfitting.

The deep-learning model used in this study was implemented on a PyTorch platform using a hardware system comprising an NVIDIA GeForce RTX 4090 graphics processing unit and Intel Xeon Silver central processing unit with a customized water-cooling system.

Performance evaluation and statistical analysis

After training deep-learning models, the performance of each model was evaluated in the test dataset three times using different seeds. For external validation, the trained deep learning models were tested with another external validation dataset as described above.

The primary outcome measurement for the established deep learning model was the classification accuracy in predicting twelve age groups, delineated on a monthly basis. The secondary outcome included the one-month relaxation accuracy of the deep learning models. Continuous variables are presented as means with standard deviations. MannâWhitney U test was used for the comparison of prediction performance between different age groups. A P-value ofâ<â0.05 were considered statistically different and all tests were two-sided. A gradient-weighted class activation map (Grad-CAM++) was implemented in the neural network layer to localize the discriminative regions used by the deep-learning tool to determine the specific class in the given images¹⁵. To validate the superiority of the method proposed in this study, comparison experiments were conducted with the RSNA Bone Challenge Winner Model¹⁶. RSNA Bone Challenge is a competition for estimating the bone age of pediatric patients based on radiographs of their hand. The RSNA Winner Model used not only InceptionV3 as the deep learning network but also sex as an additional input feature.

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee (Institutional Review Board of Hallym University Sacred Hospital and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

Informed consent

This study was carried out as a retrospective analysis, wherein all patient data were anonymized prior to utilization. Informed consent was waived due to the retrospective nature of the study by the Institutional Review Board of Hallym University Sacred Hospital (No. 2023-01-002).

Results

Patient characteristics

The entire dataset, including internal and external data, comprised a total of 5797 images from 1552 children. Among those, the dataset included 2401 X-ray images from 1336 patients in the AP view dataset and 2532 images from 1321 patients in the lateral view dataset. The data composition of the training and test datasets is presented in Table 1. The mean age in the internal dataset was 7.3âÂ±â3.2Â months. In the internal dataset, females accounted for 632 (47.1%) of the children, while in the external dataset, 106 individuals (49.1%) were female. The external dataset included 864 images from 216 children.

Performance of deep learning models for age prediction

The prediction performance of deep learning models for the internal and external dataset are presented in Table 2. For the internal dataset evaluation, the accuracy of the DenseNet-121 models in age prediction was 38.5âÂ±â4.0% for the AP view images and 39.7âÂ±â1.8% for the lateral view images. The accuracy of the EfficientNet-V2-M models for the internal dataset in age prediction was 39.1âÂ±â5.5% for the AP view images and 47.8âÂ±â1.5% for the lateral view images. EfficientNet-V2-M models exhibited 0.6% and 8.1% higher accuracy than DenseNet-121 models for the internal dataset of AP images and lateral images, respectively. The confusion matrices for monthly accuracy are shown in Fig.Â 2.

Table 2 Diagnostic performance for age prediction of machine learning.

Full size table

When considering a margin of error ofâÂ±â1Â month, the maximum corrected accuracy of DenseNet-121 for the AP view images reached 78.2%, with an average of 78.0âÂ±â1.5%, as presented in Fig.Â 3. For the lateral images, the maximum corrected accuracy under the same error margin was 84.2%, with an average of 81.1âÂ±â2.9%. On the other hand, for EfficientNet-V2-M, when considering a margin of error ofâÂ±â1Â month, the maximum corrected accuracy for the AP view images reached 79.1%, with an average of 77.0âÂ±â2.3%. For the lateral images, the maximum corrected accuracy under the same error margin was 87.3%, with an average of 85.1âÂ±â2.5%.

To compare the per-class prediction performance by sub-group, the per-class accuracy of EfficientNet-V2-M was analyzed, which performed slightly better overall than DenseNet-121. In terms of 1-month relaxation prediction performance, the accuracy of AP view was highest in the 1-month subgroup (100âÂ±â0%) and lowest in the 9-month subgroup (51âÂ±â8%) in the internal dataset, but there was no statistical difference in the per-class prediction performance between two subgroups (pâ=â0.064). For the lateral view, the accuracy was highest in the 0-month subgroup (100âÂ±â0%) and lowest in the 9-month subgroup (68âÂ±â8%), and these values were not statistically different (pâ=â0.064).

External validation results

In the evaluation of the external dataset, the accuracy of the DenseNet-121 models in age prediction was 33.9âÂ±â2.1% for the AP view images and 28.3âÂ±â1.3% for the lateral view images, as presented in Table 2. The accuracy of the EfficientNet-V2-M models for the external dataset in age prediction was 32.8âÂ±â2.5% for the AP view images and 29.5âÂ±â1.3% for the lateral view images. EfficientNet-V2-M models demonstrated 0.7% lower accuracy than DenseNet-121 for the external dataset of AP images and 1.2% higher accuracy than DenseNet-121 models for the external dataset of lateral images. These results indicate that models trained using the internal dataset can predict the ages of skull images in external datasets effectively.

Regarding the one-month relaxation results, for DenseNet-121, when considering a margin of error ofâÂ±â1Â month, the maximum corrected accuracy for the AP view images reached 76.4%, with an average of 75.5âÂ±â1.1%. For the lateral images, the maximum corrected accuracy under the same error margin was 72.5%, with an average of 71.1âÂ±â1.3%. Meanwhile, for EfficientNet-V2-M, when considering a margin of error ofâÂ±â1Â month, the maximum corrected accuracy for the AP view images reached 77.8%, with an average of 75.3âÂ±â2.2%. For the lateral images, the maximum corrected accuracy under the same error margin was 75.2% with an average of 74.1âÂ±â1.2%. There were no statistically significant differences in the F1-scores of 1-month relaxation prediction between DenseNet-121 and EfficientNet-V2-M when using AP view (pâ=â1.000) and lateral view (Pâ=â0.190) images in the internal dataset, as well as in the external dataset (pâ=â1.000 and pâ=â0.081, respectively).

To delineate the majority decision areas in classifying age categories, saliency maps were generated using Grad-CAM++. Analysis of these maps revealed that the coronal suture and fontanels in AP skull X-ray images, along with the lambdoid suture and variations in cortical bone density in lateral skull X-ray images, serve as the predominant discriminative regions for age classification within the test dataset (Fig.Â 4).

Comparison with the RSNA Bone Challenge winner model

The performances of the RSNA winner model are presented in Table 2. In the prediction performance of 1-month relaxation, the accuracy of the model was 66.7âÂ±â3.4% for AP view images and 73.8âÂ±â4.2% for the lateral view images in the internal test dataset. For the external test dataset, the mean accuracy of the RSNA winner model reached 62.4âÂ±â8.2% for AP view images and 59.9âÂ±â7.8% for lateral view images. Our best models outperformed the RSNA winner model in the prediction accuracy by approximately 10% on each of the test datasets.

Discussion

With advances in deep-learning techniques, it has been possible to develop computational models composed of multiple processing layers to learn representations of data with multiple levels of abstraction¹⁷. To apply deep-learning systems for disease assessment using medical imaging, it is important to realize highly accurate classifications on test datasets as well as reasonable feature extraction of target lesions. However, traditional machine-learning methods for disease classification, such as support vector machines, K-means clustering, and the naÃ¯ve Bayes classifier, require expert knowledge and time-consuming manual adjustments to extract specific features^18,19,20. This implies that traditional machine-learning methods require the extraction of features representing characteristics by using various segmentation methods. Thus, recent deep-learning architectures can facilitate the direct acquisition of useful feature representations from data. The CNN model is known to be a powerful imaging classifier and is widely used to evaluate radiologic images, such as X-rays, computerized tomography, and magnetic resonance imaging²¹. In addition, CAM++ enables classification-trained CNNs to localize characteristic features without using any bounding box annotations²². In the present study, using a dataset comprising infantile skull X-ray images, the decision algorithm was shown to be an efficient model for classifying the image data in the age categories.

The observations from GradCAM++ algorithmâs focus provide a valuable tool for estimating an infant's postnatal age by charting the predictable sequence of cranial suture closure and bone development. Furthermore, the descriptive focus of the GradCAM++ algorithm allows for the inference of progressive developmental alterations in skull X-ray images across various postnatal stages (Supplementary Information).

To our knowledge, the present study is the first to develop a deep-learning model for the prediction of infantile age using skull X-rays. The average accuracies achieved by the CNN-based classifier reached 78.0âÂ±â1.5% for the AP view and 78.0âÂ±â1.5% for the Lateral view, classifying the age category with a one-month relaxation. Interestingly, we found that decisions pertaining to age classification by CAM++ were based on certain regions in the skull X-ray images: the fontanels and the coronal sutures on the AP images of skull X-rays were used to a remarkable extent while the region of the lambdoid suture and the cortical bone density on the lateral images was also used prominently. We provided explanations that detail the representative morphological hallmarks, as discerned through the GradCAM++ visualizations, that characterize each specified age stage in Table 3. For instance, off-the-midline patterns observed in the Grad-CAM++ Saliency Maps of both the sagittal and metopic sutures are considered indicative of the ongoing ossification of these sutures around the 5â6 postnatal month.

Table 3 The representative morphological hallmarks of the descriptive focus from Grad-CAMâ+ââ+âSaliency Maps.

Full size table

Considering the chronological changes during infantile cranial development, CNN-based deep learning effectively identified age categories with reasonable detection of characteristic features representative of cranial development.

The areas highlighted by Grad-CAM++ align with the regions known for characteristic changes in infant skull development over various stages post-birth^23,24.

Moreover, by examining the Grad-CAM++ areas corresponding to different developmental stages in infants, we were able to deduce retrospectively the specific regions where significant changes occur in the skull development of infants over time. By correlating these observed features with known timelines of cranial development, radiologists and pediatricians can estimate the postnatal age of an infant.

We expect that the novel findings from the present study can be useful in the individual assessment of normal cranial development according to postnatal age. Screening the skull X-ray can be used to detect overdevelopment of the cranial bones compared to the actual postnatal age. Additionally, screening of certain conditions, such as premature closure of the cranial suture, may be possible without computerized tomography of the head, which requires a higher radiation dose. In addition, this method may be helpful in the follow-up of patients who have had surgical correction of craniosynostosis, facilitating assessment of normal cranial development after surgery.

Furthermore, it is expected that the CNN-based deep learning used in the present study can also be applied in legal medicine and archeology. For instance, the developed algorithm could be used in the estimation of the actual age of the corpse at the time of death.

In this study, we presented and evaluated the performance of a deep learning model for age prediction based on cranial development, focusing specifically on cranial sutures and cortical bone development, excluding the facial region of the pediatric skull. However, building on this research, future studies will explore a broader range of cranial metrics, including cephalic index and head circumference, to provide a more nuanced understanding of cranial development. In addition, we plan to explore the integration of these metrics with advanced imaging technologies and machine learning algorithms to improve diagnostic accuracy and prognostic capabilities in cranial pathology.

Study limitations

The present study is subject to several limitations. Firstly, the deployed deep-learning model was trained exclusively on data from infants aged under 12Â month. To enhance the model's applicability, future iterations should incorporate a broader age range, extending to at least 24 postnatal months, with classification conducted on a monthly basis. Secondly, the current deep-learning model was unable to distinguish individual cranial sutures within the skull X-ray images when applying the Grad-CAM++ technique. Subsequent enhancements to this technique that enable precise delineation of each cranial suture will be essential for employing this method in the definitive diagnosis of single-suture craniosynostosis." Thirdly, the variation in spatial resolution of X-rays collected under different circumstances over the past decade could potentially have affected the results of deep learning training. Future research should address these potential effects through a technical evaluation using cranial X-ray data from other institutions over the same time period, using similar equipment and the same X-ray dose. Building upon the algorithm developed in this study, we aim to refine the predictive accuracy concerning infant age, normal cranial development, and pathological cranial anomalies in future research.

Future perspectives

In the present study, the age information was a numerical variable, and thus the targeted problem could be approached as a regression problem. Nevertheless, we solved it as a classification problem because skull bone development may not simply have features that increase linearly as the continuous variable increases; features may appear in a nonlinear and discontinuous manner, such as sutures disappearing and shapes changing. However, we plan to explore the use of regression in future research.

In the present study, utilizing deep learning, we anticipate the development of algorithms not only for predicting the age of infants under 12Â months but also for estimating the age of older children. Additionally, we plan to conduct research aimed at enhancing the accuracy of age prediction by applying the same deep learning algorithm to age-specific CT data of children.

And it will be possible to train a deep learning model using skull X-ray and children's hand bone X-ray or other part of the body's bone X-ray together to predict growth such as height.

Conclusion

The CNN model developed in the present study showed good performance in predicting the postnatal age categories in the infantile population. Efficient-V2-M model shows better performance than DenseNet-121 model in predicting age of skull for both AP and lateral position images. And we found that deep learning models can predict ages of skulls by viewing cranial sutures that are anatomical meaningful and related to growing pediatrics via analyzing visualizations by Grad-CAM++.

We expect that using plain skull X-rays will help in estimating actual postnatal age and evaluating normal cranial development.

Data availability

The authors confirm that the meta-data supporting the results of the deep learning is provided within the supplementary information files.

References

Swischuk, L. E. The normal pediatric skull. Variations and artefacts. Radiol. Clin. N. Am. 10(2), 277â290 (1972) (published Online First: 1972/08/01).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Swischuk, L. E. The growing skull. Semin. Roentgenol. 9(2), 115â124. https://doi.org/10.1016/0037-198x(74)90027-3 (1974) (published Online First: 1974/04/01).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Speltz, M. L. et al. Neurodevelopment of infants with single-suture craniosynostosis: Presurgery comparisons with case-matched controls. Plast. Reconstr. Surg. 119(6), 1874â1881. https://doi.org/10.1097/01.prs.0000259184.88265.3f (2007) (published Online First: 2007/04/19).
ArticleÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Lekovic, G. P., Bristol, R. E. & Rekate, H. L. Cognitive impact of craniosynostosis. Semin. Pediatr. Neurol. 11(4), 305â310. https://doi.org/10.1016/j.spen.2004.12.001 (2004) (published Online First: 2005/04/15).
ArticleÂ PubMedÂ Google ScholarÂ
Shim, K. W., Park, E. K., Kim, J. S., Kim, Y. O. & Kim, D. S. Neurodevelopmental problems in non-syndromic craniosynostosis. J. Korean Neurosurg. Soc. 59(3), 242â246. https://doi.org/10.3340/jkns.2016.59.3.242 (2016) (published Online First: 2016/05/27).
ArticleÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Proctor, M. R. & Meara, J. G. A review of the management of single-suture craniosynostosis, past, present, and future. J. Neurosurg. Pediatr. 24(6), 622â631. https://doi.org/10.3171/2019.7.Peds18585 (2019) (published Online First: 2019/12/02).
ArticleÂ PubMedÂ Google ScholarÂ
Byeon, S. J., Park, J., Cho, Y. A. & Cho, B. J. Automated histological classification for digital pathology images of colonoscopy specimen via deep learning. Sci. Rep. 12(1), 12804. https://doi.org/10.1038/s41598-022-16885-x (2022) (published Online First: 20220727).
ArticleÂ ADSÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Poplin, R. et al. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nat. Biomed. Eng. 2(3), 158â164. https://doi.org/10.1038/s41551-018-0195-0 (2018) (published Online First: 20180219).
ArticleÂ PubMedÂ Google ScholarÂ
Kim, D. K., Cho, B. J., Lee, M. J. & Kim, J. H. Prediction of age and sex from paranasal sinus images using a deep learning network. Medicine (Baltimore) 100(7), e24756. https://doi.org/10.1097/MD.0000000000024756 (2021) (published Online First: 2021/02/21).
ArticleÂ PubMedÂ Google ScholarÂ
Hou, Y., Wu, Z., Cai, X. & Zhu, T. The application of improved densenet algorithm in accurate image recognition. Sci. Rep. 14(1), 8645. https://doi.org/10.1038/s41598-024-58421-z (2024) (published Online First: 2024/04/15).
ArticleÂ ADSÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Gao Huang, Z.L. Laurens van der Maaten and Kilian Weinberger. Densely Connected Convolutional Networks (DenseNets). CVPR 2017.
Mingxing Tan QVL. EfficientNetV2: Smaller Models and Faster Training. International Conference on Machine Learning, 2021.
Mingxing Tan, Q.V.L. EfficientNet: Rethinking model scaling for convolutional neural networks international conference on machine learning. 2019. 11 [published Online First: 24 May 2019].
Diederik, P., Kingma, J.B. Adam: A method for stochastic optimization. International Conference on Learning Representations, 2014.
Aditya Chattopadhyay, A.S., Prantik Howlader, V. Balasubramanian. Grad-CAM++: Improved Visual Explanations for Deep Convolutional Networks. IEEE Workshop/Winter Conference on Applications of Computer Vision, 2017.
Halabi, S. S. et al. The RSNA pediatric bone age machine learning challenge. Radiology 290(2), 498â503. https://doi.org/10.1148/radiol.2018180736 (2019) (published Online First: 2018/11/27).
ArticleÂ PubMedÂ Google ScholarÂ
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521(7553), 436â444. https://doi.org/10.1038/nature14539 (2015) (published Online First: 2015/05/29).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Quitadamo, L. R. et al. Support vector machines to detect physiological patterns for EEG and EMG-based humanâcomputer interaction: A review. J. Neural Eng. 14(1), 011001. https://doi.org/10.1088/1741-2552/14/1/011001 (2017) (published Online First: 2017/01/10).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Ichikawa, K. & Morishita, S. A simple but powerful heuristic method for accelerating k-means clustering of large-scale data in life science. IEEE/ACM Trans. Comput. Biol. Bioinform. 11(4), 681â692. https://doi.org/10.1109/TCBB.2014.2306200 (2014) (published Online First: 2014/07/01).
ArticleÂ PubMedÂ Google ScholarÂ
Murtaza, S. S., Kolpak, P., Bener, A. & Jha, P. Automated verbal autopsy classification: Using one-against-all ensemble method and Naive Bayes classifier. Gates Open Res. 2, 63. https://doi.org/10.12688/gatesopenres.12891.2 (2018) (published Online First: 2019/05/28).
ArticleÂ PubMedÂ Google ScholarÂ
Soffer, S. et al. Convolutional neural networks for radiologic images: A radiologistâs guide. Radiology 290(3), 590â606. https://doi.org/10.1148/radiol.2018180547 (2019) (published Online First: 2019/01/30).
ArticleÂ PubMedÂ Google ScholarÂ
Wang, Y., Zhu, F., Boushey, C. J. & Delp, E. J. Weakly supervised food image segmentation using class activation maps. Proc. Int. Conf. Image Proc. 2017, 1277â1281. https://doi.org/10.1109/ICIP.2017.8296487 (2017) (published Online First: 2017/09/01).
ArticleÂ PubMedÂ Google ScholarÂ
Momose, K. J. Developmental approach in the analysis of roentgenograms of the pediatric skull. Radiol. Clin. N. Am. 9(1), 99â116 (1971) (published Online First: 1971/04/01).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Slater, B. J. et al. Cranial sutures: A brief review. Plast. Reconstr. Surg. 121(4), 170e-e178. https://doi.org/10.1097/01.prs.0000304441.99483.97 (2008) (published Online First: 2008/03/20).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ

Download references

Acknowledgements

The authors are thankful to all staff members of the Neurosurgical Department and the patients of the study group whose contributions made this work possible.

Funding

This research was supported by the Bio & Medical Technology Development Program of the National Research Foundation (NRF) and funded by the Korean government (MSIT) (No. NRF-2022R1C1C1010643).

Author information

These authors contributed equally: Heui Seung Lee and Jaewoong Kang.

Authors and Affiliations

Department of Neurosurgery, College of Medicine, Hallym University Sacred Heart Hospital, Hallym University, 22, Gwanpyeong-Ro 170Beon-Gil, Dongan-Gu, Anyang-Si, Gyeonggi-Do, 14068, Republic of Korea
Heui Seung LeeÂ &Â Ji Hee Kim
Interdisciplinary Program for Bioinformatics, Graduate School, Seoul National University, Seoul, Republic of Korea
Heui Seung Lee
Medical Artificial Intelligence Center, Hallym University Medical Center, Anyang, Republic of Korea
Jaewoong Kang,Â So Eui KimÂ &Â Bum-Joo Cho
Department of Ophthalmology, College of Medicine, Hallym University Sacred Heart Hospital, Hallym University, 22, Gwanpyeong-Ro 170Beon-Gil, Dongan-Gu, Anyang-Si, Gyeonggi-Do, 14068, Republic of Korea
Bum-Joo Cho

Authors

Heui Seung Lee
View author publications
You can also search for this author in PubMedÂ Google Scholar
Jaewoong Kang
View author publications
You can also search for this author in PubMedÂ Google Scholar
So Eui Kim
View author publications
You can also search for this author in PubMedÂ Google Scholar
Ji Hee Kim
View author publications
You can also search for this author in PubMedÂ Google Scholar
Bum-Joo Cho
View author publications
You can also search for this author in PubMedÂ Google Scholar

Contributions

Conceptualization and Study Design: BJ Choi, HS Lee. Methodology: BJ CHO. Formal analyses: HS Lee, BJ Cho, SE Kim, JW Kang. Data curation: JH Kim, HS Lee. Writing Original Draft: HS Lee, BJ Cho, JW Kang. Editing: BJ Cho, HS Lee, JW Kang. All authors reviewed and approved manuscript.

Corresponding authors

Correspondence to Heui Seung Lee or Bum-Joo Cho.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, H.S., Kang, J., Kim, S.E. et al. Estimating infant age from skull X-ray images using deep learning. Sci Rep 14, 16600 (2024). https://doi.org/10.1038/s41598-024-64489-4

Download citation

Received: 08 February 2024
Accepted: 10 June 2024
Published: 18 July 2024
DOI: https://doi.org/10.1038/s41598-024-64489-4

Subjects

Abstract

Similar content being viewed by others

Combining deep learning with 3D stereophotogrammetry for craniosynostosis diagnosis

Attention-guided deep learning for gestational age prediction using fetal brain MRI

Pediatric age estimation from thoracic and abdominal CT scout views using deep learning

Introduction

Methods

Data collection

Dataset construction

Data pre-processing

Training CNN models

Performance evaluation and statistical analysis

Ethical approval

Informed consent

Results

Patient characteristics

Performance of deep learning models for age prediction

External validation results

Comparison with the RSNA Bone Challenge winner model

Discussion

Study limitations

Future perspectives

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links