\jmlrproceedings

MIDLMedical Imaging with Deep Learning \jmlrpages \jmlryear2024 \jmlrworkshopShort Paper – MIDL 2024 submission \jmlrvolume– Under Review \midlauthor\NameDeepa Krishnaswamy\nametag¹ \Emaildkrishnaswamy@bwh.harvard.edu
\NameBálint Kovács\nametag²
\NameStefan Denner\nametag²
\NameSteve Pieper\nametag³
\NameDavid Clunie\nametag⁴
\NameChristopher P. Bridge\nametag⁵
\NameTina Kapur\nametag¹
\NameKlaus H. Maier-Hein\nametag²
\NameAndrey Fedorov\nametag¹
\addr¹ Brigham and Women’s Hospital, Boston, MA, USA
\addr² Division of Medical Image Computing, German Cancer Research Center, Heidelberg, Germany
\addr³ Isomics, Cambridge, MA, USA
\addr⁴ PixelMed Publishing, Bangor, PA, USA
\addr⁵ Massachusetts General Hospital, Boston, MA, USA

Automatic classification of prostate MR series type using image content and metadata

Abstract

With the wealth of medical image data, efficient curation is essential. Assigning the sequence type to magnetic resonance images is necessary for scientific studies and artificial intelligence-based analysis. However, incomplete or missing metadata prevents effective automation. We therefore propose a deep-learning method for classification of prostate cancer scanning sequences based on a combination of image data and DICOM metadata. We demonstrate superior results compared to metadata or image data alone, and make our code publicly available at \urlhttps://github.com/deepakri201/DICOMScanClassification.

keywords:

classification, DICOM, MRI, prostate cancer, convolutional neural network

^†^†editors: Under Review for MIDL 2024

1 Introduction

To diagnose clinically significant prostate cancer (PCa), multi-parametric magnetic resonance imaging (MRI) scans are acquired, typically following the PI-RADS v2 guidelines [Weinreb et al.(2016)Weinreb, Barentsz, Choyke, Cornud, Haider, Macura, Margolis, Schnall, Shtern, Tempany, and Thoeny, Turkbey et al.(2019)Turkbey, Rosenkrantz, Haider, Padhani, Villeirs, Macura, Tempany, Choyke, Cornud, Margolis, Thoeny, Verma, Barentsz, and Weinreb]. However, metadata describing the type of scan (free text DICOM field SeriesDescription configured by the operator) is prone to user input errors, institutional conventions, and removal during de-identification. As most machine learning (ML) tasks for segmentation and detection of lesions require specific sequences as input [Saha et al.(2023)Saha, Bosma, Twilt, van Ginneken, Yakar, Elschot, Veltman, Fütterer, and de Rooij, Adams et al.(2022)Adams, Makowski, Engel, Rattunde, Busch, Asbach, Niehues, Vinayahalingam, van Ginneken, Litjens, and Bressem], efficient curation without relying on free text fields or manual input is necessary. Approaches for series classification have been developed based on metadata [Gauriau et al.(2020)Gauriau, Bridge, Chen, Kitamura, Tenenholtz, Kirsch, Andriole, Michalski, and Bizzo, Cluceru et al.(2023)Cluceru, Lupo, Interian, Bove, and Crane], and convolutional neural networks (CNN) using images [Kasmanoff et al.(2023)Kasmanoff, Lee, Razavian, and Lui, Salome et al.(2023)Salome, Sforazzini, Brugnara, Kudak, Dostal, Herold-Mende, Heiland, Debus, Abdollahi, and Knoll, van der Voort et al.(2021)van der Voort, Smits, and Klein]. However most approaches do not combine metadata with image data in a learned fashion [Cluceru et al.(2023)Cluceru, Lupo, Interian, Bove, and Crane], and relatively few methods have performed classification of prostate/pelvis sequences [Baumgärtner et al.(2023)Baumgärtner, Hamm, Schulze-Weddige, Ruppel, Beetz, Rudolph, Dräger, Froböse, Posch, Lenk, and Biessmann, Helm et al.(2024)Helm, Mathai, Kim, Mukherjee, Liu, and Summers].

We propose a CNN-based method for classification of prostate MRI scans, where A) we integrate the image data and DICOM metadata (acquisition parameters captured in fields populated by the scanner) in a single CNN, which has not been done yet for scan classification, B) we train and evaluate our methods using entirely publicly available DICOM PCa MRI collections, and C) we compare our method with a random forest approach using the metadata, and a CNN-based approach using the image data.

Table 1: Collections from Imaging Data Commons and the corresponding number of MR series (patients in parentheses) included for the analysis. ERC = endorectal coil was used, †=multiple manufacturers, ‡=multiple magnetic field strengths.

Dataset	With ERC	T2W	DWI	ADC	DCE	Train	Val	Test
QIN-Prostate-Repeatability	\checkmark	30 (15)	30 (15)	30 (15)	30 (15)	\checkmark	\checkmark	\checkmark
[Fedorov et al.(2018)Fedorov, Schwier, Clunie, Herz, Pieper, Kikinis, Tempany, and Fennessy]
ProstateX†	–	431 (346)	357 (346)	356 (346)	15456 (346)	\checkmark	\checkmark	\checkmark
[Litjens et al.(2014)Litjens, Debats, Barentsz, Karssemeijer, and Huisman, Litjens et al.(2017)Litjens, Debats, Barentsz, Karssemeijer, and Huisman]
Prostate-MRI	\checkmark	26 (26)	52 (26)	–	51 (26)	–	–	\checkmark
[Choyke et al.(2016)Choyke, Turkbey, Pinto, Merino, and Wood]
Prostate-3T†	–	64 (64)	–	–	–	–	–	\checkmark
[Litjens et al.(2016)Litjens, Futterer, and Huisman]
Prostate-Diagnosis	\checkmark	93 (91)	–	–	–	–	–	\checkmark
[Bloch et al.(2015)Bloch, Jain, and CC]
Prostate-MRI-US-Biopsy†‡	\checkmark	958 (792)	110 (108)	1019 (836)	–	–	–	\checkmark
[Natarajan et al.(2013)Natarajan, Priester, Margolis, Huang, and Marks]
[Sonn et al.(2013)Sonn, Natarajan, Margolis, MacAiran, Lieu, Huang, Dorey, and Marks]
Prostate-Fused-MRI-Pathology	\checkmark	46 (27)	13 (12)	12 (12)	102 (28)	–	–	\checkmark
[Singanamalli et al.(2016)Singanamalli, Rusu, Sparks, Shih, Ziober, Wang, Tomaszewski, Rosen, Feldman, and Madabhushi]
[Madabhushi and Feldman(2016)]

2 Methodology

Data

We use publicly available data from NCI Imaging Data Commons (IDC), a cloud-based repository of cancer imaging data [Fedorov et al.(2023)Fedorov, Longabaugh, Pot, Clunie, Pieper, Gibbs, Bridge, Herrmann, Homeyer, Lewis, and Aerts] as seen in Table 1. The ground truth series type was assigned semi-automatically by manually checking all possible regular SeriesDescription expressions specific for these datasets, where 84.1% of all series were assigned to the T2-weighted (T2W), diffusion-weighted imaging (DWI), apparent diffusion coefficient (ADC) and dynamic contrast enhanced (DCE) classes. Series not included consisted of those acquired sagitally/coronally, localizers, and calculated b-value DWI images. We split the data patient-wise, using 60% of QIN-Prostate-Repeatability [Fedorov et al.(2018)Fedorov, Schwier, Clunie, Herz, Pieper, Kikinis, Tempany, and Fennessy] and ProstateX [Litjens et al.(2014)Litjens, Debats, Barentsz, Karssemeijer, and Huisman, Litjens et al.(2017)Litjens, Debats, Barentsz, Karssemeijer, and Huisman] for training, and 20% of the same datasets for validation. Both collections contain the sequences to classify, including T2W, DWI, ADC and DCE images. We split testing into internal (20% of QIN-Prostate-Repeatability and ProstateX), and external (five collections not seen during training). These external collections were curated in the same manner, to only contain the four classes used for the classification.

Methods

We propose a CNN-based method that leverages image data and DICOM metadata. The following metadata attributes were used, as these are machine-generated, standardized, and not removed during de-identification: RepetitionTime, EchoTime, FlipAngle, ScanningSequence, and ContrastBolusAgent. A derived is4D attribute was assigned based on whether spatially overlapping slices were detected within the series. Feature scaling was applied to relevant metadata. Figure LABEL:hiplot summarizes the metadata distribution. Center slices were extracted from 3D volumes, resampled to 64x64, and normalized between 0 and 1. The CNN consisted of three convolution layers each followed by max pooling, followed by three dense layers, where the metadata was concatenated. Sparse categorical cross entropy loss was used with Adam optimization. K-fold cross validation was performed (4 folds), where each model was trained for 10 epochs with early stopping, and probability outputs were ensembled. The combined image and metadata approach was compared to A) a random forest classifier using only the metadata, and a B) CNN model utilizing only the images.

Refer to caption — Figure 1: Hiplot visualization [Haziza et al.(2020)Haziza, Rapin, and Synnaeve] of DICOM metadata parameters, colored by the assigned ground truth scan type.

Table 2: Quantitative results for the three methods on the two test datasets. The mean is provided for the F-beta score when four fold cross-validation is performed.

Method	Internal	Internal	Internal	Internal	External	External	External	External
	T2Ax	DWI	ADC	DCE	T2Ax	DWI	ADC	DCE
Metadata	1.00	1.00	1.00	1.00	0.98	0.60	0.91	1.00
Images	0.99	0.99	0.99	0.99	0.93	0.59	0.99	0.89
Images + metadata	1.00	1.00	0.99	0.99	0.98	0.72	0.99	0.99

3 Results and Discussion

Table 2 displays the evaluation. We note the higher accuracy on the internal dataset, as patients are from the same collection as training/validation, compared to the external test set, which contains five collections not seen during training. The metadata-only approach performs poorly on the external test set due to DWI misclassified as ADC and vice versa. Depending on the format of the DWI data (single series is a 4D volume, or multiple series each with a 3D volume), the latter can be considered similar to ADC. In the images and metadata combination approach, the performance of DWI in the external test improves, but DWI still suffers from misclassification as ADC due to the is4D parameter, and the intensity similarity of the two. Future work involves refining the metadata used, and including the ability to differentiate between low and high b-value images.

\midlacknowledgments

We acknowledge Dr. Clare Tempany, grants P41EB028741-04, P41EB028741-03S1, and NIH NCI under Task Order No. HHSN2611 0071 under Contract No. HHSN261201500003l, which made this research possible.

References

[Adams et al.(2022)Adams, Makowski, Engel, Rattunde, Busch, Asbach, Niehues, Vinayahalingam, van Ginneken, Litjens, and Bressem] LC Adams, MR Makowski, G Engel, M Rattunde, F Busch, P Asbach, SM Niehues, S Vinayahalingam, B van Ginneken, G Litjens, and KK Bressem. Prostate158-an expert-annotated 3T MRI dataset and algorithm for prostate cancer detection. Computers in Biology and Medicine, 148:105817, 2022.
[Baumgärtner et al.(2023)Baumgärtner, Hamm, Schulze-Weddige, Ruppel, Beetz, Rudolph, Dräger, Froböse, Posch, Lenk, and Biessmann] GL Baumgärtner, CA Hamm, S Schulze-Weddige, R Ruppel, NL Beetz, M Rudolph, F Dräger, KP Froböse, H Posch, J Lenk, and F Biessmann. Metadata-independent classification of MRI sequences using convolutional neural networks: Successful application to prostate MRI. European Journal of Radiology, 166:110964, 2023.
[Bloch et al.(2015)Bloch, Jain, and CC] BN Bloch, A Jain, and Jaffe CC. Data from PROSTATE-DIAGNOSIS [dataset]. 2015. https://doi.org/10.7937/K9/TCIA.2015.FOQEUJVT.
[Choyke et al.(2016)Choyke, Turkbey, Pinto, Merino, and Wood] P Choyke, B Turkbey, P Pinto, M Merino, and B Wood. Data from PROSTATE-MRI. 2016. https://doi.org/10.7937/K9/TCIA.2016.6046GUDv.
[Cluceru et al.(2023)Cluceru, Lupo, Interian, Bove, and Crane] J Cluceru, JM Lupo, Y Interian, R Bove, and JC Crane. Improving the automatic classification of brain MRI acquisition contrast with machine learning. Journal of Digital Imaging, 36(1):289–305, 2023.
[Fedorov et al.(2018)Fedorov, Schwier, Clunie, Herz, Pieper, Kikinis, Tempany, and Fennessy] A Fedorov, M Schwier, D Clunie, C Herz, S Pieper, R Kikinis, C Tempany, and F Fennessy. An annotated test-retest collection of prostate multiparametric MRI. Scientific data, 5(1):1–3, 2018.
[Fedorov et al.(2023)Fedorov, Longabaugh, Pot, Clunie, Pieper, Gibbs, Bridge, Herrmann, Homeyer, Lewis, and Aerts] A Fedorov, WJ Longabaugh, D Pot, DA Clunie, SD Pieper, DL Gibbs, C Bridge, MD Herrmann, A Homeyer, R Lewis, and HJ Aerts. National cancer institute imaging data commons: Toward transparency, reproducibility, and scalability in imaging artificial intelligence. Radiographics, 43(12):e230180, 2023.
[Gauriau et al.(2020)Gauriau, Bridge, Chen, Kitamura, Tenenholtz, Kirsch, Andriole, Michalski, and Bizzo] R Gauriau, C Bridge, L Chen, F Kitamura, NA Tenenholtz, JE Kirsch, KP Andriole, MH Michalski, and BC Bizzo. Using DICOM metadata for radiological image series categorization: a feasibility study on large clinical brain MRI datasets. Journal of digital imaging, 33:747–62, 2020.
[Haziza et al.(2020)Haziza, Rapin, and Synnaeve] D. Haziza, J. Rapin, and G. Synnaeve. Hiplot, interactive high-dimensionality plots. \urlhttps://github.com/facebookresearch/hiplot, 2020.
[Helm et al.(2024)Helm, Mathai, Kim, Mukherjee, Liu, and Summers] K Helm, TS Mathai, B Kim, P Mukherjee, J Liu, and RM Summers. Automated classification of body MRI sequence type using convolutional neural networks, 2024. URL \urlhttps://arxiv.org/abs/2402.08098.
[Kasmanoff et al.(2023)Kasmanoff, Lee, Razavian, and Lui] N Kasmanoff, MD Lee, N Razavian, and YW Lui. Deep multi-task learning and random forest for series classification by pulse sequence type and orientation. Neuroradiology, 65:77–87, 2023.
[Litjens et al.(2014)Litjens, Debats, Barentsz, Karssemeijer, and Huisman] G Litjens, O Debats, J Barentsz, N Karssemeijer, and H Huisman. Computer-aided detection of prostate cancer in MRI. IEEE transactions on medical imaging, 33(5):1083–92, 2014.
[Litjens et al.(2016)Litjens, Futterer, and Huisman] G Litjens, J Futterer, and H Huisman. Data from prostate-3T. 2016. https://doi.org/10.7937/K9/TCIA.2015.QJTV5IL5.
[Litjens et al.(2017)Litjens, Debats, Barentsz, Karssemeijer, and Huisman] G Litjens, O Debats, J Barentsz, N Karssemeijer, and H Huisman. SPIE-AAPM PROSTATEx challenge data (version 2) [dataset]. 2017. https://doi.org/10.7937/K9TCIA.2017.MURS5CL.
[Madabhushi and Feldman(2016)] A Madabhushi and M Feldman. Fused radiology-pathology prostate dataset (prostate fused-MRI-pathology). 2016. https://doi.org/10.7937/k9/TCIA.2016.tlpmr1am.
[Natarajan et al.(2013)Natarajan, Priester, Margolis, Huang, and Marks] S Natarajan, A Priester, D Margolis, J Huang, and L Marks. Prostate MRI and ultrasound with pathology and coordinates of tracked biopsy (Prostate-MRI-US-Biopsy) (version 2) [data set]. 2013. https://doi.org/10.7937/TCIA.2020.A61IOC1A.
[Saha et al.(2023)Saha, Bosma, Twilt, van Ginneken, Yakar, Elschot, Veltman, Fütterer, and de Rooij] A Saha, J Bosma, J Twilt, B van Ginneken, D Yakar, M Elschot, J Veltman, J Fütterer, and M de Rooij. Artificial intelligence and radiologists at prostate cancer detection in MRI—the pi-cai challenge. In Medical Imaging with Deep Learning, short paper track 2023, 2023.
[Salome et al.(2023)Salome, Sforazzini, Brugnara, Kudak, Dostal, Herold-Mende, Heiland, Debus, Abdollahi, and Knoll] P Salome, F Sforazzini, G Brugnara, A Kudak, M Dostal, C Herold-Mende, S Heiland, J Debus, A Abdollahi, and M Knoll. MR-Class: A python tool for brain mr image classification utilizing one-vs-all dcnns to deal with the open-set recognition problem. Cancers, 15:1820, 2023.
[Singanamalli et al.(2016)Singanamalli, Rusu, Sparks, Shih, Ziober, Wang, Tomaszewski, Rosen, Feldman, and Madabhushi] A Singanamalli, M Rusu, RE Sparks, NN Shih, A Ziober, L Wang, J Tomaszewski, M Rosen, M Feldman, and A Madabhushi. Identifying in vivo DCE MRI markers associated with microvessel architecture and gleason grades of prostate cancer. Journal of Magnetic Resonance Imaging, 43:149–158, 2016.
[Sonn et al.(2013)Sonn, Natarajan, Margolis, MacAiran, Lieu, Huang, Dorey, and Marks] GA Sonn, S Natarajan, DJ Margolis, M MacAiran, P Lieu, J Huang, FJ Dorey, and LS Marks. Targeted biopsy in the detection of prostate cancer using an office based magnetic resonance ultrasound fusion device. Journal of Urology, 189(1):86–91, 2013.
[Turkbey et al.(2019)Turkbey, Rosenkrantz, Haider, Padhani, Villeirs, Macura, Tempany, Choyke, Cornud, Margolis, Thoeny, Verma, Barentsz, and Weinreb] B Turkbey, AB Rosenkrantz, MA Haider, AR Padhani, G Villeirs, KJ Macura, CM Tempany, PL Choyke, F Cornud, DJ Margolis, HC Thoeny, S Verma, J Barentsz, and JC Weinreb. Prostate imaging reporting and data system version 2.1: 2019 update of prostate imaging reporting and data system version 2. European urology, 76(3):340–351, 2019.
[van der Voort et al.(2021)van der Voort, Smits, and Klein] SR van der Voort, M Smits, and S Klein. DeepDicomSort: an automatic sorting algorithm for brain magnetic resonance imaging data. Neuroinformatics, 19(1):159–84, 2021.
[Weinreb et al.(2016)Weinreb, Barentsz, Choyke, Cornud, Haider, Macura, Margolis, Schnall, Shtern, Tempany, and Thoeny] JC Weinreb, JO Barentsz, PL Choyke, F Cornud, MA Haider, KJ Macura, D Margolis, MD Schnall, F Shtern, CM Tempany, and HC Thoeny. PI-RADS prostate imaging–reporting and data system: 2015, version 2. European urology, 69(1):16–40, 2016.