Figure 1.
The images illustrate the distinct visual designs of different medication packaging types. These packaging classes demonstrate substantial variations in size, shape, color, text, packaging material, and label markings. Note that Amp denotes ampoule, and Supp denotes suppository.
Figure 1.
The images illustrate the distinct visual designs of different medication packaging types. These packaging classes demonstrate substantial variations in size, shape, color, text, packaging material, and label markings. Note that Amp denotes ampoule, and Supp denotes suppository.
Figure 2.
The data-preparation procedure: (a) Place yellow dots on the mat at a distance of 20 cm from each dot, creating a square environment measuring 40 cm by 40 cm. (b) Determine the position and angle of the medication by aligning with the yellow dots. Then, rotate the medication approximately 45 degrees for each placement. (c) Maintain the camera height between 15 and 25 cm, allowing for a range of heights to capture the complete appearance of all medications.
Figure 2.
The data-preparation procedure: (a) Place yellow dots on the mat at a distance of 20 cm from each dot, creating a square environment measuring 40 cm by 40 cm. (b) Determine the position and angle of the medication by aligning with the yellow dots. Then, rotate the medication approximately 45 degrees for each placement. (c) Maintain the camera height between 15 and 25 cm, allowing for a range of heights to capture the complete appearance of all medications.
Figure 3.
(a) Cefa, (b) Nebilet, (c) Kentamin. We demonstrate the process of capturing medication images with three different packaging types. Firstly, we position the medication at the center of the mat and capture photos from nine different angles, including various tilt and pitch angles. Next, we rotate the medication approximately 45 degrees and capture photos from the same nine camera angles. Finally, we rotated the medication in eight different orientations, completing the collection of 72 images.
Figure 3.
(a) Cefa, (b) Nebilet, (c) Kentamin. We demonstrate the process of capturing medication images with three different packaging types. Firstly, we position the medication at the center of the mat and capture photos from nine different angles, including various tilt and pitch angles. Next, we rotate the medication approximately 45 degrees and capture photos from the same nine camera angles. Finally, we rotated the medication in eight different orientations, completing the collection of 72 images.
Figure 4.
The examples demonstrate similar drugs (SDs) that possess similar features and are prone to be mistaken for one another: (a) Despas vs. Novamin: they have a dark-colored ampoule and a white label; (b) Mycomb vs. Totifen: they are both white bottles with red letters and packaging; (c) Lanoxin vs. Prochlorperazine: they are packaged in clip chain bags and are white, circular pills; (d) Alcos anal vs. Bisacodyl: they are packaged in white plastic film with green fonts.
Figure 4.
The examples demonstrate similar drugs (SDs) that possess similar features and are prone to be mistaken for one another: (a) Despas vs. Novamin: they have a dark-colored ampoule and a white label; (b) Mycomb vs. Totifen: they are both white bottles with red letters and packaging; (c) Lanoxin vs. Prochlorperazine: they are packaged in clip chain bags and are white, circular pills; (d) Alcos anal vs. Bisacodyl: they are packaged in white plastic film with green fonts.
Figure 5.
Overview of the proposed TSIDL method: In the first stage, we use the first-stage CNN to train the CNN Model 0 and then use the 5-fold cross-validation method from Model 0 to obtain the 5-fold CVCM. The 5-fold CVCM was subjected to a similar-drugs grouping algorithm to classify SGs, forming N SGs (SG1-N). In the IDL stage, pharmacists utilize their expertise to determine the optimal cropping size for different regions of interest (ROIs) based on the characteristics of each SG. The optimal drug’s ROI images were then used to train Model 1-N in the second stage, corresponding to N SGs.
Figure 5.
Overview of the proposed TSIDL method: In the first stage, we use the first-stage CNN to train the CNN Model 0 and then use the 5-fold cross-validation method from Model 0 to obtain the 5-fold CVCM. The 5-fold CVCM was subjected to a similar-drugs grouping algorithm to classify SGs, forming N SGs (SG1-N). In the IDL stage, pharmacists utilize their expertise to determine the optimal cropping size for different regions of interest (ROIs) based on the characteristics of each SG. The optimal drug’s ROI images were then used to train Model 1-N in the second stage, corresponding to N SGs.
Figure 6.
An example of the proposed grouping algorithm for grouping similar medications. Firstly, we constructed a 5-fold CVCM by using Model 0. Then, we examined the true positive values for each medication in the 5-fold CVCM. If the value was less than or equal to 48, we marked that medication red. Next, we checked the false negative values for each medication with a red label. If the value exceeded one or equaled one, we marked those medications in yellow. Subsequently, we grouped the medications with red and yellow labels into an SD list and marked them in green. Then, we could obtain the SD list for the similarity group. We applied the same process for all drug categories. We could obtain the SD lists for SG1-N, and finally, we could integrate them into the SG map.
Figure 6.
An example of the proposed grouping algorithm for grouping similar medications. Firstly, we constructed a 5-fold CVCM by using Model 0. Then, we examined the true positive values for each medication in the 5-fold CVCM. If the value was less than or equal to 48, we marked that medication red. Next, we checked the false negative values for each medication with a red label. If the value exceeded one or equaled one, we marked those medications in yellow. Subsequently, we grouped the medications with red and yellow labels into an SD list and marked them in green. Then, we could obtain the SD list for the similarity group. We applied the same process for all drug categories. We could obtain the SD lists for SG1-N, and finally, we could integrate them into the SG map.
Figure 7.
The flowchart of the proposed IDL for SGs is illustrated for pill classification in the CCB class. To capture subtle differences in packaging features, we simulate the steps of a pharmacist carefully examining the medication by adjusting the region of interest (ROI). This lets us discover smaller distinctions within similar drug characteristics, such as texture and imprints. To begin, we load an image of size () pixels. Then, we position the image center at (N/2, N/2). Next, we expand and crop four windows of different sizes () from the center. Finally, an experienced pharmacist selects the optimal ROI from the four images of varying sizes, resulting in an image with the best cropping size, sized (). This image with the optimal ROI is then further used within the IDL framework for additional applications.
Figure 7.
The flowchart of the proposed IDL for SGs is illustrated for pill classification in the CCB class. To capture subtle differences in packaging features, we simulate the steps of a pharmacist carefully examining the medication by adjusting the region of interest (ROI). This lets us discover smaller distinctions within similar drug characteristics, such as texture and imprints. To begin, we load an image of size () pixels. Then, we position the image center at (N/2, N/2). Next, we expand and crop four windows of different sizes () from the center. Finally, an experienced pharmacist selects the optimal ROI from the four images of varying sizes, resulting in an image with the best cropping size, sized (). This image with the optimal ROI is then further used within the IDL framework for additional applications.
Figure 8.
The flowchart of the inference process. In the first stage, CNN Model 0 generates inference results at the drug-name level and the package level. In the second stage, we use the SG map to allocate these drugs to N SGs. Subsequently, the images of SG1-N are cropped by using their respective optimal ROI. Next, each SG image is classified by using the dedicated CNN model corresponding to that SG (CNN Model 1-N), resulting in inference outputs. Finally, the inference outputs from Model 1-N are consolidated to yield the overall inference results of TSIDL.
Figure 8.
The flowchart of the inference process. In the first stage, CNN Model 0 generates inference results at the drug-name level and the package level. In the second stage, we use the SG map to allocate these drugs to N SGs. Subsequently, the images of SG1-N are cropped by using their respective optimal ROI. Next, each SG image is classified by using the dedicated CNN model corresponding to that SG (CNN Model 1-N), resulting in inference outputs. Finally, the inference outputs from Model 1-N are consolidated to yield the overall inference results of TSIDL.
Figure 9.
The training process of Model 0. In the upper part of the figure, the blue line represents the training accuracy, and each epoch is denoted by a black dot indicating the recorded validation accuracy. In the lower part, the orange line represents the training loss, and each epoch corresponds to a black dot representing the recorded validation loss.
Figure 9.
The training process of Model 0. In the upper part of the figure, the blue line represents the training accuracy, and each epoch is denoted by a black dot indicating the recorded validation accuracy. In the lower part, the orange line represents the training loss, and each epoch corresponds to a black dot representing the recorded validation loss.
Figure 10.
The classification results for drug-name level were obtained from the 5-fold CVCM of the CNN model in the first stage by using the testing set. Upon reviewing the results, it can be observed that significant errors are present in specific packaging classes, namely AMP, BOT, CCB, and SUPP. Particularly, the errors in the CCB classes are particularly pronounced. In the CCB’s 5-fold CVCM, it can be seen that Prochlorperazine has a TP value of only 54, with 26 instances misclassified as Lanoxin and 30 instances mistakenly classified as Spironolactone. Similar situations exist for Spironolactone, with a TP value of only 63, including 17 instances classified as Lanoxin, 8 as Magnesium Oxide, and 22 as Prochlorperazine.
Figure 10.
The classification results for drug-name level were obtained from the 5-fold CVCM of the CNN model in the first stage by using the testing set. Upon reviewing the results, it can be observed that significant errors are present in specific packaging classes, namely AMP, BOT, CCB, and SUPP. Particularly, the errors in the CCB classes are particularly pronounced. In the CCB’s 5-fold CVCM, it can be seen that Prochlorperazine has a TP value of only 54, with 26 instances misclassified as Lanoxin and 30 instances mistakenly classified as Spironolactone. Similar situations exist for Spironolactone, with a TP value of only 63, including 17 instances classified as Lanoxin, 8 as Magnesium Oxide, and 22 as Prochlorperazine.
Figure 11.
The package-level classification results are as follows: (a) the 5-fold CVCM of the validation set demonstrates perfect accuracy; (b) the 5-fold CVCM of the testing set exhibits high accuracy in classifying different packages. These 5-fold CVCMs were obtained from the CNN model in the first stage of deep learning. It can be seen that there are some values of 1 in the 5-fold CVCM of the testing set. These represent drugs that are misclassified at the package level.
Figure 11.
The package-level classification results are as follows: (a) the 5-fold CVCM of the validation set demonstrates perfect accuracy; (b) the 5-fold CVCM of the testing set exhibits high accuracy in classifying different packages. These 5-fold CVCMs were obtained from the CNN model in the first stage of deep learning. It can be seen that there are some values of 1 in the 5-fold CVCM of the testing set. These represent drugs that are misclassified at the package level.
Figure 12.
The error classification images at the package level reveal certain similar features despite their differences: (a) Mycomb lotion and Sintrix; (b) Diphenhydramine and Clexane; (c) Frotin and Totifen.
Figure 12.
The error classification images at the package level reveal certain similar features despite their differences: (a) Mycomb lotion and Sintrix; (b) Diphenhydramine and Clexane; (c) Frotin and Totifen.
Figure 13.
Result of the proposed grouping algorithm. We utilized Model 0 from the first stage to generate a 5-fold CVCM on the validation set. This resulted in the classification of five distinct SGs, comprising six types of AMP (SG1), four types of BOT (SG2), four types of CCB (SG3), two types of SUPP (SG4), and ninety-two types of other classes (SG5), which are labeled green in corresponding fields of 5-fold CVCM.
Figure 13.
Result of the proposed grouping algorithm. We utilized Model 0 from the first stage to generate a 5-fold CVCM on the validation set. This resulted in the classification of five distinct SGs, comprising six types of AMP (SG1), four types of BOT (SG2), four types of CCB (SG3), two types of SUPP (SG4), and ninety-two types of other classes (SG5), which are labeled green in corresponding fields of 5-fold CVCM.
Figure 14.
The group-level classification results are (a) the 5-fold CVCM for the validation set and (b) the 5-fold CVCM for the testing set obtained by using the first-stage CNN. Notably, no classification errors occurred in SG1-4; they only occurred in SG5. It can be seen that there are some values of 1, 2, and 3 in the 5-fold CVCM for the testing set. These represent drugs that are misclassified at the group level.
Figure 14.
The group-level classification results are (a) the 5-fold CVCM for the validation set and (b) the 5-fold CVCM for the testing set obtained by using the first-stage CNN. Notably, no classification errors occurred in SG1-4; they only occurred in SG5. It can be seen that there are some values of 1, 2, and 3 in the 5-fold CVCM for the testing set. These represent drugs that are misclassified at the group level.
Figure 15.
The nine misclassified SD images and misclassified images after grouping, where the drugs have similar appearances, which led to classification errors.
Figure 15.
The nine misclassified SD images and misclassified images after grouping, where the drugs have similar appearances, which led to classification errors.
Figure 16.
Training image examples of SG3, with the original images in the top row and the region of interest (ROI) images in the bottom row.
Figure 16.
Training image examples of SG3, with the original images in the top row and the region of interest (ROI) images in the bottom row.
Figure 17.
The training process on the SG3 testing dataset with and without the proposed IDL method. (a) EfficientNet-B0; (b) ResNet-101; (c) Inception-v3; (d) MobileNetV2; (e) SSCNN; (f) SSCNN with IDL. In the upper part of each figure, the blue line represents the training accuracy, and each epoch is denoted by a black dot indicating the recorded validation accuracy. In the lower part, the orange line represents the training loss, and each epoch corresponds to a black dot representing the recorded validation loss. It can be observed that all CNN models trained without the proposed IDL method (a–e) exhibited validation accuracies all below 75%, and all validation losses exceeded 0.5 throughout the training process. Notably, when combining the SSCNN with the proposed IDL method (f), after 1000 epochs, the validation loss consistently remained below 0.3, and the validation accuracy exceeded 95%. Moreover, the best validation accuracy achieved a satisfactory classification accuracy of 97.50%.
Figure 17.
The training process on the SG3 testing dataset with and without the proposed IDL method. (a) EfficientNet-B0; (b) ResNet-101; (c) Inception-v3; (d) MobileNetV2; (e) SSCNN; (f) SSCNN with IDL. In the upper part of each figure, the blue line represents the training accuracy, and each epoch is denoted by a black dot indicating the recorded validation accuracy. In the lower part, the orange line represents the training loss, and each epoch corresponds to a black dot representing the recorded validation loss. It can be observed that all CNN models trained without the proposed IDL method (a–e) exhibited validation accuracies all below 75%, and all validation losses exceeded 0.5 throughout the training process. Notably, when combining the SSCNN with the proposed IDL method (f), after 1000 epochs, the validation loss consistently remained below 0.3, and the validation accuracy exceeded 95%. Moreover, the best validation accuracy achieved a satisfactory classification accuracy of 97.50%.
Figure 18.
The cropping sizes and corresponding ROI images for SG1-4. In the AMP class, the pattern and color of the label are of paramount importance, leading us to select a size of 748 × 748. In the case of BOT, it is crucial to observe distinct bottle contours and color distribution, which led to the choice of 1496 × 1496. For CCB, a clear view of the tablet markings and labels is necessary. Thus, the size is set to 374 × 374. Regarding SUPP, capturing the distribution of label’s text is crucial. Hence, we opted for a 374 × 374 size.
Figure 18.
The cropping sizes and corresponding ROI images for SG1-4. In the AMP class, the pattern and color of the label are of paramount importance, leading us to select a size of 748 × 748. In the case of BOT, it is crucial to observe distinct bottle contours and color distribution, which led to the choice of 1496 × 1496. For CCB, a clear view of the tablet markings and labels is necessary. Thus, the size is set to 374 × 374. Regarding SUPP, capturing the distribution of label’s text is crucial. Hence, we opted for a 374 × 374 size.
Figure 19.
The training process for Models 1–5 was generated by SSCNN with IDL on SG1-5. (a) Model 1: AMP; (b) Model 2: BOT; (c) Model 3: CCB; (d) Model 4: SUPP; (e) Model 5: Other drugs. In the upper part of each figure, the blue line represents the training accuracy, and each epoch is denoted by a black dot indicating the recorded validation accuracy. In the lower part, the orange line represents the training loss, and each epoch corresponds to a black dot representing the recorded validation loss.
Figure 19.
The training process for Models 1–5 was generated by SSCNN with IDL on SG1-5. (a) Model 1: AMP; (b) Model 2: BOT; (c) Model 3: CCB; (d) Model 4: SUPP; (e) Model 5: Other drugs. In the upper part of each figure, the blue line represents the training accuracy, and each epoch is denoted by a black dot indicating the recorded validation accuracy. In the lower part, the orange line represents the training loss, and each epoch corresponds to a black dot representing the recorded validation loss.
Figure 20.
(a–d) are the 5-fold CVCM for SSCNN without and with IDL for SG1-4, respectively. The improved TP values are indicated in green, and the unchanged value is marked in blue. It can be observed that the IDL method resulted in significant improvements in the TP values of each SD across all SGs, except for Suzole, where the values remained unchanged. Particularly in SG3, there are significant improvements in TP values for all drugs. Prochlorperazine increased from 54 to 93, and Spironolactone increased from 63 to 105.
Figure 20.
(a–d) are the 5-fold CVCM for SSCNN without and with IDL for SG1-4, respectively. The improved TP values are indicated in green, and the unchanged value is marked in blue. It can be observed that the IDL method resulted in significant improvements in the TP values of each SD across all SGs, except for Suzole, where the values remained unchanged. Particularly in SG3, there are significant improvements in TP values for all drugs. Prochlorperazine increased from 54 to 93, and Spironolactone increased from 63 to 105.
Figure 21.
Testing accuracies for SG1-5 of SSCNN without and with IDL. The IDL improves accuracies for SG1-5. Specifically, there is a significant increase in accuracy for SG3 (CCB), which exhibited the lowest accuracy among all models. The accuracy significantly improved from 70.00% to 91.59%.
Figure 21.
Testing accuracies for SG1-5 of SSCNN without and with IDL. The IDL improves accuracies for SG1-5. Specifically, there is a significant increase in accuracy for SG3 (CCB), which exhibited the lowest accuracy among all models. The accuracy significantly improved from 70.00% to 91.59%.
Table 1.
The medication list in this study. The list includes 108 medications distributed among 12 different types of packaging.
Table 1.
The medication list in this study. The list includes 108 medications distributed among 12 different types of packaging.
Package Type | Drug Name | Package Type | Drug Name |
---|
Amp | Despas | Syringe | Clexane |
Amp | Dexamethasone | Syringe | Humalog |
Amp | Diphenhydramine | Syringe | Levemir |
Amp | Furosemide | Syringe | NESP |
Amp | Imperan | Syringe | Novomix |
Amp | Laston | Syringe | Novorapid |
Amp | Novamin | Syringe | Recormon 2000 |
Amp | Tranexamic acid | Syringe | Recormon 5000 |
Amp | Voren | Syringe | Toujeo |
Blister | Anpo | Syrup | Anti-Phen |
Blister | Anwu | Syrup | Aswell |
Blister | Bafen | Syrup | Brown mixture |
Blister | Dogmatyl | Syrup | Cetirizine |
Blister | Dorison | Syrup | Cypromin |
Blister | Euricon | Syrup | Kidsolone |
Blister | Nebilet | Syrup | Meptin |
Blister | Syntam | Syrup | Musco |
Blister | Terodine | Syrup | Secorine |
Bot | Alminto | Tube | Biomycin |
Bot | Altropine0-125 | Tube | Fusidic acid |
Bot | Altropine0-3 | Tube | Gentaderm |
Bot | Betame | Tube | Gentamicin |
Bot | Exelderm | Tube | Mycomb |
Bot | Eyehelp | Tube | Nincort |
Bot | Flucason | Tube | Sinpharderm |
Bot | Jaline | Tube | Tetracycline |
Bot | Mycomb | Tube | Topsym |
Bot | Suzole | Vial | Cefa |
Bot | Topsym | Vial | Gentamycin |
Bot | Totifen | Vial | Lofatin |
ClipChainBag | Ceficin | Vial | Medason |
ClipChainBag | Erythromycin | Vial | Mepem |
ClipChainBag | Kentamin | Vial | Sintrix |
ClipChainBag | Lanoxin | Vial | Soonmelt |
ClipChainBag | Ligilin | Vial | Subacillin |
ClipChainBag | Magnesium oxide | Vial | U-Vanco |
ClipChainBag | Prochlorperazine | PaperBox | Azetin |
ClipChainBag | Spironolactone | PaperBox | Berotec |
ClipChainBag | Transamin | PaperBox | Ciproxin |
Enema | Sumgel | PaperBox | Claforan |
Powder | Actein | PaperBox | Cravit |
Powder | Biofermin R | PaperBox | Depakine |
Powder | Forlax | PaperBox | Invanz |
Powder | Hidrasec infant | PaperBox | Nimotop |
Powder | Histapp | PaperBox | Relvar |
Powder | Kalimate | PaperBox | Solu |
Powder | Miyarisan | PaperBox | Spiolto |
Powder | Normacol Plus | PaperBox | Spiriva |
Powder | Smecta | PaperBox | Stilamin |
Supp | Albothyl | PaperBox | Symbicort rapihaler |
Supp | Alcos anal | PaperBox | Symbicort turbuhaler |
Supp | Bisacodyl | PaperBox | Trelegy |
Supp | Frotin | PaperBox | Trisonin |
Supp | Voren | PaperBox | Vfend |
Table 2.
Proposed SSCNN based on AlexNet. The SSCNN architecture consists of eight layers, including five convolutional layers followed by three fully connected layers. We adjusted the output layer of AlexNet to accommodate the classification of 108 different packaging medications in this study.
Table 2.
Proposed SSCNN based on AlexNet. The SSCNN architecture consists of eight layers, including five convolutional layers followed by three fully connected layers. We adjusted the output layer of AlexNet to accommodate the classification of 108 different packaging medications in this study.
Layer | Type | Maps | Size | Kernel Size | Stride | Padding | Activation |
---|
Input | Input Layer | 3 | 227 × 227 | - | - | - | - |
Conv1 | Convolution | 96 | 55 × 55 | 11 × 11 | 4 | 0 | ReLU |
Pool1 | Max Pooling | 96 | 27 × 27 | 3 × 3 | 2 | 0 | - |
Conv2 | Convolution | 256 | 27 × 27 | 5 × 5 | 1 | 2 | ReLU |
Pool2 | Max Pooling | 256 | 13 × 13 | 3 × 3 | 2 | 0 | - |
Conv3 | Convolution | 384 | 13 × 13 | 3 × 3 | 1 | 1 | ReLU |
Conv4 | Convolution | 384 | 13 × 13 | 3 × 3 | 1 | 1 | ReLU |
Conv5 | Convolution | 256 | 13 × 13 | 3 × 3 | 1 | 1 | ReLU |
Pool5 | Max Pooling | 256 | 6 × 6 | 3 × 3 | 2 | 0 | - |
Fc6 | Fully Connected | - | 4096 × 1 | - | - | - | ReLU |
Fc7 | Fully Connected | - | 4096 × 1 | - | - | - | ReLU |
Fc8 | Fully Connected | - | 108 × 1 | - | - | - | Softmax |
Table 3.
The drug- and package-level classification results of the first-stage deep learning model.
Table 3.
The drug- and package-level classification results of the first-stage deep learning model.
Classify Target | Recall | Precision | F1-Score | Accuracy |
---|
Drug Name | 98.16% | 98.19% | 98.13% | 98.16% |
Package | 99.97% | 99.98% | 99.97% | 99.97% |
Table 4.
Examples of similar drugs for SG1-4.
Table 5.
The grouping level classification results achieved superior accuracy. Only a few drugs were misclassified.
Table 5.
The grouping level classification results achieved superior accuracy. Only a few drugs were misclassified.
Classify Target | Recall | Precision | F1-Score | Accuracy |
---|
Similarity Group | 99.84% | 99.65% | 99.74% | 99.92% |
Table 6.
A comparison between the proposed SSCNN with the IDL and the state-of-the-art CNN models without the IDL on the SG3 testing dataset. The proposed SSCNN with the IDL achieves a 91.59% testing accuracy, much higher than other state-of-the-art CNN models. Bold indicates the best performance.
Table 6.
A comparison between the proposed SSCNN with the IDL and the state-of-the-art CNN models without the IDL on the SG3 testing dataset. The proposed SSCNN with the IDL achieves a 91.59% testing accuracy, much higher than other state-of-the-art CNN models. Bold indicates the best performance.
Model | Recall | Precision | F1-Score | Accuracy |
---|
EfficientNet-B0 [34] | 59.55% | 60.37% | 59.13% | 59.55% |
ResNet-101 [35] | 61.14% | 61.59% | 61.06% | 61.14% |
Inception-v3 [36] | 53.64% | 53.71% | 53.37% | 53.64% |
MobileNetV2 [37] | 52.27% | 53.55% | 52.57% | 52.27% |
SSCNN | 66.36% | 67.55% | 66.49% | 66.36% |
SSCNN with IDL | 91.59% | 91.77% | 91.58% | 91.59% |
Table 7.
Comparison of the overall performance between the proposed TSIDL and the state-of-the-art CNN models. Bold indicates the best performance.
Table 7.
Comparison of the overall performance between the proposed TSIDL and the state-of-the-art CNN models. Bold indicates the best performance.
Model | Recall | Precision | F1-Score | Accuracy | Inference Time (ms) |
---|
EfficientNet-B0 [34] | 99.18% | 99.22% | 99.18% | 99.18% | 2.54 |
ResNet-101 [35] | 98.69% | 98.77% | 98.69% | 98.69% | 3.26 |
Inception-v3 [36] | 98.46% | 98.54% | 98.43% | 98.46% | 3.40 |
MobileNetV2 [37] | 98.79% | 98.84% | 98.78% | 98.79% | 1.54 |
SSCNN | 98.16% | 98.19% | 98.13% | 98.16% | 1.17 |
Proposed TSIDL | 99.39% | 99.41% | 99.39% | 99.39% | 3.12 |