0% found this document useful (0 votes)

14 views

Transfer Learning CNN

Uploaded by

BARRY

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Transfer Learning CNN

Uploaded by

BARRY

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Transfer Learning for CNNs: Leveraging Pre-trained Models

Transfer learning is a machine learning technique where a pre-trained model is used as a starting point for a new task. In the
context of convolutional neural networks (CNNs), this means using a CNN that has been trained on a large dataset for one task
(e.g., ImageNet) as a foundation for a new task (e.g., classifying medical images).

Why Transfer Learning?

Reduced Training Time: Training a CNN from scratch on a large dataset can be computationally expensive and time-consuming.
Transfer learning allows you to leverage the knowledge learned by the pre-trained model, reducing training time significantly.
Improved Performance: Pre-trained models have often been trained on massive datasets, allowing them to learn general-purpose
features that can be useful for a wide range of tasks. Using these pre-trained models can improve the performance of your new task.
Smaller Datasets: Transfer learning can be particularly useful when you have a small dataset for your new task. By using a pre-
trained model, you can augment your limited data with the knowledge learned from the larger dataset.

How Transfer Learning Works:

Choose a Pre-trained Model: Select a pre-trained CNN that is suitable for your task. Common choices include VGG16, ResNet,
InceptionV3, and EfficientNet.
Freeze Layers: Typically, the earlier layers of a CNN learn general-purpose features, while the later layers learn more task-specific
features. You can freeze the earlier layers of the pre-trained model to prevent them from being updated during training. This helps to
preserve the learned features.
Add New Layers: Add new layers, such as fully connected layers or convolutional layers, to the end of the pre-trained model. These
layers will be trained on your new dataset to learn task-specific features.
Fine-tune: Train the new layers on your dataset while keeping the frozen layers fixed. This process is called fine-tuning.

Common Transfer Learning Scenarios:

Feature Extraction: Extract features from the pre-trained model and use them as input to a different model, such as a support
vector machine (SVM) or a random forest.
Fine-tuning: Fine-tune the pre-trained model on your new dataset to adapt it to your specific task.
Hybrid Approach: Combine feature extraction and fine-tuning by extracting features from the pre-trained model and using them as
input to a new model, while also fine-tuning some layers of the pre-trained model.

Transfer learning is a powerful technique that can significantly improve the performance and efficiency of CNNs, especially
when working with limited datasets or time constraints.

import numpy as np
import pandas as pd

# Visualization
import matplotlib.pyplot as plt
import seaborn as sns

# Class weight calculation

from sklearn.utils.class_weight import compute_class_weight

# Keras library
from keras.models import Sequential
from keras.layers import Conv2D, MaxPooling2D, Flatten, Dense, Dropout, GlobalAveragePooling2D
from tensorflow.keras.preprocessing.image import ImageDataGenerator
from keras.utils import to_categorical
from tensorflow.keras.callbacks import EarlyStopping
from keras import regularizers
from keras.callbacks import ReduceLROnPlateau

# Different CNN Model

from tensorflow.keras.applications import VGG16, ResNet50, InceptionV3, MobileNetV2, DenseNet121

# To chain two different data augmented images for training

from itertools import chain

# Distributed Computing
import tensorflow as tf

import warnings
warnings.filterwarnings("ignore")

BATCH_SIZE = 48

image_height = 299
image_width = 299

# Data agumentation and pre-processing using tensorflow

data_generator_1 = ImageDataGenerator(
rescale=1./255,
rotation_range=5,
width_shift_range=0.05,
height_shift_range=0.05,
shear_range=0.05,
zoom_range=0.05,
brightness_range = [0.95,1.05],
horizontal_flip=False,
vertical_flip=False,
fill_mode='nearest'
)

print('Data Augmentation 1 was created')

data_generator_2 = ImageDataGenerator(
rescale=1./255,
rotation_range=10,
width_shift_range=0.1,
height_shift_range=0.1,
shear_range=0.1,
zoom_range=0.1,
brightness_range = [0.9,1.1],
horizontal_flip=False,
vertical_flip=False,
fill_mode='nearest'
)
print('Data Augmentation 2 was created')

data_generator_3 = ImageDataGenerator (rescale=1./255)

Data Augmentation 1 was created

Data Augmentation 2 was created

train_generator1 = data_generator_1.flow_from_directory(
directory = "/kaggle/input/brain-tumor-classification-mri/Training",
color_mode = "rgb",
target_size = (image_height, image_width), # image height , image width
class_mode = "categorical",
batch_size = BATCH_SIZE,
shuffle = True,
seed = 42)

print('Data Augmentation 1 was used to generate train data set\n')

Found 2870 images belonging to 4 classes.

Data Augmentation 1 was used to generate train data set

test_generator = data_generator_3.flow_from_directory(
directory = "/kaggle/input/brain-tumor-classification-mri/Testing",
color_mode = "rgb",
target_size = (image_height, image_width), # image height , image width
class_mode = "categorical",
batch_size = BATCH_SIZE,
shuffle = True,
seed = 42)

Found 394 images belonging to 4 classes.

dict_class = train_generator1.class_indices
print('Dictionary: {}'.format(dict_class))
class_names = list(dict_class.keys()) # storing class/breed names in a list
print('Class labels: {}'.format(class_names))

Dictionary: {'glioma_tumor': 0, 'meningioma_tumor': 1, 'no_tumor': 2, 'pituitary_tumor': 3}

Class labels: ['glioma_tumor', 'meningioma_tumor', 'no_tumor', 'pituitary_tumor']

frequency = np.unique(train_generator1.classes, return_counts=True)

plt.title("Trainning dataset", fontsize='20')

plt.pie(frequency[1], labels = class_names, autopct='%1.0f%%');
# Dataset characteristics
print("Dataset Characteristics of Train Data Set:\n")
print("Number of images:", len(train_generator1.classes))
print("Number of glioma_tumor images:", len([label for label in train_generator1.classes if label == 0]))
print("Number of meningioma_tumor images:", len([label for label in train_generator1.classes if label == 1]))
print("Number of no_tumor images:", len([label for label in train_generator1.classes if label == 2]))
print("Number of pituitary_tumor images:", len([label for label in train_generator1.classes if label == 3]))
print()

# Dataset characteristics
print("Dataset Characteristics of Test Data Set:\n")
print("Number of images:", len(test_generator.classes))
print("Number of glioma_tumor images:", len([label for label in test_generator.classes if label == 0]))
print("Number of meningioma_tumor images:", len([label for label in test_generator.classes if label == 1]))
print("Number of no_tumor images:", len([label for label in test_generator.classes if label == 2]))
print("Number of pituitary_tumor images:", len([label for label in test_generator.classes if label == 3]))
print()

Dataset Characteristics of Train Data Set:

Number of images: 2870

Number of glioma_tumor images: 826
Number of meningioma_tumor images: 822
Number of no_tumor images: 395
Number of pituitary_tumor images: 827

Dataset Characteristics of Test Data Set:

Number of images: 394

Number of glioma_tumor images: 100
Number of meningioma_tumor images: 115
Number of no_tumor images: 105
Number of pituitary_tumor images: 74

class_weights = compute_class_weight(class_weight = "balanced", classes= np.unique(train_generator1.classes), y=

class_weights = dict(zip(np.unique(train_generator1.classes), class_weights))
class_weights

{0: 0.8686440677966102,
1: 0.8728710462287105,
2: 1.8164556962025316,
3: 0.8675937122128174}

print('Train image data from Data Augmentation 1')

img, label = next(train_generator1)
# print(len(label))

plt.figure(figsize=[20, 15])
for i in range(15):
plt.subplot(3, 5, i+1)
plt.imshow(img[i])
plt.axis('off')
plt.title(class_names[np.argmax(label[i])])
plt.show()

Train image data from Data Augmentation 1

Convolutional neural networks (CNNs)
# Define the epochs for training
EPOCHS = 2

# Define the number of GPUs to use

num_gpus = 2

# Merge augmented image data for training

# merged_train_generator = chain(train_generator1, train_generator2, train_generator3)

# Define early stopping criteria

early_stopping = EarlyStopping(monitor='val_accuracy', patience=2, verbose=1, restore_best_weights=True)

# Define the ReduceLROnPlateau callback

reduce_lr = ReduceLROnPlateau(monitor='val_accuracy', factor=0.001, patience=10, verbose=1)

# For development purpose, we first limit the train data set to the original image data set
# train_data = merged_train_generator
# train_data = train_generator1
train_data = train_generator1
# train_data = test_generator

VGG16
VGG16 is a convolutional neural network (CNN) architecture that was introduced in 2014. It was developed by researchers from
the University of Oxford and is known for its simplicity and effectiveness in image classification tasks.

Key features of VGG16:

Simple Architecture: VGG16 uses a stack of 16 convolutional layers, followed by three fully connected layers and a final softmax
layer for classification.
Small Filters: The convolutional layers in VGG16 use small 3x3 filters, which are repeated multiple times to increase the depth of
the network.
Uniform Stride: The convolutional layers use a uniform stride of 1, which means that the filters are applied to every pixel in the input
image.
Max Pooling: After each block of convolutional layers, a max pooling layer is used to reduce the spatial dimensions of the feature
maps.

Benefits of VGG16:

Simplicity: VGG16's architecture is relatively simple and easy to understand, making it a popular choice for researchers and
practitioners.
Effectiveness: VGG16 achieved state-of-the-art performance on the ImageNet classification dataset when it was introduced,
demonstrating its effectiveness in image classification tasks.
Pre-trained Models: Pre-trained VGG16 models are widely available, which can be used as a starting point for transfer learning
tasks in other domains.

Applications of VGG16:

Image Classification: VGG16 is commonly used for image classification tasks, such as object recognition, scene classification, and
facial recognition.
Transfer Learning: VGG16 can be used as a feature extractor for transfer learning tasks, where the pre-trained model is fine-tuned
on a smaller dataset to solve a related task.
Object Detection: VGG16 has been used as a component of object detection architectures, such as Faster R-CNN and SSD.

In summary, VGG16 is a powerful and versatile CNN architecture that has made significant contributions to the field of
computer vision. Its simplicity, effectiveness, and availability of pre-trained models make it a popular choice for various image
classification and transfer learning tasks.

# Create a MirroredStrategy
strategy = tf.distribute.MirroredStrategy(devices=['/gpu:0', '/gpu:1'])
# Open a strategy scope
with strategy.scope():

# Load the pre-trained VGG16 model without the top classification layer
base_model_VGG16 = VGG16(weights='imagenet', include_top=False, input_shape=(image_height, image_width, 3))

# Set the layers of the base model as non-trainable (freeze them)

for layer in base_model_VGG16.layers:
layer.trainable = False

# Create a new model and add the VGG16 base model

model_VGG16 = Sequential()
model_VGG16.add(base_model_VGG16)

# Add a fully connected layer and output layer for classification

model_VGG16.add(GlobalAveragePooling2D())
model_VGG16.add(Dense(128, activation='relu',kernel_regularizer=regularizers.l2(0.001)))
model_VGG16.add(Dropout(0.4))
model_VGG16.add(Dense(64, activation='relu',kernel_regularizer=regularizers.l2(0.001)))
model_VGG16.add(Dropout(0.2))
model_VGG16.add(Dense(4, activation='softmax'))

# Model summary
print("Model Summary (VGG16):")
model_VGG16.summary()
print()

# Compile the model

model_VGG16.compile(optimizer='adam', loss='categorical_crossentropy', metrics=['accuracy'])

# Train the model

history_VGG16 = model_VGG16.fit(train_data, epochs=EPOCHS, validation_data=test_generator, callbacks=[early_stopp

# Validate the model

val_loss_VGG16, val_accuracy_VGG16 = model_VGG16.evaluate(test_generator, steps=len(test_generator))
print(f'Validation Loss: {val_loss_VGG16:.4f}')
print(f'Validation Accuracy: {val_accuracy_VGG16:.4f}')

Downloading data from https://storage.googleapis.com/tensorflow/keras-applications/vgg16/vgg16_weights_tf_dim_o

rdering_tf_kernels_notop.h5
58889256/58889256 ━━━━━━━━━━━━━━━━━━━━ 0s 0us/step
Model Summary (VGG16):
Model: "sequential"
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Layer (type) ┃ Output Shape ┃ Param # ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ vgg16 (Functional) │ ? │ 14,714,688 │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ global_average_pooling2d │ ? │ 0 (unbuilt) │
│ (GlobalAveragePooling2D) │ │ │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout (Dropout) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_1 (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout_1 (Dropout) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_2 (Dense) │ ? │ 0 (unbuilt) │
└─────────────────────────────────┴────────────────────────┴───────────────┘
Total params: 14,714,688 (56.13 MB)
Trainable params: 0 (0.00 B)
Non-trainable params: 14,714,688 (56.13 MB)
Epoch 1/2
60/60 ━━━━━━━━━━━━━━━━━━━━ 88s 1s/step - accuracy: 0.2499 - loss: 1.6866 - val_accuracy: 0.3604 - val_loss: 1.4
790
Epoch 2/2
60/60 ━━━━━━━━━━━━━━━━━━━━ 71s 1s/step - accuracy: 0.4193 - loss: 1.4111 - val_accuracy: 0.4873 - val_loss: 1.3
680
Restoring model weights from the end of the best epoch: 2.
9/9 ━━━━━━━━━━━━━━━━━━━━ 3s 304ms/step - accuracy: 0.4723 - loss: 1.3854
Validation Loss: 1.3712
Validation Accuracy: 0.5025

MobileNetV2
MobileNetV2 is a convolutional neural network (CNN) architecture designed specifically for mobile and embedded vision
applications. It was introduced in 2018 and is known for its high efficiency and accuracy.

Key features of MobileNetV2:

Inverted Residual Blocks: MobileNetV2 uses inverted residual blocks as the building blocks of its architecture. These blocks
consist of a 1x1 expansion layer, a depthwise separable convolution, a 1x1 projection layer, and a residual connection.
Depthwise Separable Convolutions: Depthwise separable convolutions are a key component of MobileNetV2. They decompose
the standard convolution operation into two separate operations: depthwise convolutions and pointwise convolutions. This
decomposition significantly reduces the number of parameters and computations.
Pointwise Convolutions: Pointwise convolutions are used to combine the features produced by the depthwise convolutions. They
are equivalent to 1x1 convolutions.
ReLU6 Activation: MobileNetV2 uses the ReLU6 activation function, which is a variant of the ReLU function with a maximum value
of 6. This helps to prevent the vanishing gradient problem.

Benefits of MobileNetV2:

High Efficiency: MobileNetV2 is designed to be highly efficient, making it suitable for mobile and embedded devices with limited
computational resources.
High Accuracy: Despite its efficiency, MobileNetV2 achieves state-of-the-art accuracy on a variety of image classification
benchmarks.
Transfer Learning: MobileNetV2 can be used for transfer learning, where the pre-trained model is fine-tuned on a smaller dataset to
solve a related task.

Applications of MobileNetV2:

Mobile Vision: MobileNetV2 is widely used in mobile vision applications, such as object detection, image classification, and facial
recognition.
Embedded Vision: MobileNetV2 can be deployed on embedded devices, such as drones and robots, for tasks like real-time object
tracking and scene understanding.
Edge Computing: MobileNetV2 is well-suited for edge computing applications, where models are deployed on devices at the edge
of the network to reduce latency and bandwidth requirements.

In summary, MobileNetV2 is a highly efficient and accurate CNN architecture that is specifically designed for mobile and
embedded vision applications. Its inverted residual blocks, depthwise separable convolutions, and pointwise convolutions
make it a popular choice for developers working on resource-constrained devices.
%%time

# Create a MirroredStrategy
strategy = tf.distribute.MirroredStrategy(devices=['/gpu:0', '/gpu:1'])

# Open a strategy scope

with strategy.scope():

# Load the pre-trained MobileNetV2 model without the top classification layer
base_model_MobileNet = MobileNetV2(weights='imagenet', include_top=False, input_shape=(image_height, image_width

# Set the layers of the base model as non-trainable (freeze them)

for layer in base_model_MobileNet.layers:
layer.trainable = False

# Create a new model and add the MobileNetV2 base model

model_MobileNet = Sequential()
model_MobileNet.add(base_model_MobileNet)

# Add a global average pooling layer and output layer for classification
model_MobileNet.add(GlobalAveragePooling2D())
model_MobileNet.add(Dense(128, activation='relu', kernel_regularizer=regularizers.l2(0.001)))
model_MobileNet.add(Dropout(0.4))
model_MobileNet.add(Dense(64, activation='relu', kernel_regularizer=regularizers.l2(0.001)))
model_MobileNet.add(Dropout(0.2))
model_MobileNet.add(Dense(4, activation='softmax'))

# Model summary
print("Model Summary (MobileNetV2):")
model_MobileNet.summary()
print()

# Compile the model

model_MobileNet.compile(optimizer='adam', loss='categorical_crossentropy', metrics=['accuracy'])

# Train the model

history_MobileNet = model_MobileNet.fit(train_data, epochs=EPOCHS, validation_data=test_generator, callbacks

# Validate the model

val_loss_MobileNet, val_accuracy_MobileNet = model_MobileNet.evaluate(test_generator, steps=len(test_generator
print(f'Validation Loss: {val_loss_MobileNet:.4f}')
print(f'Validation Accuracy: {val_accuracy_MobileNet:.4f}')

Downloading data from https://storage.googleapis.com/tensorflow/keras-applications/mobilenet_v2/mobilenet_v2_we

ights_tf_dim_ordering_tf_kernels_1.0_224_no_top.h5
9406464/9406464 ━━━━━━━━━━━━━━━━━━━━ 0s 0us/step
Model Summary (MobileNetV2):
Model: "sequential_1"
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Layer (type) ┃ Output Shape ┃ Param # ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ mobilenetv2_1.00_224 │ ? │ 2,257,984 │
│ (Functional) │ │ │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ global_average_pooling2d_1 │ ? │ 0 (unbuilt) │
│ (GlobalAveragePooling2D) │ │ │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_3 (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout_2 (Dropout) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_4 (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout_3 (Dropout) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_5 (Dense) │ ? │ 0 (unbuilt) │
└─────────────────────────────────┴────────────────────────┴───────────────┘
Total params: 2,257,984 (8.61 MB)
Trainable params: 0 (0.00 B)
Non-trainable params: 2,257,984 (8.61 MB)
Epoch 1/2
60/60 ━━━━━━━━━━━━━━━━━━━━ 84s 1s/step - accuracy: 0.4051 - loss: 1.6491 - val_accuracy: 0.4112 - val_loss: 1.7
908
Epoch 2/2
60/60 ━━━━━━━━━━━━━━━━━━━━ 65s 957ms/step - accuracy: 0.6881 - loss: 0.9968 - val_accuracy: 0.3959 - val_loss:
2.0328
Epoch 2: early stopping
Restoring model weights from the end of the best epoch: 1.

9/9 ━━━━━━━━━━━━━━━━━━━━ 2s 176ms/step - accuracy: 0.4386 - loss: 1.8038

Validation Loss: 1.8498
Validation Accuracy: 0.4467
CPU times: user 2min 36s, sys: 10.9 s, total: 2min 46s
Wall time: 2min 35s

DenseNet
DenseNet, introduced in 2017, is a convolutional neural network (CNN) architecture known for its efficient use of parameters
and its ability to achieve high accuracy with relatively fewer layers. Unlike traditional CNNs, where each layer's output is
passed to the next layer, DenseNet connects every layer to every other layer after it. This dense connectivity pattern enhances
information flow and gradient propagation, leading to improved performance.

Key Features of DenseNet:

1. Dense Connectivity:

- Every layer is connected to every other layer after it.

- This creates a dense highway network, where features from earlier layers are reused by later
layers.
- This helps to alleviate the vanishing gradient problem and enhances information flow.

2. Growth Rate (k):

- Each layer adds k new feature maps to the network.

- The growth rate controls the network's depth and width.
- A small growth rate can lead to a more compact network, while a larger growth rate can
result in a deeper network.

3. Transition Layers:

- Transition layers are used to reduce the number of feature maps and the spatial dimensions
of the network.
- They typically consist of a batch normalization layer, a 1x1 convolution, and a 2x2 average
pooling layer.

Benefits of DenseNet:

Efficient Parameter Usage: DenseNet can achieve high accuracy with fewer parameters compared to traditional CNNs.
Improved Information Flow: The dense connectivity pattern helps to propagate information more effectively through the network,
leading to better gradient flow and reduced vanishing gradient problems.
Feature Reuse: Features from earlier layers are reused by later layers, which can help to improve the network's ability to learn
complex patterns.
Reduced Overfitting: The dense connectivity pattern can help to reduce overfitting by encouraging feature reuse and preventing the
network from learning redundant features.

Applications of DenseNet:

Image Classification: DenseNet has been successfully used for various image classification tasks, such as ImageNet classification
and fine-grained object recognition.
Object Detection: DenseNet has been incorporated into object detection architectures like Faster R-CNN and SSD.
Semantic Segmentation: DenseNet has been applied to semantic segmentation tasks, where the goal is to assign a semantic label
to each pixel in an image.

In conclusion, DenseNet is a powerful and efficient CNN architecture that has made significant contributions to the field of
computer vision. Its dense connectivity pattern and efficient parameter usage make it a popular choice for various image
analysis tasks.
%%time

# Create a MirroredStrategy
strategy = tf.distribute.MirroredStrategy(devices=['/gpu:0', '/gpu:1'])

# Open a strategy scope

with strategy.scope():

# Load the pre-trained DenseNet121 model without the top classification layer
base_model_DenseNet = DenseNet121(weights='imagenet', include_top=False, input_shape=(image_height, image_width

# Set the layers of the base model as non-trainable (freeze them)

for layer in base_model_DenseNet.layers:
layer.trainable = False

# Create a new model and add the DenseNet121 base model

model_DenseNet = Sequential()
model_DenseNet.add(base_model_DenseNet)

# Add a global average pooling layer and output layer for classification
model_DenseNet.add(GlobalAveragePooling2D())
model_DenseNet.add(Dense(128, activation='relu', kernel_regularizer=regularizers.l2(0.001)))
model_DenseNet.add(Dropout(0.4))
model_DenseNet.add(Dense(64, activation='relu', kernel_regularizer=regularizers.l2(0.001)))
model_DenseNet.add(Dropout(0.2))
model_DenseNet.add(Dense(4, activation='softmax'))

# Model summary
print("Model Summary (DenseNet121):")
model_DenseNet.summary()
print()

# Compile the model

model_DenseNet.compile(optimizer='adam', loss='categorical_crossentropy', metrics=['accuracy'])

# Train the model

history_DenseNet = model_DenseNet.fit(train_data, epochs=EPOCHS, validation_data=test_generator, callbacks=[

# Validate the model

val_loss_DenseNet, val_accuracy_DenseNet = model_DenseNet.evaluate(test_generator, steps=len(test_generator))
print(f'Validation Loss: {val_loss_DenseNet:.4f}')
print(f'Validation Accuracy: {val_accuracy_DenseNet:.4f}')

Downloading data from https://storage.googleapis.com/tensorflow/keras-applications/densenet/densenet121_weights

_tf_dim_ordering_tf_kernels_notop.h5
29084464/29084464 ━━━━━━━━━━━━━━━━━━━━ 0s 0us/step
Model Summary (DenseNet121):
Model: "sequential_2"
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Layer (type) ┃ Output Shape ┃ Param # ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ densenet121 (Functional) │ ? │ 7,037,504 │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ global_average_pooling2d_2 │ ? │ 0 (unbuilt) │
│ (GlobalAveragePooling2D) │ │ │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_6 (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout_4 (Dropout) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_7 (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout_5 (Dropout) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_8 (Dense) │ ? │ 0 (unbuilt) │
└─────────────────────────────────┴────────────────────────┴───────────────┘
Total params: 7,037,504 (26.85 MB)
Trainable params: 0 (0.00 B)
Non-trainable params: 7,037,504 (26.85 MB)
Epoch 1/2
60/60 ━━━━━━━━━━━━━━━━━━━━ 117s 1s/step - accuracy: 0.3769 - loss: 1.6322 - val_accuracy: 0.4010 - val_loss: 1.
7286
Epoch 2/2
60/60 ━━━━━━━━━━━━━━━━━━━━ 68s 1s/step - accuracy: 0.6416 - loss: 0.9998 - val_accuracy: 0.4112 - val_loss: 1.8
933
Epoch 2: early stopping
Restoring model weights from the end of the best epoch: 1.
9/9 ━━━━━━━━━━━━━━━━━━━━ 2s 200ms/step - accuracy: 0.3999 - loss: 1.7517
Validation Loss: 1.6654
Validation Accuracy: 0.4315
CPU times: user 3min 31s, sys: 12.5 s, total: 3min 43s
Wall time: 3min 15s

InceptionV3
InceptionV3 is a convolutional neural network (CNN) architecture introduced in 2015 that is known for its depth, width, and
computational efficiency. It builds upon the ideas of the Inception modules introduced in earlier Inception versions,
incorporating several enhancements to improve performance.

Key Features of InceptionV3:

Inception Modules: The core building block of InceptionV3 is the Inception module. It consists of a parallel combination of different
convolutional filters with different sizes (1x1, 3x3, 5x5) and a pooling layer. This allows the network to capture features at different
scales.
Factorization: InceptionV3 uses a factorization technique to decompose 5x5 convolutions into two 3x3 convolutions. This reduces
the computational cost while maintaining performance.
Label Smoothing: To regularize the network and prevent overfitting, InceptionV3 uses label smoothing. This technique assigns a
small probability to incorrect classes, forcing the network to be less confident in its predictions.
Auxiliary Classifiers: To improve training stability, InceptionV3 includes auxiliary classifiers at intermediate layers. These classifiers
help to guide the training process, especially in the early stages.

Benefits of InceptionV3:

High Accuracy: InceptionV3 has achieved state-of-the-art performance on various image classification benchmarks, including
ImageNet.
Computational Efficiency: The factorization technique and auxiliary classifiers help to improve the computational efficiency of the
network.
Flexibility: The Inception modules allow the network to capture features at different scales, making it more flexible and adaptable to
various image tasks.

Applications of InceptionV3:

Image Classification: InceptionV3 is widely used for image classification tasks, such as object recognition, scene classification, and
fine-grained categorization.
Object Detection: InceptionV3 has been incorporated into object detection architectures like Faster R-CNN and SSD.
Semantic Segmentation: InceptionV3 has been applied to semantic segmentation tasks, where the goal is to assign a semantic
label to each pixel in an image.

In conclusion, InceptionV3 is a powerful and efficient CNN architecture that has made significant contributions to the field of
computer vision. Its Inception modules, factorization techniques, and auxiliary classifiers make it a popular choice for various
image analysis tasks.
%%time

# Create a MirroredStrategy
strategy = tf.distribute.MirroredStrategy(devices=['/gpu:0', '/gpu:1'])

# Open a strategy scope

with strategy.scope():

# Load the pre-trained InceptionV3 model without the top classification layer
base_model_Inception = InceptionV3(weights='imagenet', include_top=False, input_shape=(image_height, image_width

# Set the layers of the base model as non-trainable (freeze them)

for layer in base_model_Inception.layers:
layer.trainable = False

# Create a new model and add the InceptionV3 base model

model_Inception = Sequential()
model_Inception.add(base_model_Inception)

# Add a global average pooling layer and output layer for classification
model_Inception.add(GlobalAveragePooling2D())
model_Inception.add(Dense(128, activation='relu', kernel_regularizer=regularizers.l2(0.001)))
model_Inception.add(Dropout(0.4))
model_Inception.add(Dense(64, activation='relu', kernel_regularizer=regularizers.l2(0.001)))
model_Inception.add(Dropout(0.2))

model_Inception.add(Dense(4, activation='softmax'))

# Model summary
print("Model Summary (InceptionV3):")
model_Inception.summary()
print()

# Compile the model

model_Inception.compile(optimizer='adam', loss='categorical_crossentropy', metrics=['accuracy'])

# Train the model with EarlyStopping

history_Inception = model_Inception.fit(train_data, epochs=EPOCHS, validation_data=test_generator, callbacks

# Validate the model

val_loss_Inception, val_accuracy_Inception = model_Inception.evaluate(test_generator, steps=len(test_generator
print(f'Validation Loss: {val_loss_Inception:.4f}')
print(f'Validation Accuracy: {val_accuracy_Inception:.4f}')

Downloading data from https://storage.googleapis.com/tensorflow/keras-applications/inception_v3/inception_v3_we

ights_tf_dim_ordering_tf_kernels_notop.h5
87910968/87910968 ━━━━━━━━━━━━━━━━━━━━ 0s 0us/step
Model Summary (InceptionV3):
Model: "sequential_3"
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Layer (type) ┃ Output Shape ┃ Param # ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ inception_v3 (Functional) │ ? │ 21,802,784 │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ global_average_pooling2d_3 │ ? │ 0 (unbuilt) │
│ (GlobalAveragePooling2D) │ │ │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_9 (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout_6 (Dropout) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_10 (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout_7 (Dropout) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_11 (Dense) │ ? │ 0 (unbuilt) │
└─────────────────────────────────┴────────────────────────┴───────────────┘
Total params: 21,802,784 (83.17 MB)
Trainable params: 0 (0.00 B)
Non-trainable params: 21,802,784 (83.17 MB)
Epoch 1/2
60/60 ━━━━━━━━━━━━━━━━━━━━ 98s 1s/step - accuracy: 0.4116 - loss: 1.6481 - val_accuracy: 0.4924 - val_loss: 1.4
737
Epoch 2/2
60/60 ━━━━━━━━━━━━━━━━━━━━ 66s 974ms/step - accuracy: 0.6280 - loss: 1.1214 - val_accuracy: 0.5178 - val_loss:
1.4841
Restoring model weights from the end of the best epoch: 2.
9/9 ━━━━━━━━━━━━━━━━━━━━ 2s 165ms/step - accuracy: 0.5253 - loss: 1.4499
Validation Loss: 1.3977
Validation Accuracy: 0.5127
CPU times: user 2min 59s, sys: 10.7 s, total: 3min 10s
Wall time: 2min 52s

ResNet
ResNet
ResNet (Residual Network), introduced in 2015, is a type of convolutional neural network (CNN) architecture that has
significantly impacted the field of deep learning. The key innovation in ResNet is the introduction of residual blocks, which
allow the network to learn residual functions instead of the entire underlying mapping. This enables the training of extremely
deep networks without suffering from the vanishing gradient problem.

Residual Blocks:

Identity Mapping: A residual block consists of a stack of layers followed by an identity connection. This identity connection allows
the network to bypass the stack of layers and directly pass the input to the output.
Residual Function: The residual function is the difference between the output of the stack of layers and the input. This residual
function is learned by the network.

Why Residual Blocks Help:

Vanishing Gradient Problem: Residual blocks help to alleviate the vanishing gradient problem, which occurs when gradients
become very small during backpropagation, making it difficult for the network to learn. The identity connection provides a direct path
for gradients to flow, ensuring that they don't vanish completely.
Easier Optimization: Residual blocks make it easier for the network to learn deep representations. By learning the residual function,
the network can focus on learning the differences between the input and output, rather than the entire mapping.

Benefits of ResNet:

Deep Networks: ResNet has enabled the training of extremely deep networks, which have been shown to improve performance on
various tasks.
Improved Accuracy: ResNet has achieved state-of-the-art performance on many image classification benchmarks, such as
ImageNet.
Faster Training: Residual blocks can help to speed up training by making it easier for the network to learn deep representations.

Applications of ResNet:

Image Classification: ResNet is widely used for image classification tasks, such as object recognition and scene classification.
Object Detection: ResNet has been incorporated into object detection architectures like Faster R-CNN and SSD.
Semantic Segmentation: ResNet has been applied to semantic segmentation tasks, where the goal is to assign a semantic label to
each pixel in an image.

In conclusion, ResNet is a powerful and influential CNN architecture that has enabled the training of extremely deep networks
and has achieved state-of-the-art performance on various tasks. The introduction of residual blocks has been a major
breakthrough in deep learning, allowing for the development of more complex and accurate models.
%%time

import tensorflow as tf
from tensorflow.keras.applications import ResNet50
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, GlobalAveragePooling2D, Dropout
from tensorflow.keras import regularizers

# Create a MirroredStrategy
strategy = tf.distribute.MirroredStrategy(devices=['/gpu:0', '/gpu:1'])

# Open a strategy scope

with strategy.scope():

# Load the pre-trained ResNet50 model without the top classification layer
base_model_ResNet = ResNet50(weights='imagenet', include_top=False, input_shape=(image_height, image_width,

# Set the layers of the base model as non-trainable (freeze them)

for layer in base_model_ResNet.layers:
layer.trainable = False

# Create a new model and add the ResNet50 base model

model_ResNet = Sequential()
model_ResNet.add(base_model_ResNet)

# Add a global average pooling layer and output layer for classification
model_ResNet.add(GlobalAveragePooling2D())
model_ResNet.add(Dense(128, activation='relu', kernel_regularizer=regularizers.l2(0.001)))
model_ResNet.add(Dropout(0.4))
model_ResNet.add(Dense(64, activation='relu', kernel_regularizer=regularizers.l2(0.001)))
model_ResNet.add(Dropout(0.2))

model_ResNet.add(Dense(4, activation='softmax'))

# Model summary
print("Model Summary (ResNet50):")
model_ResNet.summary()
print()

# Compile the model

model_ResNet.compile(optimizer='adam', loss='categorical_crossentropy', metrics=['accuracy'])

# Train the model with EarlyStopping

history_ResNet = model_ResNet.fit(train_data, epochs=EPOCHS, validation_data=test_generator, callbacks=[early_sto

# Validate the model

val_loss_ResNet, val_accuracy_ResNet = model_ResNet.evaluate(test_generator, steps=len(test_generator))
print(f'Validation Loss: {val_loss_ResNet:.4f}')
print(f'Validation Accuracy: {val_accuracy_ResNet:.4f}')

Downloading data from https://storage.googleapis.com/tensorflow/keras-applications/resnet/resnet50_weights_tf_d

im_ordering_tf_kernels_notop.h5
94765736/94765736 ━━━━━━━━━━━━━━━━━━━━ 0s 0us/step
Model Summary (ResNet50):
Model: "sequential_4"
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Layer (type) ┃ Output Shape ┃ Param # ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ resnet50 (Functional) │ ? │ 23,587,712 │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ global_average_pooling2d_4 │ ? │ 0 (unbuilt) │
│ (GlobalAveragePooling2D) │ │ │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_12 (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout_8 (Dropout) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_13 (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout_9 (Dropout) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_14 (Dense) │ ? │ 0 (unbuilt) │
└─────────────────────────────────┴────────────────────────┴───────────────┘
Total params: 23,587,712 (89.98 MB)
Trainable params: 0 (0.00 B)
Non-trainable params: 23,587,712 (89.98 MB)
Epoch 1/2
60/60 ━━━━━━━━━━━━━━━━━━━━ 94s 1s/step - accuracy: 0.2606 - loss: 1.7344 - val_accuracy: 0.2893 - val_loss: 1.5
052
Epoch 2/2
60/60 ━━━━━━━━━━━━━━━━━━━━ 67s 992ms/step - accuracy: 0.2525 - loss: 1.5070 - val_accuracy: 0.2335 - val_loss:
1.4806
Epoch 2: early stopping
Restoring model weights from the end of the best epoch: 1.
9/9 ━━━━━━━━━━━━━━━━━━━━ 3s 246ms/step - accuracy: 0.2515 - loss: 1.5305
Validation Loss: 1.5281
Validation Accuracy: 0.2538
CPU times: user 2min 52s, sys: 10.9 s, total: 3min 3s
Wall time: 2min 49s

EfficientNet
EfficientNet is a family of convolutional neural network (CNN) architectures designed to achieve state-of-the-art accuracy with
significantly fewer computational resources compared to previous models. It introduces a novel scaling method that uniformly
scales the network's depth, width, and resolution.

Key Features of EfficientNet:

Compound Scaling: EfficientNet uses a compound scaling method that scales the network's depth, width, and resolution in a
balanced manner. This ensures that the network's performance improves while maintaining computational efficiency.
EfficientNet-B0 to EfficientNet-B7: The EfficientNet family consists of a series of models, ranging from EfficientNet-B0 to
EfficientNet-B7. These models differ in their size and complexity, allowing for a trade-off between accuracy and computational cost.
MobileNetV2 Building Blocks: EfficientNet is based on MobileNetV2 building blocks, which are highly efficient and effective for
mobile and embedded vision applications.

Benefits of EfficientNet:

High Accuracy: EfficientNet achieves state-of-the-art accuracy on various image classification benchmarks, often surpassing
previous models with significantly fewer parameters.
Computational Efficiency: EfficientNet is designed to be computationally efficient, making it suitable for deployment on resource-
constrained devices.
Scalability: The compound scaling method allows EfficientNet to be easily scaled to different sizes and complexities, providing
flexibility for various applications.

Applications of EfficientNet:

Image Classification: EfficientNet is widely used for image classification tasks, such as object recognition, scene classification, and
fine-grained categorization.
Object Detection: EfficientNet has been incorporated into object detection architectures like Faster R-CNN and SSD.
Semantic Segmentation: EfficientNet has been applied to semantic segmentation tasks, where the goal is to assign a semantic
label to each pixel in an image.

In conclusion, EfficientNet is a powerful and efficient CNN architecture that has made significant contributions to the field of
computer vision. Its compound scaling method and efficient building blocks make it a popular choice for various image
analysis tasks, especially in scenarios where computational resources are limited.
%%time

import tensorflow as tf
from tensorflow.keras.applications import EfficientNetB0
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, GlobalAveragePooling2D, Dropout
from tensorflow.keras import regularizers

# Create a MirroredStrategy
strategy = tf.distribute.MirroredStrategy(devices=['/gpu:0', '/gpu:1'])

# Open a strategy scope

with strategy.scope():

# Load the pre-trained EfficientNetB0 model without the top classification layer
base_model_EfficientNet = EfficientNetB0(weights='imagenet', include_top=False, input_shape=(image_height, image_

# Set the layers of the base model as non-trainable (freeze them)

for layer in base_model_EfficientNet.layers:
layer.trainable = False

# Create a new model and add the EfficientNetB0 base model

model_EfficientNet = Sequential()
model_EfficientNet.add(base_model_EfficientNet)

# Add a global average pooling layer and output layer for classification
model_EfficientNet.add(GlobalAveragePooling2D())
model_EfficientNet.add(Dense(128, activation='relu', kernel_regularizer=regularizers.l2(0.001)))
model_EfficientNet.add(Dropout(0.4))
model_EfficientNet.add(Dense(64, activation='relu', kernel_regularizer=regularizers.l2(0.001)))
model_EfficientNet.add(Dropout(0.2))

model_EfficientNet.add(Dense(4, activation='softmax'))

# Model summary
print("Model Summary (EfficientNetB0):")
model_EfficientNet.summary()
print()

# Compile the model

model_EfficientNet.compile(optimizer='adam', loss='categorical_crossentropy', metrics=['accuracy'])

# Train the model with EarlyStopping

history_EfficientNet = model_EfficientNet.fit(train_data, epochs=EPOCHS, validation_data=test_generator, callback

# Validate the model

val_loss_EfficientNet, val_accuracy_EfficientNet = model_EfficientNet.evaluate(test_generator, steps=len(test_gen
print(f'Validation Loss: {val_loss_EfficientNet:.4f}')
print(f'Validation Accuracy: {val_accuracy_EfficientNet:.4f}')

Downloading data from https://storage.googleapis.com/keras-applications/efficientnetb0_notop.h5

16705208/16705208 ━━━━━━━━━━━━━━━━━━━━ 0s 0us/step
Model Summary (EfficientNetB0):
Model: "sequential_5"
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Layer (type) ┃ Output Shape ┃ Param # ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ efficientnetb0 (Functional) │ ? │ 4,049,571 │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ global_average_pooling2d_5 │ ? │ 0 (unbuilt) │
│ (GlobalAveragePooling2D) │ │ │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_15 (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout_10 (Dropout) │ ? │ 0 (unbuilt) │

├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_16 (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout_11 (Dropout) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_17 (Dense) │ ? │ 0 (unbuilt) │
└─────────────────────────────────┴────────────────────────┴───────────────┘
Total params: 4,049,571 (15.45 MB)
Trainable params: 0 (0.00 B)
Non-trainable params: 4,049,571 (15.45 MB)
Epoch 1/2
2024-08-24 19:36:54.302463: E tensorflow/core/grappler/optimizers/meta_optimizer.cc:961] layout failed: INVALID
_ARGUMENT: Size of values 0 does not match size of permutation 4 @ fanin shape inStatefulPartitionedCall/cond/e
lse/_742/cond/StatefulPartitionedCall/sequential_5_1/efficientnetb0_1/block2b_drop_1/stateless_dropout/SelectV2
-2-TransposeNHWCToNCHW-LayoutOptimizer
60/60 ━━━━━━━━━━━━━━━━━━━━ 98s 1s/step - accuracy: 0.2314 - loss: 1.6641 - val_accuracy: 0.2183 - val_loss: 1.5
422
Epoch 2/2
60/60 ━━━━━━━━━━━━━━━━━━━━ 65s 959ms/step - accuracy: 0.2513 - loss: 1.5095 - val_accuracy: 0.2284 - val_loss:
1.4732
Epoch 2: early stopping
Restoring model weights from the end of the best epoch: 1.
9/9 ━━━━━━━━━━━━━━━━━━━━ 2s 193ms/step - accuracy: 0.1881 - loss: 1.5437
Validation Loss: 1.5347
Validation Accuracy: 0.1878
CPU times: user 2min 52s, sys: 10.5 s, total: 3min 3s
Wall time: 2min 51s

NASNet
NASNet is a convolutional neural network (CNN) architecture that was designed using neural architecture search (NAS)
techniques. This means that the network's architecture was not manually designed by humans but rather was automatically
generated by a machine learning algorithm.

Key Features of NASNet:

Neural Architecture Search: NASNet was generated using a reinforcement learning algorithm that searched through a vast space
of possible architectures to find the best one.
Inception-like Modules: NASNet is based on Inception-like modules, which are a combination of different convolutional filters with
different sizes.
Transfer Learning: NASNet can be used for transfer learning, where a pre-trained model is fine-tuned on a smaller dataset to solve
a related task.

Benefits of NASNet:

High Accuracy: NASNet has achieved state-of-the-art performance on various image classification benchmarks.
Efficient Architecture: The architecture of NASNet is designed to be computationally efficient.
Automation: The use of neural architecture search eliminates the need for manual design, making it easier to explore a wider range
of architectures.

Applications of NASNet:

Image Classification: NASNet is widely used for image classification tasks, such as object recognition, scene classification, and
fine-grained categorization.
Object Detection: NASNet has been incorporated into object detection architectures like Faster R-CNN and SSD.
Semantic Segmentation: NASNet has been applied to semantic segmentation tasks, where the goal is to assign a semantic label to
each pixel in an image.

In conclusion, NASNet is a powerful and efficient CNN architecture that has made significant contributions to the field of
computer vision. Its use of neural architecture search allows for the automatic generation of high-performing models, making it
a valuable tool for researchers and practitioners.
%%time

import tensorflow as tf
from tensorflow.keras.applications import NASNetMobile
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, GlobalAveragePooling2D, Dropout
from tensorflow.keras import regularizers

# Create a MirroredStrategy (adjust device names if needed)

strategy = tf.distribute.MirroredStrategy(devices=['/gpu:0', '/gpu:1'])

# Open a strategy scope

with strategy.scope():
# Load the pre-trained NASNetMobile model without top classification layer
base_model_NASNet = NASNetMobile(weights='imagenet', include_top=False, input_shape=(image_height, image_width

# Set the layers of the base model as non-trainable (freeze them)

for layer in base_model_NASNet.layers:
layer.trainable = False

# Create a new model and add the NASNetMobile base model

model_NASNet = Sequential()
model_NASNet.add(base_model_NASNet)

# Add a global average pooling layer and output layer for classification
model_NASNet.add(GlobalAveragePooling2D())
model_NASNet.add(Dense(128, activation='relu', kernel_regularizer=regularizers.l2(0.001)))
model_NASNet.add(Dropout(0.4))
model_NASNet.add(Dense(64, activation='relu', kernel_regularizer=regularizers.l2(0.001)))
model_NASNet.add(Dropout(0.2))
model_NASNet.add(Dense(4, activation='softmax'))

# Model summary
print("Model Summary (NASNetMobile):")
model_NASNet.summary()
print()
# Compile the model
model_NASNet.compile(optimizer='adam', loss='categorical_crossentropy', metrics=['accuracy'])

# Train the model with EarlyStopping (replace with your training logic)
history_NASNet = model_NASNet.fit(train_data, epochs=EPOCHS, validation_data=test_generator, callbacks=[early_stopp

#Validate the model (replace with your validation logic)

val_loss_NASNet, val_accuracy_NASNet = model_NASNet.evaluate(test_generator, steps=len(test_generator))
print(f'Validation Loss: {val_loss_NASNet:.4f}')
print(f'Validation Accuracy: {val_accuracy_NASNet:.4f}')

Downloading data from https://storage.googleapis.com/tensorflow/keras-applications/nasnet/NASNet-mobile-no-top.

h5
19993432/19993432 ━━━━━━━━━━━━━━━━━━━━ 0s 0us/step
Model Summary (NASNetMobile):
Model: "sequential_6"
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Layer (type) ┃ Output Shape ┃ Param # ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ NASNet (Functional) │ ? │ 4,269,716 │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ global_average_pooling2d_6 │ ? │ 0 (unbuilt) │
│ (GlobalAveragePooling2D) │ │ │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_18 (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout_12 (Dropout) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_19 (Dense) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dropout_13 (Dropout) │ ? │ 0 (unbuilt) │
├─────────────────────────────────┼────────────────────────┼───────────────┤
│ dense_20 (Dense) │ ? │ 0 (unbuilt) │
└─────────────────────────────────┴────────────────────────┴───────────────┘
Total params: 4,269,716 (16.29 MB)
Trainable params: 0 (0.00 B)
Non-trainable params: 4,269,716 (16.29 MB)
Epoch 1/2
60/60 ━━━━━━━━━━━━━━━━━━━━ 140s 1s/step - accuracy: 0.4293 - loss: 1.5059 - val_accuracy: 0.5635 - val_loss: 1.
4621
Epoch 2/2
60/60 ━━━━━━━━━━━━━━━━━━━━ 64s 948ms/step - accuracy: 0.6694 - loss: 1.0520 - val_accuracy: 0.5584 - val_loss:
1.4447
Restoring model weights from the end of the best epoch: 1.
9/9 ━━━━━━━━━━━━━━━━━━━━ 2s 186ms/step - accuracy: 0.5102 - loss: 1.5139
Validation Loss: 1.5138
Validation Accuracy: 0.5025
CPU times: user 3min 48s, sys: 12.3 s, total: 4min
Wall time: 3min 37s

Model Performance Comparison

data = {
'VGG16': val_accuracy_VGG16,
'MobileNet': val_accuracy_MobileNet,
'DenseNet': val_accuracy_DenseNet,
'Inception': val_accuracy_Inception,
'ResNetNASNet' : val_accuracy_ResNet,
'EfficientNet' : val_accuracy_EfficientNet,
'NASNet' : val_accuracy_NASNet
}

df = pd.DataFrame.from_dict(data, orient='index', columns=['Accuracy'])

df = df.reset_index().rename(columns={'index': 'Model'})

plt.figure(figsize=[15, 5])

# Create bar chart

sns.barplot(x='Model', y='Accuracy', data=df)

# Add labels to bars

ax = plt.gca()
for bar in ax.containers:
ax.bar_label(bar, label_type='edge', labels=[f"{x:.1%}" for x in bar.datavalues], fontsize=20)

# Adjust the layout

plt.tight_layout()

plt.show()
Prediction Result Samples
test_generator.reset()
img, label = next(test_generator)

prediction = model_Inception.predict(img)
test_pred_classes = np.argmax(prediction, axis=1)

plt.figure(figsize=[20, 20])
for i in range(20):
plt.subplot(5, 4, i+1)
plt.imshow(img[i])
plt.axis('off')
plt.title("Label : {}\n Prediction : {} {:.1f}%".format(class_names[np.argmax(label[i])], class_names[test_pred_c
plt.show()

2/2 ━━━━━━━━━━━━━━━━━━━━ 12s 6s/step

https://www.kaggle.com/code/pythonafroz/vgg16-mobilenet-densenet-inception-resnet-nasne

Loading [MathJax]/jax/output/CommonHTML/fonts/TeX/fontdata.js

TLM for CNN
No ratings yet
TLM for CNN
32 pages
AIML Lab 3
No ratings yet
AIML Lab 3
17 pages
improved_fcc_cat_dog.ipynb - Colab
No ratings yet
improved_fcc_cat_dog.ipynb - Colab
12 pages
Image Classifier Report
No ratings yet
Image Classifier Report
7 pages
Face - Emotion Recog - Implementation
No ratings yet
Face - Emotion Recog - Implementation
11 pages
Deep Learning Lab With Output
No ratings yet
Deep Learning Lab With Output
12 pages
Data Augmentation: Objectives
No ratings yet
Data Augmentation: Objectives
10 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
4 pages
EXP6
No ratings yet
EXP6
5 pages
ML Paper - Pneumonia Model (FINAL)
No ratings yet
ML Paper - Pneumonia Model (FINAL)
18 pages
Soc Ex-11,12
No ratings yet
Soc Ex-11,12
5 pages
Image Category Classification Using Deep Learning
No ratings yet
Image Category Classification Using Deep Learning
11 pages
Alzheimers Classification 2
No ratings yet
Alzheimers Classification 2
17 pages
Advanced_ML_Image_Processing
No ratings yet
Advanced_ML_Image_Processing
6 pages
CNN Implementation in Python
No ratings yet
CNN Implementation in Python
7 pages
dlweek9
No ratings yet
dlweek9
5 pages
21BCP167_AI_9
No ratings yet
21BCP167_AI_9
10 pages
UNIT-II Regularization in Deep Learning
No ratings yet
UNIT-II Regularization in Deep Learning
24 pages
DL7 1
No ratings yet
DL7 1
19 pages
CSY3025 Artificial Intelligence Techniques: Deep Learning
No ratings yet
CSY3025 Artificial Intelligence Techniques: Deep Learning
42 pages
cat-dog-classification-report
No ratings yet
cat-dog-classification-report
11 pages
ANN Detection Technique
No ratings yet
ANN Detection Technique
20 pages
Improve The Accuracy of A CNN Layer in Deep Learning
No ratings yet
Improve The Accuracy of A CNN Layer in Deep Learning
14 pages
To Address Overfitting in An EfficientNet Model For Malaria Prediction
No ratings yet
To Address Overfitting in An EfficientNet Model For Malaria Prediction
5 pages
Artificial Intelligence Mini Project
No ratings yet
Artificial Intelligence Mini Project
5 pages
a-1
No ratings yet
a-1
9 pages
Vit
No ratings yet
Vit
11 pages
Augmentation and Segmentation
No ratings yet
Augmentation and Segmentation
32 pages
mini4
No ratings yet
mini4
9 pages
How To Pass Image Datasets To CNN Models Using Image Data Generations - by MD Shahbaz Alam - Medium
No ratings yet
How To Pass Image Datasets To CNN Models Using Image Data Generations - by MD Shahbaz Alam - Medium
14 pages
DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab
No ratings yet
DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab
5 pages
10 PDF
No ratings yet
10 PDF
12 pages
Building Deep Neural Network
No ratings yet
Building Deep Neural Network
17 pages
Brain Tumor Detection Using Deep Learning
No ratings yet
Brain Tumor Detection Using Deep Learning
96 pages
Stuff
No ratings yet
Stuff
2 pages
Traffic Signs Recognition using CNN and Keras in Python
No ratings yet
Traffic Signs Recognition using CNN and Keras in Python
9 pages
UNIT_I CHP_5
No ratings yet
UNIT_I CHP_5
26 pages
Object Detection Tutorial - Py
No ratings yet
Object Detection Tutorial - Py
3 pages
skin_prj
No ratings yet
skin_prj
5 pages
Tensor Flow 2
No ratings yet
Tensor Flow 2
3 pages
Original Code
No ratings yet
Original Code
3 pages
GEMA - IA B3 CNN - Transfer Learning - DenseNet121 - Colab
No ratings yet
GEMA - IA B3 CNN - Transfer Learning - DenseNet121 - Colab
9 pages
Keras Tutorial Cheatsheet
No ratings yet
Keras Tutorial Cheatsheet
1 page
Assignment 2
No ratings yet
Assignment 2
3 pages
Deep Learning Types
No ratings yet
Deep Learning Types
16 pages
Deep Learning Practical File
No ratings yet
Deep Learning Practical File
36 pages
ML Paper - Breast Cancer Model
No ratings yet
ML Paper - Breast Cancer Model
38 pages
Keras CheatSheet PGAA
No ratings yet
Keras CheatSheet PGAA
1 page
Project - Ipynb - Colaboratory
No ratings yet
Project - Ipynb - Colaboratory
4 pages
DeepLearningForVisionSystems Ch5 ResNet
No ratings yet
DeepLearningForVisionSystems Ch5 ResNet
24 pages
1-GAN Mnist.ipynb - Colab
No ratings yet
1-GAN Mnist.ipynb - Colab
4 pages
WLeaf Disease Classification - ResNet50.ipynb - Colaboratory
No ratings yet
WLeaf Disease Classification - ResNet50.ipynb - Colaboratory
12 pages
Weely Assignment-I VGG16
No ratings yet
Weely Assignment-I VGG16
5 pages
Pythonfile
No ratings yet
Pythonfile
36 pages
Deep Learning for Vision Lab Manual 2024
100% (1)
Deep Learning for Vision Lab Manual 2024
25 pages
Capstone Project Report (Digit-Recognition Using CNN)
No ratings yet
Capstone Project Report (Digit-Recognition Using CNN)
11 pages
Implementation of Time Series Forecasting
No ratings yet
Implementation of Time Series Forecasting
12 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
03 - AutoML
100% (1)
03 - AutoML
14 pages
Engineering Applications of Neural Networks: 23rd International Conference, EAAAI/EANN 2022, Chersonissos, Crete, Greece, June 17–20, 2022, ... in Computer and Information Science, 1600) Lazaros Iliadis (Editor) - Download the full ebook version right now
100% (1)
Engineering Applications of Neural Networks: 23rd International Conference, EAAAI/EANN 2022, Chersonissos, Crete, Greece, June 17–20, 2022, ... in Computer and Information Science, 1600) Lazaros Iliadis (Editor) - Download the full ebook version right now
47 pages
Data Science Interview Preparation (30 Days of Interview Preparation)
No ratings yet
Data Science Interview Preparation (30 Days of Interview Preparation)
15 pages
Dop-DenseNet: Densely Convolutional Neural Network-Based Gesture Recognition Using A Micro-Doppler Radar
No ratings yet
Dop-DenseNet: Densely Convolutional Neural Network-Based Gesture Recognition Using A Micro-Doppler Radar
9 pages
Superserve: Fine-Grained Inference Serving For Unpredictable Workloads
No ratings yet
Superserve: Fine-Grained Inference Serving For Unpredictable Workloads
20 pages
Recovering Quantitative Models of Human Information Processing With Differentiable Architecture Search
No ratings yet
Recovering Quantitative Models of Human Information Processing With Differentiable Architecture Search
8 pages
Binary Neural Networks
No ratings yet
Binary Neural Networks
218 pages
Can GPT-4 Perform Neural Architecture Search?
No ratings yet
Can GPT-4 Perform Neural Architecture Search?
13 pages
a-GIST CNNs Quicksummary Hyunguk
No ratings yet
a-GIST CNNs Quicksummary Hyunguk
25 pages
Performance Comparison of YOLOV8 and YOLOV5 For Vessel Detection For Controlling A Barrier System
No ratings yet
Performance Comparison of YOLOV8 and YOLOV5 For Vessel Detection For Controlling A Barrier System
25 pages
Week1 - 02 DR - Muhammad Farrukh Shahid FCV-ACV
No ratings yet
Week1 - 02 DR - Muhammad Farrukh Shahid FCV-ACV
147 pages
Zero-Reference Low-Light Enhancement Via Physical Quadruple Priors
No ratings yet
Zero-Reference Low-Light Enhancement Via Physical Quadruple Priors
11 pages
Full-In Production
No ratings yet
Full-In Production
55 pages
(2020-ECCV) Rethinking Bottleneck Structure For Efficient Mobile Network Design
No ratings yet
(2020-ECCV) Rethinking Bottleneck Structure For Efficient Mobile Network Design
24 pages
Hybrid_Deep_Learning_Algorithms_for_Dog_Breed_IdentificationA_Comparative_Analysis
No ratings yet
Hybrid_Deep_Learning_Algorithms_for_Dog_Breed_IdentificationA_Comparative_Analysis
12 pages
Constrained Design of Deep Iris Networks
No ratings yet
Constrained Design of Deep Iris Networks
10 pages
cs231n 2019 Lecture10
No ratings yet
cs231n 2019 Lecture10
106 pages
SpineNet - Learning Scale-Permuted Backbone For Recognition and Localization
No ratings yet
SpineNet - Learning Scale-Permuted Backbone For Recognition and Localization
11 pages
Syed Shafiq Sherazi (19pwele5545) DSP Mini Project Thesis
No ratings yet
Syed Shafiq Sherazi (19pwele5545) DSP Mini Project Thesis
19 pages
??????? ???????? ???????? ??????????
No ratings yet
??????? ???????? ???????? ??????????
6 pages
YOLO Object Detection Explained_ A Beginner's Guide _ DataCamp
No ratings yet
YOLO Object Detection Explained_ A Beginner's Guide _ DataCamp
14 pages
Deep Learning For Automatic Violence Detection - Tests On The AIRTLab Dataset
No ratings yet
Deep Learning For Automatic Violence Detection - Tests On The AIRTLab Dataset
16 pages
Vpipe A Virtualized Acceleration System For Achieving Efficient and Scalable Pipeline Parallel DNN Training
No ratings yet
Vpipe A Virtualized Acceleration System For Achieving Efficient and Scalable Pipeline Parallel DNN Training
18 pages
A Survey of Small Language Models
No ratings yet
A Survey of Small Language Models
20 pages
MixConv - Mixed Depthwise Convolutional Kernels
No ratings yet
MixConv - Mixed Depthwise Convolutional Kernels
13 pages
Performance Analysis of NASNet On
No ratings yet
Performance Analysis of NASNet On
26 pages
EEG-based Emotion Recognition via Transformer Neural Architecture Search
No ratings yet
EEG-based Emotion Recognition via Transformer Neural Architecture Search
10 pages
IJERT Early Detection of Parkinsons Dise
No ratings yet
IJERT Early Detection of Parkinsons Dise
4 pages
MLDD
No ratings yet
MLDD
19 pages