GitHub Repository: https-deeplearning-ai/tensorflow-1-public
Path: blob/main/C2/W3/assignment/C2W3_Assignment.ipynb
²⁹⁵⁵ views

Kernel: Python 3 (ipykernel)

Week 3: Transfer Learning

Welcome to this assignment! This week, you are going to use a technique called Transfer Learning in which you utilize an already trained network to help you solve a similar problem to the one it was originally trained to solve.

TIPS FOR SUCCESSFUL GRADING OF YOUR ASSIGNMENT:

All cells are frozen except for the ones where you need to submit your solutions or when explicitly mentioned you can interact with it.
You can add new cells to experiment but these will be omitted by the grader, so don't rely on newly created cells to host your solution code, use the provided places for this.
ou can add the comment # grade-up-to-here in any graded cell to signal the grader that it must only evaluate up to that point. This is helpful if you want to check if you are on the right track even if you are not done with the whole assignment. Be sure to remember to delete the comment afterwards!
Avoid using global variables unless you absolutely have to. The grader tests your code in an isolated environment without running all cells from the top. As a result, global variables may be unavailable when scoring your submission. Global variables that are meant to be used will be defined in UPPERCASE.
To submit your notebook, save it and then click on the blue submit button at the beginning of the page.

Let's get started!

In [ ]:

import os
import matplotlib.pyplot as plt
import tensorflow as tf

In [ ]:

import unittests

Dataset

For this assignment, you will use the Horse or Human dataset, which contains images of horses and humans.

All the images are contained within the ./data/ directory. The complete tree looks like this:

.
└── data/
    ├── train/
    │   ├── horses/
    │   │   ├── train_horse_1.png
    │   │   └── ...
    │   └── humans/
    │       ├── train_human_1.png
    │       └── ...
    └── validation/
        ├── horses/
        │   ├── val_horse_1.png
        │   └── ...
        └── humans/
            ├── val_human_1.png
            └── ...

In [ ]:

TRAIN_DIR = './data/train/'
VALIDATION_DIR = './data/validation/'

Now take a look at a sample image of each one of the classes. You will simply be picking the first image from each class in the train folder.

In [ ]:

# Directories for each class
horses_dir = os.path.join(TRAIN_DIR, 'horses')
humans_dir = os.path.join(TRAIN_DIR, 'humans')

# Load the first example of each one of the classes
sample_image_horse  = tf.keras.preprocessing.image.load_img(os.path.join(horses_dir, os.listdir(horses_dir)[0]))
sample_image_human  = tf.keras.preprocessing.image.load_img(os.path.join(humans_dir, os.listdir(humans_dir)[0]))

ax = plt.subplot(1,2,1)
ax.imshow(sample_image_horse)
ax.set_title('Sample horse image')

ax = plt.subplot(1,2,2)
ax.imshow(sample_image_human)
ax.set_title('Sample human image')
plt.show()

By plotting the images with matplotlib it is easy to see that these images have a resolution of 300x300 (look at the image axes) and are colored, but you can double check this by using the code below:

In [ ]:

# Convert the image into its numpy array representation
sample_array = tf.keras.preprocessing.image.img_to_array(sample_image_horse)

print(f"Each image has shape: {sample_array.shape}")

As expected, the sample image has a resolution of 300x300 and the last dimension is used for each one of the RGB channels to represent color.

Exercise 1: train_val_datasets

Now that you have a better understanding of the images you are dealing with, it is time for you to code the datsets that will feed these images to your network. For this, complete the train_val_datasets function below, in which you will be using the image_dataset_from_directory function from tf.keras.utils. For grading purposes, use a batch size of 32 for the generators, you can later test what happens if you change this parameter.

Important Note: The images have a resolution of 300x300 but the image_dataset_from_directory method you will use allows you to set a target resolution. In this case, set a image_size of (150, 150). This will heavily lower the number of trainable parameters in your final network, yielding much quicker training times without compromising the accuracy!

In [ ]:

# GRADED FUNCTION: train_val_datasets

def train_val_datasets():
    """Creates training and validation datasets

    Returns:
        (tf.data.Dataset, tf.data.Dataset): training and validation datasets
    """

    ### START CODE HERE ###

    training_dataset = tf.keras.utils.image_dataset_from_directory( 
        directory=None,
        batch_size=None,
        image_size=None,
        shuffle=True, 
        seed=7 
    ) 
    
    validation_dataset = tf.keras.utils.image_dataset_from_directory( 
        directory=None,
        batch_size=None,
        image_size=None,
        shuffle=True, 
        seed=7 
    ) 

    ### END CODE HERE ###
                                                                        
    return training_dataset, validation_dataset

In [ ]:

# Test your generators
training_dataset, validation_dataset = train_val_datasets()

Expected Output:

Found 1027 images belonging to 2 classes.
Found 256 images belonging to 2 classes.

In [ ]:

# Test your code!
unittests.test_train_val_datasets(train_val_datasets)

Ultimately, you will want to use your trained model to predict new images, so it is always good to reserve some images for the test set. This will be images never seen by the model, which you can use to check your final model performance. As the original dataset doesn't contain a test set, you will create one by splitting the validation dataset.

In [ ]:

val_batches = int(validation_dataset.cardinality())
test_dataset, validation_dataset = tf.keras.utils.split_dataset(validation_dataset, val_batches//5)

print(f'Number of validation batches: {validation_dataset.cardinality()}')
print(f'Number of test batches: {test_dataset.cardinality()}')

Exercise 2: create_pre_trained_model

For this assignment, you will be using the pretrained model inception V3 available on Tensorflow. In the model folder, you can already find the inception V3 weights, so you can use them to initialize the InceptionV3 model.

In [ ]:

# Define the path to the inception v3 weights
LOCAL_WEIGHTS_FILE = './model/inception_v3_weights_tf_dim_ordering_tf_kernels_notop.h5'

Complete the create_pre_trained_model function below. You should specify the correct input_shape for the model (remember that you set a new resolution for the images instead of the native 300x300). Remember to make all of the layers non-trainable, since you will be using the weights you just downloaded.

In [ ]:

# GRADED FUNCTION: create_pre_trained_model

def create_pre_trained_model():
    """Creates the pretrained inception V3 model

    Returns:
        tf.keras.Model: pre-trained model
    """

    ### START CODE HERE ###
    
    pre_trained_model = tf.keras.applications.inception_v3.InceptionV3( 
        include_top=False, 
        input_shape=None,
        weights=None
    ) 

    # Make all the layers in the pre-trained model non-trainable
    pre_trained_model.None = None

    ### END CODE HERE ###

    return pre_trained_model

In [ ]:

# Create the pre-trained model
pre_trained_model = create_pre_trained_model()

# Count the total number of parameters and how many are trainable
num_total_params = pre_trained_model.count_params()
num_trainable_params = sum([w.shape.num_elements() for w in pre_trained_model.trainable_weights])

print(f"There are {num_total_params:,} total parameters in this model.")
print(f"There are {num_trainable_params:,} trainable parameters in this model.")

Expected Output:

There are 21,802,784 total parameters in this model.
There are 0 trainable parameters in this model.

In [ ]:

# Test your code!
unittests.test_create_pre_trained_model(create_pre_trained_model)

Now print the summary for the pre_trained_model. If you scroll down to the end of your output you will see that the layers in the model are set to non-trainable, since the number of Total params is the same as Non-trainable params.

In [ ]:

# Print the model summary
pre_trained_model.summary()

Creating callbacks for later

You do not want your model to train more than it is necessary, so you will be creating a callback to stop the training once an accuracy of 99.9% is reached. Since you have already worked with callbacks beforehand in this specialization, this callback is provided for you, just run the cell below.

In [ ]:

# Define a Callback class that stops training once accuracy reaches 99.9%
class EarlyStoppingCallback(tf.keras.callbacks.Callback):
    def on_epoch_end(self, epoch, logs=None):
        if logs['accuracy']>0.999:
            self.model.stop_training = True
            print("\nReached 99.9% accuracy so cancelling training!")

Exercise 3: output_of_last_layer

Now that the pre-trained model is ready, you need to "glue" it to your own model to solve the task at hand. For this you will need the last output of the pre-trained model, since this will be the input for your own. Complete the output_of_last_layer function below.

Note: For grading purposes use the mixed7 layer as the last layer of the pre-trained model. However, after submitting feel free to come back here and play around with this.

In [ ]:

# GRADED FUNCTION: output_of_last_layer

def output_of_last_layer(pre_trained_model):
    """Fetches the output of the last desired layer of the pre-trained model

    Args:
        pre_trained_model (tf.keras.Model): pre-trained model

    Returns:
        tf.keras.KerasTensor: last desired layer of pretrained model
    """
    ### START CODE HERE ###

    last_desired_layer = None
    last_output = None
    
    print('last layer output shape: ', last_output.shape)
    
    ### END CODE HERE ###

    return last_output

Check that everything works as expected:

In [ ]:

last_output = output_of_last_layer(pre_trained_model)

Expected Output (if mixed7 layer was used):

last layer output shape:  (None, 7, 7, 768)
last layer output:  KerasTensor(type_spec=TensorSpec(shape=(None, 7, 7, 768), dtype=tf.float32, name=None), name='mixed7/concat:0', description="created by layer 'mixed7'")

In [ ]:

# Test your code!
unittests.test_output_of_last_layer(output_of_last_layer, pre_trained_model)

Now you will create the final model by adding some additional layers on top of the pre-trained model.

Complete the create_final_model function below. You will need to use Tensorflow's Functional API for this since the pretrained model has been created using it.

Let's double check this first:

In [ ]:

# Print the type of the pre-trained model
print(f"The pretrained model has type: {type(pre_trained_model)}")

Exercise 4: create_final_model

To create the final model, you will use tf.keras.Model class by defining the appropriate inputs and outputs. If you need any help doing this, you can check the official docs.

There is more than one way to implement the final layer for this kind of binary classification problem. For this exercise, use a layer with a single unit and a sigmoid activation function along with an appropriate loss function. This way the number of parameters to train is consistent with the expected outputs presented later.

To help you build the full model, remember that you can get the input from any existing model by using its input attribute and by using the Funcional API you can use the last layer directly as output when creating the final model.

In [ ]:

# GRADED FUNCTION: create_final_model

def create_final_model(pre_trained_model, last_output):
    """Creates final model by adding layers on top of the pretrained model.

    Args:
        pre_trained_model (tf.keras.Model): pre-trained inceptionV3 model
        last_output (tf.keras.KerasTensor): last layer of the pretrained model

    Returns:
        Tensorflow model: final model
    """
    
    # Flatten the output layer of the pretrained model to 1 dimension
    x = tf.keras.layers.Flatten()(last_output)

    ### START CODE HERE ###

    # Add a fully connected layer with 1024 hidden units and ReLU activation
    x = None
    # Add a dropout rate of 0.2
    x = None  
    # Add a final sigmoid layer for classification
    x = None        

    # Create the complete model by using the Model class
    model = tf.keras.Model(inputs=None, outputs=None)

    # Compile the model
    model.compile( 
        optimizer=tf.keras.optimizers.RMSprop(learning_rate=0.00001), 
        loss=None, # use a loss for binary classification
        metrics=['accuracy'] 
    )

    ### END CODE HERE ###
  
    return model

In [ ]:

# Save your model in a variable
model = create_final_model(pre_trained_model, last_output)

# Inspect parameters
total_params = model.count_params()
num_trainable_params = sum([w.shape.num_elements() for w in model.trainable_weights])

print(f"There are {total_params:,} total parameters in this model.")
print(f"There are {num_trainable_params:,} trainable parameters in this model.")

Expected Output:

There are 47,512,481 total parameters in this model.
There are 38,537,217 trainable parameters in this model.

Wow, that is a lot of parameters!

After submitting your assignment later, try re-running this notebook but using the original resolution of 300x300, you will be surprised to see how many more parameters there are for that case.

In [ ]:

# Test your code!
unittests.test_create_final_model(create_final_model, pre_trained_model, last_output)

Before training the model, there is one small preprocessing you need to apply to the input images. According to the inception_v3 documentation, the model expects you to apply tf.keras.applications.inception_v3.preprocess_input to the images, which simply scales the input pixels between 1 and -1. Run the cell below to define a preprocess function, which you can then apply to the data.

In [ ]:

# Define the preprocess function
def preprocess(image, label):
    image = tf.keras.applications.inception_v3.preprocess_input(image)
    return image, label

# Apply the preprocessing to all datasets
training_dataset = training_dataset.map(preprocess)
validation_dataset = validation_dataset.map(preprocess)
test_dataset = test_dataset.map(preprocess)

Now that you have defined your model and the preprocessing function, go ahead and train it. Note the map method used to apply the preprocessing to the train, validation and test datasets.

In [ ]:

# Run this and see how many epochs it takes before the callback fires
history = model.fit(
    training_dataset,
    validation_data = validation_dataset,
    epochs = 100,
    verbose = 2,
    callbacks = [EarlyStoppingCallback()],
)

The training should have stopped after less than 5 epochs and it should have reached an accuracy over 99,9% (firing the callback). This happened so quickly because of the pre-trained model you used, which already contained information to classify humans from horses. Really cool!

Now take a quick look at the training and validation accuracies for each epoch of training. Of course, since the training was done so fast you will not have many points to visualize.

In [ ]:

# Plot the training and validation accuracies for each epoch

acc = history.history['accuracy']
val_acc = history.history['val_accuracy']
loss = history.history['loss']
val_loss = history.history['val_loss']

epochs = range(len(acc))

plt.plot(epochs, acc, 'r', label='Training accuracy')
plt.plot(epochs, val_acc, 'b', label='Validation accuracy')
plt.title('Training and validation accuracy')
plt.legend(loc=0)
plt.show()

Testing your model

Now that you have trained your full model, you can go ahead and test the performance on the test data you created earlier. You can simply use the .evaluate method for this purpose:

In [ ]:

test_loss, test_accuracy = model.evaluate(test_dataset)
print(f'Test loss: {test_loss},\nTest accuracy: {test_accuracy}')

Congratulations on finishing this week's assignment!

You have successfully implemented a convolutional neural network that leverages a pre-trained network to help you solve the problem of classifying humans from horses.

Keep it up!

Week 3: Transfer Learning

TIPS FOR SUCCESSFUL GRADING OF YOUR ASSIGNMENT:

Dataset

Exercise 1: train_val_datasets

Exercise 2: create_pre_trained_model

Creating callbacks for later

Exercise 3: output_of_last_layer

Exercise 4: create_final_model

Testing your model

Product

Resources

Company