📚 The CoCalc Library - books, templates and other resources
License: OTHER
Credits: Forked from deep-learning-keras-tensorflow by Valerio Maggio
Convolution Nets for MNIST
Deep Learning models can take quite a bit of time to run, particularly if GPU isn't used.
In the interest of time, you could sample a subset of observations (e.g. ) that are a particular number of your choice (e.g. ) and observations that aren't that particular number (i.e. ).
We will build a model using that and see how it performs on the test dataset
Loading Data
Data Preparation
A simple CNN
Step 1: Model Definition
Step 2: Compile
Step 3: Fit
Step 4: Evaluate
Let's plot our model Predictions!
Adding more Dense Layers
Adding Dropout
Adding more Convolution Layers
Exercise
The above code has been written as a function.
Change some of the hyperparameters and see what happens.
Batch Normalisation
Normalize the activations of the previous layer at each batch, i.e. applies a transformation that maintains the mean activation close to 0 and the activation standard deviation close to 1.