An Example of CNN on MNIST dataset

Detailes of the CNN strucdure in the demo, as well as the mathematical derivation of backpropagation can be found in "Derivation of Backpropagation in Convolutional Neural Network (CNN)", which is specifically written for this demo.

The implementation of CNN uses the trimmed version of DeepLearnToolbox by R. B. Palm.

Pre-requisite

Matlab (R2015a)

MNIST dataset

In the MNIST dataset, there are 50,000 digit images for training and 10,000 for testing. The image size is 28x28, and the digits are from 0 to 9 (10 categories).

Some samples of digit images

CNN structure in the demo

Run the demo

>> demo_CNN_MNIST

Note that only 1 epoch will be performs. If you want to run more epochs, please modify the variable num_epochs in the file demo_CNN_MNIST.m (line 62).

Results

Running the demo for 200 epochs, the classification accuracy is shown as follow. Note that the results may be a little bit different for each running because of the random initialization of convolutional kernels.

num_epochs	Training accuracy	Testing accuracy
200	99.34%	99.02%

Class-wise training and testing accuracy

Training and testing accuracy vs. epoch

Training error in Mean Square Error (MSE)

The loss function used in this demo is

where y and y_hat denote the true label and prediction, respectively.

The learned kernels of the first and second convolutional layers

The first convolutional layer has 6 kernels, and the second has 6x12 kernels. All kernels are in the size of 5x5.

An example of feedforward on the trained CNN

Use a wider or deeper CNN

The classification accuracy will increase if using a wider or deeper CNN. The "wider" means increasing the number of channels in each convolutional (pooling) layer, and the "deeper" refers to increasing the number of convolutional (pooling) layers.

We consider the CNN structure used in the demo as the prototype, which has 6 and 12 channels in the first and second convolutional (pooling) layer, respectively. The wider CNN will take 12 and 24 channels in the first and second convolutional (pooling) layer, respectively. The deeper CNN has three convolutional (pooling) layers, and there are 6, 12, and 24 channels for each. The traing and testing accuracy are shown as follow.

num_epochs=200	Training accuracy	Testing accuracy	Training MSE
Wider	99.60%	99.16%	0.0052
Deeper	99.69%	99.28%	0.0042

num_epochs=500	Training accuracy	Testing accuracy	Training MSE
Wider	99.83%	99.20%	0.0022
Deeper	99.84%	99.37%	0.0016

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
DeepLearnToolbox_trimmed		DeepLearnToolbox_trimmed
doc		doc
figs		figs
README.md		README.md
_config.yml		_config.yml
demo_CNN_MNIST.m		demo_CNN_MNIST.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An Example of CNN on MNIST dataset

Contents

Pre-requisite

MNIST dataset

Some samples of digit images

CNN structure in the demo

Run the demo

Results

Class-wise training and testing accuracy

Training and testing accuracy vs. epoch

Training error in Mean Square Error (MSE)

The learned kernels of the first and second convolutional layers

An example of feedforward on the trained CNN

Use a wider or deeper CNN

About

Releases

Packages

Languages

ZZUTK/An-Example-of-CNN-on-MNIST-dataset

Folders and files

Latest commit

History

Repository files navigation

An Example of CNN on MNIST dataset

Contents

Pre-requisite

MNIST dataset

Some samples of digit images

CNN structure in the demo

Run the demo

Results

Class-wise training and testing accuracy

Training and testing accuracy vs. epoch

Training error in Mean Square Error (MSE)

The learned kernels of the first and second convolutional layers

An example of feedforward on the trained CNN

Use a wider or deeper CNN

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages