Dog Breed Classifier

I build a simple CNN model from scratch and this model is neither too deep nor too shallow. It has five blocks of Conv2D layer followed by MaxPooling2D layer. I added a dropout layer after every two blocks of Conv2D and MaxPooing2D layers to avoid overfitting. This model didn't perform well and achieved only 5% accuracy on the test dataset.

I used six different models with pre-trained weights to classify dog breeds. The models include VGG16, VGG19, InceptionV3, ResNet50, EfficientNetB4 and Xception. Of all the models trained, the EfficientNetB4 model performed the best on the validation dataset. It achieved an accuracy of 91% on the validation data. Trained model weights are stored in EfficientNetB4_trained_weights folder. The accuracy of other models was below 83% on the validation data.

List of Dependencies

The requirements folder list all the libraries/dependencies required to run this project.

Instructions to use the repository

Clone this github repository. git clone https://github.com/Ankit-Kumar-Saini/Dog_Breed_Classifier
Download the dog dataset. Unzip the folder and prepare image label pairs for training the model.
Download the human dataset. Unzip the folder and prepare images for the face detector model.

File Descriptions

The haarcascades folder contains the pre-trained weights in the xml file format to use with the OpenCv face detector class that has been used in this project.
The test_images folder contains the sample images that are used to test the predictions of the final algorithm in this project.
The results folder contains the results of the algorithm tested on the test images. These are used for the purpose of quick demonstration in the results section below.
The extract_bottleneck_features.py file contains the code to use pre-trained imagenet models as feature extractors for transfer learning.
The dog_app.ipynb file is the main file for this project. It is a jupyter notebook containing code of face detector, dog detector and dog breed classifier models. The final algorithm that uses all these three models to make predictions is also implemented in this notebook.

Results

The step by step explanation of the project can be found at the post available here.

Some visualizations of the predictions made by the algorithm on test images

Conclusion

This project serves as a good starting point to enter into the domain of deep learning. Data exploration and visualizations are extremely important before training any Machine Learning model as it helps in choosing a suitable performance metric for evaluating the model. CNN models in Keras need image data in the form of a 4D tensor. All images need to be reshaped into the same shape for training the CNN models in batch.

Building CNN models from scratch is extremely simple in Keras. But training CNN models from scratch is computationally expensive and time-consuming. There are many pre-trained models available in Keras (trained on ImageNet dataset) that can be used for transfer learning.

The most interesting thing to note is the power of transfer learning to achieve good results with small computation. It works well when the task is similar to the task on which the pre-trained model weights are optimized.

Tips to improve the performance

Get more images per class
Make the dataset balanced
Use image augmentation methods such as CutOut, MixUp, and CutMix
Use VAEs/GANs to generate artificial data
Use activation maps to interpret the model predictions
Use deep learning-based approaches to detect human faces (MTCNN)

Licensing, Authors, Acknowledgements

Must give credit to Udacity for the data and python 3 notebook.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dog Breed Classifier

Table of Contents

Project Overview

Problem Statement

Performance Metric

Data Exploration and Visualization

Data Preprocessing

Human Detector

Dog Detector