rohinsequeira / cifar10_image_recognition Goto Github PK

This project forked from arijit-datascience/cifar10_image_recognition

Python 7.91% Jupyter Notebook 92.09%

cifar10_image_recognition's Introduction

EVA6_Session7_Advanced_Concepts

Time to try our hands on something more than just digits. How about some cars ... planes ... maybe a few animals here and there? Welcome to our experimentation of Advanced Concepts using CIFAR10 dataset.

Understanding the CIFAR-10 dataset

The CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test images.

The dataset is divided into five training batches and one test batch, each with 10000 images. The test batch contains exactly 1000 randomly-selected images from each class. The training batches contain the remaining images in random order, but some training batches may contain more images from one class than another. Between them, the training batches contain exactly 5000 images from each class.

Here are the classes in the dataset, as well as 10 random images from each:

The classes are completely mutually exclusive. There is no overlap between automobiles and trucks. "Automobile" includes sedans, SUVs, things of that sort. "Truck" includes only big trucks. Neither includes pickup trucks.

Source: https://www.cs.toronto.edu/~kriz/cifar.html

Concept Time!

Dilated Convolution

Source: Rohan Shravan

Dilated convolution is a way of increasing the receptive view (global view) of the network exponentially and linear parameter accretion. With this purpose, it finds usage in applications thats care more about integrating the knowledge of the wider context with less cost.

The key application the dilated convolution authors have in mind is a dense prediction:vision applications where the predicted object has a similar size and structure to the input image. For example, semantic segmentation with one label per pixel; image super-resolution, denoising, demosaicing, bottom-up saliency, keypoint detection, etc.

In many such applications one wants to integrate information from different spatial scales and balance two properties:

∙ local, pixel-level accuracy, such as precise detection of edges, and

∙ integrating the knowledge of the wider, global context

Source: Rohan Shravan

Depthwise Separable Convolution

Source: Rohan Shravan

Objectives

Code Structure

Code is split into different modules(as it should be!). If you are looking for the final notebook, you can find it here.

dataset contains the code for data downloading, prepping and preprocessing. You can find code related to transformations and augmentations here.
- dataset.py: Data loading and processing code is here.
models will take you to our modelling directory which contains code for our network structure and the training and testing modules.
- model.py: Network Architecture code.
- test.py: Test code.
- train.py: Train code.
utils has code for our visualization needs.
- plots.py: Visualization for Train, Test logs and sample images.
CIFAR10_Image_Recognition.ipynb is the one notebook to rule them all! To see the final results of experiments.

Logs

Model Summary

Training and Validation Loss

Training and Validation Accuracy

Conclusions and notes

Objectives Achieved

Notes:

In place of Max pooling, we have employed a "Depthwise Convolution" with kernel size of 3 and stride of 2, which reduced the channel size to half.
The usage of Depthwise Convolution greatly reduced the number of parameters required as there is only one depth filter for each input channel.

Collaborators

Abhiram Gurijala
Arijit Ganguly
Rohin Sequeira

Recommend Projects

rohinsequeira / cifar10_image_recognition Goto Github PK

cifar10_image_recognition's Introduction

EVA6_Session7_Advanced_Concepts

Topics

Understanding the CIFAR-10 dataset

Concept Time!

Dilated Convolution

Depthwise Separable Convolution

Objectives

Code Structure

Logs

Model Summary

Training and Validation Loss

Training and Validation Accuracy

Conclusions and notes

Objectives Achieved

Notes:

Collaborators

cifar10_image_recognition's People

Contributors

Watchers

Recommend Projects

Recommend Topics

Recommend Org