Giter Site home page Giter Site logo

rvk007 / deepnet Goto Github PK

View Code? Open in Web Editor NEW
1.0 2.0 0.0 556 KB

A python library for computer vision applications

License: MIT License

Python 100.00%
object-recognition computer-vision python pytorch cnn gradcam deep-learning cifar10 monocular-depth-estimation mask-detection

deepnet's Introduction

tensornet

Open in Colab

Deepnet is an open-source library that can be used for solving problems of Computer vision in Deep Learning.

NOTE: This documentation applies to the MASTER version of DeepNet only.

Install Dependencies

Install the required packages
pip install -r requirements.txt

Features

DeepNet currently supports the following features:

Models

Models Description
ResNet ResNet-18
ResModNet A modified version of ResNet-18
CustomNet A modified version of ResNet-18
MaskNet3 A model to predict the Segmentation mask of the given image.
DepthMaskNet8 A model to predict the Monocular Depth Maps of the given image.

Training and Validation

Functionality Description
Train Training and Validation of the model
Model Handles all the function for training a model
Dataset Contains classes to handle data for training the model

Metrics

Losses

Loss Description
Dice ResNet-18
SSIM A modified version of ResNet-18
MSE Mean squared error (squared L2 norm) between each element in the input and target
BCE Binary Cross Entropy between the target and the output
BCEWithLogitsLoss Combination of Sigmoid layer and the BCE in one single class
RMSE Root mean squared error (squared L2 norm) between each element in the input and target

Weighted Combination of loss functions

Scheduler

  • StepLR
  • ReduceLROnPlateau
  • OneCycleLR

Data Augmentation

  • Resize
  • Padding
  • Random Crop
  • Horizontal Flip
  • Vertical Flip
  • Gaussian Blur
  • Random Rotation
  • CutOut

Utilities

Utility Description
GRADCAM Calculates GradCAM(Gradient-weighted Class Activation Map) saliency map
GradCAMpp Calculate GradCAM++ salinecy map using heatmap and image
LRFinder Range test to calculate optimal Learning Rate
Checkpoint Loading and saving checkpoints
ProgressBar Display Progress bar
Tensorboard Creates Tensorboard visualization
Summary Display model summary
Plot Plot the graph of a metric, prediction image and class accuracy

Dependencies

DeepNet has the following third-party dependencies

  • numpy
  • torch
  • torchvision
  • torchsummary
  • tqdm
  • matplotlib
  • albumentations
  • opencv-python

For a demo on how to use these modules, refer to the notebooks present in the examples directory.

Contact/Getting Help

If you need any help or want to report a bug, raise an issue in the repo.

deepnet's People

Contributors

rvk007 avatar

Stargazers

 avatar

Watchers

 avatar  avatar

deepnet's Issues

Data Augmentation with Albumentation

Add data augmentations using albumentation package.
Following augmentations will be added:

  • Horizontal Flip
  • Vertical Flip
  • Rotate
  • Cutout
  • Gaussian Blur

Update metrics

Accuracy doesn't work for depth estimation, so incorporate evaluation metrics.

Add Examples

Add a Example folder that contains colab files that can referred to understand the package.

GradCAM

Add GradCAM functionality to the project

  • GradCAM
  • GradCAM++

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.