Giter Site home page Giter Site logo

-dl-for-computer-vision-'s Introduction

-Completed Assignments (My solution) for EECS 498-007 / 598-005: Deep Learning for Vision Fall 2019.

This course was offered by the University of Michigan to talk really deep about computer vision especially in deep learning.

Find course notes and assignments here and be sure to check out video lectrues for Fall 2019!

All the assignments before are done with Pytorch.

-Assignment 1:

Q1: PyTorch 101. Walk you through the basics of working with tensors in PyTorch.

Q2: k-Nearest Neighbor classifier. Walk you through implementing a kNN classifier.

-Assignment 2:

Q1: Linear Classifiers. Walk you through implmenting SVM and Softmax classifier.

Q2: Two-layer Neural Network. Walk you through implementing a two-layer neural network-based classifier.

-Assignment 3:

Q1: Fully-Connected Neural Network. Walk you through implementing Fully-Connected Neural Networks

Q2: Convolutional Neural Network. Walk you through implementing Convolutional Neural Networks.

-Assignment 4:

Q1: PyTorch Autograd. Introduce you to the different levels of abstraction that PyTorch provides for building neural network models. You will use this knowledge to implement and train Residual Networks for image classification.

Q2: Image Captioning with Recurrent Neural Networks. Walk you through the implementation of vanilla recurrent neural networks (RNN) and Long Short Term Memory (LSTM) RNNs. You will use these networks to train an image captioning model. You will then augment your implementation to perform spatial attention over image regions while generating captions.

Q3: Network Visualization. Walk you through the use of image gradients for generating saliency maps, adversarial examples, and class visualizations

Q4: Style Transfer. Learn how to create images with the artistic style of one image and the content of another image

-Assignment 5:

Q1: Single-Stage Detector. Walk you through the implementation of a fully-convolutional single-stage object detector similar to YOLO (Redmon et al, CVPR 2016). You will train and evaluate your detector on the PASCAL VOC 2007 object detection dataset.

Q2: Two-Stage Detector. Walk you through the implementation of a two-stage object detector similar to Faster R-CNN (Ren et al, NeurIPS 2015). This will combine a fully-convolutional Region Proposal Network (RPN) and a second-stage recognition network.

-Assignment 6:

Q1: Generative Adversarial Networks. walk you through the implementation of fully-connected and convolutional generative adversarial networks on the MNIST dataset.

-dl-for-computer-vision-'s People

Contributors

jeremiah-ropo avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.