Giter Site home page Giter Site logo

patrick-llgc / learning-deep-learning Goto Github PK

View Code? Open in Web Editor NEW
1.1K 107.0 176.0 89.46 MB

Paper reading notes on Deep Learning and Machine Learning

Jupyter Notebook 100.00%
deep-learning paper literature-review machine-learning computer-vision cnn paper-reading paper-review reinforcement-learning medical medical-imaging point-cloud 3d-object-detection 3d-object-recognition

learning-deep-learning's Introduction

Paper notes

This repository contains my paper reading notes on deep learning and machine learning. It is inspired by Denny Britz and Daniel Takeshi. A minimalistic webpage generated with Github io can be found here.

About me

My name is Patrick Langechuan Liu. After about a decade of education and research in physics, I found my passion in deep learning and autonomous driving.

What to read

If you are new to deep learning in computer vision and don't know where to start, I suggest you spend your first month or so dive deep into this list of papers. I did so (see my notes) and it served me well.

Here is a list of trustworthy sources of papers in case I ran out of papers to read.

My review posts by topics

I regularly update my blog in Toward Data Science.

2024-06 (8)

2024-03 (11)

2024-02 (7)

2023-12 (4)

2023-09 (3)

2023-08 (3)

2023-07 (6)

2023-06 (5)

2023-05 (7)

2023-04 (1)

2023-03 (5)

2023-02 (4)

2023-01 (2)

2022-11 (1)

2022-10 (1)

2022-09 (3)

2022-08 (1)

2022-07 (8)

2022-06 (3)

2022-03 (1)

2022-02 (1)

2022-01 (1)

2021-12 (5)

2021-11 (4)

2021-10 (3)

2021-09 (11)

2021-08 (11)

2021-07 (1)

2021-06 (2)

2021-04 (5)

2021-03 (4)

2021-01 (7)

2020-12 (17)

2020-11 (18)

2020-10 (14)

2020-09 (15)

2020-08 (26)

2020-07 (25)

2020-06 (20)

2020-05 (19)

2020-04 (14)

2020-03 (15)

2020-02 (12)

2020-01 (19)

2019-12 (12)

2019-11 (20)

2019-10 (18)

2019-09 (17)

2019-08 (18)

2019-07 (19)

2019-06 (12)

2019-05 (18)

2019-04 (12)

2019-03 (19)

2019-02 (9)

2019-01 (10)

2018

2017 and before

Papers to Read

Here is the list of papers waiting to be read.

Deep Learning in general

Self-training

2D Object Detection and Segmentation

Fisheye

Video Understanding

Pruning and Compression

Architecture Improvements

Reinforcement Learning

3D Perception

Stereo and Flow

Traffic light and traffic sign

Datasets and Surveys

Unsupervised depth estimation

Indoor Depth

lidar

Egocentric bbox prediction

Lane Detection

Tracking

keypoints: pose and face

General DL

Mono3D

Radar Perception

SLAM

Radar Perception

Reviews and Surveys

Beyond Perception in Autonomous Driving

Prediction and Planning

Annotation and Tooling

Low level DL

Early NLP papers

Non-DL

Technical Debt

To be organized (CVPR 2021 and ICCV 2021 the pile to be read)

TODO

learning-deep-learning's People

Contributors

patrick-llgc avatar yushen1116 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

learning-deep-learning's Issues

Regarding review of Learning Joint 2D-3D Representations for Depth Completion

Hi,I have written a review of the paper mentioned in the subject line.Would like to have your opinion about the paper as well as my review.
Enclosed is the review.
Joint 2D-3D representation for depth completion

1. A lot of survey has been done in the field of depth estimation,the paper uses features from both 2D(camera) and 3D(lidar) ,and fuses them to get a more sharper and cleaner estimate of the depth.
2. Compared to other existing traditional 3D depth estimation approaches,what separates this paper is it’s ability to learn a better representation without relying heavily on complex data and labels.
3. The key defining feature is the usage of 2D-3D convolutional blocks.This allows the network to learn in two separate feature domains.The first  branch learns features in 2D using convolutional layers.The learning of features in 3D is achieved with the help of using continuous convolutional neural networks.
4. Notice that both continuous constitutional neural nets and traditional CNNs  representing the output as a weight sum of all the neighboring features.However whereas CNNs assume that data can be represented as a grid(which makes finding neighbors easy),the same assumption doesn’t hold for point cloud data.
5. Because of the sparse nature of lidar data,the first step is to use a k nearest neighboring algorithm to determine all the neighbors.Those K nearest neighbors  along with the input features are then fed to a Multi layer perceptron  to get the kernel parameters.Finally,this kernel is convoluted with the input feature to get the final result.
6. The 2D features and 3D features are then concatenated to get the final output.To ensure that output feature dimension is consistent with the input dimension,a final convolution is applied.Skip connections between the input and output are also added to facilitate training.
7. Ablation studies show that when evaluating the algorithm on the basis of RMSE,the approach tends to perform satisfactory results.
8. However,certain questions still need to be answered,some of the prominent ones are as follows:
    1. many autonomous driving applications require that camera and lidar should be articulated.Will the architecture be able to estimate the depth with same accuracy in presence of varying camera roll and pitch angles
    2. Recent studies show that RELU networks  tend to suffer from several issues.For example,the very essence of deep learning relies on being able to express any non linear function provided that the network is deep enough.But in practice,it has been observed that the number of activation patterns that RELU can learn is less than the theoretical result.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.