Giter Site home page Giter Site logo

autoencorder_simple's Introduction

Hi there! Amazing fellow human being ๐Ÿ‘‹

I am a data scientist and I love getting my hands ๐Ÿคš and feet ๐Ÿฆถ wet in problem infested โ˜  murky waters ๐Ÿšค of data, to unearth invaluable information and actionable insights ๐Ÿ’Ž.

What do I love?

  • ๐Ÿ Python
  • ๐Ÿค– Machine Learning, Deep Learning, Artificial Intelligence
  • ๐Ÿงฎ TensorFlow

Some of my publications

autoencorder_simple's People

Contributors

thushv89 avatar

Watchers

 avatar  avatar  avatar

Forkers

reiisky

autoencorder_simple's Issues

Errors while running optimize.minimize on laptop

Err1: Bad direction in the line search; refresh the lbfgs memory and restart the iteration.

Err2: Line search cannot locate an adequate point after 20 function and gradient evaluations. Previous x, f and g restored.
Possible causes: 1 error in function or gradient evaluation; 2 rounding error dominate computation.

Fix the issue with softmax classifier

Fixed the issue with the softmax classifier. The problem was, in the gradient descent I was doing this
delta[i] = y_vec[i]-actv[k] where y=k (y_vec is vectorized form of y, actv is current output activation vector for y)

This is WRONG! Correct implementation is
delta[i] = y_vec[i] - actv[i]

Disconnected gradient Issue THeano

theano.gradient.DisconnectedInputError: grad method was asked to compute the gradient with respect to a variable that is not part of the computational graph of the cost, or is used only by a non-differentiable operator: W1

can't multiply sequence by non-int of type 'float'

This issue was encountered in
updates = [
(param, param - l_rate*gparams)
for param, gparam in zip(self.theta,gparams)]

which should be
updates = [
(param, param - l_rate*gparam)
for param, gparam in zip(self.theta,gparams)]

If this error is occuring it's probably because you're trying to multiply a list by a float where you should multiply a scalar by a float

All the test cases get the same value - Iris dataset

The problem has been in the following code

for i in range(0, iter):
for idx in range(0, X.size[1]):
....
p_deriv_W2 = np.dot(delta3[:, None], np.transpose(a2[:, None]))
p_deriv_b2 = delta3

       delta_W2 = delta_W2 + p_deriv_W2
       delta_b2 = delta_b2 + p_deriv_b2

       self.W2 = self.W2 - (alpha * delta_W2) + (M * prev_delta_W2)
       self.b2 = self.b2 - (alpha * delta_b2) + (M * prev_delta_b2)

Having code like this inside training case loop causes delta_W2 to grow in very abnormal ways. The solutions is to have the following

       delta_W2 = p_deriv_W2
       delta_b2 = p_deriv_b2

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.