Comments (3)
Hello @kannadaraj ,
update:** I was able to recreate the issue in a clean vm install, just trying to figure out what is causing it.
Thanks for bringing this issue to my attention. I'm trying to recreate it on my end. So far, cloning master branch, and running $ python train.py
seems to do what I expect. I was wondering if you are running with the default hyper parameters or using your own (if you are using your own, do mind letting me know which ones you changed to help me re-create the issue)?
I am asking because I've recently realized that running alternative hyper parameters causes exceptions similar to that. It doesn't check to see if the model is compatible. My plan is to do that check, then create a new model if it is not compatible. I will try to make that fix by tonight. So far this is my best guess as to what the problem is.
I'm going to admit I'm not confident that is the actual problem from reading your stack trace. I will also do a clean install of everything in a vm later today, just incase there is some sort of dependency issue I've missed. What OS are you running this on?
from neural-sentiment.
Hello @kannadaraj
I believe I've found the root of the problem. I built this with Tensorflow version 0.6.0, and the issue occurs in version 0.7.1. According to the release notes found here
The non-public nn.rnn and the various nn.seq2seq methods now return just the final state instead of the list of all states.
Unfortunately this means that the states can't be averaged over time anymore. I have committed a 'fix'. Master should be good to pull and run now. I also partially fixed the inability to change hyper parameters. I'm not sure how this will affect network performance and training, I'll have to test it out over the next few days.
from neural-sentiment.
Thanks a lot Dominik
from neural-sentiment.
Related Issues (14)
- logistics regression layer not properly implemented
- memory is blowing up
- padding all sequences to same length is inefficient HOT 1
- Missing util/hyperparams? HOT 2
- ValueError: setting an array element with a sequence HOT 1
- Dropout used incorrectly! HOT 4
- Problem with tensorflow version 0.12 head HOT 4
- Tensorflow version HOT 2
- checkpoint names HOT 2
- There is no restoreHyperParameters function in the sample.py HOT 6
- Code does not run with the latest Tensorflow version. HOT 1
- Training is not working? HOT 1
- Model not found...
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from neural-sentiment.