dinghanshen / swem Goto Github PK

The Tensorflow code for this ACL 2018 paper: "Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms"

Python 65.83% Makefile 0.17% C++ 1.52% Jupyter Notebook 32.48%

deep-learning natural-language-processing representation-learning tensorflow

swem's People

Contributors

Stargazers

Watchers

Forkers

authman liqunchen0606 shannonyu chenghuige binnong meccy xuerenlv xiongshufeng pemywei junailin privatethink 15863004186sunchi mqrshiyan rxt2012kc berryhn cuizhigang1989 zhyq xiedake muximuxi makai281 jkhlot binkmust niurouli superzhangxing cxncu001 mysqlsc doudoucao haiker2011 ht1221 xyionwu paidamoyo itsmengzaime chunlinx dq-liu bayesquant chenglongchen sandy4321 alsm168 xiaxyun yugenlgy ritusirohi gaokaigithub mywhitecastle wagjk jbdatascience zhiyuanding marsstones xrosliang haching1105 whatyouknow123 greitzmann trellixvulnteam iq-scm

swem's Issues

Difference between the paper and the implementation

Thanks sharing us the code.
I have question regarding DBPedia experiment code.

https://github.com/dinghanshen/SWEM/blob/master/eval_dbpedia_emb.py#L96-L118

It seems to me that you are applying max-pooling (tf.nn.max_pool) then another max pooling while your paper says that you applied average-pooling then max pooling.
Also, pooling window size (max_len option) seems to be very large, it seems to me like you are just applying max pooling rather than hierarchical pooling.

Is this intended implementation? Or am I missing somthing?

Not find SWEM-hier

Hi, seems not find hier encoder as paper mentioned. Very interested to see it :)

aver_emb_encoder

I don't think aver_emb_encoder is actually ever defined in the committed code base--though it's pretty straightforward to understand what it should be doing :-).

executing requirement.txt leaves you with a CPU backed tensorflow

Just a small observation to point out that the requirement.txt when executed on virtual environment installs a CPU backed tensorflow instead of a GPU one.
Better, change it to tensorflow-gpu==1.5 or point out that this give you a CPU tensorflow.

Thanks !!

Package dependency related errors occuring while loading snli_emb.py

While loading the snli_emb.py with current numpy==1.16.* the following error came:

  File "/usr/local/lib/python2.7/dist-packages/numpy/lib/npyio.py", line 451, in load
    raise ValueError("Cannot load file containing pickled data "
ValueError: Cannot load file containing pickled data when allow_pickle=False

When, I downgraded it to numpy==1.14.* the error related to binary compatibility came:

ValueError: numpy.ufunc size changed, may indicate binary incompatibility. Expected 216 from C header, got 192 from PyObject

To me it looks like a package version compatibility related issue with respect to numpy used for creating snli_emb.p pickle file.

It'll be really helpful if you can mention the package versions also that you have used in as your import statements like for gensim, scipy etc packages.

Experiment Setting on Section 4.1.1 in the paper

It seems that the repo do not contain code about the section 4.1.1: Interpreting model predictions, but I am really interested in the part, could you share it if you have time.

Also, I have some questions on the experiment setting of the section.
(1) Do you set the word embedding randomly initialized(as you mention in the section) but not from Glove pretrain.
(2) What is your network architecture, Is Word_embedding + MaxPooling +Classifier or Word_Embedding + MaxPooling + MLP + Classifier.

 I am a first-year doctoral student at Beijing University of Posts and Telecommunications,China, and I major in computer science and technology.I have recently read your paper"NASH: Toward End-to-End Neural Architecture for Generative Semantic Hashing" .It would make a really positive contribution to my work. I am wondering if you could kindly send me the source program of your experiments. I promise they will be used only for research purposed.