wecmodel's Introduction

WECModel

Attemptting to implement Shen, Yikang, et al. "Word Embedding Based Correlation Model for Question/Answer Matching." AAAI. 2017.
Training data: Yahoo! L4 - Yahoo! Answers Manner Questions and L6 - Yahoo! Answers Comprehensive Questions and Answers part1.

Prerequisite

Python 3.6 & 2.7
Java 1.8.0
Tensorflow (Works on Python 3.6)
word2vec (Works on Python 2.7)
THUTag (Works on Linux)
Note: I will try to replace the heavy THUTag tool dependency with a light Python script later.

Overview

Manual

THUTag

java -Xmx3G -jar tagsuggest.jar train.TrainWEC --input=../traindata/YahooPostL6-1-[time].dat --output=/mnt/hgfs/Data/thu-tag-workspace/trainWEC7 --config="dataType=KeywordPost;para=0.5;minwordfreq=10;mintagfreq=10;selfTrans=0.2;commonLimit=2"

Performance

Currently 48% by using DAG@1 evaluation method without using negative label.

Recommend Projects

songrb / wecmodel Goto Github PK

wecmodel's Introduction

WECModel

Prerequisite

Overview

Manual

THUTag

Performance

wecmodel's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent