ntmc-community / awesome-neural-models-for-semantic-match Goto Github PK

View Code? Open in Web Editor NEW

772.0 53.0 125.0 162 KB

A curated list of papers dedicated to neural text (semantic) matching.

License: MIT License

Python 24.23% Ruby 3.98% HTML 71.79%

deep-learning semantic-matching neu-ir information-retrieval text-similarity question-answering

awesome-neural-models-for-semantic-match's Introduction

Awesome Neural Models for Semantic Match

_{A collection of papers maintained by MatchZoo Team.}
_{Checkout our open source toolkit MatchZoo for more information!}

Text matching is a core component in many natural language processing tasks, where many task can be viewed as a matching between two texts input.

Where s and t are source text input and target text input, respectively. The psi and phi are representation function for input s and t, respectively. The f is the interaction function, and g is the aggregation function. More detailed explaination about this formula can be found on A Deep Look into Neural Ranking Models for Information Retrieval. The representative matching tasks are as follows:

Tasks	Source Text	Target Text
Ad-hoc Information Retrieval	query	document (title/content)
Community Question Answering	question	question/answer
Paraphrase Identification	string1	string2
Natural Language Inference	premise	hypothesis
Response Retrieval	context/utterances	response
Long Form Question Answering	question+document	answer

Healthcheck

pip3 install -r requirements.txt
python3 healthcheck.py

awesome-neural-models-for-semantic-match's People

Contributors

Stargazers

Watchers

Forkers

wqh17101 dextercoder karan2k hxyshare syjbupt liujian19911023 gokunwu 0x01111 breadsh zhouyonglong pankajmehar pilgrim2go zuiwufenghua tcxdgit brittneygogogo leephan kevinking lixinsu ch812248495 hurmean curiouscowboy allensmile akhileshydv howardchenhd cuiyi0501 nipengmath nlpformyself ramdhanoriya ymingzhu shubhampachori12110095 qgzang lazuraslong hhh920406 lindgew sabirdvd cxncu001 wuxiaobo charlottesean pli76 cyzhangathit nonva jinsongpan xiaodanjiao magicalchao souvikdgp16 tangzy7 lenepalu herobring yunhenk wangzhen-nlp chetannitk ssgalitsky miracle24 flamato jkcha71 paul0m dh434 amallia jiniaoxu strategist922 fuxianghua morindaz learningfish nidhoggurz srihari-palivela githubmyk sudhu26 c00h00g surefirelin hitum-dev jieli4970 kaelchen zjms karndeb yucoian fishredleaf rothsword rishistyping abelpy1 datasci-rigo loveningbo nghuyong datafields-team markwjj luzhongqiu hccngu qingyaoai hate-deadline jivnesh horsedongmin canyuchen suchana34 xaiocaibi dalek-who dyuyang yasasdy eddiebarry chenlu19 xiaodeng-1 qianrenjian

awesome-neural-models-for-semantic-match's Issues

End-to-End Retrieval in Continuous Space

https://arxiv.org/pdf/1811.08008.pdf

ESIM model

Enhanced LSTM for Natural Language Inference

new push to response retrieval

add CONV-KNRM

CONV-KNRM is a paper published in WSDM 2018.

NPRF: A Neural Pseudo Relevance Feedback Framework for Ad-hoc Information Retrieval

https://arxiv.org/pdf/1810.12936.pdf

Add HiNT from SIGIR 2018

Implement the HiNT model from SIGIR 2018 paper.

Add MIX from KDD 2018

MIX: Multi-Channel Information Crossing for Text Matching
website: http://www.kdd.org/kdd2018/accepted-papers/view/mix-multi-channel-information-crossing-for-text-matching

pdf: http://delivery.acm.org/10.1145/3220000/3219928/p110-chen.pdf?ip=89.146.52.65&id=3219928&acc=OPENTOC&key=4D4702B0C3E38B35%2E4D4702B0C3E38B35%2E4D4702B0C3E38B35%2E054E54E275136550&__acm__=1534239354_8f9980fa3cf636970710270d04a0a46f

Together We Stand: Siamese Networks for Similar Question Retrieval

SCQA

update Response retrieval to add recently published models and datasets

update new models and datasets recently published

Recommandation (unsupervised/low ressource text alignment)

Hey guys,

Good work both on MatchZoo and this list!
I would be interested in quick advices/pointers on something related: I'd like to match related parts of texts.

More formally, I've a document, made of different sections (each with multiple sentences), and I'd like to map it to a similar text (transcription in fact), which is a bit longer, with some noise but talk about the same thing (lot of similarities) and in the same order. I made a dynamic programming algorithms which maximize a cosine similarities between sentence embeddings. Results aren't too bad, but I'd like to experiment other stuff.

Any idea?

Thanks a lot for any clue / references that seems relevant. We could discuss through gitter.im as well.

Paul

I've not much gold data (i.e. suitable segments to be training pairs), which is why I mention unsupervised/low ressources).

A Decomposable Attention Model for Natural Language Inference

Here is an important work for NLI task, which can be added:
A Decomposable Attention Model for Natural Language Inference [EMNLP 2016, by Google]
open code
please consider it

Add Siamese-LSTM

for sentence level & character level similarity.

DeepTileBars: Visualizing Term Distribution for Neural Information Retrieval

https://arxiv.org/pdf/1811.00606.pdf

NAACL 2018/2019 relevant papers

NAACL 2018:
DeepAlignment: Unsupervised Ontology Matching With Refined Word Vectors
http://www.dit.unitn.it/~pavel/OM/articles/Kolyvakis_N18.pdf
DR-BILSTM: DEPENDENT READING BIDIRECTIONAL LSTM FOR NATURAL LANGUAGE
https://arxiv.org/pdf/1802.05577.pdf
Learning to Disentangle Interleaved Conversational Threads with a Siamese Hierarchical Network and Similarity Ranking
https://www.aclweb.org/anthology/N18-1164
Learning to Rank Question-Answer Pairs using Hierarchical Recurrent Encoder with Latent Topic Clustering
https://arxiv.org/pdf/1710.03430.pdf

NAACL 2019:
A Complex-valued Network for Matching
https://arxiv.org/pdf/1904.05298.pdf
pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference
https://arxiv.org/pdf/1810.08854.pdf

AAAI 2019 relevant papers

“match”

Yang, Xiao, et al. "Adversarial training for community question answer selection based on multi-scale matching."
https://arxiv.org/abs/1804.08058

Kim, Seonhoon, et al. "Semantic sentence matching with densely-connected recurrent and co-attentive information."
https://arxiv.org/abs/1805.11360

Zhao, Boming, et al. "Preference-Aware Task Assignment in On-demand Taxi Dispatching: An Online Stable Matching Approach." (2019).
https://www.tik.ee.ethz.ch/file/5c75a1f030e8d090d46ef165f1805d34/aaai19-zhao.pdf

Tang, Min, Jiaran Cai, and Hankz Hankui Zhuo. "Multi-Matching Network for Multiple Choice Reading Comprehension." (2019).
http://xplan-lab.org/Paper_PDF/AAAI-19.pdf

Lai, Yuxuan, et al. "Lattice CNNs for Matching Based Chinese Question Answering." arXiv preprint arXiv:1902.09087 (2019).
https://arxiv.org/abs/1902.09087

Zhang, Kun, et al. "DRr-Net: Dynamic Re-read Network for Sentence Semantic Matching." (2019).
http://staff.ustc.edu.cn/~cheneh/paper_pdf/2019/Kun-Zhang-AAAI.pdf

“retrieval”

Tang, Zhiwen, and Grace Hui Yang. "Deeptilebars: Visualizing term distribution for neural information retrieval."
https://arxiv.org/abs/1811.00606

“answer”

Hu, Minghao, et al. "Read+ verify: Machine reading comprehension with unanswerable questions."
https://arxiv.org/abs/1808.05759

Yang, Xiao, et al. "Adversarial training for community question answer selection based on multi-scale matching.
https://arxiv.org/abs/1804.08058

Pang, Liang, et al. "HAS-QA: Hierarchical Answer Spans Model for Open-domain Question Answering." arXiv preprint arXiv:1901.03866 (2019).
https://arxiv.org/abs/1901.03866

(not sure)
Answer Identification from Product Reviews for User Questions by Multi-‐task Attentive Networks Long Chen (Northwest University of China)*; Ziyu Guan (Northwest University); Wei Zhao (Xidian University); Wanqing
http://web.cse.ohio-state.edu/~sun.397/docs/AAAI19_ProdQA.pdf

For text match problem, what is the different between question-question match and question-answer match?

I know question-question match is a text similarity problem.
What about question-answer match or question-doc match? It is used in information retrieval.
question-question match is indeed text similarity. But how do you define question-answer similarity?
Thank you!!