Giter Site home page Giter Site logo

rid17pawar / semantic-search-model-experiments Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 305 KB

Experiments in the field of Semantic Search using BM-25 Algorithm, Mean of Word Vectors, along with state of the art Transformer based models namely USE and SBERT.

Jupyter Notebook 100.00%
bm25 fasttext fasttext-embeddings glove glove-embeddings information-retrieval sbert semantic-search universal-sentence-encoder word2vec

semantic-search-model-experiments's Introduction

Semantic-Search-Model-Experiments

Dataset Used For Semantic Search/ Information Retrieval:

CISI Dataset - Kaggle

Experiments:

Experiment-1. Using BM-25 Algorithm and Parameter Tuning For Semantic Search

BM-25 Algorithm variations used:

  • BM25Okapi
  • BM25L
  • BM25Plus

Result:

image
BEST MODEL: BM25Plus

Experiment-2. Using Mean of Word Vectors (MWV) with Pretrained Embeddings For Semantic Search

BM-25 Algorithm variations used:

  • word2vec
  • GloVe
  • FastText

Result:

image
BEST MODEL: word2vec

Experiment-3. Using LDA Topic Modelling For Semantic Search

Result:

Performs worst than BM-25

Experiment-4. Using Universal Sentence Encoder (USE) For Semantic Search

USE Model variations used:

  • Transformer Encoder
  • Deep Averaging Network(DAN) Encoder

Result:

image
BEST MODEL: USE-Transformer

Experiment-5. Using Pretrained and Finetuned Sentence Transformers (SBERT) For Semantic Search

Result:

image
BEST MODEL: Finetuned SBERT

Final Result:

image

image
Overall Best Model: Finetuned SBERT

semantic-search-model-experiments's People

Contributors

rid17pawar avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.