Giter Site home page Giter Site logo

pbusienei / conference_helper Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 0.0 1.33 MB

This is Nashville Analytics Summit conference helper in searching for the top three sessions of your interest.

License: MIT License

Jupyter Notebook 98.76% Python 1.24%

conference_helper's Introduction

Nashville Analytics Summit conference helper

Overview

Since its inception in 2013 Nashville Analytics Summit event is held once a year in the months of September-October. There are dozens of talks covering analytics, data science, AI, career growth, new tools, specific industries and more. It has seen tremendous growth in sessions and participation increase.

The problem

The recent exponential need for data and its solutions, companies are looking forward to ways to leverage data in their organizations in order meet their business objectives. This has made it challenging for participants to navigate so many sessions and presentations at Nashville Analytics Summit. Therefore, if you are thinking of attending, Nashville Analytics Summit which talks might be of interest to you? Using a sentence similarity model, we'll pick out the top 3 talks for a participant based on their description of what they hope to get out of the Session/Talk.

semantic search

This project utilizes Transformers sentence similarity model multi-qa-MiniLM-L6-cos-v1 to search for abstracts that have cosine similarity that matches the search query.The user will input a phrase of or a word of interest and the model will search for similarity contexts that matches the phrase and output the top 3 sessions that matches the phrase using Transformers model semantic search over conference abstracts.

The model

Semantic search models embed all the entries in the corpus (the abstracts) into a vector space. When the user inputs the query, its embed into the same vector space and the closest embeddings from the corpus (abstract) are found. Hence, the entries with high semantic overlap with query are ranked (high similarity at the top).

Cosine Similarity

Two types of Semantic Search:

  1. Symmetric - query and entries in corpus are about the same length and have same amount of content
  2. Asymmetric - short query and output longer paragraphs
    This project uses Asymmetric semantic search.

Huggingface spaces

Please take a look at my spaces at Huggingfaces

Model Card

Model card is found here

Critical Analysis

  • speech-to-text: Create a trained text to speech app using Speech2Text Models in conjunction with Wav2Vec2 (speech translation)
  • domain-specific: Using domain specific may increase the accuracy of the model
  • Different-models Trying out different models (models tuned for dot-product) that prefers retrieval of longer documents

Code Demonstration

To run the streamlit app install streamlit in your machine

From command line/shell navigate to the abstract-search folder

Run the command below:

streamlit run app.py

This will open a web interface in your browser like the one below;

Deploying the Streamlit App

Streamlit App

Accessing the app locally

Local URL: http://localhost:8501 Network URL: http://10.76.227.68:8501

Or click the link that was provided once you run the streamlit in your command line You can enter your search query in the text box given and you will get your top three Sessions of interest.

Summit event 2022

semantic search

2022 Analytics Summit will be held on Wednesday Sep 28, 2022 and Thursday Sep 29, 2022. See you there!

Link to the Recording

Resource Links

conference_helper's People

Contributors

pbusienei avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.