Giter Site home page Giter Site logo

hotel-reviews's Introduction

Hotel-Reviews

Suggest-Inn, a hotel recommender that finds the best valued hotel for the lowest price.

www.suggest-inn.com

Motivation

According to Michele Walters, Co-Founder of Origin World Labs, hotel data science seems to have a plethora of untapped, solvable problems.

(article by Michele Walters)

"This shortage will hit the hospitality industry especially hard as it tends to be at the bottom of the totem pole for attracting analytical and technical talent. Unfortunately, hospitality is still perceived as an industry where soft skills are overwhelmingly more important than hard skills."

"Companies such as Marriott and Disney have realized that hospitality is a data-intensive business and that there is a wealth of creative strategies and tactics that can be found when the data is analyzed by professionals. Yet, even for these big brands the single biggest obstacle to making data-driven progress is their inability to find enough qualified talent to fill their analytics positions."

Data Sources:

Anonymized hotel_id data was obtained by Professor Hongning Wang.

He and co-authors have accompanying machine learning papers discovering latent aspects in the rating. Overview of data: 2232 Hotels, 37181 Reviews, 34187 Reviewers, 96.5 Avg Len, [3.92-1.23, 3.929+1.23] Rating.

Data page

"Latent Aspect Rating Analysis on Review Text Data: A Rating Regression Approach", paper, slides

"Latent Aspect Rating Analysis without Aspect Keyword Supervision", paper, slides

Model

K-Means

Clustering on four aspect ratings Value, Room, Location, Cleanliness (the features)

from the table H_normed in the database app.db available in the web-app folder.

Pipeline

Store data from [Data page] into SQL databases. Group ratings by hotel_id and average the predictions for the aspect ratings.

Future work

  1. Scrape hotel data from Yelp using their API and have non-anonymized data.
  2. Create LARA, a combination of regression and maximum likelihood estimation, code for python to get prediction for the four aspects listed above in Model.
  3. Use word2vec on the keywords extracted from LARA to build an alternative model for unsupervised learning.

hotel-reviews's People

Contributors

rkw0k avatar

Stargazers

Alex Canizales avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.