Giter Site home page Giter Site logo

grelp-me's Introduction

Objective

Develop a model combining data from Yelp and Groupon to predict the longterm success of a business. The project will focus on business failure first by obtaining and analyzing the Yelp pages of thousands of closed business, then augmenting the training set with random samples of successful businesses from each category.

This project will a lighter side. We will develop a website called Grelp Me! That uses this combined model to predict whether a current Groupon deal is a good deal, and whether the company stands of chance of staying in business in the next few months.

Underneath, however, this project is really about the asymmetric web and walled gardens. Yelp hides failed businesses from their search, but search engines still scrape them. Groupon also does not make past deals available to the public. While google claims it has over 2 million results for closed businesses in its index, it doesn’t allow scraping or even human access to more than 500 results.

The Data

Yelp's Business API returns an "is_closed" boolean flag indicating a business has permanently or temporarily closed. However, this variable is not exposed in the Search API. It is also impossible to search for a closed businesses through the consumer facing channels (web, mobile, etc). Yelp even removes closed business from the general search within a year or two (source: yelp forums). Fortunately, Yelp leaves the business information exposed if you know the URL or the Yelp business_id (which, it turns out are actually the same, there goes a half day of work). It has not been easy (more on this later), but I’ve accumulative a list of 10,000 failed businesses and their Yelp ids from search engine results, mostly google. Businesses also using Groupon is obviously a smaller subset. However, the nature of the discount and the profitability of the deal can be estimated and tied to the prediction.

Groupon as unsustainable model

"Sucking value out of the small business market will ultimately damage the local merchants that are the bread and butter of Groupon’s base. Groupon’s model is not sustainable. In a race to the bottom everyone drowns."--CEO, SpaBoom and CoverBoom, Bill Bice (Sep 20, 2012)

Groupon as a Hail Mary pass

"...if you’re going out of business (in which case the Groupon is a Hail Mary pass) then Groupon might be for you."--Ding Dong, Daily Deals Are Dead

References

There are hundreds of papers written on Groupon and Yelp, and even the influence of Groupon on Yelp reviews:

grelp-me's People

Contributors

d4le avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.