Giter Site home page Giter Site logo

iphlouxmaster / recoprod Goto Github PK

View Code? Open in Web Editor NEW

This project forked from keshavramaswamy/recoprod

0.0 2.0 0.0 709 KB

Content based Recommender System which implements sentiment analysis(Naive Bayes,SVMs) on Amazon product reviews. Built in Python(Beautiful Soup,SciPy,NumPy,matplotlib),Java and RapidMiner

Python 86.07% Java 13.93%

recoprod's Introduction

" Content Recommender System "(Filmfundr)

About the Project


Filmfundr - a Content selection and reccomendation system which uses sentiment analysis techniques to provide the best types of content based on content demo prefernces. The techniques involved in Sentiment Analysis are used to classify the reviews by the sentiment of the words into positive or negative. Using the sentiment expressed in the words, opinions on any entity can be categorized into positive or negative.

The system uses the existing product reviews and sentiment classification of the reviews is carried out. RecoProd consists of an Information Retrieval component which extracts the reviews from the e-commerce websites using the product names as queries.

Sentiment Analysis algorithms like Naive Bayes and SVM (Support Vector Machine) are used to categorize the reviews and opinion scores are assigned to the reviews. A comparative study on the accuracy of the sentiment analysis algorithms used is also carried out. Aspect based summary of opinions for each product is carried out and visually compared. The products are displayed to the user.

The project is developed in Python,Java and RapidMiner.

Language(s) required:

  1. Python 2.7

  2. Java 7

Software(s) Required:

  1. RapidMiner 5

About the Dataset:

The dataset contains two directories - 'pos' and 'neg' . Each folder contains 1000 reviews each.

The dataset must be used to train the classifier model.

How To Run The Project:

  1. The Python files required to run the project are

    a.IR_Component_module1.py - Information Retrieval Component. Web scraper built using Beautiful Soup. Basic Front-End built using Tkinter.

    b.mod2.py - Sentiment Analysis Component with predictive model built in RapidMiner.

    c.rating.py - Sentiment Rating given for each product.

    d.module3_int.py - Aspect based Opinion Summarisation for each product.

    e.plotmod3inp.py - Aspect based Opinion Visual Summarisation using matplotlib and NumPy.

    f.mod4.py - Content based Recommender component using clustering based on Euclidian Distance.matplotlib used.

    g.new_gui.py - Final GUI built using Tkinter.

  2. The Java file required to run the project is:

    a.Sample.java - Integrates the Rapidminer processes and python files.

  3. Three RapidMiner processes are used. The respective XML codes are stored in testfiles are

    a.linsvmmodel.txt - Training the Classifier Model b.linsvmtest.txt - Classification for Module 2 c.mod3.txt - Classification for Module 3

  4. The Excelfiles and folders stored in the "Files" MUST not be deleted.

  5. The RapidMiner processes must be created using the XML codes and suitable editing must be made to execute them.

  6. The paths of the files created in Python must be suitably edited.

  7. The project can be run by executing the Java file:Sample.java

WARNING:A fast working internet connection is required. The folders and spreadsheets created before must not be deleted.

recoprod's People

Contributors

keshavramaswamy avatar iphlouxmaster avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.