Giter Site home page Giter Site logo

perceptanalyze's Introduction

PerceptAnalyze

This Repo focusses on Data Extraction and Network Analysis using Google 2 gram Dataset

Google N-gram dataset link http://storage.googleapis.com/books/ngrams/books/datasetsv2.html

Requirements

NOTE (Your files and code should be in the same folder or place)

Python Installed on PC preferably(Python 3.5 or Python 3)

Two .csv perceptual data files for example here they are flavornetPercepts.csv & superscentPercepts.csv

For Python 2.7

Change urllib.request to urllib everywhere in the program (As it was shifted in Python 3)

Remove encoding="utf8" everywhere in the program

For running script.py (Data extraction)

Open Command Prompt

cd to the loaction of code and flavornetPercepts.csv , superscentPercepts.csv files

type "python script.py"

THE CODE WILL START

The errors and information will be logged in sample.log file which will be automatically created in the same folder

The final output in JSON format for flavornetPercepts will be in json.txt and for superscentPercepts will be in newjson.txt

For running graph.py (Data Analysis)

Run this after running script.py

Additional Requirements/Dependecies

pip install networkx

Two .csv perceptualEdges files i.e in this case edgesFlav.csv and edgesSuperSc.csv

Also Added newjson.txt and json.txt for Reference

Open Command Prompt

cd to the loaction of code and PerceptualEdges files

type "python graph.py"

THE CODE WILL START

The final output similarity comparing Superscent and Flavournet will be printed on screen

Final Results Obtained were

Similarity[0,inf) in Flavournet Graphs is 23903599.9347 Similarity[0,inf) in Superscent Graphs is 38827186.0992

For running newscript.py (Data extraction for Stem Words)

You Will need word.csv file

STEM WORDS are words whose all kind of occurances we want to find like taint* will include taint,tainted,tainting......

Open Command Prompt

cd to the loaction of code and word.csv file

type "python newscript.py"

THE CODE WILL START

The errors and information will be logged in log.out file which will be automatically created in the same folder

The final output in JSON format for word will be in newjson.txt

perceptanalyze's People

Contributors

rijuldhir avatar riteshcanfly avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.