Giter Site home page Giter Site logo

final-year-project's Introduction

Final Year Project

Introduction

This project deals with the analysis of characters of the book, predicting their attributes and relationships amongst them. During the adaptation of a novel/book in any form, there goes a lot of human effort, which proves to be cumbersome and unwanted. Also, the human brain tends to ignore several minor details about the happenings/characters in the book. The above forth mentioned scenario can often lead to inaccuracy in the plot of the adaptation. The project is therefore an innovation that aims at aiding an easy and accurate adaptation of a book, thus making the process a whole lot simpler and precise. The model aims at scanning the humongous amounts of texts present in the book. Post scanning, the model will show interesting insights which are derived from the given book by applying a variety of analytical techniques based on a combination of natural language processing, sentiment/emotion analysis, and social network analysis method

Repository Layout

Regarding the layout of the repository, the project has been organized into three folders:-

  1. Human Names Generator - This folder consists of notebooks which are responsible for generating a master CSV file of all possible human names.

    1. There are three CSV files, Indian-Female-Names.csv, Indian-Male-Names.csv and Foreign-Names.csv which have been used.
    2. The iPython Notebook Names List Generator.ipynb is responsible for generating the list of human names which are stored into a pickle file.
    3. The pickle file humanNames.txt is the output of this particular stint and is used in the further course of the project.
  2. Any Book - This folder performs analysis on an English novel, named Sense and Sensisbility.

    1. The file textFile.txt is the UTF-8 encoded version of the novel.
    2. The iPython Notebook Word CSV Generator.ipynb generates a CSV file words.csv which is used further in the course to perform analysis.
    3. The iPython Notebook Character List Generator.ipynb generates the list of main characters in the book and saves it in characterList.txt
    4. The iPython Notebook Word Cloud Generator.ipynb generates word cloud for any specified character The folder Word Clouds consists of results of Word Cloud Genrator.ipynb run on various characters.
    5. The iPython Notebook Sentiment Analysis.ipynb performs sentiment analysis, using NRC_emotion_lexicon_list.txt and generates visualisations to facilitate proper depiction of sentiment throughout the book.
  3. Mahabharata - This folder performs analysis on the epic Mahabharata.

    1. The folder data consists of the raw text data. It consists of all 18 books, which have been combined into mahafull.txt
    2. The iPython Notebook Word CSV Generator.ipynb generates a CSV file words.csv which is used further in the course to perform analysis.
    3. The iPython Notebook WordCloud For Any Character.ipynb generates word cloud for any specified character The folder Word Clouds consists of results of Word Cloud Genrator.ipynb run on various characters.
    4. The iPython Notebook Relation Generator For Any Character.ipynb generates the network for the specified characters, i.e. depicts the characters in the book with whom that particular character had encounters.
    5. The iPython Notebook Mahabharat Sentiment Analysis.ipynb performs sentiment analysis, using NRC_emotion_lexicon_list.txt and generates visualisations to facilitate proper depiction of sentiment throughout the book, as well as chap_with_emo_scores_normalized.csv which shows the degree of every emotion in every book.
    6. The iPython Notebook LDA_Mahabharata.ipynb performs LDA Analysis for topic modelling on Mahabharata. And generates an interactive output

Outputs

Sentiment Analysis For Mahabharata

Sentiment Analysis Mahabharata


Word Cloud for Lord Krishna


Topic Modelling Output for Mahabharata

Sentiment Analysis Mahabharata


Topic Modelling Interactive Output Screenshot

Sentiment Analysis Mahabharata


Sentiment Analysis for Sense and Sensibility

Sentiment Analysis Mahabharata


Character Cloud for Fanny from Sense and Sensibility

Tools

  1. Python - NLTK, spaCy, Seaborn, Matplotlib
  2. Algorithms - VADER, LDA
  3. Development Platform - Jupyter

Project Report

Report

final-year-project's People

Contributors

cruzer3008 avatar sadanand48 avatar

Watchers

 avatar  avatar

Forkers

vaibhav1seth

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.