Giter Site home page Giter Site logo

hindi_stories_author_identification's Introduction

Hindi_stories_author_identification

Pre-reqistis

  • Windows OS specific:
  • Microsoft Excel
  • Text Editor(preferably Notepad)
  • Python 3.0+
  • Jupiter Notebook
  • Weka Machine Learning Tool.

Basic Setup

  • Follow this link to install Python along with Jupiter notebook in your system.
  • Follow this link to download Weka machine learning tool.For further assistence refer to this video.

Instructions

  1. Clone the repo or download it in zip format.
  2. If you followed the link to install Jupiter Notebook as mentioned above.Place the above code folder in opencv/scripts.
  3. Else place the folder wherever your Jupiter notebook Files are being saved.
  4. Change the directory to where your text is present and run the code.
  5. Once fetures for the whole corpus of an author are extracted, import the txt files like conjuction.txt etc to excel files.
  6. Refer to excel file above for viewing the name of features along with names of autor's stories.
  7. Convert these excel files into csv files.
  8. Convert thse files to arff format using this link in weka.
  9. once done load the arff files in explorer section and choose J48 in classifier section.
  10. Go to Select attributes section and click on start.Form a new dataser using these attributes.
  11. Run different algorithms on this dataset for determining the best algorithm.

hindi_stories_author_identification's People

Contributors

mauryapari avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.