Giter Site home page Giter Site logo

sharanyaa / nlp_naivebayes_textclassifier Goto Github PK

View Code? Open in Web Editor NEW
4.0 4.0 3.0 6.71 MB

Language: Python Created a naive Bayes text classifier (nblearn.py, nbclassify.py) and compared accuracy with two popular machine learning toolkits. Worked with two datasets (emails and IMDB reviews) and performed binary classification: SPAM or HAM (not spam), and POSITIVE or NEGATIVE (sentiment analysis). Compared three machine learning techniques for making these classifications: naive Bayes classification, maximum entropy modeling, and support vector machines (MegaM and SVM-Light). report.txt - contains detailed instructions about the sequence of scripts to run and what cmd arguments to pass each python script.

Python 100.00%

nlp_naivebayes_textclassifier's People

Contributors

sharanyaa avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

nlp_naivebayes_textclassifier's Issues

A use example would be nice.

Interesting program. A use example in the readme would be helpful to speed testing, unless said examples are found elsewhere in which case please close this ticket.

Oops Ok I found it. It's in the report.txt. Nevermind. Thanks! Closing ticket.

How to run on Linux?

Ubuntu 14.04

Getting an error when trying to install:

speed@speed-desktop:~/Downloads/nlptxt$ sudo pip3 install https://github.com/sharanyaa/nlp_naivebayes_textclassifier
Downloading/unpacking https://github.com/sharanyaa/nlp_naivebayes_textclassifier
Downloading nlp_naivebayes_textclassifier (unknown size): 16kB downloaded
Cleaning up...
Exception:
Traceback (most recent call last):
File "/usr/lib/python3/dist-packages/pip/basecommand.py", line 122, in main
status = self.run(options, args)
File "/usr/lib/python3/dist-packages/pip/commands/install.py", line 278, in run
requirement_set.prepare_files(finder, force_root_egg_info=self.bundle, bundle=self.bundle)
File "/usr/lib/python3/dist-packages/pip/req.py", line 1198, in prepare_files
do_download,
File "/usr/lib/python3/dist-packages/pip/req.py", line 1376, in unpack_url
self.session,
File "/usr/lib/python3/dist-packages/pip/download.py", line 582, in unpack_http_url
unpack_file(temp_location, location, content_type, link)
File "/usr/lib/python3/dist-packages/pip/util.py", line 645, in unpack_file
and is_svn_page(file_contents(filename))):
File "/usr/lib/python3/dist-packages/pip/util.py", line 211, in file_contents
return fp.read().decode('utf-8')
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x8b in position 1: invalid start byte

Storing debug log for failure in /home/speed/.pip/pip.log

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.