Giter Site home page Giter Site logo

Comments (8)

amirmohammadkz avatar amirmohammadkz commented on May 24, 2024

Hello,
Thanks for using our repository. the n_job is for running in parallel. -1 means it uses all your processors to get the result as fast as possible. If you change it, you are reducing the speed of the classifier, and that is why your result takes much more than 7 minutes.

I just tested the whole code again on a new laptop and used python 3.7 with a new virtual environment, and installed all the requirements. I did not face that issue. Are you using something different (different versions or another distribution of python such as Anaconda)?

from personality_detection.

Arkhemis avatar Arkhemis commented on May 24, 2024

Hi,

Just faced the same issue, with Python 3.7.6 and pip env. I'll change the n_job parameter to see how it goes.

EDIT: Ok, apparently the issue is caused by IPython, so basically PyCharm. When running in the Windows Powershell and with the n_job parameter unchanged (so =-1), it does not encounter any issues (I'm actually running it, will confirm once it's done).

from personality_detection.

Arkhemis avatar Arkhemis commented on May 24, 2024

A quick update but the script hasn't finished running (it's been ~10h now), despite having a more than correct CPU (i7-4790k, 4GHz). What could be the explanation?

from personality_detection.

amirmohammadkz avatar amirmohammadkz commented on May 24, 2024

Hi,
Are you getting the accuracy as print output?
The pushed version of model is predicting all 5 traits and using cross-validation for each of them to report the final result. Hence, the final result gets ready whenever that finished. If you want to test it faster, in the training loop, just use 1 cv instead of 10-fold cross validation. Let me know if the problem still exists.

from personality_detection.

Arkhemis avatar Arkhemis commented on May 24, 2024

I was actually using my own dataset (for which I had a specific personality for each line (i.e: users) and related mairesse csv features) and followed all the steps in the ReadMe.

This file is around ~4000 lines long, so basically the double of the essays.csv . When running the python svm_result_calculator.py with it, I had some accuracy output, but every 4 hours or so. The fact that my file is just twice the size in terms of rows of the essays.csv should not explain the impressive increase in waiting time.

As of now, I'm trying to see what is happening when running the traditional steps (so using the essays.csv), and will see how much time it takes, to see if that's due to my csv, or another common problem.

from personality_detection.

Arkhemis avatar Arkhemis commented on May 24, 2024

A quick update, but the problem seems to remain, even with the essays.csv. The process seems faster (I'm seeing more print output of the accuracy and the Bagging SVC (around 5minutes per BaggingSVC), but it's been 1-hour since I started the script, and it keeps running.

EDIT: I just finished the first Y, so for the 5 traits, it should last around ~7h. I do not understand why is it taking so long to be honest.

from personality_detection.

amirmohammadkz avatar amirmohammadkz commented on May 24, 2024

When you say around 5 minute per bagging svc, it means that the model is working fine.
We are using 10 fold cross validation so our result be more reliable. If you do not need it, just pick one of the cvs as the test and the rest for the training. Plus, you can change the bagging svm to regular svm. You may lose a bit of accuracy, but it is not much if you really need the speed. Ultimately, you can also check our newer model:
https://github.com/yashsmehta/personality-prediction

from personality_detection.

kkkkangx avatar kkkkangx commented on May 24, 2024

When you say around 5 minute per bagging svc, it means that the model is working fine.
We are using 10 fold cross validation so our result be more reliable. If you do not need it, just pick one of the cvs as the test and the rest for the training. Plus, you can change the bagging svm to regular svm. You may lose a bit of accuracy, but it is not much if you really need the speed. Ultimately, you can also check our newer model:
https://github.com/yashsmehta/personality-prediction

Hi,

I completed all steps and have another question now. How could I save the trained model to predict the personality scores of my own text?

from personality_detection.

Related Issues (6)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.