Giter Site home page Giter Site logo

Comments (8)

a1a2y3 avatar a1a2y3 commented on May 12, 2024 3

from http://pythonhosted.org/joblib/parallel.html#common-usage
I found this "Under Windows, it is important to protect the main loop of code to avoid recursive spawning of subprocesses when using joblib.Parallel."..."No code should run outside of the “if name == ‘main’” blocks, only imports and definitions."

Problem solved.

from mlbox.

a1a2y3 avatar a1a2y3 commented on May 12, 2024 1

Thank you for reply. I think joblib or multiprocessing cause this problem, and trying to solve it.
I use windows10 + anaconda + python3.6 + vs2015, may have conflict with joblib?

set n_jobs=1, seems OK
reading csv : train.csv ...
cleaning data ...
CPU time: 0.22528505325317383 seconds
reading csv : test.csv ...
cleaning data ...
CPU time: 0.1932668685913086 seconds

set n_jobs=2, it dies.

from mlbox.

a1a2y3 avatar a1a2y3 commented on May 12, 2024

Drift_thresholder() has same problem.
It takes 1.38s on kaggle kernel, and176s on my PC with E5-2683v3 cpu.

from mlbox.

AxeldeRomblay avatar AxeldeRomblay commented on May 12, 2024

Hum... sounds very weird ! Because it takes only 2 sec on my computer (7 cores). Have you tried to set n_jobs = 1 and run again ?

from mlbox.

AxeldeRomblay avatar AxeldeRomblay commented on May 12, 2024

Yes this is what I was wondering. At the moment, MLBox does not support Windows but soon :)
Thank you very much for reporting this issue !!

from mlbox.

DarquesM avatar DarquesM commented on May 12, 2024

I've got same issue, where should I set n_jobs=1 ?
mlbox.preprocessing.Reader does not have "n_jobs" parameter

from mlbox.

AxeldeRomblay avatar AxeldeRomblay commented on May 12, 2024

Hello @DarquesM !
The problem is due to windows... At the moment what you can do is to set n_jobs=1 in the source code :

df = pd.concat(Parallel(n_jobs=-1)(delayed(convert_list)(df[col]) for col in df.columns), axis=1)

df = pd.concat(Parallel(n_jobs=-1)(delayed(convert_float_and_dates)(df[col]) for col in df.columns), axis=1)

Otherwise, I will release soon a new version with reading and cleaning separate classes...

from mlbox.

AxeldeRomblay avatar AxeldeRomblay commented on May 12, 2024

Hello, thanks for reporting this issue. I will close it since this will be fixed in a next release (MLBox 0.7.1 probably)

from mlbox.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.