@quinngroup/bigneuron I'm trying to replicate the errors you're gett

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

BlueData cluster setup for testing about dr1dl-pyspark HOT 5 CLOSED

magsol commented on July 30, 2024

BlueData cluster setup for testing

from dr1dl-pyspark.

Comments (5)

milad181 commented on July 30, 2024

Perfect! Please use T 176 and P 2687340 for the 4tasks03.txt.
The 4task01.txt was the largest dataset that worked on the local mode with 16 GB RAM. (-T 176 -P 895780)

from dr1dl-pyspark.

magsol commented on July 30, 2024

What about r and e? I believe we generally set m to 100.

On Thu, Mar 10, 2016 at 4:27 PM milad181 [email protected] wrote:

Perfect! Please use T 176 and P 2687340 for the 4tasks03.txt.
The 4task01.txt http://bd.hafni.cs.uga.edu/test/4tasks01.txt was the
largest dataset that worked on the local mode with 16 GB RAM. (-T 176 -P
895780)

—
Reply to this email directly or view it on GitHub
#67 (comment)
.

iPhone'd

from dr1dl-pyspark.

milad181 commented on July 30, 2024

We generally used r 0.07 -m 5 -e 0.01 to obtain results faster.

from dr1dl-pyspark.

magsol commented on July 30, 2024

@quinngroup/bigneuron

I seem to have a reliable BlueData image working. It's currently crunching the 4tasks03.txt dataset; so far it's working. I also implemented a few optimizations--broadcasting the random seed at the start of each iteration, and representing v with a SparseVector object--to see how they work. They're not fully tested yet so the job may crash at some point.

In the meantime, feel free to use the image and stress test it against either the cluster I've spun up or your own custom cluster. Let me know if there are any problems.

from dr1dl-pyspark.

MOJTABAFA commented on July 30, 2024

@magsol
Dear Dr.quinn, would you please set some credentials for me to work with your cluster ?
Thanks

from dr1dl-pyspark.

Recommend Projects

BlueData cluster setup for testing about dr1dl-pyspark HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent