Comments (3)
Hi Aswin-roy,
Could you please provide us with:
- content of your MASTER environment variable
- complete code/script you are trying to execute
Without this information I'm not able to reproduce the issue. I suppose you are using Sparkling-shell ?
Thanks!, Kuba
from sparkling-water.
I am trying to run a Spark-Streaming job which does some operations and then moves on to predictions (from trained model on disk). I am running the job on a spark 1.6.0 standalone setup (single node). The amount of executors I have specified in conf is 49. But, when I log my h2oContext it comes to something like
Sparkling Water Context:
-
H2O name: sparkling-water-root_612514996
-
number of executors: 48
-
list of used executors:
(executorId, host, port)
....Open H2O Flow in browser: http://null:0 (CMD + click in Mac OSX)
even though in the spark ui, I can see that 49 executors are up. Why is this happening? After this, I get the ERROR JobScheduler: Error running job streaming job 1460453735000 ms.0 java.lang.ArrayIndexOutOfBoundsException: 65535
on the h2oFrame creation line.
I often get these exceptions too :
Exception in thread "main" java.lang.RuntimeException: Cloud size under 48
ERROR LiveListenerBus: Listener anon1 threw an exception
java.lang.IllegalArgumentException: Executor without H2O instance discovered, killing the cloud!
I run the job using spark-submit and using the --packages ai.h2o:sparkling-water-core_2.10:1.6.1
parameter. What might be wrong with my setup?
Thanks!
from sparkling-water.
Hi aswin-roy,
thanks for the detailed explanation!
I already saw that you reacted in the issue #4 . The issue explained there is exactly why in some cases we are not able to initiate H2OContext. It's because we weren't able to find all Spark executors during the creation of H2OContext.
In Sparkling Water you are using we created a listener which checks for changes in the cluster topology and just kills the cloud if new executor without H2O instance appeared. It's not great, but at lest we get notified about what is happening.
We are undergoing an architectural discussion with the rest of the Sparkling Water team & the community what could be the best approach when dealing with this.
The java.lang.ArrayIndexOutOfBoundsException
is probably just a consequence of the failed H2OContext initialisation.
I will close this one and redirect you please to comment at #4.
Thanks, Kuba!
from sparkling-water.
Related Issues (20)
- QA/ H2O frame to delta lake HOT 1
- AWS Glue Jobs with: {"error":"TypeError: 'JavaPackage' object is not callable","errorType":"EXECUTION_FAILURE"} HOT 2
- H2OFrame repartitioning has no API in Python HOT 2
- How to export the AutoMl Models on real-time? HOT 4
- water.AutoBuffer$AutoBufferException: java.io.IOException: Error writing request body to server HOT 3
- Updating previously trained XGBoost HOT 2
- H2O node/pod becomes unhealthy HOT 8
- How resouces are shared in case of multiple request to external sparking water backend in k8?? HOT 3
- Deployment configuration error - request reached a non-leader H2O node HOT 6
- Sparkling water (Internal Backend mode) model training stuck at 100% with OOM Error HOT 5
- Ref count mismatch for vec ERROR while training GLM models HOT 23
- Latin/UTF-8 character encoding HOT 2
- Error when training XGBoost or conduct target encoding on data with high cardinality features on sparkling water HOT 13
- Errors for fitting sparkling water GLM HOT 12
- hello-world HOT 3
- water.exceptions.H2OIllegalArgumentException: Can only convert jobs producing a single Model or ModelContainer. HOT 15
- Failed to Create H20Context HOT 3
- Save H2o Sparkling water models to Disk HOT 1
- Saving Pysparkling h2o mdeol on disk and then load into a single python application rather than pyspark HOT 3
- Bug in writing CV mojos HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sparkling-water.