2017-04-21 137 views
0

当试图调用H2OContext.getOrCreate具有有效SparkContext,随机我们不断看到失败部署:getOrCreate部署失败随机

17/04/21 17:21:32 ERROR TaskSchedulerImpl: Lost executor 0 on 172.17.0.4: Remote RPC client disassociated. Likely due to containers exceeding thresholds, or network issues. Check driver logs for WARN messages. 
17/04/21 17:21:38 ERROR LiveListenerBus: Listener ExecutorAddNotSupportedListener threw an exception 
java.lang.IllegalArgumentException: Executor without H2O instance discovered, killing the cloud! 
    at org.apache.spark.listeners.ExecutorAddNotSupportedListener.onExecutorAdded(H2OSparkListener.scala:27) 
    at org.apache.spark.scheduler.SparkListenerBus$class.doPostEvent(SparkListenerBus.scala:61) 
    at org.apache.spark.scheduler.LiveListenerBus.doPostEvent(LiveListenerBus.scala:36) 
    at org.apache.spark.scheduler.LiveListenerBus.doPostEvent(LiveListenerBus.scala:36) 
    at org.apache.spark.util.ListenerBus$class.postToAll(ListenerBus.scala:63) 
    at org.apache.spark.scheduler.LiveListenerBus.postToAll(LiveListenerBus.scala:36) 
    at org.apache.spark.scheduler.LiveListenerBus$$anon$1$$anonfun$run$1$$anonfun$apply$mcV$sp$1.apply$mcV$sp(LiveListenerBus.scala:94) 
    at org.apache.spark.scheduler.LiveListenerBus$$anon$1$$anonfun$run$1$$anonfun$apply$mcV$sp$1.apply(LiveListenerBus.scala:79) 
    at org.apache.spark.scheduler.LiveListenerBus$$anon$1$$anonfun$run$1$$anonfun$apply$mcV$sp$1.apply(LiveListenerBus.scala:79) 
    at scala.util.DynamicVariable.withValue(DynamicVariable.scala:58) 
    at org.apache.spark.scheduler.LiveListenerBus$$anon$1$$anonfun$run$1.apply$mcV$sp(LiveListenerBus.scala:78) 
    at org.apache.spark.util.Utils$.tryOrStopSparkContext(Utils.scala:1252) 
    at org.apache.spark.scheduler.LiveListenerBus$$anon$1.run(LiveListenerBus.scala:77) 

H2OContext.getOrCreate导致错误:从H2O

Context.spark_session = SparkSession.builder.getOrCreate() 
Context.h2o_context = H2OContext.getOrCreate(Context.spark_session) 

有什么想法船员?

回答

0

这是Sparkling Water内部后端在此刻已知的行为。为了避免这种情况,可以使用外部Sparkling Water后端。关于这方面的更多信息可以在这里找到https://github.com/h2oai/sparkling-water/blob/master/doc/backends.md

我目前正在研究这个JIRA,它应该消除上面的行为。它正在进行中,可以跟踪JIRA https://0xdata.atlassian.net/browse/SW-369以获取任务的状态。

+0

太棒了。感谢您的解释和支持。 – deepelement