6

我试图按照quick start guide中提到的部署推荐引擎。 我完成了构建引擎的步骤。现在我想要训练推荐引擎。我在快速入门指南中提到过。 (执行pio train)。然后我得到了冗长的错误日志,我无法在这里粘贴所有内容。所以我把错误的前几行。在Predictionio中训练数据时出现异常

[INFO] [Console$] Using existing engine manifest JSON at /home/PredictionIO/PredictionIO-0.9.6/bin/MyRecommendation/manifest.json 
[INFO] [Runner$] Submission command: /home/PredictionIO/PredictionIO-0.9.6/vendors/spark-1.5.1-bin-hadoop2.6/bin/spark-submit --class io.prediction.workflow.CreateWorkflow --jar/PredictionIO/PredictionIO-0.9.6/bin/MyRecommendation/target/scala-2.10/template-scala-parallel-recommendation_2.10-0.1-SNAPSHOT.jar,file:/home/PredictionIO/PredictionIO-0.9.6/bndation/target/scala-2.10/template-scala-parallel-recommendation-assembly-0.1-SNAPSHOT-deps.jar --files file:/home/PredictionIO/PredictionIO-0.9.6/conf/log4j.properties --driver/home/PredictionIO/PredictionIO-0.9.6/conf:/home/PredictionIO/PredictionIO-0.9.6/lib/postgresql-9.4-1204.jdbc41.jar:/home/PredictionIO/PredictionIO-0.9.6/lib/mysql-connector-jav file:/home/PredictionIO/PredictionIO-0.9.6/lib/pio-assembly-0.9.6.jar --engine-id qokYFr4rwibijNjabXeVSQKKFrACyrYZ --engine-version ed29b3e2074149d483aa85b6b1ea35a52dbbdb9a --et file:/home/PredictionIO/PredictionIO-0.9.6/bin/MyRecommendation/engine.json --verbosity 0 --json-extractor Both --env PIO_ENV_LOADED=1,PIO_STORAGE_REPOSITORIES_METADATA_NAME=pFS_BASEDIR=/root/.pio_store,PIO_HOME=/home/PredictionIO/PredictionIO-0.9.6,PIO_FS_ENGINESDIR=/root/.pio_store/engines,PIO_STORAGE_SOURCES_PGSQL_URL=jdbc:postgresql://localhost/pGE_REPOSITORIES_METADATA_SOURCE=PGSQL,PIO_STORAGE_REPOSITORIES_MODELDATA_SOURCE=PGSQL,PIO_STORAGE_REPOSITORIES_EVENTDATA_NAME=pio_event,PIO_STORAGE_SOURCES_PGSQL_PASSWORD=pio,PIURCES_PGSQL_TYPE=jdbc,PIO_FS_TMPDIR=/root/.pio_store/tmp,PIO_STORAGE_SOURCES_PGSQL_USERNAME=pio,PIO_STORAGE_REPOSITORIES_MODELDATA_NAME=pio_model,PIO_STORAGE_REPOSITORIES_EVENTDGSQL,PIO_CONF_DIR=/home/PredictionIO/PredictionIO-0.9.6/conf 
[INFO] [Engine] Extracting datasource params... 
[INFO] [WorkflowUtils$] No 'name' is found. Default empty String will be used. 
[INFO] [Engine] Datasource params: (,DataSourceParams(MyApp3,None)) 
[INFO] [Engine] Extracting preparator params... 
[INFO] [Engine] Preparator params: (,Empty) 
[INFO] [Engine] Extracting serving params... 
[INFO] [Engine] Serving params: (,Empty) 
[WARN] [Utils] Your hostname, test-digin resolves to a loopback address: 127.0.1.1; using 192.168.2.191 instead (on interface p5p1) 
[WARN] [Utils] Set SPARK_LOCAL_IP if you need to bind to another address 
[INFO] [Remoting] Starting remoting 
[INFO] [Remoting] Remoting started; listening on addresses :[akka.tcp://[email protected]:56574] 
[WARN] [MetricsSystem] Using default name DAGScheduler for source because spark.app.id is not set. 
[INFO] [Engine$] EngineWorkflow.train 
[INFO] [Engine$] DataSource: [email protected] 
[INFO] [Engine$] Preparator: [email protected] 
[INFO] [Engine$] AlgorithmList: List([email protected]) 
[INFO] [Engine$] Data sanity check is on. 
[INFO] [Engine$] duo.TrainingData does not support data sanity check. Skipping check. 
[INFO] [Engine$] duo.PreparedData does not support data sanity check. Skipping check. 
[WARN] [BLAS] Failed to load implementation from: com.github.fommil.netlib.NativeSystemBLAS 
[WARN] [BLAS] Failed to load implementation from: com.github.fommil.netlib.NativeRefBLAS 
[WARN] [LAPACK] Failed to load implementation from: com.github.fommil.netlib.NativeSystemLAPACK 
[WARN] [LAPACK] Failed to load implementation from: com.github.fommil.netlib.NativeRefLAPACK 
Exception in thread "main" org.apache.spark.SparkException: Job aborted due to stage failure: Task serialization failed: java.lang.StackOverflowError 
java.io.ObjectStreamClass.invokeWriteObject(ObjectStreamClass.java:1028) 
java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1496) 
java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1432) 
java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1178) 
java.io.ObjectOutputStream.defaultWriteFields(ObjectOutputStream.java:1548) 
java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1509) 
java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1432) 
java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1178) 
java.io.ObjectOutputStream.defaultWriteFields(ObjectOutputStream.java:1548) 
java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1509) 
java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1432) 
java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1178) 
java.io.ObjectOutputStream.writeObject(ObjectOutputStream.java:348) 
scala.collection.immutable.$colon$colon.writeObject(List.scala:379) 
sun.reflect.GeneratedMethodAccessor3.invoke(Unknown Source) 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) 
java.lang.reflect.Method.invoke(Method.java:498) 
java.io.ObjectStreamClass.invokeWriteObject(ObjectStreamClass.java:1028) 
java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1496) 
java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1432) 
java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1178) 
java.io.ObjectOutputStream.defaultWriteFields(ObjectOutputStream.java:1548) 
java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1509) 
java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1432) 
java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1178) 
java.io.ObjectOutputStream.defaultWriteFields(ObjectOutputStream.java:1548) 
java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1509) 
java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1432) 
java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1178) 
java.io.ObjectOutputStream.defaultWriteFields(ObjectOutputStream.java:1548) 
java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1509) 
java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1432) 
java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1178) 
java.io.ObjectOutputStream.defaultWriteFields(ObjectOutputStream.java:1548) 
java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1509) 
java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1432) 
java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1178) 
java.io.ObjectOutputStream.defaultWriteFields(ObjectOutputStream.java:1548) 

我能做些什么来克服这个问题?

+0

这似乎是内存问题。您是否尝试过增加驱动程序内存限制? – Anzel

+0

我正在使用4核,6GB RAM和Ubuntu 14.04服务器。我在监视模型的同时监视服务器的性能,但没有使用交换内存,甚至没有使用全部6GB。所以我认为这个例外是另一回事。 –

+0

但是从上面发布的例外来看,确实与内存有关。尝试使用' - driver'内存'和'--executor-memory'来运行'4G'或更高,并查看是否有帮助 – Anzel

回答

4

您的错误说为java.lang.StackOverflowError因为您可以减少numIterations parameterengine.json文件。请参阅this

-1

我在8GB MacOS机器上遇到类似问题。将/MyRecommendation/engine.json中的numIterations参数更改为10(默认为20)可以解决此问题。在pio火车上使用--driver-memory和--executor-memory没有。

+0

欢迎来到Stack Overflow!虽然我们感谢您的回答,但如果它能提供其他答案的附加价值会更好。在这种情况下,您的答案不会提供额外的价值,因为另一个用户已经发布了该解决方案。如果以前的答案对你有帮助,你应该投票,而不是重复相同的信息。 –

相关问题