2015-07-09 61 views

回答

1

在Mahout中的MapReduce支持NaiveBayes,该模型将被保存到由-o参数指定的目录,如果培训是通过CLI:

mahout trainnb 
    -i ${PATH_TO_TFIDF_VECTORS} 
    -o ${"path/to/model}/model 
    -li ${PATH_TO_MODEL}/labelindex 
    -ow 
    -c 

参见:http://mahout.apache.org/users/classification/bayesian.html

,并通过检索:

NaiveBayesModel model = NaiveBayesModel.materialize(("/path/to/model"), getConf()); 

替代方案LY,使用亨利马乌-轮回的火花备份朴素贝叶斯,模型可以通过命令行训练,将类似地被输出到由-o参数指定的路径:

mahout spark-trainnb 
    -i ${PATH_TO_TFIDF_VECTORS} 
    -o ${/path/to/model} 
    -ow 
    -c 

或模型可以从被训练

val model = SparkNaiveBayes.train(aggregatedObservations, labelIndex, false) 

输出到(HD)FS由:经由应用内

model.dfsWrite("/path/to/model") 

和检索经由:

val retrievedModel = NBModel.dfsRead("/path/to/model") 

参见:http://mahout.apache.org/users/environment/classify-a-doc-from-the-shell.html

相关问题