2017-03-16 91 views
-1

我正在练习一些火花代码,并开始自己我被困在下面的错误。RDD不可迭代错误

newRDD = sc.textFile("/user/cloudera/sqoop-import/products") 

17/03/15 20:41:31 INFO storage.MemoryStore: Block broadcast_1 stored as values in memory (estimated size 198.0 KB, free 419.3 KB) 17/03/15 20:41:31 INFO storage.MemoryStore: Block broadcast_1_piece0 stored as bytes in memory (estimated size 23.3 KB, free 442.7 KB) 17/03/15 20:41:31 INFO storage.BlockManagerInfo: Added broadcast_1_piece0 in memory on 192.168.50.129:49855 (size: 23.3 KB, free: 530.2 MB) 17/03/15 20:41:31 INFO spark.SparkContext: Created broadcast 1 from textFile at NativeMethodAccessorImpl.java:-2

for i in newRDD: 
...  print(i) 
... 

回溯(最近通话最后一个): 文件 “”,1号线,在 类型错误: 'RDD' 对象不是可迭代

+4

可能的重复[我如何在apache spark(scala)中迭代RDD](http://stackoverflow.com/questions/25914789/how-do-i-iterate-rdds-in-apache-spark-scala) –

回答

0

我的坏。我没有对数据采取任何行动。只有变革才会将数据传递给驱动程序,因为所有这些都是懒惰的评估。

相关问题