0
如何打开存储在HDFS中的文件 - 此处输入文件来自HDFS - 如果我将文件作为波纹管,我将无法开放的,它会显示为找不到文件如何使用open打开存储在pySpark中HDFS中的文件
from pyspark import SparkConf,SparkContext
conf = SparkConf()
sc = SparkContext(conf = conf)
def getMovieName():
movieNames = {}
with open ("/user/sachinkerala6174/inData/movieStat") as f:
for line in f:
fields = line.split("|")
mID = fields[0]
mName = fields[1]
movieNames[int(fields[0])] = fields[1]
return movieNames
nameDict = sc.broadcast(getMovieName())
我的假设是使用像
with open (sc.textFile("/user/sachinkerala6174/inData/movieStat")) as f:
但也没有工作