0
我想用pyspark运行sql查询。我使用下面的代码:错误与RDD.registerAsTable?
from pyspark import SparkConf, SparkContext
from pyspark.sql import SQLContext, Row
import os
spark_config = SparkConf().setMaster("local").setAppName("Project_SQL")
sc = SparkContext(conf = spark_config)
sqlctx = SQLContext(sc)
df = sqlctx.read.format("jdbc").option(url="..."
, driver = "oracle.jdbc.driver.OracleDriver"
, dbtable = "UDM_CDS.TRANSACTIONS"
, user="..."
, password="...").load()
df_rdd = df.rdd
df_rdd.registerAsTable("transactions")
sqlctx.sql('SELECT TRANSACTION_KEY FROM transactions').show(5)
但它口口声声说RDD对象有没有atributte registerAsTable,虽然它有!我很困惑! 任何人都可以帮助我吗?
*'registerTempTable' – philantrovert
你说得对!谢谢 – MaFF
起初我是这么做的,但仍然有同样的错误,说DataFrame对象没有属性registerAsTable! –