2017-04-18 59 views

回答

1

你可以尝试设置mode"DROPMALFORMED"为:

val df = sqlContext.read.format("com.databricks.spark.csv").option("mode", "DROPMALFORMED")... 

Python

df = sqlContext.read.format('com.databricks.spark.csv').options(mode = "DROPMALFORMED")... 

其中根据documentation

"...drops lines which have fewer or more tokens than expected."

+0

现在我得到这个错误: va lue选项不是org.apache.spark.sql.DataFrame的成员 –

+1

我认为上面使用了Python语法。对于Scala,请使用spark.read.option(“mode”,“DROPMALFORMED”)。csv(path) –