2017-02-15 67 views
0

我在Scala中遇到了Module not found错误。我试图获得与Oracle的jdbc连接,加入两个表然后将其打印出来。Scala Oracle JDBC

我的斯卡拉文件

import org.apache.spark.SparkContext 
import org.apache.spark.SparkContext._ 
import org.apache.spark.SparkConf 
import org.apache.spark.sql.SQLContext 

object sparkJDBC { 
    def main(args: Array[String]): Unit = { 
    val conf = new SparkConf().setAppName("Simple  
     Application").setMaster("local[2]").set("spark.executor.memory","1g") 
    val sc = new SparkContext(conf) 
    var sqlContext = new SQLContext(sc) 
    val chrttype = sqlContext.load("jdbc", 
     Map("url" -> "jdbc:oracle:thin:gductv1/[email protected]//localhost:1521/XE", 
     "dbtable" -> "chrt_typ")) 
    val clntlvl1 = sqlContext.load("jdbc", 
     Map("url" -> "jdbc:oracle:thin:gductv1/[email protected]//localhost:1521/XE", 
     "dbtable" -> "clnt_lvl1")) 
    val join2 = 
     chrttyp.join(clntlvl1,chrttyp.col("chrt_typ_key")===clntlvl1("lvl1_key")) 
    join2.foreach(println) 
    join2.printSchema() 
    } 
} 

我build.sbt文件

name := "sparkJDBC" 
    version := "0.1" 
    scalaVersion := "2.11.7" 

    libraryDependencies += "org.apache.spark" %% "spark-core" % "1.5.1" 
    libraryDependencies += "org.apache.tika" % "tika-core" % "1.11" 
    libraryDependencies += "org.apache.tika" % "tika-parsers" % "1.11" 
    libraryDependencies += "org.apache.hadoop" % "hadoop-client" % "2.7.1" 
    libraryDependencies += "org.apache.spark" % "spark-sql" % "1.0.0" 

错误文件是

[warn] module not found: org.apache.spark#spark-sql;1.0.0 
[warn] ==== local: tried 
[warn] C:\Users\.ivy2\local\org.apache.spark\spark-sql\1.0.0\ivys\ivy.xml 
[warn] ==== public: tried 
[warn] https://repo1.maven.org/maven2/org/apache/spark/spark-sql/1.0.0/spark-sql-1.0.0.pom 
[info] Resolving jline#jline;2.12.1 ... 
[warn] :::::::::::::::::::::::::::::::::::::::::::::: 
[warn] ::   UNRESOLVED DEPENDENCIES   :: 
[warn] :::::::::::::::::::::::::::::::::::::::::::::: 
[warn] :: org.apache.spark#spark-sql;1.0.0: not found 
[warn] :::::::::::::::::::::::::::::::::::::::::::::: 

[error] (*:update) sbt.ResolveException: unresolved dependency: org.apache.spark#spark-sql;1.0.0: not found 

请帮我找出是什么导致了这一点。

+0

问题:当前的解决没有你问 –

回答

0

为了确保有一个正确的依赖,你可以使用网站像mvnrepository:https://mvnrepository.com/artifact/org.apache.spark/spark-sql_2.10/1.0.0

libraryDependencies += "org.apache.spark" % "spark-sql_2.10" % "1.0.0" 
+0

这是一个伟大的指针托马斯的依赖。我没有'Module not found'问题,但现在出现错误[错误]模块已在{file:/ C:/apps/spark-2.1.0/ScalaFiles/中与冲突的跨版本后缀一起解决} scalafil es: [error] org.json4s:json4s-ast _2.11,_2.10 [error] com.twitter:chill _2.11,_2.10 [error] org.json4s:json4s-jackson _2.11,_2.10 [error] org.json4s:json4s-core _2.11,_2.10 [error] org.apache.spark:spark-core _2.11,_2.10 –

+0

我删除了Scala .sbt文件中的版本参考(scalaVersion:=“2.11.7”),然后它没有给我任何冲突的交叉版本。现在它给了我一些SQLContext错误,但它已经过了第一个障碍。感谢Thomas和Alexey的快速反应。 –