2017-05-27 79 views
1

我试图运行Spring Boot YARN示例(Windows上为https://spring.io/guides/gs/yarn-basic/)。在application.yml我改变了fsUriresourceManagerHost指向我的虚拟主机192.168...。 但是,当我试图运行的应用程序Exceprion出现:Spring Boot YARN无法在Hadoop 2.8.0客户端上运行客户端无法访问DataNode

DFSClient: Exception in createBlockOutputStream 
java.net.ConnectException: Connection timed out: no further information 
    at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) 
    at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:717) 
    at org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206) 
    at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:531) 
    at org.apache.hadoop.hdfs.DFSOutputStream.createSocketForPipeline(DFSOutputStream.java:1508) 
    at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.createBlockOutputStream(DFSOutputStream.java:1284) 
    at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.nextBlockOutputStream(DFSOutputStream.java:1237) 
    at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.run(DFSOutputStream.java:449) 
[2017-05-27 19:59:49.570] boot - 7728 INFO [Thread-5] --- DFSClient: Abandoning BP-646365587-10.0.2.15-1495898351938:blk_1073741830_1006 
[2017-05-27 19:59:49.602] boot - 7728 INFO [Thread-5] --- DFSClient: Excluding datanode DatanodeInfoWithStorage[10.0.2.15:50010,DS-f909ec7a-8374-4cdd-9cfc-0e778810d98c,DISK] 
[2017-05-27 19:59:49.647] boot - 7728 WARN [Thread-5] --- DFSClient: DataStreamer Exception 
org.apache.hadoop.ipc.RemoteException(java.io.IOException): File /app/gs-yarn-basic/gs-yarn-basic-container-0.1.0.jar could only be replicated to 0 nodes instead of minReplication (=1). There are 1 datanode(s) running and 1 node(s) are excluded in this operation. 

这意味着,数据管理部是不是从我的主机访问。出于这个原因,我加入到hdfs-site.xml中

<property> 
    <name>dfs.client.use.datanode.hostname</name> 
    <value>true</value> 
    <description>Whether clients should use datanode hostnames when 
    connecting to datanodes. 
    </description> 
</property> 

但它仍然会抛出该异常。

我的虚拟机上运行Hadoop 2.8.0。这是conf。文件:

核心的site.xml

<configuration> 
    <property> 
     <name>fs.defaultFS</name> 
     <value>hdfs://0.0.0.0:9000</value> 
    </property> 

</configuration> 

HDFS-site.xml中

<configuration> 
     <property> 
      <name>dfs.replication</name> 
      <value>1</value> 
     </property> 
     <property> 
      <name>dfs.namenode.name.dir</name> 
      <value>/usr/local/hadoop/hadoop-2.8.0/data/namenode</value> 
     </property> 

     <property> 
      <name>dfs.datanode.data.dir</name> 
      <value>/usr/local/hadoop/hadoop-2.8.0/data/datanode</value> 
     </property> 

     <property> 
      <name>dfs.permissions.enabled</name> 
      <value>false</value> 
     </property> 

     <property> 
      <name>dfs.client.use.datanode.hostname</name> 
      <value>true</value> 
      <description>Whether clients should use datanode hostnames when 
       connecting to datanodes. 
      </description> 
     </property> 
    </configuration> 

mapred-site.xml中

<configuration>  
    <property> 
     <name>mapreduce.framework.name</name> 
     <value>yarn</value> 
    </property> 
</configuration> 

纱的site.xml

<configuration> 
    <property> 
     <name>yarn.nodemanager.aux-services</name> 
     <value>mapreduce_shuffle</value> 
    </property> 
    <property> 
     <name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name> 
     <value>org.apache.hadoop.mapred.ShuffleHandler</value> 
    </property> 
    <property> 
     <name>yarn.scheduler.maximum-allocation-mb</name> 
     <value>8192</value> 
    </property> 
     <property> 
     <name>yarn.nodemanager.resource.memory-mb</name> 
     <value>8192</value> 
    </property> 
    <property> 
     <name>yarn.nodemanager.disk-health-checker.max-disk-utilization-per- 
      disk-percentage</name> 
     <value>99</value> 
    </property>  
</configuration> 
+1

为什么在'core-site.xml中放入'0.0.0.0:9000'?这应该是IP或主机名。 –

+0

@RameshMaharjan,它工作后改为IP,谢谢 – Markiza

回答

2

您的core-site.xml应指向Namenode地址,但其当前指向0.0.0.0这意味着本地计算机上的所有地址。这会造成模糊的结果,因为每台机器都应被视为Namenode

Namenode应该只在hadoop集群中有一个。

Namenodeiphostname替换0.0.0.0应解决您所面临的问题。

+0

它真的有效! 谢谢!:) – Markiza

+0

你认为它不应该被删除? – Markiza

+0

这并不糟糕 – Markiza

1

将core-site.xml中的0.0.0.0:9000更改为[VM的IP]:9000后,Spring连接到YARN。感谢@RameshMaharjan

+0

@RameshMaharjan,Ohhh,对不起:)谢谢! – Markiza

+0

@RameshMaharjan,当然:) – Markiza