我试图在virtualbox虚拟机(RHEL 6.5,8GB RAM,100GB HDD)中的伪分布式集群上运行一个MR作业,但提交作业后只能在此处找到。Hadoop YARN作业提交后状态保持未定义
信息:mapreduce.Job:正在运行的作业:job_1437483993_001
应用跟踪URL(http://localhost:8088/cluster/applicationID)这样表示结果:
用户:根
名称: grep-search
应用程序类型:mapreduce
状态:接受
FinalStatus:未定义
我曾尝试:
- 修改纱的site.xml和mapred-site.xml中最小和最大分配内存遵循教程(http://hortonworks.com/blog/how-to-plan-and-configure-yarn-in-hdp-2-0/)
- 确保磁盘空间足够用于容纳新作业。
jps
显示所有服务正常运行。
但没有运气。请引导我。
编辑: 这里的日志:
[[email protected] ~]# hadoop jar /usr/lib/hadoop-mapreduce/hadoop-mapreduce-examples.jar grep /user/pradeep output23 'dfs[a-z.]+' SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder". SLF4J: Defaulting to no-operation (NOP) logger implementation SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinder for further details. 16/04/27 10:21:09 INFO client.RMProxy: Connecting to ResourceManager at /127.0.0.1:8032 16/04/27 10:21:09 WARN mapreduce.JobResourceUploader: No job jar file set. User classes may not be found. See Job or Job#setJar(String). 16/04/27 10:21:09 INFO input.FileInputFormat: Total input paths to process : 4 16/04/27 10:21:10 INFO mapreduce.JobSubmitter: number of splits:4 16/04/27 10:21:11 INFO mapreduce.JobSubmitter: Submitting tokens for job: job_1461732411884_0001 16/04/27 10:21:11 INFO mapred.YARNRunner: Job jar is not present. Not adding any jar to the list of resources. 16/04/27 10:21:11 INFO impl.YarnClientImpl: Submitted application application_1461732411884_0001 16/04/27 10:21:11 INFO mapreduce.Job: The url to track the job: http://localhost:8088/proxy/application_1461732411884_0001/ 16/04/27 10:21:11 INFO mapreduce.Job: Running job: job_1461732411884_0001
我试图运行一个简单的现有示例应用程序来测试基础设施 'hadoop jar hadoop-examples.jar grep input output'dfs [az。] +'' –
按照你的建议我试着运行一个不同的map-reduce由PIG脚本生成的代码。提交给resorcemanager后,该工作仍然冻结。 –