减速器停留在70％

我正在与hadoop一起编写一个非常初始的编程任务，并且还会使用经典的wordcount问题。减速器停留在70％

已经在hdfs上放了一个示例文件，并试图在其上运行wordcount。映射器经过得很好，但是，减速器停留在70％，永远不会前进。

我也尝试过使用本地文件系统上的文件，并获得相同的行为。

我能做什么错？这里有地图和减少功能 -

public void map(LongWritable key, Text value, 
     OutputCollector<Text, IntWritable> output, Reporter reporter) 
     throws IOException { 
    // TODO Auto-generated method stub 
    String line = value.toString(); 

    String[] lineparts = line.split(","); 

    for(int i=0; i<lineparts.length; ++i) 
    { 
     output.collect(new Text(lineparts[i]), new IntWritable(1)); 
    } 


public void reduce(Text key, Iterator<IntWritable> values, 
       OutputCollector<Text, IntWritable> output, Reporter reporter) 
      throws IOException { 
     // TODO Auto-generated method stub 
     int count = 0; 
     while(values.hasNext()) 
     { 
      count=count+1; 
     } 
     output.collect(key , new IntWritable(count)); 
    }

来源

2016-08-21 Gyan

你永远不叫你的迭代器next()，所以你基本上是创建一个无限循环。

作为一个侧面说明，要实现这个字数例子的首选方法是不是1递增计数，但使用的值改为：

IntWritable value = values.next(); 
count += value.get();

这样，您就可以重用Reducer作为Combiner，以便它将计算每个映射器的部分计数，并从给定的映射器向reducer发出（“wordX”，7）而不是7次出现的（“wordX”，1）。你可以阅读更多关于梳妆台here。

来源

2016-08-21 18:48:38

完美，感谢您快速查找错误。 – Gyan

减速器停留在70％

回答

相关问题