Hadoop reduce停止运行
我现在有一个可怕的问题。当我在hadoop中运行一个作业时,映射过程是正常的,达到了100%,没有任何错误发生。然而,当reduce进程运行时,它在达到67%时停止。这很奇怪。我是hadoop的新手,在网上搜索了很多资料,但现在仍然让我感到困惑。Follow是一个输出Hadoop reduce停止运行,hadoop,mapreduce,Hadoop,Mapreduce,我现在有一个可怕的问题。当我在hadoop中运行一个作业时,映射过程是正常的,达到了100%,没有任何错误发生。然而,当reduce进程运行时,它在达到67%时停止。这很奇怪。我是hadoop的新手,在网上搜索了很多资料,但现在仍然让我感到困惑。Follow是一个输出 13/10/25 21:40:00 INFO input.FileInputFormat: Total input paths to process : 2 13/10/25 21:40:01 INFO mapred.JobCli
13/10/25 21:40:00 INFO input.FileInputFormat: Total input paths to process : 2
13/10/25 21:40:01 INFO mapred.JobClient: Running job: job_201310252001_0003
13/10/25 21:40:02 INFO mapred.JobClient: map 0% reduce 0%
13/10/25 21:40:30 INFO mapred.JobClient: map 1% reduce 0%
13/10/25 21:40:37 INFO mapred.JobClient: map 2% reduce 0%
13/10/25 21:40:39 INFO mapred.JobClient: map 3% reduce 0%
13/10/25 21:40:40 INFO mapred.JobClient: map 4% reduce 0%
13/10/25 21:40:42 INFO mapred.JobClient: map 5% reduce 0%
13/10/25 21:40:43 INFO mapred.JobClient: map 6% reduce 0%
13/10/25 21:40:45 INFO mapred.JobClient: map 7% reduce 0%
13/10/25 21:40:46 INFO mapred.JobClient: map 9% reduce 0%
13/10/25 21:40:48 INFO mapred.JobClient: map 10% reduce 0%
13/10/25 21:40:49 INFO mapred.JobClient: map 11% reduce 0%
13/10/25 21:40:52 INFO mapred.JobClient: map 14% reduce 0%
13/10/25 21:40:55 INFO mapred.JobClient: map 17% reduce 0%
13/10/25 21:40:58 INFO mapred.JobClient: map 19% reduce 0%
13/10/25 21:41:01 INFO mapred.JobClient: map 22% reduce 0%
13/10/25 21:41:04 INFO mapred.JobClient: map 23% reduce 0%
13/10/25 21:41:05 INFO mapred.JobClient: map 24% reduce 0%
13/10/25 21:41:07 INFO mapred.JobClient: map 26% reduce 0%
13/10/25 21:41:08 INFO mapred.JobClient: map 27% reduce 0%
13/10/25 21:41:10 INFO mapred.JobClient: map 28% reduce 0%
13/10/25 21:41:11 INFO mapred.JobClient: map 29% reduce 0%
13/10/25 21:41:13 INFO mapred.JobClient: map 30% reduce 0%
13/10/25 21:41:14 INFO mapred.JobClient: map 31% reduce 0%
13/10/25 21:41:16 INFO mapred.JobClient: map 32% reduce 0%
13/10/25 21:41:20 INFO mapred.JobClient: map 34% reduce 0%
13/10/25 21:41:23 INFO mapred.JobClient: map 35% reduce 0%
13/10/25 21:41:26 INFO mapred.JobClient: map 36% reduce 0%
13/10/25 21:41:34 INFO mapred.JobClient: map 37% reduce 0%
13/10/25 21:41:39 INFO mapred.JobClient: map 38% reduce 0%
13/10/25 21:41:43 INFO mapred.JobClient: map 40% reduce 0%
13/10/25 21:41:44 INFO mapred.JobClient: map 40% reduce 6%
13/10/25 21:41:46 INFO mapred.JobClient: map 42% reduce 6%
13/10/25 21:41:49 INFO mapred.JobClient: map 43% reduce 6%
13/10/25 21:41:51 INFO mapred.JobClient: map 44% reduce 6%
13/10/25 21:41:52 INFO mapred.JobClient: map 45% reduce 6%
13/10/25 21:41:55 INFO mapred.JobClient: map 46% reduce 6%
13/10/25 21:41:57 INFO mapred.JobClient: map 47% reduce 6%
13/10/25 21:41:58 INFO mapred.JobClient: map 48% reduce 9%
13/10/25 21:42:01 INFO mapred.JobClient: map 51% reduce 12%
13/10/25 21:42:04 INFO mapred.JobClient: map 54% reduce 12%
13/10/25 21:42:07 INFO mapred.JobClient: map 56% reduce 12%
13/10/25 21:42:10 INFO mapred.JobClient: map 58% reduce 12%
13/10/25 21:42:13 INFO mapred.JobClient: map 60% reduce 12%
13/10/25 21:42:16 INFO mapred.JobClient: map 61% reduce 12%
13/10/25 21:42:19 INFO mapred.JobClient: map 62% reduce 15%
13/10/25 21:42:22 INFO mapred.JobClient: map 63% reduce 15%
13/10/25 21:42:23 INFO mapred.JobClient: map 65% reduce 15%
13/10/25 21:42:26 INFO mapred.JobClient: map 66% reduce 15%
13/10/25 21:42:28 INFO mapred.JobClient: map 67% reduce 15%
13/10/25 21:42:29 INFO mapred.JobClient: map 68% reduce 15%
13/10/25 21:42:32 INFO mapred.JobClient: map 69% reduce 15%
13/10/25 21:42:34 INFO mapred.JobClient: map 70% reduce 18%
13/10/25 21:42:35 INFO mapred.JobClient: map 72% reduce 18%
13/10/25 21:42:38 INFO mapred.JobClient: map 75% reduce 18%
13/10/25 21:42:41 INFO mapred.JobClient: map 77% reduce 18%
13/10/25 21:42:44 INFO mapred.JobClient: map 80% reduce 18%
13/10/25 21:42:47 INFO mapred.JobClient: map 82% reduce 18%
13/10/25 21:42:50 INFO mapred.JobClient: map 85% reduce 18%
13/10/25 21:42:53 INFO mapred.JobClient: map 87% reduce 18%
13/10/25 21:42:56 INFO mapred.JobClient: map 88% reduce 18%
13/10/25 21:42:59 INFO mapred.JobClient: map 89% reduce 18%
13/10/25 21:43:02 INFO mapred.JobClient: map 90% reduce 18%
13/10/25 21:43:05 INFO mapred.JobClient: map 91% reduce 18%
13/10/25 21:43:18 INFO mapred.JobClient: map 94% reduce 21%
13/10/25 21:43:21 INFO mapred.JobClient: map 97% reduce 21%
13/10/25 21:43:24 INFO mapred.JobClient: map 99% reduce 27%
13/10/25 21:43:27 INFO mapred.JobClient: map 100% reduce 30%
13/10/25 21:43:30 INFO mapred.JobClient: map 100% reduce 67%
这里的症状是reduce阶段的代码被“卡住”,要么是因为无限循环,要么就是接收到的数据量太大,要么是其他原因(可能是发布reduce代码?) 以下是百分比在减速器中的工作方式:
在JobTracker界面中,查看您的作业,并查看还原程序正在获取多少数据。如果reducer中的记录数在增加,这意味着您可能有太多的数据进入reducer。如果这个数字保持不变,你可能会有某种无限循环。显然这里没有任何人可以帮助的。为什么?你能说更多细节吗?可能是因为我的程序不正确吗?我面临着类似的问题!你应该检查JobTracker(URL=:50030)中的日志。在阅读日志时,我发现我的一个数据节点无法通过ssh连接到自身。我必须更新它的/etc/hosts文件。嗨,Donald,我也面临着同样的问题,只有几个字节的数据…知道为什么吗?我的输入是3个文件,每个文件只包含2行…这个答案帮助了我!我被困在16%,这是洗牌阶段。发现其中一个数据节点无法与其他计算机通信。这对我也有帮助。我的程序停留在70%的水平。所以我重温了reducer代码,并意识到它将进入无限循环。解决了这个问题。