Hadoop Pig映射或Reduce查询
我有以下数据样本:Hadoop Pig映射或Reduce查询,hadoop,mapreduce,apache-pig,Hadoop,Mapreduce,Apache Pig,我有以下数据样本: AGE,EDU,SEX,SALARY 67,10th,Male,<=50K 17,10th,Female,<=50K 40,Assoc-voc,Male,>50K 35,Assoc-voc,Male,<=50K 57,Assoc-voc,Male,<=50K 49,Assoc-voc,Male,>50K 42,Bachelors,Male,>50K 30,Bachelors,Male,>50K 23,Bachelors,Fema
AGE,EDU,SEX,SALARY
67,10th,Male,<=50K
17,10th,Female,<=50K
40,Assoc-voc,Male,>50K
35,Assoc-voc,Male,<=50K
57,Assoc-voc,Male,<=50K
49,Assoc-voc,Male,>50K
42,Bachelors,Male,>50K
30,Bachelors,Male,>50K
23,Bachelors,Female,<=50K
非常感谢是的,当您启动作业时,您将看到一个字符串
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - detailed locations: M: Alias1[73,14] C: Alias2[20, 9] R: Alias3[90, 78]
M代表映射器,C代表组合器,R代表减速机。但在一般情况下,您的查询可能同时在mapper和reducer上运行
Elapsed: 35sec
Diagnostics:
Average Map Time: 12sec
Average Shuffle Time: 10sec
Average Merge Time: 0sec
Average Reduce Time: 2sec
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - detailed locations: M: Alias1[73,14] C: Alias2[20, 9] R: Alias3[90, 78]