Apache pig 清管器运行总计:总计超过()
我正在使用以下pig脚本计算运行总数(pig本地模式) 但是,我得到了以下错误 未知总金额(C1.工资) 有人有什么想法吗 编辑: 我自己想出了答案。这是:Apache pig 清管器运行总计:总计超过(),apache-pig,cumulative-sum,Apache Pig,Cumulative Sum,我正在使用以下pig脚本计算运行总数(pig本地模式) 但是,我得到了以下错误 未知总金额(C1.工资) 有人有什么想法吗 编辑: 我自己想出了答案。这是: Register /home/ec2-user/pig*/bin/piggybank-0.12.0.jar ; define Over org.apache.pig.piggybank.evaluation.Over(); define Stitch org.apache.pig.piggybank.evaluation.St
Register /home/ec2-user/pig*/bin/piggybank-0.12.0.jar ;
define Over org.apache.pig.piggybank.evaluation.Over();
define Stitch org.apache.pig.piggybank.evaluation.Stitch();
A = load '/home/ec2-user/staff_data.csv' using PigStorage(',') as (id:int, name:chararray, salary:int, department:chararray);
B = group A by department;
C = foreach B {
C1 = order A by salary;
generate flatten(Stitch(C1, Over(C1.salary, 'sum(int)')));
};
你试过使用SUM和OVER吗?而不是累加
Register /home/ec2-user/pig*/bin/piggybank-0.12.0.jar ;
define Over org.apache.pig.piggybank.evaluation.Over();
define Stitch org.apache.pig.piggybank.evaluation.Stitch();
A = load '/home/ec2-user/staff_data.csv' using PigStorage(',') as (id:int, name:chararray, salary:int, department:chararray);
B = group A by department;
C = foreach B {
C1 = order A by salary;
generate flatten(Stitch(C1, Over(C1.salary, 'sum(int)')));
};