Apache pig 计数分组数据时出现清管器错误1070
我只想数一数2011年每支球队有多少名球员。 将其与tmID分组时效果良好。然而,当我试图计算分组数据时,出现了错误1070Apache pig 计数分组数据时出现清管器错误1070,apache-pig,bigdata,Apache Pig,Bigdata,我只想数一数2011年每支球队有多少名球员。 将其与tmID分组时效果良好。然而,当我试图计算分组数据时,出现了错误1070 load_file = load 'Assignment2/basketball_players.csv' using PigStorage(','); temp = foreach load_file generate (chararray)$3 AS tmID, (int)$1 AS year, (chararray)$0 AS pl
load_file = load 'Assignment2/basketball_players.csv' using PigStorage(',');
temp = foreach load_file generate
(chararray)$3 AS tmID,
(int)$1 AS year,
(chararray)$0 AS playerID;
fil_data = filter temp by year == 2011;
group_data = group fil_data by tmID;
count_data = foreach group_data generate group, count($1);
dump count_data;
错误消息如下所示
<file script.pig, line 8, column 48> Failed to generate logical plan. Nested exception: org.apache.pig.backend.executionengine.ExecException: ERROR 1070: Could not resolve count using imports: [, java.lang., org.apache.pig.builtin., org.apache.pig.impl.builtin.]
无法生成逻辑计划。嵌套异常:org.apache.pig.backend.executionengine.ExecuteException:错误1070:无法使用导入解析计数:[,java.lang.,org.apache.pig.builtin.,org.apache.pig.impl.builtin.]
有人能帮我解决这个问题吗?THX计数函数区分大小写。参考: 试试这个:
count_data = foreach group_data generate group, COUNT($1);
建议使用alias fil_数据而不是$1,因为它更具可读性。所有函数都需要大写。foreach、generate group by等命令可以是两种情况。并非所有函数都是大写的,有许多函数如IsEmpty()和所有DateTime函数:AddDuration()、CurrentTime()等都不是大写的。