Apache pig Pig:如何解析元素数量可变的元组?

Apache pig Pig:如何解析元素数量可变的元组?,apache-pig,Apache Pig,这是我的输出文件,我用另一个Pig脚本编写: 1 3,5 2 4,6,7 我试图将每一行解析为(chararray,tuple) 但当我尝试转储元组时,它们是空的: rows = foreach data generate y; () ()试试这个 X = LOAD 'pigtuple.txt' AS (str:chararray); X1 = FOREACH X GENERATE FLATTEN(STRSPLIT(str, '\\s+')) AS (id:int,

这是我的输出文件,我用另一个Pig脚本编写:

1   3,5 
2   4,6,7
我试图将每一行解析为(chararray,tuple)

但当我尝试转储元组时,它们是空的:

rows = foreach data generate y;
() ()

试试这个

   X = LOAD 'pigtuple.txt' AS (str:chararray);

   X1 = FOREACH X GENERATE FLATTEN(STRSPLIT(str, '\\s+')) AS (id:int, attr:chararray);

   X3 = FOREACH X1 GENERATE id, STRSPLIT(attr, ',') AS (y:tuple());

   X4 = foreach X3 GENERATE id,y;

   dump X4;
如果要访问元组中的每个元素

   X4 = foreach X3 GENERATE y.$0,y.$1;
   X4 = foreach X3 GENERATE y.$0,y.$1;