Apache pig PIG:将列分成行

Apache pig PIG:将列分成行,apache-pig,cloudera-cdh,Apache Pig,Cloudera Cdh,我有一个文件包含以下内容: id_v^id_f^id_s1,id_s2,id_s3,id_s4 id_v1^id_f1^id_s2,id_s3,id_s4 id_v2^id_f2^id_s2,id_s1,id_s4 此文件是以“^”分隔的csv。 我想使用pig将其正常化: id_v^id_f^id_s1 id_v^id_f^id_s2 id_v^id_f^id_s3 id_v^id_f^id_s4 id_v1^id_f2^id_s2 id_v1^id_f2^id_s3 id_v1^id_f2

我有一个文件包含以下内容:

id_v^id_f^id_s1,id_s2,id_s3,id_s4
id_v1^id_f1^id_s2,id_s3,id_s4
id_v2^id_f2^id_s2,id_s1,id_s4
此文件是以“^”分隔的csv。 我想使用pig将其正常化:

id_v^id_f^id_s1
id_v^id_f^id_s2
id_v^id_f^id_s3
id_v^id_f^id_s4
id_v1^id_f2^id_s2
id_v1^id_f2^id_s3
id_v1^id_f2^id_s4
...
有人能帮我吗


问候

好。假设这是我的文件,文件名是id\u v.txt

id_v^id_f^id_s1,id_s2,id_s3,id_s4
id_v1^id_f1^id_s2,id_s3,id_s4
id_v2^id_f2^id_s2,id_s1,id_s4
以下是清管器代码

A = load '/user/cloudera/inputfiles/id_v.txt' using PigStorage('^') as(v:chararray,f:chararray,data:chararray);
B = foreach A generate v,f,FLATTEN(TOKENIZE(data)) as data;
dump B; 
下面是由上述代码生成的O/p

(id_v,id_f,id_s1)
(id_v,id_f,id_s2)
(id_v,id_f,id_s3)
(id_v,id_f,id_s4)
(id_v1,id_f1,id_s2)
(id_v1,id_f1,id_s3)
(id_v1,id_f1,id_s4)
(id_v2,id_f2,id_s2)
(id_v2,id_f2,id_s1)
(id_v2,id_f2,id_s4)

谢谢你@Surender。谢谢你我所需要的一切。