Apache pig 从pig脚本中的映射中删除空值
这是猪的剧本Apache pig 从pig脚本中的映射中删除空值,apache-pig,Apache Pig,这是猪的剧本 set default_parallel 100; REGISTER '/x/home/eap/prod/lib/*.jar'; data = LOAD 'eap-prod://event' USING EAPDataLoader('{"attributes": { "all": "false", "values" : ["customer_id","second_party_cookie_id","email_name","user_guid"] }}'); data_pr
set default_parallel 100;
REGISTER '/x/home/eap/prod/lib/*.jar';
data = LOAD 'eap-prod://event' USING EAPDataLoader('{"attributes": { "all": "false", "values" : ["customer_id","second_party_cookie_id","email_name","user_guid"] }}');
data_proj = foreach data generate eapVars.timestamp, attributes;
store data_proj into '/user/eap_test';
在输出中,我正在打印一个可以有4个值的映射(属性)。
下一步我要做的就是删除输出中没有值的键
目前的输出是:
1392833008295[secondary_cookie_id#abcd,user_guid#,email_name#,customer_id#1234]
我希望是这样
1392833008295[secondary_cookie_id#abcd,customer_id#1234]
如何去掉输出中没有值的键 编写一个UDF,迭代所有映射并删除空值