Hadoop 配置单元查询数组作为字段

Hadoop 配置单元查询数组作为字段,hadoop,hive,hiveql,Hadoop,Hive,Hiveql,我有两张蜂箱桌: 客户表: id,name,salary 1 ,John, 10000 2 ,Melissa, 5000 科目表: id,account_number,client_id 1 ,00920202, 1 2 ,00920203, 1 3 ,00920204, 1 4 ,00920205, 2 5 ,00920206, 2 我需要返回以下结果的配置单元查询: id,name,salary,accounts 1 ,John, 10000, {00920202, 00920203,

我有两张蜂箱桌:

客户表:

id,name,salary 
1 ,John, 10000
2 ,Melissa, 5000
科目表:

id,account_number,client_id
1 ,00920202, 1
2 ,00920203, 1
3 ,00920204, 1
4 ,00920205, 2
5 ,00920206, 2
我需要返回以下结果的配置单元查询:

id,name,salary,accounts
1 ,John, 10000, {00920202, 00920203, 00920204}
2 ,Melissa, 5000, {00920205, 00920206}

提前感谢

如果您确定账号是唯一的,请使用
收款清单
。否则使用
collect\u set
消除重复项

select c.id,c.name,c.salary,collect_list(a.account_number) as all_accounts
from client c
join account a on a.client_id=c.id
group by c.id,c.name,c.salary

如果您确定帐号是唯一的,请使用
collect\u list
。否则使用
collect\u set
消除重复项

select c.id,c.name,c.salary,collect_list(a.account_number) as all_accounts
from client c
join account a on a.client_id=c.id
group by c.id,c.name,c.salary