Apache pig 在配置单元上分析json数据时遇到的问题

Apache pig 在配置单元上分析json数据时遇到的问题,apache-pig,Apache Pig,我想分析pig中的json数据。以下是我的有效json数据: [{"author":"gjkfhvk","title":"gdfjhsdgfjk","published":1997, "reviews":[{"name":"fvdjk","stars":5},{"name":"dhjk","stars":4}]}, {"author":"ggkjhk","title":"gdfghfjhgh","published":1998,"reviews":[{"name":"jhj‌​k","stars

我想分析pig中的json数据。以下是我的有效json数据:

[{"author":"gjkfhvk","title":"gdfjhsdgfjk","published":1997, "reviews":[{"name":"fvdjk","stars":5},{"name":"dhjk","stars":4}]}, {"author":"ggkjhk","title":"gdfghfjhgh","published":1998,"reviews":[{"name":"jhj‌​k","stars":6},{"name":"fghh","stars":6}]}]
这是我的猪命令:

data = load '/home/user/Desktop/tej/pig.json' using JsonLoader('author:chararray,title:chararray,year:int,reviews:{review:(name:char‌​array,stars:int)}');
当我使用此命令显示数据中的内容时:转储数据

我得到的输出为:

Input(s):
Successfully read 3 records from: "/home/user/Desktop/pig.json"

Output(s):
Successfully stored 3 records in: "file:/tmp/temp1826337556/tmp244945211"

(,)
(,)
(gdfghfjhgh,{(jhjk),(fghh)})
无法获取标题和审阅数据的第一个数组数据


你能帮帮我吗

Json无效。我已更正它。请参见下文

输出


我更改了json,但现在得到的输出是:(gjkfhvk,gdfjhsdgfjk,1997,{(fvdjk,5),(dhjk,4)})(,,,)我有一个关于sql查询的问题,查找特定用户ID的跟随者数量。[用户ID][跟随者ID]12 13 12 15 16 17[用户ID]12 12 16[Follower_ID]13 14 15 17
{"author":"gjkfhvk","title":"gdfjhsdgfjk","published":1997,"reviews":[{"name":"fvdjk","stars":5},{"name":"dhjk","stars":4}]} {"author":"ggkjhk","title":"gdfghfjhgh","published":1998,"reviews":[{"name":"jhj??k","stars":6},{"name":"fghh","stars":6}]}