Python 对大查询表的API请求
我面临一个问题,关于使用python修改JSON文件的好方法 JSON请求:Python 对大查询表的API请求,python,json,api,Python,Json,Api,我面临一个问题,关于使用python修改JSON文件的好方法 JSON请求: { "reports": [{ "data": { "rows": [{ "metrics": [{ "values": ["27.8", "4", "4", "6.95", "1.0", "0.0", "3.8834951456310676"]
{
"reports": [{
"data": {
"rows": [{
"metrics": [{
"values": ["27.8", "4", "4", "6.95", "1.0", "0.0", "3.8834951456310676"]
}
],
"dimensions": ["TEST1", "20180725"]
}, {
"metrics": [{
"values": ["75.0", "12", "12", "6.25", "1.0", "0.0", "3.4782608695652173"]
}
],
"dimensions": ["TEST2", "20180725"]
}
],
"maximums": [{
"values": ["1665.0", "140", "126", "65.0", "3.0", "0.0", "50.0"]
}
],
"minimums": [{
"values": ["0.0", "0", "0", "0.0", "0.0", "0.0", "0.0"]
}
],
"isDataGolden": true,
"totals": [{
"values": ["27045.99", "3274", "2831", "8.260839951130116", "1.1564818085482163", "0.0", "4.949387227049424"]
}
],
"rowCount": 358
},
"columnHeader": {
"dimensions": ["ga:productName", "ga:date"],
"metricHeader": {
"metricHeaderEntries": [{
"type": "CURRENCY",
"name": "ga:itemRevenue"
}, {
"type": "INTEGER",
"name": "ga:itemQuantity"
}, {
"type": "INTEGER",
"name": "ga:uniquePurchases"
}, {
"type": "CURRENCY",
"name": "ga:revenuePerItem"
}, {
"type": "FLOAT",
"name": "ga:itemsPerPurchase"
}, {
"type": "CURRENCY",
"name": "ga:productRefundAmount"
}, {
"type": "PERCENT",
"name": "ga:buyToDetailRate"
}
]
}
}
}
]
}
{
"ga:productName": "NAME", #from dimension
"ga:date": "NAME", #from dimension
"ga:itemRevenue": "value1", #from metricHeaderEntries
"ga:itemQuantity": "value2", #from metricHeaderEntries
...
}
{
"ga:productName": "NAME2", #from dimension
"ga:date": "NAME2", #from dimension
"ga:itemRevenue": "value3", #from metricHeaderEntries
"ga:itemQuantity": "value4", #from metricHeaderEntries
...
}
寻找:
{
"reports": [{
"data": {
"rows": [{
"metrics": [{
"values": ["27.8", "4", "4", "6.95", "1.0", "0.0", "3.8834951456310676"]
}
],
"dimensions": ["TEST1", "20180725"]
}, {
"metrics": [{
"values": ["75.0", "12", "12", "6.25", "1.0", "0.0", "3.4782608695652173"]
}
],
"dimensions": ["TEST2", "20180725"]
}
],
"maximums": [{
"values": ["1665.0", "140", "126", "65.0", "3.0", "0.0", "50.0"]
}
],
"minimums": [{
"values": ["0.0", "0", "0", "0.0", "0.0", "0.0", "0.0"]
}
],
"isDataGolden": true,
"totals": [{
"values": ["27045.99", "3274", "2831", "8.260839951130116", "1.1564818085482163", "0.0", "4.949387227049424"]
}
],
"rowCount": 358
},
"columnHeader": {
"dimensions": ["ga:productName", "ga:date"],
"metricHeader": {
"metricHeaderEntries": [{
"type": "CURRENCY",
"name": "ga:itemRevenue"
}, {
"type": "INTEGER",
"name": "ga:itemQuantity"
}, {
"type": "INTEGER",
"name": "ga:uniquePurchases"
}, {
"type": "CURRENCY",
"name": "ga:revenuePerItem"
}, {
"type": "FLOAT",
"name": "ga:itemsPerPurchase"
}, {
"type": "CURRENCY",
"name": "ga:productRefundAmount"
}, {
"type": "PERCENT",
"name": "ga:buyToDetailRate"
}
]
}
}
}
]
}
{
"ga:productName": "NAME", #from dimension
"ga:date": "NAME", #from dimension
"ga:itemRevenue": "value1", #from metricHeaderEntries
"ga:itemQuantity": "value2", #from metricHeaderEntries
...
}
{
"ga:productName": "NAME2", #from dimension
"ga:date": "NAME2", #from dimension
"ga:itemRevenue": "value3", #from metricHeaderEntries
"ga:itemQuantity": "value4", #from metricHeaderEntries
...
}
基于“维度”和“metricHeaderEntries”的矩阵值
修改(或重新创建)报告的干净方法是什么
LINE1 - {"ga:productName": "NAME","ga:date": "NAME","ga:itemRevenue": "value1", "ga:itemQuantity": "value2", ... }
LINE2 - {"ga:productName": "NAME","ga:date": "NAME","ga:itemRevenue": "value1", "ga:itemQuantity": "value2", ... }
EDIT1:
{
"reports": [{
"data": {
"rows": [{
"metrics": [{
"values": ["27.8", "4", "4", "6.95", "1.0", "0.0", "3.8834951456310676"]
}
],
"dimensions": ["TEST1", "20180725"]
}, {
"metrics": [{
"values": ["75.0", "12", "12", "6.25", "1.0", "0.0", "3.4782608695652173"]
}
],
"dimensions": ["TEST2", "20180725"]
}
],
"maximums": [{
"values": ["1665.0", "140", "126", "65.0", "3.0", "0.0", "50.0"]
}
],
"minimums": [{
"values": ["0.0", "0", "0", "0.0", "0.0", "0.0", "0.0"]
}
],
"isDataGolden": true,
"totals": [{
"values": ["27045.99", "3274", "2831", "8.260839951130116", "1.1564818085482163", "0.0", "4.949387227049424"]
}
],
"rowCount": 358
},
"columnHeader": {
"dimensions": ["ga:productName", "ga:date"],
"metricHeader": {
"metricHeaderEntries": [{
"type": "CURRENCY",
"name": "ga:itemRevenue"
}, {
"type": "INTEGER",
"name": "ga:itemQuantity"
}, {
"type": "INTEGER",
"name": "ga:uniquePurchases"
}, {
"type": "CURRENCY",
"name": "ga:revenuePerItem"
}, {
"type": "FLOAT",
"name": "ga:itemsPerPurchase"
}, {
"type": "CURRENCY",
"name": "ga:productRefundAmount"
}, {
"type": "PERCENT",
"name": "ga:buyToDetailRate"
}
]
}
}
}
]
}
{
"ga:productName": "NAME", #from dimension
"ga:date": "NAME", #from dimension
"ga:itemRevenue": "value1", #from metricHeaderEntries
"ga:itemQuantity": "value2", #from metricHeaderEntries
...
}
{
"ga:productName": "NAME2", #from dimension
"ga:date": "NAME2", #from dimension
"ga:itemRevenue": "value3", #from metricHeaderEntries
"ga:itemQuantity": "value4", #from metricHeaderEntries
...
}
以这种方式工作:
"metrics": [{"values": ["27.8", "4", "4", "6.95", "1.0", "0.0","3.8834951456310676"] #headers in metricHeaderEntries
"dimensions": ["TEST1", "20180725"] #header in dimension
或类似(我对总数等不感兴趣)
寻找解决方案/样品/解释如何做,以及BQ将接受的方式
额外:
{
"reports": [{
"data": {
"rows": [{
"metrics": [{
"values": ["27.8", "4", "4", "6.95", "1.0", "0.0", "3.8834951456310676"]
}
],
"dimensions": ["TEST1", "20180725"]
}, {
"metrics": [{
"values": ["75.0", "12", "12", "6.25", "1.0", "0.0", "3.4782608695652173"]
}
],
"dimensions": ["TEST2", "20180725"]
}
],
"maximums": [{
"values": ["1665.0", "140", "126", "65.0", "3.0", "0.0", "50.0"]
}
],
"minimums": [{
"values": ["0.0", "0", "0", "0.0", "0.0", "0.0", "0.0"]
}
],
"isDataGolden": true,
"totals": [{
"values": ["27045.99", "3274", "2831", "8.260839951130116", "1.1564818085482163", "0.0", "4.949387227049424"]
}
],
"rowCount": 358
},
"columnHeader": {
"dimensions": ["ga:productName", "ga:date"],
"metricHeader": {
"metricHeaderEntries": [{
"type": "CURRENCY",
"name": "ga:itemRevenue"
}, {
"type": "INTEGER",
"name": "ga:itemQuantity"
}, {
"type": "INTEGER",
"name": "ga:uniquePurchases"
}, {
"type": "CURRENCY",
"name": "ga:revenuePerItem"
}, {
"type": "FLOAT",
"name": "ga:itemsPerPurchase"
}, {
"type": "CURRENCY",
"name": "ga:productRefundAmount"
}, {
"type": "PERCENT",
"name": "ga:buyToDetailRate"
}
]
}
}
}
]
}
{
"ga:productName": "NAME", #from dimension
"ga:date": "NAME", #from dimension
"ga:itemRevenue": "value1", #from metricHeaderEntries
"ga:itemQuantity": "value2", #from metricHeaderEntries
...
}
{
"ga:productName": "NAME2", #from dimension
"ga:date": "NAME2", #from dimension
"ga:itemRevenue": "value3", #from metricHeaderEntries
"ga:itemQuantity": "value4", #from metricHeaderEntries
...
}
我了解从JSON请求获取数据的方式,如:
responce[][][]
但这种情况对我来说太棘手了(
示例:
{
"reports": [{
"data": {
"rows": [{
"metrics": [{
"values": ["27.8", "4", "4", "6.95", "1.0", "0.0", "3.8834951456310676"]
}
],
"dimensions": ["TEST1", "20180725"]
}, {
"metrics": [{
"values": ["75.0", "12", "12", "6.25", "1.0", "0.0", "3.4782608695652173"]
}
],
"dimensions": ["TEST2", "20180725"]
}
],
"maximums": [{
"values": ["1665.0", "140", "126", "65.0", "3.0", "0.0", "50.0"]
}
],
"minimums": [{
"values": ["0.0", "0", "0", "0.0", "0.0", "0.0", "0.0"]
}
],
"isDataGolden": true,
"totals": [{
"values": ["27045.99", "3274", "2831", "8.260839951130116", "1.1564818085482163", "0.0", "4.949387227049424"]
}
],
"rowCount": 358
},
"columnHeader": {
"dimensions": ["ga:productName", "ga:date"],
"metricHeader": {
"metricHeaderEntries": [{
"type": "CURRENCY",
"name": "ga:itemRevenue"
}, {
"type": "INTEGER",
"name": "ga:itemQuantity"
}, {
"type": "INTEGER",
"name": "ga:uniquePurchases"
}, {
"type": "CURRENCY",
"name": "ga:revenuePerItem"
}, {
"type": "FLOAT",
"name": "ga:itemsPerPurchase"
}, {
"type": "CURRENCY",
"name": "ga:productRefundAmount"
}, {
"type": "PERCENT",
"name": "ga:buyToDetailRate"
}
]
}
}
}
]
}
{
"ga:productName": "NAME", #from dimension
"ga:date": "NAME", #from dimension
"ga:itemRevenue": "value1", #from metricHeaderEntries
"ga:itemQuantity": "value2", #from metricHeaderEntries
...
}
{
"ga:productName": "NAME2", #from dimension
"ga:date": "NAME2", #from dimension
"ga:itemRevenue": "value3", #from metricHeaderEntries
"ga:itemQuantity": "value4", #from metricHeaderEntries
...
}
理想情况下,这就是表格的外观
这就是谷歌提供的打印数据的方式(但需要将其转换为我上面解释的格式)
def print_response(response):
for report in response.get('reports', []):
columnHeader = report.get('columnHeader', {})
dimensionHeaders = columnHeader.get('dimensions', [])
metricHeaders = columnHeader.get('metricHeader', {}).get('metricHeaderEntries', [])
for row in report.get('data', {}).get('rows', []):
dimensions = row.get('dimensions', [])
dateRangeValues = row.get('metrics', [])
for header, dimension in zip(dimensionHeaders, dimensions):
print header + ': ' + dimension
for i, values in enumerate(dateRangeValues):
print 'Date range: ' + str(i)
for metricHeader, value in zip(metricHeaders, values.get('values')):
print metricHeader.get('name') + ': ' + value
因此,您的编辑仍然与JSON不匹配;在维度中,您有一个值列表,而不是key:value
"dimensions": [
"ga:productName",
"ga:date"
],
这意味着你没有任何价值可言,因此你的例子是不正确的。
在“metricHeaderEntries”中,您有以下内容:
"metricHeader": {
"metricHeaderEntries": [
{
"type": "CURRENCY",
"name": "ga:itemRevenue"
},
{
"type": "INTEGER",
"name": "ga:itemQuantity"
},
{
"type": "INTEGER",
"name": "ga:uniquePurchases"
},
{
"type": "CURRENCY",
"name": "ga:revenuePerItem"
},
{
"type": "FLOAT",
"name": "ga:itemsPerPurchase"
},
{
"type": "CURRENCY",
"name": "ga:productRefundAmount"
},
{
"type": "PERCENT",
"name": "ga:buyToDetailRate"
}
]
}
因此,即使是这种情况也不符合您的示例,因为在“metricHeaderEntries”下,您没有在示例中显示的任何“ga:itemRevenue”或“ga:itemQuantity”值
在任何情况下,您都可以像python字典一样实现JSON,这样您就可以在字典中按键选择元素,在列表中按索引选择元素
如果我有时间,我将尝试解决您的问题,从您的示例中获取值,即使您给出的节点不正确
回答:
{
"reports": [{
"data": {
"rows": [{
"metrics": [{
"values": ["27.8", "4", "4", "6.95", "1.0", "0.0", "3.8834951456310676"]
}
],
"dimensions": ["TEST1", "20180725"]
}, {
"metrics": [{
"values": ["75.0", "12", "12", "6.25", "1.0", "0.0", "3.4782608695652173"]
}
],
"dimensions": ["TEST2", "20180725"]
}
],
"maximums": [{
"values": ["1665.0", "140", "126", "65.0", "3.0", "0.0", "50.0"]
}
],
"minimums": [{
"values": ["0.0", "0", "0", "0.0", "0.0", "0.0", "0.0"]
}
],
"isDataGolden": true,
"totals": [{
"values": ["27045.99", "3274", "2831", "8.260839951130116", "1.1564818085482163", "0.0", "4.949387227049424"]
}
],
"rowCount": 358
},
"columnHeader": {
"dimensions": ["ga:productName", "ga:date"],
"metricHeader": {
"metricHeaderEntries": [{
"type": "CURRENCY",
"name": "ga:itemRevenue"
}, {
"type": "INTEGER",
"name": "ga:itemQuantity"
}, {
"type": "INTEGER",
"name": "ga:uniquePurchases"
}, {
"type": "CURRENCY",
"name": "ga:revenuePerItem"
}, {
"type": "FLOAT",
"name": "ga:itemsPerPurchase"
}, {
"type": "CURRENCY",
"name": "ga:productRefundAmount"
}, {
"type": "PERCENT",
"name": "ga:buyToDetailRate"
}
]
}
}
}
]
}
{
"ga:productName": "NAME", #from dimension
"ga:date": "NAME", #from dimension
"ga:itemRevenue": "value1", #from metricHeaderEntries
"ga:itemQuantity": "value2", #from metricHeaderEntries
...
}
{
"ga:productName": "NAME2", #from dimension
"ga:date": "NAME2", #from dimension
"ga:itemRevenue": "value3", #from metricHeaderEntries
"ga:itemQuantity": "value4", #from metricHeaderEntries
...
}
我解决了您的问题,即使我硬编码了键值,而不是从原始JSON中提取,只是为了让您了解它是如何工作的;请让我知道这是否是您所期望的:
new_list=[]
l=a["reports"][0]["data"]["rows"]#get to "rows" key from a, where a is your JSON readed as dictionary
for i in l:#iterate rows key for search the needed values for each lines
dict_line={}#create a dictionary for each line
dict_line["ga:productName"]=i["dimensions"][0]#add to the dictionary dict_lineth key ga:productName and the product name as value
dict_line["ga:date"]=i["dimensions"][1]#add to the dictionary dict_lineth key ga:date and the product date as value
j= (i["metrics"][0]['values'])#for each product line I create a key node and value
dict_line["ga:itemRevenue"]=j[0]
dict_line["ga:itemQuantity"]=j[1]
dict_line["ga:uniquePurchases"]=j[2]
dict_line["ga:revenuePerIte"]=j[3]
dict_line["ga:itemsPerPurchase"]=j[4]
dict_line["ga:productRefundAmount"]=j[5]
dict_line["ga:buyToDetailRate"]=j[6]
new_list.append(dict_line)
print (new_list)
结果是:
[
{
"ga:productName": "TEST1",
"ga:itemRevenue": "27.8",
"ga:uniquePurchases": "4",
"ga:date": "20180725",
"ga:revenuePerIte": "6.95",
"ga:productRefundAmount": "0.0",
"ga:itemQuantity": "4",
"ga:itemsPerPurchase": "1.0",
"ga:buyToDetailRate": "3.8834951456310676"
},
{
"ga:productName": "TEST2",
"ga:itemRevenue": "75.0",
"ga:uniquePurchases": "12",
"ga:date": "20180725",
"ga:revenuePerIte": "6.25",
"ga:productRefundAmount": "0.0",
"ga:itemQuantity": "12",
"ga:itemsPerPurchase": "1.0",
"ga:buyToDetailRate": "3.4782608695652173"
}
]
嗨,Denis,基于前面的JSON示例,您能告诉我预期结果吗?因为在您的LINE1示例中,您有“ga:productName”:“NAME”,但老实说,“ga:productName”不存在任何值。如果您告诉我您需要什么,我将能够帮助您ga:productName和ga:date取自维度-““dimensions”:[“ga:productName”,“ga:date”],,标题的其余部分来自“metricHeaderEntries”。@Carlo1585编辑了问题(看一看)@Carlo1585添加了图片,表格应该是什么样子在我的另一篇文章中检查答案,我刚刚编辑了,它正在工作,但我硬编码了所有的键,你应该编辑它以动态地获取不同的键;)但最终的结果与您的预期相同;)