Python dict中唯一嵌套值的计数
我有一个这样的模式,我想得到Python dict中唯一嵌套值的计数,python,python-3.x,dictionary,count,nested,Python,Python 3.x,Dictionary,Count,Nested,我有一个这样的模式,我想得到杂志,其中提到了最大数量的独特药物 my_list = [{'atccode': 'A04AD', 'drug': 'DIPHENHYDRAMINE', 'mentioned_in': [{'date': '01/01/2019', 'journal': 'Journal of emergency nursing'}, {'date': '01/01/2019',
杂志
,其中提到了最大数量的独特药物
my_list = [{'atccode': 'A04AD',
'drug': 'DIPHENHYDRAMINE',
'mentioned_in': [{'date': '01/01/2019',
'journal': 'Journal of emergency nursing'},
{'date': '01/01/2019',
'journal': 'Journal of emergency nursing'
},
{'date': '1 January 2020',
'journal': 'Journal of emergency nursing'
},
{'date': '1 January 2020',
'journal': 'Journal of emergency nursing'},
{'date': '1 January 2020',
'journal': 'Journal of emergency nursing'
}]},
{'atccode': 'S03AA',
'drug': 'TETRACYCLINE',
'mentioned_in': [{'date': '02/01/2020',
'journal': 'American journal of veterinary research'
},
{'date': '2020-01-01',
'journal': 'Psychopharmacology'}]},
{'atccode': 'V03AB',
'drug': 'ETHANOL',
'mentioned_in': [{'date': '2020-01-01',
'journal': 'Psychopharmacology'
}]},
{'atccode': 'A01AD',
'drug': 'EPINEPHRINE',
'mentioned_in': [{'date': '01/02/2020',
'journal': 'The journal of allergy and clinical '
'immunology. In practice'},
{'date': '01/03/2020',
'journal': 'The journal of allergy and clinical '
'immunology. In practice'
},
{'date': '27 April 2020',
'journal': 'Journal of emergency nursing'
}]}]
因此,结果将如下所示:
{
"journal":"Psychopharmacology",
"unique_drug_mentions" : 2
},
{
"journal" : "Psychopharmacology",
"unique_drug_mentions":2
}
到目前为止我一直在尝试的是
from collections import Counter
mentions_counts = Counter(d['journal'] for d in my_list)
most_common = {'unique_drug_mentions': mentions_counts.most_common(1)[0][0], "journal" :d["journal"]}
但它不起作用。我只是循环浏览我的列表:
# store counts of unique drugs here
counts = {}
# loop through your dicts in the list
for d in my_list:
# look in each journal mention
for d2 in d['mentioned_in']:
# if we haven't seen this journal before
if d2['journal'] not in counts:
counts[d2['journal']] = set()
counts[d2['journal']].add(d['drug'])
# this would have all your verbose info as you want it
unique_drug_counts = [
{
"journal": journal,
"unique_drug_mentions": len(drugs)
}
for journal, drugs in counts.items()
]
# max value (the answer to your question
max(counts.items(), key=lambda x: len(x[1]))
我会循环浏览我的列表:
# store counts of unique drugs here
counts = {}
# loop through your dicts in the list
for d in my_list:
# look in each journal mention
for d2 in d['mentioned_in']:
# if we haven't seen this journal before
if d2['journal'] not in counts:
counts[d2['journal']] = set()
counts[d2['journal']].add(d['drug'])
# this would have all your verbose info as you want it
unique_drug_counts = [
{
"journal": journal,
"unique_drug_mentions": len(drugs)
}
for journal, drugs in counts.items()
]
# max value (the answer to your question
max(counts.items(), key=lambda x: len(x[1]))
在你的说明书中,药物的“独特性”意味着什么?@joao我的意思是不会有重复计数请回答你的问题,并提供更多的样本输入数据和处理结果所需的输出。“没有重复计数”不够具体。每个日志中的drup计数,在所有日志中,还是什么?再次展示更多不同的输入和预期结果。药物在您的说明书中“独特”意味着什么?@joao我的意思是不会重复计数。请回答您的问题,并提供更多样本输入数据和处理结果所需的输出。“不重复计数”不够具体。每个日志中的drup计数,在所有日志中,还是什么?再次显示更多不同的输入和预期结果。哎哟,有一个输入错误,更新到您对我的列表的命名我已经尝试过了,但它给了我作为计数所有这些1在您的测试数据集中,两个日志的唯一计数都是1,不是吗?你试过更大的一组了吗?我用正确的数据更新了我的问题,并且输出了正确的结果,这是我的输出:
[{'journal':'journal of emergency nursing','unique_drug_tensions':2},{'journal':'American journal of veterinary research','unique_drug_tensions':1},{'journal':'精神药理学','独特药物'提到:2},{'journal':'过敏与临床免疫学杂志。在实践中,'独特药物'提到:1}]
Whoops,有一个输入错误,更新到了我的列表的命名我已经尝试过了,但它给了我一个计数。在你的测试数据集中,两个日志的唯一计数都是1,没有?你在更大的集合上尝试过吗?我已经用正确的数据和输出更新了我的问题嗯,看起来对我来说,这是我的输出:[{'journal':'journal of emergency nursing','unique_drug_Tentions':2},{'journal':'American journal of veterinary research','unique_drug_Tentions':1},{'journal':'The journal of allergy and clinical immunology.在实践中,'unique_drug_Tentions':1}