Python 将两个字典中的元素放入列表中
我正在读一个文件,统计整个fasta文件中的氨基酸数量。到目前为止,一切进展顺利。 然后我需要计算百分比,并输出最高的五个,带有氨基酸缩写、计数计数和百分比。我有所有的元素,但在那之后我遇到了麻烦。我有两个字典,两个字典中的键相同,但值不同(tally和percentage)。我曾试图将这两本词典合并成一个单子,但我很难做到Python 将两个字典中的元素放入列表中,python,list,dictionary,Python,List,Dictionary,我正在读一个文件,统计整个fasta文件中的氨基酸数量。到目前为止,一切进展顺利。 然后我需要计算百分比,并输出最高的五个,带有氨基酸缩写、计数计数和百分比。我有所有的元素,但在那之后我遇到了麻烦。我有两个字典,两个字典中的键相同,但值不同(tally和percentage)。我曾试图将这两本词典合并成一个单子,但我很难做到 counts = {} #open file, read line at a time for line in open('e_coli_k12_dh10b.faa',
counts = {}
#open file, read line at a time
for line in open('e_coli_k12_dh10b.faa', 'r'):
line = line.rstrip()
#ignore header line
if line.startswith('>'):continue
for aa in line:
#if key in dict, add 1
if aa in counts:
counts[aa] += 1
#else, (if empty) for aa, set to 1
else:
counts.update({aa:1})
#get sum of all dictionary values
total = sum(counts[item] for item in counts)
#iterate over values, add to dictionary divided by total * 100
#new dict for percentages
centages = {}
for aa, tally in counts.items():
#maths for percentages
percent = (1. * tally / total * 100)
#within for loop, add aa and percent to centages
centages.update({aa: percent})
print(counts.keys())
print(counts.values())
print(centages.keys())
print(centages.values())
计数字典
世纪词典
这就是我遇到的问题-我有列表中的第一个dictionary元素,但需要将centages.values添加到适当的位置。我一直在尝试:
#for loop to set aa to list as keys - [counts.keys,
counts.values, centages.values]
L = []
for aa, tally in counts.items():
L.append([aa, tally])
#add centages.values to list L at aa
for i in range(len(counts)):
for aa, percent in centages.items():
if(L[i] == centages.keys):
L[i].append(centages.values)
print(L) #just aa, tally so far
电流输出:
[['A', 123885], ['C', 14983], ['E', 74992], ['D', 66618], ['G', 95475], ['F', 50554], ['I', 77836], ['H', 29255], ['K', 57151], ['M', 36759], ['L', 139002], ['N', 50492], ['Q', 57732], ['P', 57595], ['S', 74803], ['R', 71819], ['U', 3], ['T', 69645], ['W', 20019], ['V', 91683], ['Y', 36836], ['X', 1]]
因此,我需要添加的最后一个元素是不添加。我很确定这很简单
预期产出应为:
['A',123885,9.55],[etc]如果您只是尝试获取一个嵌套列表,其中每个子列表包含一个键和每个dict中的相应值,那么您可以执行以下操作:
counts = {'A': 123885, 'C': 14983, 'E': 74992, 'D': 66618, 'G': 95475, 'F': 50554, 'I': 77836, 'H': 29255, 'K': 57151, 'M': 36759, 'L': 139002, 'N': 50492, 'Q': 57732, 'P': 57595, 'S': 74803, 'R': 71819, 'U': 3, 'T': 69645, 'W': 20019, 'V': 91683, 'Y': 36836, 'X': 1}
centages = {'A': 9.550641489186193, 'C': 1.1550814177057491, 'E': 5.781343234104621, 'D': 5.1357681295282385, 'G': 7.360435050087193, 'F': 3.8973493953611724, 'I': 6.000595156413581, 'H': 2.2553498548342583, 'K': 4.405930594894298, 'M': 2.8338542236832165, 'L': 10.71605334204996, 'N': 3.8925696417805975, 'Q': 4.450721511512268, 'P': 4.44015979795519, 'S': 5.766772694963835, 'R': 5.5367277806987385, 'U': 0.0002312783990600846, 'T': 5.369128034179864, 'W': 1.5433207569279443, 'V': 7.068099153675245, 'Y': 2.839790369259092, 'X': 7.709279968669486e-05}
results = [[key, counts[key], centages[key]] for key in counts]
print(results)
# [['A', 123885, 9.550641489186193], ['C', 14983, 1.1550814177057491], ['E', 74992, 5.781343234104621], ['D', 66618, 5.1357681295282385], ['G', 95475, 7.360435050087193], ['F', 50554, 3.8973493953611724], ['I', 77836, 6.000595156413581], ['H', 29255, 2.2553498548342583], ['K', 57151, 4.405930594894298], ['M', 36759, 2.8338542236832165], ['L', 139002, 10.71605334204996], ['N', 50492, 3.8925696417805975], ['Q', 57732, 4.450721511512268], ['P', 57595, 4.44015979795519], ['S', 74803, 5.766772694963835], ['R', 71819, 5.5367277806987385], ['U', 3, 0.0002312783990600846], ['T', 69645, 5.369128034179864], ['W', 20019, 1.5433207569279443], ['V', 91683, 7.068099153675245], ['Y', 36836, 2.839790369259092], ['X', 1, 7.709279968669486e-05]]
您还应该包括您的预期输出!感谢您的编辑-更可读!真是太棒了!非常感谢。我知道它必须更简单!
[['A', 123885], ['C', 14983], ['E', 74992], ['D', 66618], ['G', 95475], ['F', 50554], ['I', 77836], ['H', 29255], ['K', 57151], ['M', 36759], ['L', 139002], ['N', 50492], ['Q', 57732], ['P', 57595], ['S', 74803], ['R', 71819], ['U', 3], ['T', 69645], ['W', 20019], ['V', 91683], ['Y', 36836], ['X', 1]]
counts = {'A': 123885, 'C': 14983, 'E': 74992, 'D': 66618, 'G': 95475, 'F': 50554, 'I': 77836, 'H': 29255, 'K': 57151, 'M': 36759, 'L': 139002, 'N': 50492, 'Q': 57732, 'P': 57595, 'S': 74803, 'R': 71819, 'U': 3, 'T': 69645, 'W': 20019, 'V': 91683, 'Y': 36836, 'X': 1}
centages = {'A': 9.550641489186193, 'C': 1.1550814177057491, 'E': 5.781343234104621, 'D': 5.1357681295282385, 'G': 7.360435050087193, 'F': 3.8973493953611724, 'I': 6.000595156413581, 'H': 2.2553498548342583, 'K': 4.405930594894298, 'M': 2.8338542236832165, 'L': 10.71605334204996, 'N': 3.8925696417805975, 'Q': 4.450721511512268, 'P': 4.44015979795519, 'S': 5.766772694963835, 'R': 5.5367277806987385, 'U': 0.0002312783990600846, 'T': 5.369128034179864, 'W': 1.5433207569279443, 'V': 7.068099153675245, 'Y': 2.839790369259092, 'X': 7.709279968669486e-05}
results = [[key, counts[key], centages[key]] for key in counts]
print(results)
# [['A', 123885, 9.550641489186193], ['C', 14983, 1.1550814177057491], ['E', 74992, 5.781343234104621], ['D', 66618, 5.1357681295282385], ['G', 95475, 7.360435050087193], ['F', 50554, 3.8973493953611724], ['I', 77836, 6.000595156413581], ['H', 29255, 2.2553498548342583], ['K', 57151, 4.405930594894298], ['M', 36759, 2.8338542236832165], ['L', 139002, 10.71605334204996], ['N', 50492, 3.8925696417805975], ['Q', 57732, 4.450721511512268], ['P', 57595, 4.44015979795519], ['S', 74803, 5.766772694963835], ['R', 71819, 5.5367277806987385], ['U', 3, 0.0002312783990600846], ['T', 69645, 5.369128034179864], ['W', 20019, 1.5433207569279443], ['V', 91683, 7.068099153675245], ['Y', 36836, 2.839790369259092], ['X', 1, 7.709279968669486e-05]]