Python 数据帧中每第二列的值计算一列的值
我有这个数据框:Python 数据帧中每第二列的值计算一列的值,python,pandas,Python,Pandas,我有这个数据框: dummy_dataset = {'sentences': ['a','b','c','d','e','f'], 'classes': [1,2,1,3,3,2] } dataframe = pd.DataFrame(dummy_dataset) sentences classes 0 a 1 1 b 2 2 c 1 3 d 3 4 e 3 5 f 2 我要找的是: output = { 1 : ['a','c'], 2
dummy_dataset = {'sentences': ['a','b','c','d','e','f'], 'classes': [1,2,1,3,3,2] }
dataframe = pd.DataFrame(dummy_dataset)
sentences classes
0 a 1
1 b 2
2 c 1
3 d 3
4 e 3
5 f 2
我要找的是:
output = { 1 : ['a','c'], 2 : ['b','f'], 3: ['d','e'] }
我尝试了dict方法:
dict_count = {}
for m in range(len(dfg)):
if dfg['classes'].iloc[m] not in dict_count:
dict_count[dfg['classes'].iloc[m]] = [dfg['sentences'].iloc[m]]
else:
dict_count[dfg['classes'].iloc[m]].append(dfg['sentences'].iloc[m])
如何使用pandascount
和groupby
方法执行此操作?在classes
列和as列表中使用,然后:
输出:
{1: ['a', 'c'], 2: ['b', 'f'], 3: ['d', 'e']}
{1: ['a', 'c'], 2: ['b', 'f'], 3: ['d', 'e']}