比较python数据帧中2列的列表,并计算相同的项
如何比较python数据帧中两列的列表,并计算数据帧中这两列之间的相同列表。例如:比较python数据帧中2列的列表,并计算相同的项,python,list,dataframe,Python,List,Dataframe,如何比较python数据帧中两列的列表,并计算数据帧中这两列之间的相同列表。例如: column A | column B ==================================== ['a', 'b', 'c'] | ['a', 'b'] ['a', 'b'] | ['a'] ['b'] | ['a'] 我想得到这个结果: column A | column B
column A | column B
====================================
['a', 'b', 'c'] | ['a', 'b']
['a', 'b'] | ['a']
['b'] | ['a']
我想得到这个结果:
column A | column B | count_same_item
======================================================
['a', 'b', 'c'] | ['a', 'b'] | 2
['a', 'b'] | ['a'] | 1
['b'] | ['a'] | 0
非常感谢您的帮助试试这个:
df['count_same_item'] = df.apply(lambda x: len(set(x['column A']) & set(x['column B'])), axis=1)
print(df)
输出:
column A column B count_same_item
0 [a, b, c] [a, b] 2
1 [a, b] [a] 1
2 [b] [a] 0
试试这个:
df['count_same_item'] = df.apply(lambda x: len(set(x['column A']) & set(x['column B'])), axis=1)
print(df)
输出:
column A column B count_same_item
0 [a, b, c] [a, b] 2
1 [a, b] [a] 1
2 [b] [a] 0