Python 将重复的计数相加

Python 将重复的计数相加,python,pandas,Python,Pandas,我尝试对重复的行求和。但我得到了一个关键错误: Traceback (most recent call last): File "checkdup.py", line 9, in <module> dups_data = df.duplicated(subset=['VIP_ID_SOURCE'], keep = False).sum() File "/usr/local/lib/python2.7/dist-packages/panda

我尝试对重复的行求和。但我得到了一个关键错误:

Traceback (most recent call last):
  File "checkdup.py", line 9, in <module>
    dups_data = df.duplicated(subset=['VIP_ID_SOURCE'], keep = False).sum()
  File "/usr/local/lib/python2.7/dist-packages/pandas/core/frame.py", line 4381, in duplicated
    raise KeyError(diff)
KeyError: Index([u'VIP_ID_SOURCE'], dtype='object')
这是示例csv文件:

VIP_ID|SYSTEM_CODE|VIP_CODE|VIP_ID_SOURCE|GENDER|TIER|ACTIVE_MEMBER|EMAIL_ADDRESS|VIP_CENTER_CODE|TELEPHONE
826914|QQ|123321123as|F08210020322|M|AD|1||1234|28386078
798089|QQ|123321123as|F08210020322|M|AD|1||1234|73848487
798087|QQ|123321123as|F08210020905|F|AD|1||1234|30608803
796490|QQ|123321123as|F08210020403|M|AD|1||1234|38581312

dups_data=df.reset_index.duplicated(subset=[3],keep=False).sum()

存在问题
df=pd.read_csv(“test.csv”,delimiter='|',header=None)
-
header=None
生成
0,1,2
列。如果csv没有标题,则使用它。所以解决方法是使用
pd.read_csv(“test.csv”,delimiter='|')
我找到了答案
dups\u data=df.reset\u index.duplicated(subset=[3],keep=False).sum()
谢谢
VIP_ID|SYSTEM_CODE|VIP_CODE|VIP_ID_SOURCE|GENDER|TIER|ACTIVE_MEMBER|EMAIL_ADDRESS|VIP_CENTER_CODE|TELEPHONE
826914|QQ|123321123as|F08210020322|M|AD|1||1234|28386078
798089|QQ|123321123as|F08210020322|M|AD|1||1234|73848487
798087|QQ|123321123as|F08210020905|F|AD|1||1234|30608803
796490|QQ|123321123as|F08210020403|M|AD|1||1234|38581312