Pandas groupby（[]）.sum（）上出现意外结果 n=df1。分组依据['Year'、'State'、'Regulator'、'Industry'、'Product'、'Count']。总和_Pandas_Pandas Groupby

Pandas groupby（[]）.sum（）上出现意外结果 n=df1。分组依据['Year'、'State'、'Regulator'、'Industry'、'Product'、'Count']。总和

pandas

Pandas groupby（[]）.sum（）上出现意外结果 n=df1。分组依据['Year'、'State'、'Regulator'、'Industry'、'Product'、'Count']。总和,pandas,pandas-groupby,Pandas,Pandas Groupby,问题已解决。我已经运行了代码，包括并排除了代码中的列['Count']，这给了我好结果和坏结果的混合。出于某种原因，CSV没有被正确读取（如果有意义的话）。列['Count']是dtypes int，但它似乎被读取为字符串。所以我做了一个.applypd.to_numeric，删除了“Count”，并重新运行了解决问题的单元格下面是groupby/sum的最终代码： n = df1.groupby(['Year', 'State', 'Regulator', 'Industry','Produ

问题已解决。我已经运行了代码，包括并排除了代码中的列['Count']，这给了我好结果和坏结果的混合。出于某种原因，CSV没有被正确读取（如果有意义的话）。列['Count']是dtypes int，但它似乎被读取为字符串。所以我做了一个.applypd.to_numeric，删除了“Count”，并重新运行了解决问题的单元格

下面是groupby/sum的最终代码：

n = df1.groupby(['Year', 'State', 'Regulator', 'Industry','Product'])['Count'].sum()

请同时包含错误消息实际发生的错误。没有实际的错误消息，错误是获取了错误的值。例如，在groupby和sum之后，我得到的不是住房抵押贷款=3，而是住房抵押贷款=1

Year | 2012
State | Alabama
Regulator | FDIC
Debit Card | 1
Residential Mortgage | 1

n = df1.groupby(['Year', 'State', 'Regulator', 'Industry','Product'])['Count'].sum()