Python 仅为列表中的一个条目选择第一个最高值_Python_Pandas_Dataframe_Duplicates

Python 仅为列表中的一个条目选择第一个最高值

python pandas dataframe

Python 仅为列表中的一个条目选择第一个最高值,python,pandas,dataframe,duplicates,Python,Pandas,Dataframe,Duplicates,这是我的数据 Column IV Source RRD 5.795765 Personal_Demographics RRD 5.795765 Cust360_Agreement RRD 5.792729 External_Data WO 4.361066 Cust360_Asset Rating 3.600918 Personal_Demographics 我

这是我的数据

Column       IV         Source
RRD          5.795765   Personal_Demographics
RRD          5.795765   Cust360_Agreement
RRD          5.792729   External_Data
WO           4.361066   Cust360_Asset
Rating       3.600918   Personal_Demographics

我的预期结果

Column       IV         Source
RRD          5.795765   Personal_Demographics
WODate       4.361066   Cust360_Asset
Rating       3.600918   Personal_Demographics

我所尝试的

inds = df.groupby(['Column'])['IV'].transform(max) == df['IV']

但结果如何

Column       IV         Source
RRD          5.795765   Personal_Demographics
RRD          5.795765   Cust360_Agreement
WO           4.361066   Cust360_Asset
Rating       3.600918   Personal_Demographics

第一个是有类似的值，但我只需要一个类似的输出

Column       IV         Source
RRD          5.795765   Personal_Demographics
WO           4.361066   Cust360_Asset
Rating       3.600918   Personal_Demographics

关于

尝试删除重复项+

排序值

out = df.sort_values('IV',ascending=False).drop_duplicates('Column')
Out[121]: 
   Column        IV                 Source
0     RRD  5.795765  Personal_Demographics
3      WO  4.361066          Cust360_Asset
4  Rating  3.600918  Personal_Demographics

如果您想要

groupby

df.sort_values('IV',ascending=False).groupby(['Column']).head(1)

要知道，删除重复项只能在一列上工作，thanks@NabihBawazir检查更新