Python 仅为列表中的一个条目选择第一个最高值
这是我的数据Python 仅为列表中的一个条目选择第一个最高值,python,pandas,dataframe,duplicates,Python,Pandas,Dataframe,Duplicates,这是我的数据 Column IV Source RRD 5.795765 Personal_Demographics RRD 5.795765 Cust360_Agreement RRD 5.792729 External_Data WO 4.361066 Cust360_Asset Rating 3.600918 Personal_Demographics 我
Column IV Source
RRD 5.795765 Personal_Demographics
RRD 5.795765 Cust360_Agreement
RRD 5.792729 External_Data
WO 4.361066 Cust360_Asset
Rating 3.600918 Personal_Demographics
我的预期结果
Column IV Source
RRD 5.795765 Personal_Demographics
WODate 4.361066 Cust360_Asset
Rating 3.600918 Personal_Demographics
我所尝试的
inds = df.groupby(['Column'])['IV'].transform(max) == df['IV']
但结果如何
Column IV Source
RRD 5.795765 Personal_Demographics
RRD 5.795765 Cust360_Agreement
WO 4.361066 Cust360_Asset
Rating 3.600918 Personal_Demographics
第一个是有类似的值,但我只需要一个类似的输出
Column IV Source
RRD 5.795765 Personal_Demographics
WO 4.361066 Cust360_Asset
Rating 3.600918 Personal_Demographics
关于尝试删除重复项+
排序值
out = df.sort_values('IV',ascending=False).drop_duplicates('Column')
Out[121]:
Column IV Source
0 RRD 5.795765 Personal_Demographics
3 WO 4.361066 Cust360_Asset
4 Rating 3.600918 Personal_Demographics
如果您想要groupby
df.sort_values('IV',ascending=False).groupby(['Column']).head(1)
要知道,删除重复项只能在一列上工作,thanks@NabihBawazir检查更新