Python 删除数据帧中的重复项时出现问题_Python_Pandas_Dataframe

Python 删除数据帧中的重复项时出现问题

python pandas dataframe

Python 删除数据帧中的重复项时出现问题,python,pandas,dataframe,Python,Pandas,Dataframe,我有这个数据框： Ubicacion lat lon 0 a 19.28034 -99.17121 1 b 19.28333 -99.17535 2 c 19.28028 -99.16887 3 a 19.28034 -99.17121 4 b 19.28333 -99.17535 5 c 19.28028 -99.16887 6 b 19.2

我有这个数据框：

  Ubicacion       lat       lon
0         a  19.28034 -99.17121
1         b  19.28333 -99.17535
2         c  19.28028 -99.16887
3         a  19.28034 -99.17121
4         b  19.28333 -99.17535
5         c  19.28028 -99.16887
6         b  19.28333 -99.17535
7         d  19.29259 -99.17757
8         d  19.29259 -99.17757
9         d  19.29259 -99.17757

我想删除所有重复的行，因此我使用：

ubicaciones_finales = ubicaciones_finales.drop_duplicates(keep="first")

我明白了：

  Ubicacion       lat       lon
0         a  19.28034 -99.17121
1         b  19.28333 -99.17535
2         c  19.28028 -99.16887
7         d  19.29259 -99.17757

除了行分别为0、1、2和7之外，其他一切看起来都很好。所以当我跑步时：

 for k, row in ubicaciones_finales.iterrows():
    print(k)

I get:
0
1
2
7

我如何解决这个问题？顺便说一句，已经查过了

这是一样的，从0到2再加上1。谢谢您的时间。

IIUC，使用

reset\u index

或直接传递

ignore\u index=True

：

df = df.drop_duplicates(keep='first').reset_index(drop=True)

# or 

df = df.drop_duplicates(keep='first', ignore_index=True)

输出：

  Ubicacion       lat       lon
0         a  19.28034 -99.17121
1         b  19.28333 -99.17535
2         c  19.28028 -99.16887
3         d  19.29259 -99.17757

谢谢，在我知道之前，我不明白ignore_索引是关于什么的

  Ubicacion       lat       lon
0         a  19.28034 -99.17121
1         b  19.28333 -99.17535
2         c  19.28028 -99.16887
3         d  19.29259 -99.17757