Python 如何从pandas df中选择多个列并将其存储在另一个df中？_Python_Pandas

Python 如何从pandas df中选择多个列并将其存储在另一个df中？

python pandas

Python 如何从pandas df中选择多个列并将其存储在另一个df中？,python,pandas,Python,Pandas,我有一个具有以下键的数据帧： tourist_spot, nearest landmark, longitute(landmark), latitude(landmark), nearest police station, longitute(PS), latitude(PS), nearest bus stop, longitute(BS), latitude(BS), nearest taxi stand, longitute(TS), latitude(TS) 还有其他一些类似的专栏。我

我有一个具有以下键的数据帧：

tourist_spot,
nearest landmark,
longitute(landmark),
latitude(landmark),
nearest police station,
longitute(PS),
latitude(PS),
nearest bus stop,
longitute(BS),
latitude(BS),
nearest taxi stand,
longitute(TS),
latitude(TS)

还有其他一些类似的专栏。我想要实现的是这样的目标：

name,
type,
latitude,
longitude,
nearest_to

名称

将具有地标或PS或BS等的名称。

Type

将指示地点的类型。例如，警察局的PS等。

纬度

和

经度

将是每个地方的位置，

最近的

将是

旅游景点的名称

样本数据：

tourist_spot                ts1
nearest landmark            nl1
longitute(landmark)         4
latitude(landmark)          5
nearest police station      ps1
longitute(PS)               7
latitude(PS)                8
nearest bus stop            bs1
longitute(BS)               9
latitude(BS)                10
nearest taxi stand          ts1
longitute(TS)               11 
latitude(TS)                12

请将其转换为类似于表的结构。这里很难摆一张大桌子

期望输出：

 name   type   longitude   latitude   nearest_to
    nl1    landmark   4         5           ts1
    ps1    PS         7         8           ts1

到目前为止，我的代码是：

import pandas as pd
df = pd.read_excel("C:\\Users\\NCOG1\\Desktop\\Meghalaya\\Daribokgre.xlsx")
df1 = pd.DataFrame(columns=['name','type_id','longitude','latitude', 'nearby_to'])
df1['name'] = df['Nearest Landmark Name'] 
df1['type_id'] = df['Nearest Landmark Name']
df1['longitude'] = df['Longitude (of Nearest Landmark)']
df1['latitude'] = df['Latitude (of Nearest Landmark)']
df1['nearby_to'] = df['Name of Tourist Spot']

首先，必须使用

纬度

和

经度

分析列，前面有一列：

df = pd.read_excel('Daribokgre.xlsx', sheet_name='Sheet2', index_col=0)
c = df.columns[df.columns.str.startswith('Latitude')]
idx = df.columns.get_indexer(c)
df1 = df.iloc[:, np.sort(np.r_[idx-1, idx, idx + 1])[1:]].reset_index()
#print (df1)

然后将数据提取到helper DataFrame中的新列：

df2 = df1.columns.to_frame(index=False)
m = df2[0].str.contains('Latitude|Longitude')
df2['type'] = df2[0].mask(m).ffill()
df2['b'] = np.where(m, df2[0].str.split(r'\s+\(', n=1).str[0], 'name')

最后一次创建多索引并通过堆栈重新整形：

df1.columns = [df2['type'], df2['b']]
df1 = (df1.stack(0)
         .reset_index(level=0, drop=True)
         .reset_index().rename(columns={1:'type'})
         .rename_axis(None, axis=1))
print (df1)
                                                type     Latitude  \
0                                           Bus Stop  25⁰33'05" N   
1                                           Landmark  25⁰33'05" N   
2                               Name of Tourist Spot  25⁰29'24" N   
3                              Nearest Hospital Name  25⁰36'01" N   
4                            Nearest Taxi Stand Name  25⁰30'45" N   
5                                     Police Station  25⁰35'53" N   
6          Restaurants or other eating joints nearby   2529'22' N   
7  Staying Options nearby (Hotels, Resorts, Homes...  25⁰29'22" N   

     Longitude                                               name  
0  90⁰19'42" E           Oragitok Jnc. Old Williamnagar-Tura road  
1  90⁰19'42" E               Oragitok Williamnagar-Tura  old road  
2  90⁰19'22" E                                         Daribokgre  
3  90⁰16'25" E                                        Asanang PHC  
4  90⁰13'01" E                                  Tura Bazaar, Tura  
5  90⁰15'14" E  Rongram Beat House whoever Daribokgre falls un...  
6   9019'21" E                                Daribokgre Homestay  
7  90⁰19'21" E                            1. Daribokgre Homestay,

添加示例数据和预期输出。请您对您的问题进行更多解释，例如

Dataframe

实例和您程序的预期行为。因此，您将要添加

最近的

字段，该字段通过

经度和纬度定义位置，哪个是距离名称字段最近的位置？我想组合最近的地标，最近的警察局
将其列为名称
。与每个和附近的相对应的坐标将包含每个和最近的旅游景点的名称。@KaranGupta-您的熊猫版本是什么？类型错误：一元数的操作数类型错误~：“NoneType”。我想要类似的东西。我们可以私下聊聊吗？我可以给你发送真实的文件。发送到你的电子邮件。是的，它的第2页