Python 在2个单独的numpy数组中查找匹配点_Python_Arrays_Performance_Numpy_Compare

Python 在2个单独的numpy数组中查找匹配点

python arrays performance numpy

Python 在2个单独的numpy数组中查找匹配点,python,arrays,performance,numpy,compare,Python,Arrays,Performance,Numpy,Compare,我有两个不同大小的数组，其中包含3d点。我希望有效地比较这两个数组，找到匹配的点，并最终返回简单数量的匹配点 pA=[[0,0,0],[0,1,0],[1,2,4],[10,3,4],[1,20,1],[5,3,2]] pB=[[14,1,0],[1,2,4],[1,20,1],[15,1,0]] #returns 2 目前，我有一个草率的循环来实现这一点，但它对性能不是很友好，这是一个问题，因为我正在尝试匹配多对具有更多点数的数组 t= np.array([pA[x]==pB for x

我有两个不同大小的数组，其中包含3d点。我希望有效地比较这两个数组，找到匹配的点，并最终返回简单数量的匹配点

pA=[[0,0,0],[0,1,0],[1,2,4],[10,3,4],[1,20,1],[5,3,2]]
pB=[[14,1,0],[1,2,4],[1,20,1],[15,1,0]]

#returns 2

目前，我有一个草率的循环来实现这一点，但它对性能不是很友好，这是一个问题，因为我正在尝试匹配多对具有更多点数的数组

t= np.array([pA[x]==pB for x in range(len(pA))]).sum(2)
print np.sum(t==3)

我只是不知道如何有效地比较两个不同大小的多维数组。然后，如何对大量对进行多次迭代

编辑

找到了一个快速组合阵列的解决方案，创建了一个独特的阵列版本，然后比较了两个阵列的长度

pts=np.concatenate((pA,pB),axis=0)
pts2 = np.unique(pts.view([('', pts.dtype)]*pts.shape[1]))
return len(pts)-len(pts2)

不知道如何在完整数据集上执行此操作，但请尝试使用Scipy的kdtree：

from scipy.spatial import cKDTree

pA=[[0,0,0],[0,1,0],[1,2,4],[10,3,4],[1,20,1],[5,3,2]]
pB=[[14,1,0],[1,2,4],[1,20,1],[15,1,0]]

kdtree = cKDTree(pA)
dists, inds = kdtree.query(pB, distance_upper_bound=1e-5)
result = (dists == 0).sum()

numpy

diff

np.all（…==0,1）

import numpy as np

# Inputs
pA=[[0,0,0],[0,1,0],[1,2,4],[10,3,4],[1,20,1],[5,3,2]]
pB=[[14,1,0],[1,2,4],[1,20,1],[15,1,0]]

# Form concatenate array of pA and pB
pts = np.concatenate((pA,pB),axis=0)

# Sort pts by rows
spts = pts[pts[:,1].argsort(),]

# Finally get counts by DIFFing along rows and counting all zero rows
counts = np.sum(np.diff(np.all(np.diff(spts,axis=0)==0,1)+0)==1)

In [152]: counts
Out[152]: 2

# Inputs
pA=[[0,0,0],[0,1,0],[1,2,4],[10,3,4],[1,20,1],[5,3,2],[1,2,4]]
pB=[[14,1,0],[1,2,4],[1,20,1],[15,1,0],[1,2,4]]

counts = np.sum(np.all(np.diff(spts,axis=0)==0,1))

numpy

pts=np.concatenate（（pA，pB），axis=0）pts2=np.unique（pts.view（[（''，pts.dtype）]*pts.shape[1]）counts=len（pts）-len（pts2）

all+diff

唯一的

unique