Python MemoryError:无法分配具有形状和数据类型对象的数组

Python MemoryError:无法分配具有形状和数据类型对象的数组,python,pandas,numpy,Python,Pandas,Numpy,我想在两个数据集之间执行内部联接,如下所示: theme_ids.head() id Loan Theme ID Loan Theme Type Partner ID 0 638631 a1050000000skGl General 151 1 640322 a1050000000skGl General 151 2 641006 a1050000002X1ij Higher Education 160 3 641019 a1050000002X1ij Hig

我想在两个数据集之间执行内部联接,如下所示:

theme_ids.head()
id  Loan Theme ID   Loan Theme Type Partner ID
0   638631  a1050000000skGl General 151
1   640322  a1050000000skGl General 151
2   641006  a1050000002X1ij Higher Education    160
3   641019  a1050000002X1ij Higher Education    160
4   641594  a1050000002VbsW Subsistence Agriculture 336

我试过:

data = pd.merge(theme_ids, theme_reg, on='Partner ID', how='inner') 
其中提出:

MemoryError:无法分配具有形状(15,144356281)和 数据类型对象


15 x 144356281 x 8字节=17.3227537 GB
。在您的情况下,情况更糟,因为您的数据不仅仅是每个单元格8个字节。shape(5144356281)表示您正在分配5行和144356281列,这是您的要求吗?
data = pd.merge(theme_ids, theme_reg, on='Partner ID', how='inner')