Python MemoryError:无法分配具有形状和数据类型对象的数组
我想在两个数据集之间执行内部联接,如下所示:Python MemoryError:无法分配具有形状和数据类型对象的数组,python,pandas,numpy,Python,Pandas,Numpy,我想在两个数据集之间执行内部联接,如下所示: theme_ids.head() id Loan Theme ID Loan Theme Type Partner ID 0 638631 a1050000000skGl General 151 1 640322 a1050000000skGl General 151 2 641006 a1050000002X1ij Higher Education 160 3 641019 a1050000002X1ij Hig
theme_ids.head()
id Loan Theme ID Loan Theme Type Partner ID
0 638631 a1050000000skGl General 151
1 640322 a1050000000skGl General 151
2 641006 a1050000002X1ij Higher Education 160
3 641019 a1050000002X1ij Higher Education 160
4 641594 a1050000002VbsW Subsistence Agriculture 336
及
我试过:
data = pd.merge(theme_ids, theme_reg, on='Partner ID', how='inner')
其中提出:
MemoryError:无法分配具有形状(15,144356281)和
数据类型对象
15 x 144356281 x 8字节=17.3227537 GB
。在您的情况下,情况更糟,因为您的数据不仅仅是每个单元格8个字节。shape(5144356281)表示您正在分配5行和144356281列,这是您的要求吗?
data = pd.merge(theme_ids, theme_reg, on='Partner ID', how='inner')