Warning: file_get_contents(/data/phpspider/zhask/data//catemap/0/performance/5.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/ruby-on-rails/62.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Pandas 熊猫:有效地用代表值替换装箱柱_Pandas_Performance_Binning - Fatal编程技术网

Pandas 熊猫:有效地用代表值替换装箱柱

Pandas 熊猫:有效地用代表值替换装箱柱,pandas,performance,binning,Pandas,Performance,Binning,我想对数据进行分类,并为每个分类选择一个特定的集合 import pandas as pd df = pd.DataFrame({ 'A': [1, 2, 3, 4], 'B': [1, 2, 3, 4], }) groups = pd.cut(df['A'], bins=2, labels=False) group_reps = df.groupby([groups]).agg(A=('A', 'mean')) # ... some magic happens here to rep

我想对数据进行分类,并为每个分类选择一个特定的集合

import pandas as pd
df = pd.DataFrame({ 
  'A': [1, 2, 3, 4],
  'B': [1, 2, 3, 4],
})
groups = pd.cut(df['A'], bins=2, labels=False)
group_reps = df.groupby([groups]).agg(A=('A', 'mean'))
# ... some magic happens here to replace values in A by group_reps ...
# 
# expected result
# A, B
# 1.5, 1
# 1.5, 2
# 3.5, 3
# 3.5, 4

对于大小接近机器内存的数据,如何有效地实现这一点?

如果要更改一列,可以单独处理。另外,
transform
帮助您将聚合与原始索引对齐:

df['A'] = df['A'].groupby(groups).transform('mean')

df['A']=df['A'].groupby(groups).transform('mean')
?是的,谢谢。这就是我要找的。正在与合并、加入、查找进行斗争。。。你能给我一个答复吗?