Python 3.x 无法分配pandas.Dataframe中的行_Python 3.x_Pandas

Python 3.x 无法分配pandas.Dataframe中的行

python-3.x pandas

Python 3.x 无法分配pandas.Dataframe中的行,python-3.x,pandas,Python 3.x,Pandas,我试图计算数据帧中在指定列col上具有相同值的行的平均值。然而，我被困在分配一行数据帧上这是我的密码： def code(data, col): """ Finds average value of all rows that have identical col values from column col . Returns new Pandas.DataFrame with the data """ values = pd.unique(data[

我试图计算数据帧中在指定列

col

上具有相同值的行的平均值。然而，我被困在分配一行数据帧上

这是我的密码：

def code(data, col):
    """ Finds average value of all rows that have identical col values from column col .
        Returns new Pandas.DataFrame with the data
    """
    values = pd.unique(data[col])
    rows = len(values)
    res = pd.DataFrame(np.zeros(shape = (rows, len(data.columns))), columns = data.columns)
    for i, v in enumerate(values):
        e = data[data[col] == v].mean().to_frame().transpose()
        res[i:i+1] = e
    return res

问题是代码只适用于第一行，而将NaN值放在下一行。我已经检查了e的值并确认它是好的，因此赋值

res[I:I+1]=e

有问题。我也尝试过做

res.iloc[I]=e

，但是我得到了“ValueError:Incompatible indexer with Series”有其他方法吗？这看起来很直截了当，我很困惑为什么它不起作用

例如：

wdata
    Out[78]: 
        Die  Subsite  Algorithm  Vt1           It1          Ignd
    0     1        0          0  0.0 -2.320000e-07 -4.862400e-08
    1     1        0          0  0.1 -1.000000e-04  1.000000e-04
    2     1        0          0  0.2 -1.000000e-03  1.000000e-03
    3     1        0          0  0.3 -1.000000e-02  1.000000e-02
    4     1        1          1  0.0  3.554000e-07 -2.012000e-07
    5     1        2          2  0.0  5.353000e-08 -1.684000e-07
    6     1        3          3  0.0  9.369400e-08 -2.121400e-08
    7     1        4          4  0.0  3.286200e-08 -2.093600e-08
    8     1        5          5  0.0  8.978600e-08 -3.262000e-07
    9     1        6          6  0.0  3.624800e-08 -2.507600e-08
    10    1        7          7  0.0  2.957000e-08 -1.993200e-08
    11    1        8          8  0.0  7.732600e-08 -3.773200e-08
    12    1        9          9  0.0  9.300000e-08 -3.521200e-08
    13    1       10         10  0.0  8.468000e-09 -6.990000e-09
    14    1       11         11  0.0  1.434200e-11 -1.200000e-11
    15    2        0          0  0.0  8.118000e-11 -5.254000e-11
    16    2        1          1  0.0  9.322000e-11 -1.359200e-10
    17    2        2          2  0.0  1.944000e-10 -2.409400e-10
    18    2        3          3  0.0  7.756000e-11 -8.556000e-11
    19    2        4          4  0.0  1.260000e-11 -8.618000e-12
    20    2        5          5  0.0  7.122000e-12 -1.402000e-13
    21    2        6          6  0.0  6.224000e-11 -2.760000e-11
    22    2        7          7  0.0  1.133400e-08 -6.566000e-09
    23    2        8          8  0.0  6.600000e-13 -1.808000e-11
    24    2        9          9  0.0  6.861000e-08 -4.063400e-08
    25    2       10         10  0.0  2.743800e-10 -1.336000e-10

     Die  Subsite  Algorithm   Vt1      It1     Ignd
0    1      4.4        4.4  0.04 -0.00074  0.00074
0    2      5.5        5.5    0  6.792247e-09 -4.023330e-09

预期输出：

wdata
    Out[78]: 
        Die  Subsite  Algorithm  Vt1           It1          Ignd
    0     1        0          0  0.0 -2.320000e-07 -4.862400e-08
    1     1        0          0  0.1 -1.000000e-04  1.000000e-04
    2     1        0          0  0.2 -1.000000e-03  1.000000e-03
    3     1        0          0  0.3 -1.000000e-02  1.000000e-02
    4     1        1          1  0.0  3.554000e-07 -2.012000e-07
    5     1        2          2  0.0  5.353000e-08 -1.684000e-07
    6     1        3          3  0.0  9.369400e-08 -2.121400e-08
    7     1        4          4  0.0  3.286200e-08 -2.093600e-08
    8     1        5          5  0.0  8.978600e-08 -3.262000e-07
    9     1        6          6  0.0  3.624800e-08 -2.507600e-08
    10    1        7          7  0.0  2.957000e-08 -1.993200e-08
    11    1        8          8  0.0  7.732600e-08 -3.773200e-08
    12    1        9          9  0.0  9.300000e-08 -3.521200e-08
    13    1       10         10  0.0  8.468000e-09 -6.990000e-09
    14    1       11         11  0.0  1.434200e-11 -1.200000e-11
    15    2        0          0  0.0  8.118000e-11 -5.254000e-11
    16    2        1          1  0.0  9.322000e-11 -1.359200e-10
    17    2        2          2  0.0  1.944000e-10 -2.409400e-10
    18    2        3          3  0.0  7.756000e-11 -8.556000e-11
    19    2        4          4  0.0  1.260000e-11 -8.618000e-12
    20    2        5          5  0.0  7.122000e-12 -1.402000e-13
    21    2        6          6  0.0  6.224000e-11 -2.760000e-11
    22    2        7          7  0.0  1.133400e-08 -6.566000e-09
    23    2        8          8  0.0  6.600000e-13 -1.808000e-11
    24    2        9          9  0.0  6.861000e-08 -4.063400e-08
    25    2       10         10  0.0  2.743800e-10 -1.336000e-10

     Die  Subsite  Algorithm   Vt1      It1     Ignd
0    1      4.4        4.4  0.04 -0.00074  0.00074
0    2      5.5        5.5    0  6.792247e-09 -4.023330e-09

相反，我得到的是：

         Die  Subsite  Algorithm   Vt1      It1     Ignd
    0    1      4.4        4.4  0.04 -0.00074  0.00074
    0    NaN      NaN      NaN  NaN   NaN      NaN

例如，此代码导致：

In[81]: wdata[wdata['Die'] == 2].mean().to_frame().transpose()
Out[81]: 
   Die  Subsite  Algorithm  Vt1           It1          Ignd
0    2      5.5        5.5    0  6.792247e-09 -4.023330e-09

我的作品：

def code(data, col):
    """ Finds average value of all rows that have identical col values from column col .
        Returns new Pandas.DataFrame with the data
    """
    values = pd.unique(data[col])
    rows = len(values)
    res = pd.DataFrame(columns = data.columns)
    for i, v in enumerate(values):
        e = data[data[col] == v].mean()
        res.loc[i,:] = e
    return res

col = 'Die'
print (code(data, col))
  Die Subsite Algorithm   Vt1          It1         Ignd
0   1     4.4       4.4  0.04 -0.000739957  0.000739939
1   2       5         5     0  7.34067e-09 -4.35482e-09

但同样的产出对总产出产生了影响：

发布问题几分钟后，我通过在

中添加

.values

来解决问题

e = data[data[col] == v].mean().to_frame().transpose().values

然而，事实证明，我想做的事已经被熊猫完成了。谢谢马祖

df.groupBy(col).mean()

您可以添加一些示例数据和预期输出吗？

code（数据，列）

中的

col

是什么

Vt1

？

Col

带有列名的字符串。在我的例子中，它的

Die

是在

df.groupby（'Die'）.mean（）

或

df.groupby（'Die'，如_index=False）。mean（）

-它将生成您想要的数据集？确实这就是我想要的！结果是我重新发明了轮子。谢谢是的，我也删除了一些代码。但最好是将

groupby

与

mean

一起使用。